CICC

author

title

Maxime Tarabichi

ClusterID-based consensus clustering

CICC

Description and example run

A description of the concepts and example runs with dummy data is available in the pdf and the description folder (corresponding knitr Rnw).

Installation

There is no installation required, the R scripts that can be run on their own but do have some dependencies. Compile the C code in the scripts directory with the following command:

R CMD SHLIB scoringlite.c

This will generate the dynamic library to make co-clustering matrices from hard assignments vectors.

Dependencies

R packages: BiocGenerics S4Vectors IRanges GenomeInfoDb GenomicRanges

Run CICC for PCAWG

Run - step1

run Step1.submitALL.R with:

Rscript Step1.submitALL.R

This pipeline goes through the PCAWG sample IDs and submit one job per sample on a slurm-based cluster to run CICC.
The job will run runCICC.R, which loads the required data using utility functions from loadData.R, where paths are encoded, and then runs CICC and saves the output in a consensus format using functions from loadData.R.

Run - step2

run Step2.plotClusterCCF.R with:

Rscript Step2.plotClusterCCF.R

This is useful for visualisation of the results. It plots histograms of CCF and the cluster positions for each method and for the consensus.

Produced outputs

writeResultsMA will write a vector of mutation assignments, i.e. integer values corresponding to consensus cluster IDs
writeResultsClusterCCF will write a consensus subclonal architecture: a dataframe with three columns ("cluster", "n_ssms","proportion") and as many rows as identified (sub)clones.

cluster is the cluster ID
n_ssms is the number of SNVs assigned to that cluster
proportion is the fraction of cancer cells sharing these SNVs

writeResultsClusterCCF2 will write a file very similar to writeResultsClusterCCF but with fraction of the total cells instead of fraction of cancer cells.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
description		description
CICC.PAC.R		CICC.PAC.R
ClusterID-based consensus clustering.pdf		ClusterID-based consensus clustering.pdf
LICENSE		LICENSE
README.Rmd		README.Rmd
README.md		README.md
Step1.submitALL.R		Step1.submitALL.R
Step2.plotClusterCCF.final.R		Step2.plotClusterCCF.final.R
loadData.R		loadData.R
runCICC.R		runCICC.R
scoringlite.c		scoringlite.c
scoringlite.o		scoringlite.o
scoringlite.so		scoringlite.so

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CICC

Description and example run

Installation

Dependencies

Run CICC for PCAWG

Run - step1

Run - step2

Produced outputs

About

Releases

Packages

Languages

License

galder-max/CICC

Folders and files

Latest commit

History

Repository files navigation

CICC

Description and example run

Installation

Dependencies

Run CICC for PCAWG

Run - step1

Run - step2

Produced outputs

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages