CanDI - A global cancer data integrator

Package Installation

First, you need to clone this repository to use CanDI.

git clone https://github.com/GilbertLabUCSF/CanDI.git

We suggest to use Conda as a package manager and environment management system. You can create a fresh conda environment with all CanDI’s requirements using bellow command:

conda env create -f CanDI/candi.yml -n candi

Prepare Datasets

The python command from CanDI will automatically download and modify datasets.

python CanDI/CanDI/setup/install.py

Downloaded and formatted datasets would organize this way:

.
├── config.ini # modified after Installation
├── depmap
│   ├── CCLE_expression.csv
│   ├── CCLE_fusions.csv
│   ├── CCLE_gene_cn.csv
│   ├── CCLE_mutations.csv
│   ├── CCLE_RNAseq_reads.csv
│   ├── CRISPR_gene_dependency.csv
│   ├── CRISPR_gene_effect.csv
│   └── sample_info.csv
├── genes
│   └── gene_info.csv
└── locations
    └── merged_locations.csv

Package Usage

Import CanDI into python

To import CanDI, your active directory in python must be same as the cloned folder.

from CanDI import candi

OR, you can add path to the CanDI directory if you want to use it from other directories.

import sys
sys.path.append("path-to-candi-directory")

from CanDI import candi

CanDI Objects

data : Container for all candi datasets. All access to datasets go through data object.
Gene : Provides cross dataset indexing from the gene perspective.
CellLine : Provides cross dataset indexing from the cell line perspective.
Cancer : Provides cross dataset indexing by a group of cell lines that are all the same tissue.
Organelle: Provides cross dataset indexing for a group of genes whose proteins localize to the same organelle.
CellLineCluster : Provides cross dataset indexing for a group of user defined cell lines.
GeneCluster : Provides cross dataset indexing for a group of user defined genes.

Name		Name	Last commit message	Last commit date
Latest commit History 130 Commits
.github/workflows		.github/workflows
CanDI		CanDI
docs		docs
scripts		scripts
tests		tests
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
LICENSE		LICENSE
README.rst		README.rst
brca_heatmap.ipynb		brca_heatmap.ipynb
candi.yml		candi.yml
deseq_setup.ipynb		deseq_setup.ipynb
get-started.ipynb		get-started.ipynb
kras_egfr_scatter.ipynb		kras_egfr_scatter.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CanDI - A global cancer data integrator

Package Installation

Prepare Datasets

Package Usage

Import CanDI into python

CanDI Objects

About

Releases

Packages

Languages

License

GenomicsNX/CanDI

Folders and files

Latest commit

History

Repository files navigation

CanDI - A global cancer data integrator

Package Installation

Prepare Datasets

Package Usage

Import CanDI into python

CanDI Objects

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages