Skip to content

GenomicsNX/CanDI

 
 

Repository files navigation

CanDI - A global cancer data integrator

Documentation Status DOI

Package Installation

First, you need to clone this repository to use CanDI.

git clone https://github.com/GilbertLabUCSF/CanDI.git

We suggest to use Conda as a package manager and environment management system. You can create a fresh conda environment with all CanDI’s requirements using bellow command:

conda env create -f CanDI/candi.yml -n candi

Prepare Datasets

The python command from CanDI will automatically download and modify datasets.

python CanDI/CanDI/setup/install.py

Downloaded and formatted datasets would organize this way:

.
├── config.ini # modified after Installation
├── depmap
│   ├── CCLE_expression.csv
│   ├── CCLE_fusions.csv
│   ├── CCLE_gene_cn.csv
│   ├── CCLE_mutations.csv
│   ├── CCLE_RNAseq_reads.csv
│   ├── CRISPR_gene_dependency.csv
│   ├── CRISPR_gene_effect.csv
│   └── sample_info.csv
├── genes
│   └── gene_info.csv
└── locations
    └── merged_locations.csv

Package Usage

Import CanDI into python

To import CanDI, your active directory in python must be same as the cloned folder.

from CanDI import candi

OR, you can add path to the CanDI directory if you want to use it from other directories.

import sys
sys.path.append("path-to-candi-directory")

from CanDI import candi

CanDI Objects

  • data : Container for all candi datasets. All access to datasets go through data object.
  • Gene : Provides cross dataset indexing from the gene perspective.
  • CellLine : Provides cross dataset indexing from the cell line perspective.
  • Cancer : Provides cross dataset indexing by a group of cell lines that are all the same tissue.
  • Organelle: Provides cross dataset indexing for a group of genes whose proteins localize to the same organelle.
  • CellLineCluster : Provides cross dataset indexing for a group of user defined cell lines.
  • GeneCluster : Provides cross dataset indexing for a group of user defined genes.

About

CanDI - A global cancer data integrator

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 87.1%
  • Python 12.7%
  • R 0.2%