Skip to content

Collection of scripts / notebooks to reliably select datasets

License

Notifications You must be signed in to change notification settings

openkinome/kinodata

Repository files navigation

datascripts

Collection of scripts / notebooks to reliably select and curate datasets

Current sources

  • data: Some definition data used to query other datasets. These files are mostly static because they don't change often, but they are versioned just in case.
  • human-kinases: Aggregates different sources that list the complete human kinome and provides a curated CSV file with Uniprot IDs and their origin.
  • kinases-in-chembl: Maps human kinases UniProt IDs (from human-kinases) to ChEMBL target IDs.
  • kinase-bioactivities-in-chembl: Retrieve and curate activities (IC50, Ki, Kd) of small compounds against human kinases in ChEMBL (requires local SQLite copy).

Installation

Create a conda environment using the included environment file:

conda env create -f devtools/environment.yml
conda activate kinodata