Skip to content

Tools for processing and analyzing pooled Genetic Interaction map data

License

Notifications You must be signed in to change notification settings

mhorlbeck/GImap_tools

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GI map tools

This repository contains scripts and notebooks used in "Mapping the Genetic Landscape of Human Cells", Horlbeck et al., Cell 2018 and "Exploring genetic interaction manifolds constructed from rich single-cell phenotypes", Norman et al., Science 2019. At present, these are simply presented as used to process the data in the manuscript, with notebooks serving as examples of their features and use.

Tripleseq_fastqgz_to_counts.py

Adapted from [https://github.com/mhorlbeck/ScreenProcessing/fastqgz_to_counts.py], this is a command line script that counts reads and barcodes from the sgRNAs and joint barcodes sequenced by the "triple sequencing" approach. Input files are of the format *.fastq.gz and filenames are expected to be sortable such that the read 1, read 2, and read 3 sequence files for each sample are in consecutive order (e.g. sampleA_r1.fastq.gz, sampleA_r2.fastq.gz, sampleA_r3.fastq.gz, sampleB_r1.fastq.gz, sampleB_r2.fastq.gz, sampleB_r3.fastq.gz). The output are three files for each sample containing read counts for every sgRNA pair as determined by counting sgRNAs, barcodes, or reads where the sgRNAs and barcodes were concordent.

GImap_analysis.py

Collection of functions for computing sgRNA-level and gene-level GI scores and GI correlations, as well as functions for generating key quality control plots.

About

Tools for processing and analyzing pooled Genetic Interaction map data

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages