SeQC

SeQC is still very much in the Alpha stage of development. A lot has changed over the last year since the videos you may have seen on vimeo (https://vimeo.com/123508180), so it is advised to not get too familar with the SeQC code until after my PhD has finished and I can actually work on this and other projects full-time.

SeQC is a "quality-control analysis" tool, but it is not like FASTQC or similar. Written entirely in Python, SeQC is designed to help you to investigate the nature of sequencing data at scale, get to grips with your raw data, and write your own modules to do custom analysis with very little extra code needed.

It differs from other tools by outputing data to either an SQLite or Postegres SQL database (SeQL format). Data in SeQL format can be vizulised through the SeQL webservice, which comes with a number of generic plugins to look at quantitative data. All of your QC data from your project, your lab, or even your whole institute, can be loaded into a single SeQL database, so all your QC data is in one place.

The SeQL webservice will also allow you to communicate with other SeQL databases (to compare data) and offers you the ability to host your data for others to see - either publicly or privately. For SeQC this means you can compare your mapping rates to other publications, look for odd trends in your data, etc etc.

Once the code has been finalized and we have settled on the first Beta release, a full guide to using and extending SeQC will be published :)

UPDATE 13th November 2016: SeQC will be getting a big update soon, notably:

pybam will be used instead of pysam/htspython, because its faster, 100% pypy complient, and works in the same way SeQC works (reading through the whole file once). This will also massively simplify the code, particularly for the BAM header, and SeQC will have 0 dependencies.
Modules will go from being python classes to json objects.
Modules will be able to have parameters (either required or optional), which will directly effect how argparse works so these parameters become 'native' when the module is loaded. For example ./SeQC --analysis GTF --GTF_FILE ./path/to/file.gtf
The output SQL databases made by SeQC are becoming their own project - SeQL databases. SeQC will drop all the javascript code as a result, but still contain the code to make/add data to a SeQL database. All the other projects (Signl, ACGTrie, BAM+, etc) will also switch over to SeQL output, so literally everything will be in the same self-describing SQL format, with vizulization modules built-in.
The master process will use select/fcntl as per the log.bio project to read subprocesses output rather than the ghetto ping.pong() method currently used. This is good because it means if a subprocess hangs or has a lengthy .before routine, it doesn't pause the status updating of all subprocesses. I anticipate .before and .after being more heavily used in the future, so this is important.
SeQC will go up on ac.gt finally, i'll spend a full week on just documentation, and then try and publish it in the journal of bioinformatics with all the project contributors getting authorship.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
.gitignore		.gitignore
CHR.stat		CHR.stat
CIGAR.stat		CIGAR.stat
FIG.stat		FIG.stat
FLAG.stat		FLAG.stat
GC.stat		GC.stat
MAPQ.stat		MAPQ.stat
PNEXT.stat		PNEXT.stat
POS.stat		POS.stat
QNAME.stat		QNAME.stat
QUAL.stat		QUAL.stat
README.md		README.md
RGID.stat		RGID.stat
RNAME.stat		RNAME.stat
RNEXT.stat		RNEXT.stat
SAMECHR.stat		SAMECHR.stat
SEQ.stat		SEQ.stat
SeQC.js.py		SeQC.js.py
TAGS.stat		TAGS.stat
TAGS_FULL.stat		TAGS_FULL.stat
TAG_NAMES.stat		TAG_NAMES.stat
TLEN.stat		TLEN.stat
TMR.stat		TMR.stat
TYPE.stat		TYPE.stat
stats_explained.py		stats_explained.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SeQC

About

Releases

Packages

Contributors 2

Languages

JohnLonginotto/SeQC

Folders and files

Latest commit

History

Repository files navigation

SeQC

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages