JGA data analysis workflows

In Japanese Genotype-phenotype Archive (JGA), most of the whole-genome sequencing (WGS) data are registered in the FASTQ format. Accordingly, the data users have to download the WGS data, followed by data processing by themselves. To improve the convenience of the data users, germline WGS data registered in JGA were processed in a certain workflow, and alignment results (CRAM), variant call results per sample (gVCF), and variant call results per dataset (aggregated VCF) were calculated. The post-processing data have been registered in the JGA, and the data users can download the post-processing data from the JGA.

Workflows for germline WGS data processing

JGA analysis per-sample workflow. This workflow takes FASTQ files as input, aligns them to the reference genome (GRCh38), and performs variant call per sample. The alignment results (CRAM), variant call results per sample (gVCF), and quality control metrics (CRAM-level and gVCF-level metrics) used in later steps are output.
JGA analysis QC. This program performs quality control (QC) by visualizing the cram- and gVCF-level metrics calculated by the abovementioned JGA analysis per-sample workflow.
JGA analysis multi-samples workflow. This workflow takes multiple gVCF files as input, performs joint call and variant quality score recalibration (VQSR), and outputs variant call results per dataset (aggregate VCF). The summarized data (sites-only aggregate VCF) was then calculated.

The JGA analysis per-sample workflow can be executed with the JGA analysis job manager.

Name		Name	Last commit message	Last commit date
Latest commit History 405 Commits
jga-analysis-jobmanager		jga-analysis-jobmanager
jga-analysis-qc @ 81386b7		jga-analysis-qc @ 81386b7
jga-analysis-qc-cwl		jga-analysis-qc-cwl
mitocondrial-variant		mitocondrial-variant
multi-samples		multi-samples
per-sample		per-sample
rna-seq		rna-seq
somatic-short-variant		somatic-short-variant
.gitignore		.gitignore
.gitmodules		.gitmodules
JGA-germline-WGS-workflow.png		JGA-germline-WGS-workflow.png
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JGA data analysis workflows

Workflows for germline WGS data processing

About

Releases 5

Packages

Contributors 6

Languages

License

ddbj/jga-analysis

Folders and files

Latest commit

History

Repository files navigation

JGA data analysis workflows

Workflows for germline WGS data processing

About

Resources

License

Stars

Watchers

Forks

Releases 5

Packages 0

Contributors 6

Languages

Packages