BigBacter is a pipeline aimed at simplifying bacterial genomic surveillance.
Main features
- Saves your samples to a personal database (can be stored locally or on the cloud)
- Includes database samples when they are closely related to new samples
- Optimized to reduce core genome shrinkage
- Optimized for speed and resource usage
- Multiple species/subtypes can be included in a single run
- Generates report-ready figures
Main analyses:
- Recombination-aware core SNP analysis (Snippy and Gubbins)
- Accessory distance analysis (PopPUNK)
- Phylogenetic analysis (IQTREE2 or Rapidnj)
Main outputs:
- Phylogenetic trees (Core SNPs)
- Distance matrices (Accessory distance and Core SNPs)
- Tabulated summary (QC metrics, linkage summaries, and more!)
- Outputs are summarized in Microreact files 🙌
Required Inputs
Column Name | Description |
---|---|
sample | sample name |
taxa | sample taxonomy (species or lower) |
assembly | assembly file |
fastq_1 | foward read |
fastq_2 | reverse read |
BigBacter also requires a taxon-specific PopPUNK database (pre-made databases can be found here). (https://www.bacpop.org/poppunk/))
Tip
BigBacter is designed to run after general bacterial analysis (e.g, PHoeNIx, Bactopia, TheiaProk.)
Checkout the wiki to learn more!
BigBacter was originally created by Jared Johnson for the Washington State Department of Health. See a full list of contributors here.