Skip to content

DOH-JDJ0303/bigbacter-nf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BigBacter

Overview

BigBacter is a pipeline aimed at simplifying bacterial genomic surveillance.

Main features

  1. Saves your samples to a personal database (can be stored locally or on the cloud)
  2. Includes database samples when they are closely related to new samples
  3. Optimized to reduce core genome shrinkage
  4. Optimized for speed and resource usage
  5. Multiple species/subtypes can be included in a single run
  6. Generates report-ready figures

Main analyses:

  1. Recombination-aware core SNP analysis (Snippy and Gubbins)
  2. Accessory distance analysis (PopPUNK)
  3. Phylogenetic analysis (IQTREE2 or Rapidnj)

Main outputs:

  1. Phylogenetic trees (Core SNPs)
  2. Distance matrices (Accessory distance and Core SNPs)
  3. Tabulated summary (QC metrics, linkage summaries, and more!)
  4. Outputs are summarized in Microreact files 🙌

Required Inputs

Column Name Description
sample sample name
taxa sample taxonomy (species or lower)
assembly assembly file
fastq_1 foward read
fastq_2 reverse read

BigBacter also requires a taxon-specific PopPUNK database (pre-made databases can be found here). (https://www.bacpop.org/poppunk/))

Tip

BigBacter is designed to run after general bacterial analysis (e.g, PHoeNIx, Bactopia, TheiaProk.)

How to use BigBacter:

  1. Quick Start
  2. Full Instructions

Checkout the wiki to learn more!

BigBacter was originally created by Jared Johnson for the Washington State Department of Health. See a full list of contributors here.