top of page
PHG logo.PNG

Practical Haplotype Graph 

The Practical Haplotype Graph (PHG) is a powerful tool for representing pangenomes. The PHG is optimized for plant breeding and genetics, where genomic diversity can be high, phased haplotypes are common (e.g., inbred lines), and imputation with low-density markers is essential for breeding efficiency. This complements other imputation tools (e.g., BEAGLE) designed explicitly for handling samples from unphased species characterized by low genetic diversity and high-density genotyping.

​

The PHG is a graph-based trellis representation of consecutive genic and intergenic regions (called reference ranges), representing diversity across and between samples. It can be used to:

​

  • Create custom genomes for alignment

  • Call rare alleles

  • Impute genotypes

  • Efficiently store genomic data from many samples (i.e., reference, assemblies, and other lines)

​

The PHG also works well with community standards, including the Breeding API (BrAPI) and efficient tools for R, such as rPHG2 for pangenome extraction and rTASSEL for connecting genotype to phenotype.

For more information on how to get started, please visit our GitHub repository.

phg_architecture_simple.png
bottom of page