[![PyPI Version](https://img.shields.io/pypi/v/pyGenomeTracks.svg?style=plastic)](https://pypi.org/project/pyGenomeTracks/) [![bioconda-badge](https://img.shields.io/conda/vn/bioconda/pyGenomeTracks.svg?style=plastic)](https://anaconda.org/bioconda/pygenometracks) [![bioconda-badge](https://img.shields.io/badge/install%20with-bioconda-brightgreen.svg?style=plastic)](http://bioconda.github.io) [![Build Status on Azure](https://dev.azure.com/wolffj/pyGenomeTracks/_apis/build/status/deeptools.pyGenomeTracks?branchName=master)](https://dev.azure.com/wolffj/pyGenomeTracks/_build/latest?definitionId=2&branchName=master) ![Coverage](./docs/coverage.svg) [![European Galaxy server](https://img.shields.io/badge/usegalaxy-.eu-brightgreen?logo=)](https://usegalaxy.eu/root?tool_id=pygenomeTracks) pyGenomeTracks ============== Standalone program and library to plot beautiful genome browser tracks ---------------------------------------------------------------------- pyGenomeTracks aims to produce high-quality genome browser tracks that are highly customizable. Currently, it is possible to plot: * bigwig * bed/gtf (many options) * bedgraph * bedgraph matrices (like TAD-separation scores) * epilogos * narrow peaks * links (represented as arcs, triangles or squares) * Hi-C matrices (as triangle or squares) * fasta * maf (multiple alignment format) Here is a scheme which describe how pyGenomeTracks is working (graphical abstract of [Lopez-Delisle et al. 2020](https://doi.org/10.1093/bioinformatics/btaa692)): ![pyGenomeTracks](https://github.com/deeptools/pyGenomeTracks/blob/b8918d43b0bd1ca9dd2939fdd0d93a5a297d0453/docs/content/images/graphicalabstract.png) pyGenomeTracks can make plots with or without Hi-C data. The following is an example output of pyGenomeTracks from [Ramírez et al. 2017](https://www.nature.com/articles/s41467-017-02525-w) ![pyGenomeTracks example](https://github.com/deeptools/pyGenomeTracks/blob/b8918d43b0bd1ca9dd2939fdd0d93a5a297d0453/docs/content/images/hic_example_nat_comm_small.png) Table of content ---------------- * [Installation](#installation) * [Usage](#usage) * [Citation](#citation) * [Documentation](#documentation) * [External users](#external-users) Installation ------------ pyGenomeTracks works with python >=3.8. The recommended way to install pyGenomeTracks is via conda ```bash conda create -n pygenometracks -c bioconda -c conda-forge pygenometracks ``` To get a specific version, one can specify it. For example: ```bash conda create -n pygenometracks -c bioconda -c conda-forge pygenometracks=3.5 python=3.7 ``` However, we noticed that conda installation can be quite slow so using mamba can help. You first need to create the environment and install mamba: ```bash conda create -n pygenometracks -c bioconda -c conda-forge mamba python=3.9 ``` Then activate the environment and install pygenometracks with mamba: ```bash conda activate pygenometracks mamba install -c conda-forge -c bioconda pygenometracks ``` or if you want a specific version: ```bash conda create -n pygenometracks -c bioconda -c conda-forge mamba python=3.7 conda activate pygenometracks mamba install -c conda-forge -c bioconda pygenometracks=3.5 ``` Also, pyGenomeTracks can be installed using pip ```bash pip install pyGenomeTracks ``` Since version 3.5, pyGenomeTracks uses BEDTools, don't forget to install it or load it into your environment. Usage ----- To run pyGenomeTracks a configuration file describing the tracks is required. The easiest way to create this file is using the program `make_tracks_file` which creates a configuration file with defaults that can be easily changed. The format is: ```bash make_tracks_file --trackFiles ... -o tracks.ini ``` `make_tracks_file` uses the file ending to guess the file type. Then, a region can be plotted using: ```bash pyGenomeTracks --tracks tracks.ini --region chr2:10,000,000-11,000,000 --outFileName nice_image.pdf ``` The ending `--outFileName` defines the image format. If `.pdf` is used, then the resulting image is a pdf. The options are pdf, png and svg. Description of other possible arguments: ``` text options: -h, --help show this help message and exit --tracks TRACKS File containing the instructions to plot the tracks. The tracks.ini file can be genarated using the `make_tracks_file` program. --region REGION Region to plot, the format is chr:start-end --BED BED Instead of a region, a file containing the regions to plot, in BED format, can be given. If this is the case, multiple files will be created. It will use the value of --outFileName as a template and put the coordinates between the file name and the extension. --width WIDTH figure width in centimeters (default is 40) --plotWidth PLOTWIDTH width in centimeters of the plotting (central) part --height HEIGHT Figure height in centimeters. If not given, the figure height is computed based on the heights of the tracks. If given, the track height are proportionally scaled to match the desired figure height. --title TITLE, -t TITLE Plot title --outFileName OUTFILENAME, -out OUTFILENAME File name to save the image, file prefix in case multiple images are stored --fontSize FONTSIZE Font size for the labels of the plot (default is 0.3 * figure width) --dpi DPI Resolution for the image in case the ouput is a raster graphics image (e.g png, jpg) (default is 72) --trackLabelFraction TRACKLABELFRACTION By default the space dedicated to the track labels is 0.05 of the plot width. This fraction can be changed with this parameter if needed. --trackLabelHAlign {left,right,center} By default, the horizontal alignment of the track labels is left. This alignemnt can be changed to right or center. --decreasingXAxis By default, the x-axis is increasing. Use this option if you want to see all tracks with a decreasing x-axis. --version show program's version number and exit ``` Citation -------- If you use pyGenomeTracks in your analysis, you can cite the following papers: Fidel Ramírez, Vivek Bhardwaj, Laura Arrigoni, Kin Chung Lam, Björn A. Grüning, José Villaveces, Bianca Habermann, Asifa Akhtar & Thomas Manke. High-resolution TADs reveal DNA sequences underlying genome organization in flies. Nature Communications (2018) [doi:10.1038/s41467-017-02525-w](https://www.nature.com/articles/s41467-017-02525-w) Lopez-Delisle L, Rabbani L, Wolff J, Bhardwaj V, Backofen R, Grüning B, Ramírez F, Manke T. pyGenomeTracks: reproducible plots for multivariate genomic data sets. Bioinformatics. 2020 Aug 3:btaa692. [doi: 10.1093/bioinformatics/btaa692](https://doi.org/10.1093/bioinformatics/btaa692). Epub ahead of print. PMID: 32745185. Documentation ------------- Our [documentation](http://pygenometracks.readthedocs.io/) provide [examples](http://pygenometracks.readthedocs.org/en/latest/content/examples.html), as well as the [full list of possible parameters](http://pygenometracks.readthedocs.org/en/latest/content/possible-parameters.html) and [guidelines for developers who would like to add a new track type](http://pygenometracks.readthedocs.org/en/latest/content/adding-new-tracks.html). External users -------------- * [CoolBox](https://github.com/GangCaoLab/CoolBox) is an interactive genomic data explorer for Jupyter Notebooks * [Galaxy](https://usegalaxy.eu/root?tool_id=toolshed.g2.bx.psu.edu/repos/iuc/pygenometracks/pygenomeTracks) integration offers a graphical user-interface to create PGT plots. It is also possible to include PGT into workflows and automatic pipelines.