Integrating Single-Cell Results for Exploring and Analyzing Methylation
Analysis and visualization of Whole Genome Bisulfite Sequencing (WGBS) data requires reading aligned sequencing data into formats that existing packages like BSseq and scMET can analyze. Getting the data from on-disk formats like bedfiles to a matrix of methylation values can be difficult because, with nearly 30 million CpGs, WGBS data can be quite large.
iscream aims to efficiently read alinged (sc)WGBS data into formats that can be used by other packages. iscream uses htslib to query genomic regions to make matrices for BSSeq or aggregate the methylated reads for scMET.
iscream depends on the htslib header files. These may be installed with your package manager:
- ubuntu/debian:
libhts-dev
- fedora/RHEL:
htslib-devel
- brew:
htslib
- nixpkgs:
htslib
They may also be found among your HPC modules - make sure the PKG_CONFIG_PATH
environment variable includes the pkgconfig
location for your installation of
htslib. You can verify that the htslib development libraries are installed
with pkg-config
:
pkg-config --cflags --libs htslib
You can install the development version from Github by cloning the repo and running
git clone https://github.com/huishenlab/iscream
R CMD INSTALL iscream
You can also use the R devtools
package:
devtools::install_github("huishenlab/iscream")
or pak
:
pak::pkg_install("huishenlab/iscream")
A user guide is available on the package website. Bug reports may be submitted through GitHub issues.