Skip to content

Pathway enrichment analysis / visualize for Single Cell Data

License

Unknown, MIT licenses found

Licenses found

Unknown
LICENSE
MIT
LICENSE.md
Notifications You must be signed in to change notification settings

noobCoding/CellEnrich

 
 

Repository files navigation

CellEnrich

Pathway Enrichment Analysis and Visualization for Single-cell Data


Installation

CellEnrich manual is available.

NOTE: on a fresh installation, users may need to install some required interpreter compilers for the system to install other R packages:

  • C++ compiler
  • gfortran compiler (FYI: tips for MAC users or other OS)
  • Seurat >= 5.0.0 is REQUIRED

Install Dependent Packages:

  • Some packages must be installed from sources that need compilation and a proper version of RTools.
  • It is recommended to install devtools, and BiocManager packages first before installing the following packages in Github/Bioconductor (not in CRAN).
# install required packages
install.packages('Seurat') # RStudio may need a RESTART for Seurat v5.0.1 to be activated
install.packages('remotes')
install.packages('waiter')
install.packages('farver')
remotes::install_github('vqv/ggbiplot')

# install CellEnrich
remotes::install_github('noobCoding/CellEnrich')

Example with PBMC_3K data

# Download data, if not downloaded
download.file('https://github.com/noobcoding/CellEnrich/raw/master/data/pbmcData.RData','pbmcData.RData', mode = 'wb')
download.file('https://github.com/noobcoding/CellEnrich/raw/master/data/pbmcClustInfo.RData','pbmcClustInfo.RData', mode = 'wb')
download.file('https://github.com/noobcoding/CellEnrich/raw/master/data/Human_Reactome.RData', 'Human_Reactome.RData', mode = 'wb')

# Load library and data
library(CellEnrich)
library(Seurat)

load("pbmcData.RData")
load("pbmcClustInfo.RData")

CountData <- pbmcData
GroupInfo <- pbmcClustInfo

# CellEnrich uses normalized count data as input
CountData <- NormalizeData(CountData)

# This will run CellEnrich
CellEnrich(CountData, GroupInfo)

Large datasets for testing

Two datasets for testing CellEnrich are too big for hosting on GitHub so you can directly download them at Zenodo link including:

  • 'GBM_sub' is the glioblastoma data from link, which contains human HFC(Highly functionally connected) and LFC(Low functionally connected) Glioblastoma cells with added information of GRIA2 expression.
# Download data directly at the Zenodo link above!

gbm_sub <- readRDS(file = "GBM_sub.rds")
gbm_sub$count<-NormalizeData(gbm_sub$count)

CellEnrich(gbm_sub$count,gbm_sub$class)

  • 'PD_dat' is the Parkinson's disease data from link, which contains human dopamine cell cluster information about Parkinson's disease vs. Control.
PD_dat <- readRDS("PD_dat.rds")
PD_dat$count <- NormalizeData(PD_dat$count)

CellEnrich(PD_dat$count,PD_dat$type)

Example with Alzheimer's data

# Download Alzheimer's data to the working directory
download.file('https://github.com/noobcoding/CellEnrich/raw/master/data/Alzheimer_Counts_sampled.RDS','Alzheimer_Counts_sampled.RDS', mode = 'wb')
download.file('https://github.com/noobcoding/CellEnrich/raw/master/data/Alzheimer_CellType_sampled.RDS','Alzheimer_CellType_sampled.RDS', mode = 'wb')
download.file('https://github.com/noobcoding/CellEnrich/raw/master/data/Human_Reactome.RData', 'Human_Reactome.RData', mode = 'wb')

GroupInfo <- readRDS("Alzheimer_CellType_sampled.RDS")
CountData <- readRDS("Alzheimer_Counts_sampled.RDS")

CountData <- NormalizeData(CountData)

CellEnrich(CountData, GroupInfo)

Dependency

Authors

License

This project is MIT licensed