Skip to content

Multi-task deep learning framework for multi-omics data analysis

License

Notifications You must be signed in to change notification settings

elliewith/OmiEmbed

 
 

Repository files navigation

OmiEmbed

DOI Codacy grade GitHub license Safe GitHub stars GitHub forks

OmiEmbed: A Unified Multi-task Deep Learning Framework for Multi-omics Data

Xiaoyu Zhang ([email protected])

Data Science Institute, Imperial College London

Introduction

OmiEmbed is a unified framework for deep learning-based omics data analysis, which supports:

  1. Multi-omics integration
  2. Dimensionality reduction
  3. Omics embedding learning
  4. Tumour type classification
  5. Phenotypic feature reconstruction
  6. Survival prediction
  7. Multi-task learning for aforementioned tasks

Paper Link: arXiv

Getting Started

Prerequisites

  • CPU or NVIDIA GPU + CUDA CuDNN
  • Python 3.6+
  • Python Package Manager
  • Python Packages
    • PyTorch 1.2+
    • TensorBoard 1.10+
    • Tables 3.6+
    • scikit-survival 0.6+
    • prefetch-generator 1.0+
  • Git 2.7+

Installation

  • Clone the repo
git clone https://github.com/zhangxiaoyu11/OmiEmbed.git
cd OmiEmbed
  • Install the dependencies
    • For conda users
    conda env create -f environment.yml
    conda activate omiembed
    • For pip users
    pip install -r requirements.txt

Try it out

  • Train and test using the built-in sample dataset with the default settings
python train_test.py
  • Check the output files
cd checkpoints/test/
  • Visualise the metrics and losses
tensorboard --logdir=tb_log --bind_all

Citation

If you use this code in your research, please cite our paper.

@misc{zhang2021omiembed,
      title={OmiEmbed: a unified multi-task deep learning framework for multi-omics data}, 
      author={Xiaoyu Zhang and Yuting Xing and Kai Sun and Yike Guo},
      year={2021},
      eprint={2102.02669},
      archivePrefix={arXiv},
      primaryClass={q-bio.GN}
}

License

This source code is licensed under the MIT license.

About

Multi-task deep learning framework for multi-omics data analysis

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 100.0%