Self-omics: A Self-supervised Learning Framework for Multi-omics Cancer Data

Architecture

To create a conda environment using the environment file given, run the command given below:

conda env create -f environment.yml
conda activate self-omics

Rename gene expression data as A.tsv, DNA methylation data as B.tsv, and miRNA expression dataset as C.tsv
Place the files in data folder
(Optional) Run cells in notebooks/preprocessing.ipynb to convert .tsv files to .npy files. This helps in loading data quicker as well as alleviating memory issues.

Clone this repository: git clone https://github.com/hashimsayed0/self-omics.git
Change directory to this project folder: cd self-omics
Edit scrips/train.sh as you like and run the script: sh ./scripts/train.sh
Logs will be uploaded to wandb once you login and models will be saved in checkpoints folder

Code for a few functions and networks was taken from the repository OmiEmbed and modified as needed.

Name		Name	Last commit message	Last commit date
Latest commit History 178 Commits
.vscode		.vscode
anno		anno
data		data
models		models
notebooks		notebooks
scripts		scripts
sweeps		sweeps
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
stand_ml.py		stand_ml.py
test.py		test.py
test_phases.py		test_phases.py
train.py		train.py
train_in_phases.py		train_in_phases.py