Skip to content

helicalAI/helical

Repository files navigation

What is Helical ?

Helical provides a framework for state-of-the-art pre-trained bio foundation models on genomics and transcriptomics modalities.

Helical simplifies the entire application lifecycle when building with bio foundation models. You will be able to:

  • Leverage the latest bio foundation models through our easy-to-use python package
  • Run example notebooks on key downstream tasks from examples

We will update this repo on a regular basis with new models, benchmarks, modalities and functions - so stay tuned. Let’s build the most exciting AI-for-Bio community together!

Installation

We recommend installing Helical within a conda environment with the commands below (run them in your terminal) - this step is optional:

conda create --name helical-package python=3.11.8
conda activate helical-package

To install the Helical package, you can run the command below:

pip install --upgrade --force-reinstall git+https://github.com/helicalAI/helical.git

Installation

RNA models:

DNA models:

Demo & Use Cases

To run examples, be sure to have installed the Helical package (see Installation) and that it is up-to-date.

You can look directly into the example folder above and download the script of your choice, look into our documentation for step-by-step guides or directly clone the repository using:

git clone https://github.com/helicalAI/helical.git

Within the examples/notebooks folder, open the notebook of your choice. We recommend starting with Quick-Start-Tutorial.ipynb

Current Examples:

Example Description Colab
Quick-Start-Tutorial.ipynb A tutorial to quickly get used to the helical package and environment. Open In Colab
Geneformer-vs-UCE.ipynb Zero-Shot Reference Mapping with Geneformer & UCE and compare the outcomes. Open In Colab
Hyena-DNA-Inference.ipynb An example how to do probing with HyenaDNA by training a neural network on 18 downstream classification tasks. Open In Colab
Cell-Type-Annotation.ipynb An example how to do probing with scGPT by training a neural network to predict cell type annotations. Open In Colab
Coming Soon New models such as SCimilarity, scVI; benchmarking scripts; new use cases; others

Stuck somewhere ? Other ideas ?

We are eager to help you and interact with you. Reach out via [email protected]. You can also open github issues here.

Why should I use Helical & what to expect in the future?

If you are (or plan to) working with bio foundation models s.a. Geneformer or UCE on RNA and DNA data, Helical will be your best buddy! We provide and improve on:

  • Up-to-date model library
  • A unified API for all models
  • User-facing abstractions tailored to computational biologists, researchers & AI developers
  • Innovative use case and application examples and ideas
  • Efficient data processing & code-base

We will continuously upload the latest model, publish benchmarks and make our code more efficient.

Acknowledgements

A lot of our models have been published by talend authors developing these exciting technologies. We sincerely thank the authors of the following open-source projects:

Licenses

You can find the Licenses for each model implementation in the model repositories:

Citation

Please use this BibTeX to cite this repository in your publications:

@software{helical,
  author = {Maxime Allard, Benoit Putzeys, Rick Schneider, Mathieu Klop},
  title = {Helical Python Package},
  year = {2024},
  publisher = {GitHub},
  journal = {GitHub Repository},
  howpublished = {\url{https://github.com/helicalAI/helical}},
}