Skip to content
@MachineLearningLifeScience

Machine Learning in Life Science

Welcome to the github page for the Center for Basic Machine Learning Research in Life Science

We conduct the basic machine learning research needed to estimate representations of biomedical data that are

  • Robust
  • Interpretable
  • Data efficient
  • Reflective of inherent data uncertainty
  • Able to leverage existing knowledge

These representations are both predictive and knowledge discovery tasks.

Research

Our research focuses on four themes, and each theme advances different aspects of representation learning for life science and support each other:

  1. Meaningful representation of data and computational and mathematical tools development to realize the answer.
  2. Geometric constructions to incorporate existing knowledge into representations and ensure that the result is understandable by humans.
  3. Representation of data often appearing within life science, such as trees, graphs, and sequences.
  4. Inclusion of real data that is “noisy” and investigation of how associated uncertainty is best encoded.

Pinned Loading

  1. meaningful-protein-representations meaningful-protein-representations Public

    Jupyter Notebook 102 9

  2. stochman stochman Public

    Algorithms for computations on random manifolds made easier

    Python 80 9

  3. BEND BEND Public

    BEND: Benchmarking DNA Language Models on Biologically Meaningful Tasks

    Python

  4. torchplot torchplot Public

    Plotting pytorch tensors made easy!

    Python 14 1

  5. poli poli Public

    Protein Objectives Library

    Python 13 1

Repositories

Showing 10 of 11 repositories
  • poli Public

    Protein Objectives Library

    MachineLearningLifeScience/poli’s past year of commit activity
    Python 13 MIT 1 35 2 Updated Jul 12, 2024
  • poli-baselines Public

    A collection of objective functions and black box optimization algorithms related to proteins and small molecules

    MachineLearningLifeScience/poli-baselines’s past year of commit activity
    Python 5 MIT 0 10 (1 issue needs help) 3 Updated Jul 9, 2024
  • poli-docs Public

    Documentation for poli and poli-baselines

    MachineLearningLifeScience/poli-docs’s past year of commit activity
    4 0 3 2 Updated Jul 8, 2024
  • hdbo_benchmark Public

    Code for "A survey and benchmark of high-dimensional Bayesian optimization of discrete sequences"

    MachineLearningLifeScience/hdbo_benchmark’s past year of commit activity
    Python 2 0 0 0 Updated Jun 25, 2024
  • protein_regression Public

    The codebase to replicate the analysis of "A systematic analysis of regression models for protein engineering" (2024).

    MachineLearningLifeScience/protein_regression’s past year of commit activity
    Jupyter Notebook 2 MIT 1 0 0 Updated Jun 12, 2024
  • corel Public
    MachineLearningLifeScience/corel’s past year of commit activity
    Python 1 MIT 0 4 0 Updated Apr 12, 2024
  • stochman Public

    Algorithms for computations on random manifolds made easier

    MachineLearningLifeScience/stochman’s past year of commit activity
    Python 80 Apache-2.0 9 10 0 Updated Dec 4, 2023
  • BEND Public

    BEND: Benchmarking DNA Language Models on Biologically Meaningful Tasks

    MachineLearningLifeScience/BEND’s past year of commit activity
    Python 0 BSD-3-Clause 0 0 0 Updated Nov 24, 2023
  • .github Public
    MachineLearningLifeScience/.github’s past year of commit activity
    1 0 0 0 Updated Aug 18, 2023
  • MachineLearningLifeScience/meaningful-protein-representations’s past year of commit activity
    Jupyter Notebook 102 BSD-3-Clause 9 4 0 Updated Mar 7, 2022

Top languages

Loading…

Most used topics

Loading…