Multi-Objective Bayesian Optimization with Independent Tanimoto Kernel Gaussian Processes for Diverse Pareto Front Exploration (README.md in construction)

GP-MOBO is a novel multi-objective Bayesian Optimization (MOBO) algorithm designed to optimize molecular properties using Gaussian Processes (GPs). Leveraging independent Tanimoto kernel GPs for each molecular objective, the model effectively explores the Pareto frontier, balancing exploration and exploitation to identify high-quality, diverse candidate molecules.

Key Features:

Independent Tanimoto Kernel GPs: Models each molecular objective separately, capturing the full dimensionality of molecular fingerprints without reducing complexity.
Efficient Pareto Front Exploration: Utilizes the Expected Hypervolume Improvement (EHVI) acquisition function, ensuring superior coverage of the chemical search space.
Scalable & Computationally Efficient: The model scales well for large datasets and is optimized for multi-objective tasks, making it suitable for drug discovery and molecular design.

Python Scripts to Run:

Dockstring Toy MPO Setup
GUACAMOL MPO Setup

For DockSTRING Toy MPO Setup, go to dockstring-test-implementation branch, run for 3 experiments:

python ehvi_mc_3_trials.py

or

python ehvi_mc.py

For GUACAMOL MPO Setup, go to guacamol-testbranch implementation, run:

python ehvi_{mpo_name}.py

Example:

python ehvi_fexofenadine.py

Datasets to Download:

DOCKSTRING (https://github.com/dockstring/dockstring)
GUACAMOL: EXTRACTED FROM GUACAMOL BENCHMARK (https://github.com/BenevolentAI/guacamol)

pip install dockstring

Pacakge Versions:

Running the code requires:

KERN_GP which consists of a minimal kernel-only GP package from https://github.com/AustinT/kernel-only-GP
Numpy
Rdkit
PyTDC which consists the oracle functions for all objectives required. https://github.com/mims-harvard/TDC

Running for code comparison to existing methods requires:

https://github.com/AustinT/basic-mol-bo-workshop2024

Development

Please use pre-commit for code formatting / linting.

Name		Name	Last commit message	Last commit date
Latest commit History 90 Commits
.vscode		.vscode
gp_mobo		gp_mobo
research		research
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
__init__.py		__init__.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Objective Bayesian Optimization with Independent Tanimoto Kernel Gaussian Processes for Diverse Pareto Front Exploration (README.md in construction)

Pacakge Versions:

Development

About

Releases

Packages

Languages

IgnotaLabs/GP-MOBO

Folders and files

Latest commit

History

Repository files navigation

Multi-Objective Bayesian Optimization with Independent Tanimoto Kernel Gaussian Processes for Diverse Pareto Front Exploration (README.md in construction)

Pacakge Versions:

Development

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages