Accelerating Bayesian Optimization for Biological Sequence Design with Denoising Autoencoders

Stanton, Samuel; Maddox, Wesley; Gruver, Nate; Maffettone, Phillip; Delaney, Emily; Greenside, Peyton; Wilson, Andrew Gordon

Computer Science > Machine Learning

arXiv:2203.12742 (cs)

[Submitted on 23 Mar 2022 (v1), last revised 12 Jul 2022 (this version, v2)]

Title:Accelerating Bayesian Optimization for Biological Sequence Design with Denoising Autoencoders

Authors:Samuel Stanton, Wesley Maddox, Nate Gruver, Phillip Maffettone, Emily Delaney, Peyton Greenside, Andrew Gordon Wilson

View PDF

Abstract:Bayesian optimization (BayesOpt) is a gold standard for query-efficient continuous optimization. However, its adoption for drug design has been hindered by the discrete, high-dimensional nature of the decision variables. We develop a new approach (LaMBO) which jointly trains a denoising autoencoder with a discriminative multi-task Gaussian process head, allowing gradient-based optimization of multi-objective acquisition functions in the latent space of the autoencoder. These acquisition functions allow LaMBO to balance the explore-exploit tradeoff over multiple design rounds, and to balance objective tradeoffs by optimizing sequences at many different points on the Pareto frontier. We evaluate LaMBO on two small-molecule design tasks, and introduce new tasks optimizing \emph{in silico} and \emph{in vitro} properties of large-molecule fluorescent proteins. In our experiments LaMBO outperforms genetic optimizers and does not require a large pretraining corpus, demonstrating that BayesOpt is practical and effective for biological sequence design.

Comments:	ICML 2022. Code available at this https URL
Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Quantitative Methods (q-bio.QM); Machine Learning (stat.ML)
Cite as:	arXiv:2203.12742 [cs.LG]
	(or arXiv:2203.12742v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2203.12742

Submission history

From: Andrew Wilson [view email]
[v1] Wed, 23 Mar 2022 21:58:45 UTC (469 KB)
[v2] Tue, 12 Jul 2022 11:53:25 UTC (653 KB)

Computer Science > Machine Learning

Title:Accelerating Bayesian Optimization for Biological Sequence Design with Denoising Autoencoders

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Accelerating Bayesian Optimization for Biological Sequence Design with Denoising Autoencoders

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators