BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation Experiments

Yusuf H Roohani; Jian Vora; Qian Huang; Percy Liang; Jure Leskovec

BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation Experiments

Yusuf H Roohani, Jian Vora, Qian Huang, Percy Liang, Jure Leskovec

Published: 11 Mar 2024, Last Modified: 15 Mar 2024LLMAgents @ ICLR 2024 PosterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: large language model, AI agent, computational biology, experiment design

TL;DR: We develop BioDiscoveryAgent, an LLM-based AI agent that can effectively design genetic perturbation experiments

Abstract: Genetic perturbation experiments play a crucial role in discovering the mechanisms behind diseases and informing drug development. These experiments aim to find a small subset out of many possible genes that yield a particular phenotype (e.g. cell growth) upon perturbation. However, the costs involved in each experiment limits the number of perturbations that can be tested. Here, we develop BioDiscoveryAgent, an AI agent that can strategically design genetic perturbation experiments to enhance the detection of perturbations that induce desired phenotypes. Our AI agent is based on large language models, which have rich biological knowledge, and generate explainable rationales while selecting genes to perturb. BioDiscoveryAgent attains an average of 23% improvement compared to existing Bayesian optimization baselines in detecting desired phenotypes across five datasets. This includes one dataset that is unpublished and therefore guaranteed to not appear in the language model's training data. Additionally, BioDiscoveryAgent is uniquely able to predict gene combinations to perturb, a task so far not explored in this setting. Overall, our approach represents a simple new paradigm in computational design of biological experiments, aimed at augmenting scientists' capabilities and accelerating scientific discovery.

Submission Number: 116

Loading