Generative Antibody Design for Complementary Chain Pairing Sequences through Encoder-Decoder Language Model

Published: 27 Oct 2023, Last Modified: 20 Nov 2023GenBio@NeurIPS2023 PosterEveryoneRevisionsBibTeX
Keywords: language model; protein; antibody
TL;DR: In this paper, we present pAbT5, a protein language model demonstrating capability in understanding antibody pairing patterns and generating sequence pairings, potentially serving as a tool for antibody research and therapeutic development.
Abstract: Current protein language models (pLMs) predominantly focus on single-chain protein sequences and often have not accounted for constraints on generative design imposed by protein-protein interactions. To address this gap, we present paired Antibody T5 (pAbT5), an encoder-decoder model to generate complementary heavy or light chain from its pairing partner. We show that our model respects conservation in framework regions and variability in hypervariable domains, demonstrated by agreement with sequence alignment and variable-length CDR loops. We also show that our model captures chain pairing preferences through the recovery of ground-truth chain type and gene families. Our results showcase the potential of pAbT5 in generative antibody design, incorporating biological constraints from chain pairing preferences.
Submission Number: 10
Loading