Molecule Language Model with Augmented Pairs and Expertise Transfer

Published: 06 Jul 2024, Last Modified: 28 Jul 2024Language and Molecules ACL 2024 PosterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Molecule Language Model, Data Augmentation
Abstract: Understanding the molecules and their textual descriptions via molecule language models (MoLM) recently got a surge of interest among researchers. However, unique challenges exist in the field of MoLM due to 1) a limited amount of molecule-text paired data and 2) missing expertise that occurred due to the specialized areas of focus among the experts. To this end, we propose AMOLE, which 1) augments molecule-text pairs with structural similarity preserving loss, and 2) transfers the expertise between the molecules. Extensive experiments on various downstream tasks demonstrate the superiority of AMOLE in comprehending molecules and their descriptions, highlighting its potential for application in real-world drug discovery.
Archival Option: The authors of this submission do *not* want it to appear in the archival proceedings.
Submission Number: 13
Loading