Substructure-Atom Cross Attention for Molecular Representation Learning

Jiye Kim; Seungbeom Lee; Dongwoo Kim; Sungsoo Ahn; Jaesik Park

Substructure-Atom Cross Attention for Molecular Representation Learning

Jiye Kim, Seungbeom Lee, Dongwoo Kim, Sungsoo Ahn, Jaesik Park

Published: 21 Oct 2022, Last Modified: 08 Feb 2026AI4Science PosterReaders: Everyone

Keywords: Molecule representation learning

Abstract: Designing a neural network architecture for molecular representation is crucial for AI-driven drug discovery and molecule design. In this work, we propose a new framework for molecular representation learning. Our contribution is threefold: (a) demonstrating the usefulness of incorporating substructures to node-wise features from molecules, (b) designing two branch networks consisting of a transformer and a graph neural network so that the networks fused with asymmetric attention, and (c) not requiring heuristic features and computationally-expensive information from molecules. Using 1.8 million molecules collected from ChEMBL and PubChem database, we pretrain our network to learn a general representation of molecules with minimal supervision. The experimental results show that our pretrained network achieves competitive performance on 11 downstream tasks for molecular property prediction.

TL;DR: This paper proposes a novel framework that incorporates molecular substructure information to node-wise features effectively.

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 5 code implementations](https://www.catalyzex.com/paper/substructure-atom-cross-attention-for/code)

0 Replies

Loading