GELKcat: An Integration Learning of Substrate Graph with Enzyme Embedding for Kcat prediction

Published: 01 Jan 2023, Last Modified: 28 Sept 2024BIBM 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Computational modeling and identification of the enzyme turnover number k cat are crucial for synthetic biology and early-stage lead optimization. Therefore, the accurate assessment of the k cat for enzyme-substrate pairs is essential. Considering wet-lab experiment is time-consuming, laborious, and expensive, in silico prediction of k cat is an alternative choice. However, few computational methods have been developed to address this task and other enzyme kinetics predictions. To address this, we develop a novel end-to-end dual-representation framework GELKcat by harnessing graph transformers for substrate molecular encoding and CNNs for enzyme word2vec embeddings. We further integrate substrate and enzyme features using the adaptive gate network, which assigns optimal weights to capture the most suitable feature combinations. The comparison with several state-of-the-art methods exhibits the superiority of our GELKcat. The Ablation studies further illuminate the invaluable roles of the word2vec embeddings of enzymes. It is anticipated that this work can bridge current gaps in enzyme-substrate representation, which can give some guidance for drug discovery and synthetic biology.
Loading