BatchDTA: Implicit batch alignment enhances deep learning-based drug-target affinity estimation

Hongyu Luo, Yingfei Xiang, Xiaomin Fang, Wei Lin, Fan Wang, Hua Wu, Haifeng Wang

Published: 23 Nov 2021, Last Modified: 07 Jan 2026CrossrefEveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: h3>Abstract</h3> <p>Candidate compounds with high binding affinities toward a target protein are likely to be developed as drugs. Deep neural networks (DNNs) have attracted increasing attention for drug-target affinity (DTA) estimation owning to their efficiency. However, the negative impact of batch effects caused by measure metrics, system technologies, and other assay information is seldom discussed when training a DNN model for DTA. Suffering from the data deviation caused by batch effects, the DNN models can only be trained on a small amount of “clean” data. Thus, it is challenging for them to provide precise and consistent estimations. We design a batch-sensitive training framework, namely BatchDTA, to train the DNN models. BatchDTA implicitly aligns multiple batches toward the same protein, alleviating the impact of the batch effects on the DNN models. Extensive experiments demonstrate that BatchDTA facilitates four mainstream DNN models to enhance the ability and robustness on multiple DTA datasets. The average concordance index (CI) of the DNN models achieves a relative improvement of 4.0%. BatchDTA can also be applied to the fused data collected from multiple sources to achieve further improvement.</p>
Loading