Denoising Drug Discovery ADMET Data for Improved Regression Task Performance

ICLR 2024 Workshop DMLR Submission25 Authors

Published: 04 Mar 2024, Last Modified: 02 May 2024DMLR @ ICLR 2024EveryoneRevisionsBibTeXCC BY 4.0
Keywords: Denoising, noise filter, noise reduction, denoising for regression, ADMET prediction, drug discovery
TL;DR: We propose denoising schemes to reduce the noise in the drug discovery ADMET data and improve model predictions for regression tasks.
Abstract: Predicting ADMET (absorption, distribution, metabolism, excretion, and toxicity) properties of small molecules is a key task in drug discovery. A major challenge in building better ADMET models is the experimental error inherent in the data. Here, we develop denoising schemes based on deep learning to address this. The most significant performance increase occurs when the original model is finetuned with the denoised data using training error as the noise detection metric. Our denoising scheme outperforms other literature schemes for ADMET data and has implications for improving models for experimental assay data in general.
Primary Subject Area: Active learning, Data cleaning, acquisition for ML
Paper Type: Extended abstracts: up to 2 pages
DMLR For Good Track: Participate in DMLR for Good Track
Participation Mode: In-person
Confirmation: I have read and agree with the workshop's policy on behalf of myself and my co-authors.
Submission Number: 25
Loading