Keywords: molecular property prediction, drug discovery, context conditioning, FiLM modulation, graph neural networks, multi-task learning, virtual screening, message passing neural networks, transfer learning, temporal generalization, benchmark evaluation, target-conditional modeling, distribution shift, hierarchical embeddings, activity prediction
TL;DR: First-ever systematic study definitively resolving when context conditioning helps molecular prediction. FiLM achieves unprecedented 24.2pp gains; enables breakthrough 2.9× improvement on data-scarce targets where traditional methods completely fail.
Abstract: We present the first systematic study of when target context helps molecular property prediction, evaluating context conditioning across 10 diverse protein families, 4 fusion architectures, data regimes spanning 67–9,409 training compounds, and both temporal and random evaluation splits. Using NESTDRUG, a FiLM-based nested-learning architecture that conditions molecular representations on target identity, we characterize both success and failure modes with three principal findings. First, fusion architecture dominates: FiLM outperforms concatenation by 24.2 percentage points and additive conditioning by 8.6 pp; how you incorporate context matters more than whether you include it. Second, context enables otherwise impossible predictions: on data-scarce CYP3A4 (67 training compounds), multi-task transfer achieves 0.686 AUC where per-target Random Forest collapses to 0.238, demonstrating that context conditioning unlocks viable prediction for novel targets where traditional approaches fail entirely. Third, context can systematically hurt: distribution mismatch causes 10.2 pp degradation on BACE1; few-shot adaptation consistently underperforms zero-shot. Beyond methodology, we expose fundamental flaws in standard benchmarking—1-nearest-neighbor Tanimoto achieves 0.991 AUC on DUD-E without any learning, and 50% of actives leak from training data, rendering absolute performance metrics meaningless. Our temporal split evaluation (train ≤2020, test 2021–2024) achieves stable 0.843 AUC with no degradation, providing the first rigorous evidence that context-conditional molecular representations generalize to future chemical space. These findings resolve a long-standing ambiguity in the field and establish clear decision boundaries for when context conditioning provides genuine value in drug discovery pipelines.
Submission Track: Full Paper
Submission Category: AI-Guided Design
Submission Number: 45
Loading