Doubly Protected Estimation for Survival Outcomes Utilizing External Controls for Randomized Clinical Trials

Chenyin Gao; Shu Yang; Mingyang Shan; Wenyu Ye; Ilya Lipkovich; Douglas Faries

Doubly Protected Estimation for Survival Outcomes Utilizing External Controls for Randomized Clinical Trials

Chenyin Gao, Shu Yang, Mingyang Shan, Wenyu Ye, Ilya Lipkovich, Douglas Faries

Published: 01 May 2025, Last Modified: 23 Jul 2025ICML 2025 posterEveryoneRevisionsBibTeXCC BY-NC 4.0

TL;DR: We propose a doubly protected integrative estimator for survival outcomes that is more efficient than trial-only estimators and mitigates biases from external data.

Abstract: Censored survival data are common in clinical trials, but small control groups can pose challenges, particularly in rare diseases or where balanced randomization is impractical. Recent approaches leverage external controls from historical studies or real-world data to strengthen treatment evaluation for survival outcomes. However, using external controls directly may introduce biases due to data heterogeneity. We propose a doubly protected estimator for the treatment-specific restricted mean survival time difference that is more efficient than trial-only estimators and mitigates biases from external data. Our method adjusts for covariate shifts via doubly robust estimation and addresses outcome drift using the DR-Learner for selective borrowing. The approach can incorporate machine learning to approximate survival curves and detect outcome drifts without strict parametric assumptions, borrowing only comparable external controls. Extensive simulation studies and a real-data application evaluating the efficacy of Galcanezumab in mitigating migraine headaches have been conducted to illustrate the effectiveness of our proposed framework.

Lay Summary: Clinical trials can take a long time and often don’t include enough patients to show if a drug helps people live longer, especially for rare diseases. To solve this problem, we developed a data-adaptive integrative approach that gathers information from previous studies or real-world data to strengthen the placebo (or control) groups. This means more patients can be assigned to the new drug, making it easier and quicker to see if the treatment is effective, and ultimately speeding up the process of getting important new medicines to people who need them.

Link To Code: https://github.com/Gaochenyin/SelectiveIntegrative

Primary Area: General Machine Learning->Causality

Keywords: Adaptive learning, Monotone coarsening, unmeasured confounding, data heterogeneity

Submission Number: 7977

Loading