SCOPES: Measuring Accuracy–Portability Trade-offs Across Microarray and RNA-Seq

08 Feb 2026 (modified: 04 Mar 2026)Submitted to ICLR 2026 Workshop LMRLEveryoneRevisionsBibTeXCC BY 4.0
Confirmation: I have read and agree with the workshop's policy on behalf of myself and my co-authors.
Track: tiny / short paper (2-4 pages excluding references; extended abstract format)
Keywords: cross-platform generalization, transcriptomics, microarray, RNA-seq, feature selection, distribution shift, selection leakage, multi-objective optimization, NSGA-II, MMD
TL;DR: SCOPES is a leak-free multi-objective framework that exposes and controls the accuracy–stability–portability trade-off when transferring gene signatures from microarray to RNA-seq.
Abstract: Deep models often fail under distribution shift, yet the role of feature selection in amplifying or mitigating shift is underexplored. We study this in a stringent setting: transferring a tumour--vs--normal classifier across measurement platforms (Agilent microarray $\rightarrow$ RNA--Seq) using the same patients and genes. We introduce SCOPES, a leak--free, multi--objective selection framework that optimizes three competing goals: (i) predictive performance (AUC) via patient--safe cross--validation, (ii) selection stability (Kuncheva), and (iii) cross--platform alignment (Maximum Mean Discrepancy, MMD). Viewed through a representation lens, SCOPES selects a compact gene subspace that is simultaneously discriminative and domain-aligned, explicitly exposing the accuracy--portability frontier under measurement shift. On matched TCGA--BRCA Agilent/RNA--Seq, a label--informed $F$--score slab produced an implausibly perfect source model ($\mathrm{AUC} \approx 1.0$) but lost $\sim 0.30$ AUC after transfer, revealing selection leakage plus platform shift. Replacing the slab with an unsupervised MAD prefilter makes the trade--off explicit on the Pareto front: a one--gene, alignment--first solution achieves modest AUC with small transfer loss ($0.69 \rightarrow 0.61$, $\Delta \mathrm{AUC} \approx -0.08$), while a 30--gene, accuracy--first solution reaches near--perfect source AUC but transfers poorly ($\Delta \mathrm{AUC} \approx -0.38$). SCOPES provides a simple protocol to measure and control this trade--off (report source/target AUC, $\Delta$AUC, and MMD), encouraging selections near a Pareto ``knee'' for portability. Finally, in the reverse direction (RNA--Seq $\rightarrow$ microarray), a 37-gene SCOPES signature attains $\mathrm{AUC}_{\mathrm{RNA}} = 0.654$ (CV) and $\mathrm{AUC}_{\mathrm{Agilent}} = 0.890$ ($\Delta \mathrm{AUC} = +0.236$), indicating directional shift. We argue that treating selection as a multi--objective design problem is a useful lens for the science of deep learning under shift.
Anonymization: This submission has been anonymized for double-blind review via the removal of identifying information such as names, affiliations, and identifying URLs.
Presenter: ~Abdullah_Nayem_Wasi_Emran1
Format: Maybe: the presenting author will attend in person, contingent on other factors that still need to be determined (e.g., visa, funding).
Funding: Yes, the presenting author of this submission falls under ICLR’s funding aims, and funding would significantly impact their ability to attend the workshop in person.
Submission Number: 61
Loading