Prototype-Based Selective Prediction for Multimodal Instruction Models

Published: 02 Mar 2026, Last Modified: 02 Mar 2026ICLR 2026 Trustworthy AIEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Selective prediction, prototype-based learning, multimodal instruction models, uncertainty estimation, trustworthy AI
Abstract: Selective prediction is critical for instruction-tuned multimodal models, yet common confidence heuristics often fail under heterogeneous inputs and offer limited interpretability. We show that prototype-based selective prediction provides a lightweight and transparent reliability mechanism, enabling principled abstention via distance- and margin-based confidence without retraining foundation models, and consistently outperforming maximum softmax probability and embedding-based baselines on CLINC150 and MedVQA. On MedVQA, well-separated prototype geometry and nearest-prototype explanations reveal clinically meaningful semantic ambiguity, supporting reliable and interpretable abstention decisions.
Submission Number: 177
Loading