UMP-Net: Uncertainty-Aware Mixture of Prompts Network for Efficient Instruction Tuning

UMP-Net: Uncertainty-Aware Mixture of Prompts Network for Efficient Instruction Tuning

TMLR Paper5181 Authors

23 Jun 2025 (modified: 04 Nov 2025)Decision pending for TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: Instruction tuning has greatly improved how large language models (LLMs) respond to human-like instructions. However, fully fine-tuning these models is still computationally demanding, and many existing parameter-efficient methods fall short, particularly when it comes to uncertainty estimation and working effectively across different modalities. To address this, we introduce UMP-Net (Uncertainty-Aware Mixture of Prompts Network), a new approach designed to enhance the ability of LLaMA to follow instructions. UMP-Net combines a novel mixture of prompts (MoPs) technique with Latent Noise Prompting, KNN-based Heterogeneous Clustering, and Conformal Predictions to select the most reliable prompts dynamically while accounting for uncertainty. In addition, it features a CLIP-based multi-modal architecture to streamline vision-language integration. We evaluated UMP-Net on a range of benchmarks including ScienceQA, COCO Caption, and various zero-shot multi-modal tasks. The results show a strong performance: an average accuracy of 88.41% on ScienceQA and a CIDEr score of 158.3 on COCO Caption, surpassing models such as LLaVA, LLaMA-Adapter, and LLaMA-Excitor. These findings suggest that UMP-Net offers both improved multi-modal capability and computational efficiency. Further ablations demonstrate UMP-Net’s conformal prediction module provides robust uncertainty estimates under noise and domain shifts, outperforming Bayesian alternatives in coverage guarantees with minimal overhead.

Submission Length: Regular submission (no more than 12 pages of main content)

Changes Since Last Submission: This camera-ready version includes comprehensive grammatical and stylistic improvements to enhance the overall clarity and quality of the manuscript. No methodological, experimental, or result-related content has been altered. All tables, figures, and structural components remain exactly the same as in the accepted version. Also the code of our proposed model is available here: https://github.com/abdulhadyabas2/UMP-NetUncertainty.

Code: https://github.com/abdulhadyabas2/UMP-NetUncertainty.

Assigned Action Editor: ~Chunyuan_Li1

Submission Number: 5181

Loading