Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback

Katherine Tian, Eric Mitchell, Allan Zhou, Archit Sharma, Rafael Rafailov, Huaxiu Yao, Chelsea Finn, Christopher D. Manning

Published: 2023, Last Modified: 29 Mar 2024EMNLP 2023Readers: Everyone

Abstract: Katherine Tian, Eric Mitchell, Allan Zhou, Archit Sharma, Rafael Rafailov, Huaxiu Yao, Chelsea Finn, Christopher Manning. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. 2023.

0 Replies