CAPSTONE: Capability Assessment Protocol for Systematic Testing of Natural Language Models Expertise
Abstract: Prompt-based language models have limitations in classification and often require users to test multiple prompts with varying temperatures to identify the best fit. A text annotation framework addresses this by introducing explicit prompt definition and validation, and can improve performance in labeling or retrieval tasks at scale.
0 Replies
Loading