Finding Memo: The Hidden Influence of Memorization in Large Language Models’ Performance – A Critical Analysis of Benchmark Evaluation

Published: 22 Sept 2025, Last Modified: 03 Jan 2026WiML @ NeurIPS 2025EveryoneRevisionsBibTeXCC BY 4.0
Keywords: Large Language Models, LLMs, Memorization, Generalization, Evaluation, Benchmarks
Submission Number: 273
Loading