Distilling the Effects of Language Model Contamination

Behzad Mehrbakhsh; Fernando Martínez-Plumed; José Hernández-Orallo

Distilling the Effects of Language Model Contamination

Behzad Mehrbakhsh, Fernando Martínez-Plumed, José Hernández-Orallo

Published: 01 Jan 2024, Last Modified: 19 Feb 2025ECAI 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: The proportion of AI-generated content permeating the well of knowledge is increasing significantly. Large language models (LLMs) contribute to that contamination but they also suffer from it. However, it is yet to be clarified the effect of different sources of error, be it human-generated or LLM-generated. Controlling for the percentage of error, we explore the impact on LLM fine-tuning when errors come from humans, from other language models or are generated randomly using an aleatoric or epistemic source. In this paper, we compare these different types of error for in-distribution and out-of-distribution experimental settings. By analysing the levels of errors and their distribution, we find a nuanced view: while in-distribution human-generated noise seems more benign than the LLM-generated counterpart, in the out-of-distribution case the model-generated noise may not be necessarily worse.

Loading