Label Smoothing is a Pragmatic Information Bottleneck

Published: 28 Jul 2025, Last Modified: 28 Jul 2025Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0
Abstract: This study revisits label smoothing via a form of information bottleneck. Under the assumption of sufficient model flexibility and no conflicting labels for the same input, we theoretically and experimentally demonstrate that the model output obtained through label smoothing explores the optimal solution of the information bottleneck. Based on this, label smoothing can be interpreted as a practical approach to the information bottleneck, enabling simple implementation. As an information bottleneck method, we experimentally show that label smoothing also exhibits the property of being insensitive to factors that do not contain information about the target, or to factors that provide no additional information about it when conditioned on another variable.
Submission Length: Regular submission (no more than 12 pages of main content)
Changes Since Last Submission: The beginning sentence of the abstract is revised in accordance with the editor's comment.
Supplementary Material: zip
Assigned Action Editor: ~Mohammad_Emtiyaz_Khan1
Submission Number: 4582
Loading