Detecting Label Errors in Token Classification Data

Detecting Label Errors in Token Classification Data

TMLR Paper1311 Authors

21 Jun 2023 (modified: 17 Sept 2024)Rejected by TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: Mislabeled examples are a common issue in real-world data, particularly for tasks like token classification where many labels must be chosen on a fine-grained basis. Here we consider the task of finding sentences that contain label errors in token classification datasets. We study 11 different straightforward methods that score tokens/sentences based on the predicted class probabilities output by a (any) token classification model (trained via any procedure). In precision-recall evaluations based on real-world label errors in entity recognition data from CoNLL-2003, we identify a simple and effective method that consistently detects those sentences containing label errors when applied with different token classification models.

Submission Length: Regular submission (no more than 12 pages of main content)

Assigned Action Editor: ~Karthik_R_Narasimhan1

Submission Number: 1311

Loading