CASC-AI: Consensus-aware Self-corrective Learning for Cell Segmentation with Noisy Labels

Ruining Deng; Yihe Yang; David J Pisapia; Benjamin L Liechty; Junchao Zhu; Juming Xiong; Junlin Guo; Zhengyi Lu; Jiacheng Wang; Xing Yao; Runxuan Yu; Rendong Zhang; Gaurav Rudravaram; Mengmeng Yin; Pinaki Sarder; Haichun Yang; Yuankai Huo; Mert R. Sabuncu

CASC-AI: Consensus-aware Self-corrective Learning for Cell Segmentation with Noisy Labels

Ruining Deng, Yihe Yang, David J Pisapia, Benjamin L Liechty, Junchao Zhu, Juming Xiong, Junlin Guo, Zhengyi Lu, Jiacheng Wang, Xing Yao, Runxuan Yu, Rendong Zhang, Gaurav Rudravaram, Mengmeng Yin, Pinaki Sarder, Haichun Yang, Yuankai Huo, Mert R. Sabuncu

Published: 27 Mar 2025, Last Modified: 11 Jul 2025MIDL 2025 OralEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Consensus matrix, Corrective Learning, Noisy label learning, Cell Segmentation

TL;DR: We present a consensus-aware self-corrective learning framework designed to address the challenges of cell segmentation in datasets with noisy labels.

Abstract: Multi-class cell segmentation in high-resolution gigapixel whole slide images (WSIs) is crucial for various clinical applications. However, training such models typically requires labor-intensive, pixel-wise annotations by domain experts. Recent efforts have democratized this process by involving lay annotators without medical expertise. However, conventional non-corrective approaches struggle to handle annotation noise adaptively because they lack mechanisms to mitigate false positives (FP) and false negatives (FN) at both the image-feature and pixel levels. In this paper, we propose a consensus-aware self-corrective learning that leverages the Consensus Matrix to guide its learning process. The Consensus Matrix defines regions where both the AI and annotators agree on cell and non-cell annotations, which are prioritized with stronger supervision. Conversely, areas of disagreement are adaptively weighted based on their feature similarity to high-confidence consensus regions, with more similar regions receiving greater attention. Additionally, contrastive learning is employed to separate features of noisy regions from those of reliable consensus regions by maximizing their dissimilarity. This paradigm enables the model to iteratively refine noisy labels, enhancing its robustness. Validated on one real-world lay-annotated cell dataset and two reasoning-guided simulated noisy datasets, our method demonstrates improved segmentation performance, effectively correcting FP and FN errors and showcasing its potential for training robust models on noisy datasets. The official implementation and cell annotations are publicly available at https://github.com/ddrrnn123/CASC-AI.

Primary Subject Area: Application: Histopathology

Secondary Subject Area: Learning with Noisy Labels and Limited Data

Paper Type: Both

Registration Requirement: Yes

Reproducibility: https://github.com/ddrrnn123/CASC-AI

Visa & Travel: Yes

Midl Latex Submission Checklist: Ensure no LaTeX errors during compilation., Created a single midl25_NNN.zip file with midl25_NNN.tex, midl25_NNN.bib, all necessary figures and files., Includes \documentclass{midl}, \jmlryear{2025}, \jmlrworkshop, \jmlrvolume, \editors, and correct \bibliography command., Did not override options of the hyperref package, Did not use the times package., All authors and co-authors are correctly listed with proper spelling and avoid Unicode characters., Author and institution details are de-anonymized where needed. All author names, affiliations, and paper title are correctly spelled and capitalized in the biography section., References must use the .bib file. Did not override the bibliographystyle defined in midl.cls. Did not use \begin{thebibliography} directly to insert references., Tables and figures do not overflow margins; avoid using \scalebox; used \resizebox when needed., Included all necessary figures and removed *unused* files in the zip archive., Removed special formatting, visual annotations, and highlights used during rebuttal., All special characters in the paper and .bib file use LaTeX commands (e.g., \'e for é)., Appendices and supplementary material are included in the same PDF after references., Main paper does not exceed 9 pages; acknowledgements, references, and appendix start on page 10 or later.

Latex Code: zip

Copyright Form: pdf

Submission Number: 35

Loading