Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing

Published: 2024, Last Modified: 16 May 2025BIBM 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: In daily clinical practice, clinicians exhibit robustness in identifying polyps with both location and size variations. It is uncertain if deep segmentation models can achieve comparable robustness in automated colonoscopic analysis. To benchmark the model robustness, we focus on evaluating the segmentation models on the polyps with various attributes (e.g. location and size) and healthy samples. Based on the Latent Diffusion Model, we perform attribute editing on real polyps and build a new dataset named Polyp-E. Our synthetic dataset boasts exceptional realism, to the extent that clinical experts find it challenging to discern them from real data. We evaluate various existing polyp segmentation models on the proposed benchmark. The results reveal most of the models are highly sensitive to attribute variations. As a novel data augmentation technique, the proposed editing pipeline can improve both in-distribution and out-ofdistribution generalization ability. The code and datasets has been released at https://github.com/RunpuWei/Polyp-E-Benchmark.
Loading