Spatial-Temporal Consistency Enhanced Segmentation for Laparoscopic Surgical videos

12 Apr 2025 (modified: 12 Apr 2025)MIDL 2025 Short Papers SubmissionEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Computer Vision, Segmentation, CRF, Machine learning
TL;DR: This paper introduces SSTC-Seg(Surgical Spatial-Temporal Consistency Segmentation), a lightweight yet effective segmentation framework designed to address the challenges of multi-organ segmentation in laparoscopic surgical videos.
Abstract: Accurate segmentation of laparoscopic surgical videos is essential for enhancing intraoperative guidance and improving patient outcomes. However, this task remains challenging due to the constrained field of view, visual clutter, frequent occlusions, and inconsistent illumination. To address these challenges, we propose SSTC-Seg (Surgical Spatial-Temporal Consistency Segmentation), a lightweight deep learning framework for video-based segmentation. It integrates a memory system and a Hierarchical Dense Conditional Random Field (HD-CRF) with skip connections for spatial details preservation to refine coarse predictions and model contextual relationships across frames. Evaluated on the Dresden Surgical Anatomy Dataset (DSAD), SSTC-Seg achieves competitive multi-organ segmentation performance with significantly fewer parameters compared to existing state-of-the-art methods.
Submission Number: 104
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview