\begin{figure}[t]
  \centering
  \includegraphics[width=\linewidth]{figures/overview.png}
  \caption{
  Overview of our centerline-aware 3D Gaussian mapping pipeline.
  \textbf{Inputs:} monocular colonoscopy RGB frames and externally provided
  poses. 
  \textbf{Reconstruction:} a thin geometric layer around a standard 3D
  Gaussian mapper maintains an online centerline $C(s)$ and Bishop frame,
  assigns tubular coordinates $(s,r,\theta)$, and updates keyframes and
  coverage counters in colon-intrinsic space.
  \textbf{Outputs:} a 3D Gaussian reconstruction with centerline and online
  coverage maps in $(s,\theta)$, providing segment-wise coverage summaries with
  minimal extra computation.
  }
  \label{fig:overview}
\end{figure}

\section{Related work}

\textbf{Per-frame assistance and quality assessment.}
Deep learning has enabled real-time computer-aided detection and diagnosis in colonoscopy, with systems that highlight polyps, classify lesion types, and estimate per-frame quality scores~\cite{urban2018deep,wang2019artificial}. Large datasets such as HyperKvasir~\cite{borgli2020hyperkvasir} have supported multi-class lesion detection and automated quality assessment, including withdrawal speed and bowel preparation~\cite{Chang2022ColonoscopyQualityAssessment}. These approaches operate mainly in image space, reasoning over individual frames or short clips. They do not maintain a persistent 3D representation of the colon or provide explicit, geometry-based coverage measures.

\vspace{0.5em}
\noindent \textbf{Dense SLAM and endoscopic reconstruction.}
Dense SLAM has evolved from volumetric fusion to neural implicit and 3D Gaussian representations~\cite{zhu2022niceslam,Matsuki2024GaussianSplattingSLAM,keetha2024splatam}. Endoscopic variants adapt these ideas to deformable and specular anatomy. RNNSLAM couples a recurrent depth-and-pose network with a SLAM backend for colon reconstruction~\cite{ma2021rnnslam}, while EndoGSLAM integrates 3D Gaussian splatting into endoscopic surgery, demonstrating real-time tracking and dense mapping~\cite{wang2024endogslam}. The C3VD dataset provides phantom colonoscopy videos with depth and ground-truth geometry, enabling quantitative evaluation~\cite{bobrow2023c3vd}. These systems focus on geometry and tracking accuracy; they typically treat the colon as a generic scene and do not organize the map in colon-intrinsic coordinates or expose coverage metrics as first-class outputs. Our work builds directly on this line, but holds pose fixed and instead modifies the representation and loss to be colon-aware.

\vspace{0.5em}
\noindent \textbf{Tubular and anatomical priors.}
Tubular priors are widely used to model elongated anatomical structures such as vessels and airways~\cite{Bauer2009TubularSegmentation,Chlebiej2023TubularModel}. For the colon, prior work has explored digital “unrolling” for visualization and registration~\cite{Rossides2021CycloramaColon} and tubular non-rigid structure-from-motion models for colonoscopic video~\cite{sengupta2021tubularnrsfm,Floor2022CapsuleColon3D}. These methods show the value of encoding tube-like anatomy, but are typically offline and not framed as small modifications to existing real-time reconstruction pipelines. In contrast, we integrate a simple tubular coordinate system and associated priors directly into a Gaussian mapper, and show that this modest geometric addition is enough to match strong baselines in geometry while providing online coverage information at negligible extra cost.

