Self-supervised learning of monocular depth and ego-motion estimation for non-rigid scenes in wireless capsule endoscopy videos

Published: 01 Jan 2024, Last Modified: 12 Apr 2025Biomed. Signal Process. Control. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Transformer improves pose estimation with self-attention mechanism.•Multiple frame sampling intervals augment training diversity.•Binary learnable masks remove invalid self-supervisions.
Loading