On the pitfalls of Batch Normalization for end-to-end video learning: A study on surgical workflow analysis

Published: 01 Jan 2024, Last Modified: 02 Oct 2024Medical Image Anal. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Most video-based surgical workflow methods use CNNs with Batch Normalization.•Batch Normalization has several pitfalls when applied to video data.•Previous work resorts to complex multi-stage training strategies to avoid issues.•Awareness of pitfalls enables effective training of simpler end-to-end approaches.•Simple CNN-LSTMs beat the state of the art on 3 surgical workflow benchmarks.
Loading