On the pitfalls of Batch Normalization for end-to-end video learning: A study on surgical workflow analysis
Abstract: Highlights•Most video-based surgical workflow methods use CNNs with Batch Normalization.•Batch Normalization has several pitfalls when applied to video data.•Previous work resorts to complex multi-stage training strategies to avoid issues.•Awareness of pitfalls enables effective training of simpler end-to-end approaches.•Simple CNN-LSTMs beat the state of the art on 3 surgical workflow benchmarks.
Loading