Abstract: Highlights•Scene detection and annotation are solved from window view to avoid propagation errors.•Our method jointly learns scene detection and annotation.•Our method outperforms other methods on two public datasets.
Loading
OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2026 OpenReview