Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 2022

María Escobar, Laura Alexandra Daza, Cristina González, Jordi Pont-Tuset, Pablo Arbeláez

2022 (modified: 14 Sept 2022)CoRR 2022Readers: Everyone

Abstract: We implemented Video Swin Transformer as a base architecture for the tasks of Point-of-No-Return temporal localization and Object State Change Classification. Our method achieved competitive performance on both challenges.

0 Replies