Transformer feature collapse of Temporal Action Detection via Multi-granularity Semantic Enhancement
Abstract: Highlights•Mitigate temporal feature collapse issue for TAD.•Model long temporal dependency for multi-instance videos.•Enhance feature representations from diverse semantic spaces.•Expensive experiments prove the effectiveness of our method.
External IDs:dblp:journals/ijon/AnZWZY25
Loading