Multi-Scale Adaptive Skeleton Transformer for action recognition

Published: 01 Jan 2025, Last Modified: 08 Apr 2025Comput. Vis. Image Underst. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•MSAST: a novel Transformer model for skeleton-based action recognition.•ASPEM decouples position encoding to capture sample-specific latent dependencies.•MSEM generates multi-scale tokens for multi-scale feature extraction.•ARLM learns unique location information for various samples.•State-of-the-art results on NTU-60 dataset.
Loading