Abstract: Highlights•Hierarchical compositional representations include Sub-action and SAS-action.•SAS-action focuses on body parts and action-related cues.•Earth mover’s distance is employed to measure fine-grained patterns.•Experiments on HMDB51, UCF101, and Kinetics verify our effectiveness.
Loading