Abstract: Highlights•A SlowFastFormer framework with lower computational complexity is developed to solve the task of 3D human pose estimation.•Features are progressively improved via parallel encoding and blending stages.•A hierarchical supervision scheme is proposed to refine the predictions.•Different kinds of transformer blocks are employed to perform the relation modelling.
Loading