Abstract: Skeleton-based Human Activity Recognition has recently sparked a lot of attention because
skeleton data has proven resistant to changes in lighting, body sizes, dynamic camera perspectives, and complicated backgrounds. The Spatial-Temporal Graph Convolutional Networks
(ST-GCN) model has been exposed to study spatial and temporal dependencies effectively
from skeleton data. However, efficient use of 3D skeleton in-depth information remains a
significant challenge, specifically for human joint motion patterns and linkages information.
This study attempts a promising solution through a custom ST-GCN model and skeleton joints
for human activity recognition. Special attention was given to spatial & temporal features,
which were further fed to the classification model for better pose estimation. A comparative
study is presented for activity recognition using large-scale databases such as NTU-RGBD, Kinetics-Skeleton, and Florence 3D datasets. The Custom ST-GCN model outperforms
(Top-1 accuracy) the state-of-the-art method on NTU-RGB-D, Kinetics-Skeleton & Florence
3D dataset with a higher margin by 0.7%, 1.25%, and 1.92%, respectively. Similarly, with
Top-5 accuracy, the Custom ST-GCN model offers results hike by 0.5%, 0.73% & 1.52%,
respectively. It shows that the presented graph-based topologies capture the changing aspects
of a motion-based skeleton sequence better than some of the other approaches.
0 Replies
Loading