OpenCodePapers

action-recognition-in-videos-on-ntu-rgbd-120

Action Recognition
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracy (Cross-Setup)Accuracy (Cross-Subject)ModelNameReleaseDate
A Dense-Sparse Complementary Network for Human Action Recognition based on RGB and Skeleton Modalities✓ Link96.795.6DSCNet (RGB + Pose)2023-12-28
Revisiting Skeleton-based Action Recognition✓ Link96.495.3PoseC3D (RGB + Pose)2021-04-28
Just Add $π$! Pose Induced Video Transformers for Understanding Activities of Daily Living✓ Link96.195.1π-ViT (RGB + Pose)2023-11-30
MMNet: A Model-Based Multimodal Network for Human Action Recognition in RGB-D Videos✓ Link94.492.9MMNet (RGB + Pose)2022-05-26
Explore Human Parsing Modality for Action Recognition✓ Link92.891.1EPP-Net (Parsing + Pose)2024-01-04
STAR-Transformer: A Spatio-temporal Cross Attention Transformer for Human Action Recognition92.790.3STAR-Transformer (RGB + Pose)2022-10-14
EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognition✓ Link92.494.3EPAM-Net2024-08-10
Just Add $π$! Pose Induced Video Transformers for Understanding Activities of Daily Living✓ Link91.992.9π-ViT (RGB only)2023-11-30
Integrating Human Parsing and Pose Network for Human Action Recognition✓ Link91.790.0IPP-Net (Parsing + Pose)2023-07-16
Cross-Modal Learning with 3D Deformable Attention for Action Recognition91.490.53DA (RGB + Pose)2022-12-12
Joint-Partition Group Attention for skeleton-based action recognition✓ Link91.489.4JPFormer(Pose)2024-07-30
DSTSA-GCN: Advancing Skeleton-Based Gesture Recognition with Semantic-Aware Spatio-Temporal Topology Modeling✓ Link90.9789.12DSTSA-GCN2025-01-21
VPN++: Rethinking Video-Pose embeddings for understanding Activities of Daily Living✓ Link90.792.5VPN++ (RGB + Pose)2021-05-17
DVANet: Disentangling View and Action Features for Multi-View Action Recognition✓ Link90.491.6DVANet (RGB only)2023-12-10
Multi-View Action Recognition Using Contrastive Learning✓ Link87.585.6ViewCon (RGB)2023-01-03
VPN: Learning Video-Pose Embedding for Activities of Daily Living✓ Link86.387.8VPN (RGB + Pose)2020-07-06
Vertex Feature Encoding and Hierarchical Temporal Modeling in a Spatial-Temporal Graph Convolutional Network for Action Recognition78.379.2ST-GCN + AS-GCN w/DH-TCN2019-12-20
Gimme Signals: Discriminative signal encoding for multimodal activity recognition✓ Link70.871.59Gimme Signals (AIS)2020-03-13
Skeleton Image Representation for 3D Action Recognition based on Tree Structure and Reference Joints✓ Link67.962.8TSRJI2019-09-11
SkeleMotion: A New Representation of Skeleton Joint Sequences Based on Motion Information for 3D Action Recognition✓ Link66.967.7Skelemotion + Yang et al. (skeleton only)2019-07-30
Recognizing Human Actions as the Evolution of Pose Estimation Maps64.666.9Body Pose Evolution Map2018-06-01