OpenCodePapers

action-recognition-in-videos-on-ntu-rgbd

Action Recognition
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracy (CS)Accuracy (CV)ModelNameReleaseDate
A Dense-Sparse Complementary Network for Human Action Recognition based on RGB and Skeleton Modalities✓ Link97.499.4DSCNet (RGB + Pose)2023-12-28
Revisiting Skeleton-based Action Recognition✓ Link97.099.6PoseC3D (RGB + Pose)2021-04-28
Just Add $π$! Pose Induced Video Transformers for Understanding Activities of Daily Living✓ Link96.399.0π-ViT (RGB + Pose)2023-11-30
A Unified Multimodal De- and Re-coupling Framework for RGB-D Motion Recognition✓ Link96.298.0UMDR (RGB-D)2022-11-16
EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognition✓ Link96.199.0EPAM-Net2024-08-10
MMNet: A Model-Based Multimodal Network for Human Action Recognition in RGB-D Videos✓ Link96.098.8MMNet (RGB + Pose)2022-05-26
Hierarchical Action Classification with Network Pruning95.6698.79Hierarchical Action Classification (RGB + Pose)2020-07-30
VPN: Learning Video-Pose Embedding for Activities of Daily Living✓ Link95.598.0VPN (RGB + Pose)2020-07-06
Explore Human Parsing Modality for Action Recognition✓ Link94.797.7EPP-Net (Parsing + Pose)2024-01-04
Cross-Modal Learning with 3D Deformable Attention for Action Recognition94.397.93DA (RGB + Pose)2022-12-12
Action Machine: Rethinking Action Recognition in Trimmed Videos94.397.2Action Machine (RGB only)2018-12-14
Just Add $π$! Pose Induced Video Transformers for Understanding Activities of Daily Living✓ Link94.097.9π-ViT (RGB only)2023-11-30
Integrating Human Parsing and Pose Network for Human Action Recognition✓ Link93.897.1IPP-Net (Parsing + Pose)2023-07-16
Multi-View Action Recognition Using Contrastive Learning✓ Link93.798.9ViewCon (RGB + Pose)2023-01-03
DVANet: Disentangling View and Action Features for Multi-View Action Recognition✓ Link93.498.1DVANet (RGB only)2023-12-10
Joint-Partition Group Attention for skeleton-based action recognition✓ Link93.296.9JPFormer2024-07-30
DSTSA-GCN: Advancing Skeleton-Based Gesture Recognition with Semantic-Aware Spatio-Temporal Topology Modeling✓ Link92.7897.03DSTSA-GCN2025-01-21
Multimodal Fusion via Teacher-Student Network for Indoor Action Recognition✓ Link92.597.4TSMF (RGB + Pose)2021-05-18
MSAF: Multimodal Split Attention Fusion✓ Link92.24MSAF (RGB+Pose)2020-12-13
STAR-Transformer: A Spatio-temporal Cross Attention Transformer for Human Action Recognition92.096.5STAR-Transformer (RGB + Pose)2022-10-14
MMTM: Multimodal Transfer Module for CNN Fusion✓ Link91.99MMTM (RGB+Pose)2019-11-20
Infrared and 3D skeleton feature fusion for RGB-D action recognition✓ Link91.894.9FUSION (IR+Pose)2020-02-28
Recognizing Human Actions as the Evolution of Pose Estimation Maps91.795.2PoseMap (RGB+Pose)2018-06-01
B2C-AFM: Bi-Directional Co-Temporal and Cross-Spatial Attention Fusion Model for Human Action Recognition✓ Link91.7B2C-AFM(RGB+Pose)2023-08-30
Part-based Graph Convolutional Network for Action Recognition✓ Link87.593.2PB-GCN (Skeleton only)2018-09-13
Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points✓ Link86.693.2Glimpse Clouds (RGB only)2018-02-22
SkeleMotion: A New Representation of Skeleton Joint Sequences Based on Motion Information for 3D Action Recognition✓ Link76.584.7Skelemotion + Yang et al. (Skeleton only)2019-07-30
Deep Multimodal Feature Analysis for Action Recognition in RGB+D Videos74.9DSSCA-SSLM (RGB only)2016-03-23