OpenCodePapers

action-detection-on-ucf101-24

Action Detection
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeFrame-mAP 0.5Video-mAP 0.1Video-mAP 0.2Video-mAP 0.5ModelNameReleaseDate
End-to-End Spatio-Temporal Action Localisation with Video Transformers90.388.071.8STAR/L2023-04-24
Scaling Open-Vocabulary Action Detection✓ Link88.5SiA2025-04-04
You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization✓ Link87.386.178.653.1YOWO + LFB2019-11-15
Holistic Interaction Transformer Network for Action Detection✓ Link84.888.874.3HIT2022-10-23
You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization✓ Link80.482.575.848.8YOWO2019-11-15
Actions as Moving Points✓ Link77.881.853.9MOC2020-01-14
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions✓ Link76.359.9Faster-RCNN + two-stream I3D conv2017-05-23
STEP: Spatio-Temporal Progressive Learning for Video Action Detection✓ Link7583.176.6STEP2019-04-19
Stable Mean Teacher for Semi-supervised Video Action Detection✓ Link73.976.3Stable Mean Teacher (I3D)2024-12-10
Hierarchical Self-Attention Network for Action Localization in Videos73.7180.4249.50HISAN (VGG-16)2019-10-01
TACNet: Transition-Aware Context Network for Spatio-Temporal Action Detection72.177.552.9TACNet2019-05-31
End-to-End Semi-Supervised Learning for Video Action Detection✓ Link69.972.1E2E-SSL (I3D)2022-03-08
Tube Convolutional Neural Network (T-CNN) for Action Detection in Videos✓ Link41.3751.347.1T-CNN2017-03-30
Multi-region two-stream R-CNN for action detection39.94TS R-CNN2016-09-17
Multi-region two-stream R-CNN for action detection39.63MR-TS R-CNN2016-09-17
Hierarchical Self-Attention Network for Action Localization in Videos82.3051.47HISAN (ResNet-101 + FPN)2019-10-01
Dance with Flow: Two-in-One Stream Action Detection✓ Link78.4850.30Two-in-one Two Stream2019-04-01
Dance with Flow: Two-in-One Stream Action Detection✓ Link75.4848.31Two-in-one2019-04-01
Finding Action Tubes with a Sparse-to-Dense Framework54DTS2020-08-30