OpenCodePapers

temporal-action-localization-on-thumos14

Action LocalizationTemporal Action Localization
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAvg mAP (0.3:0.7)mAP IOU@0.1mAP IOU@0.2mAP IOU@0.3mAP IOU@0.4mAP IOU@0.5mAP IOU@0.6mAP IOU@0.7ModelNameReleaseDate
End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames✓ Link76.989.786.780.971.056.1AdaTAD (VideoMAEv2-giant)2023-11-28
Enhancing Temporal Action Localization: Advanced S6 Modeling with Recurrent Mechanism✓ Link74.288.784.678.266.651.9RDFA-S6 (InternVideo2-6B)2024-07-18
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding✓ Link72.7286.8983.0976.9065.9150.82ActionMamba(InternVideo2-6B)2024-03-14
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding✓ Link72.0InternVideo2-6B2024-03-22
InternVideo: General Video Foundation Models via Generative and Discriminative Learning✓ Link71.58ActionFormer (InternVideo features)2022-12-06
Temporal Action Localization with Enhanced Instant Discriminability✓ Link70.184.880.073.363.848.8TriDet (VideoMAE v2-g feature)2023-09-11
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding✓ Link69.8InternVideo2-1B2024-03-22
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking✓ Link69.684.079.673.063.547.7ActionFormer (VideoMAE V2-g features)2023-03-29
TriDet: Temporal Action Detection with Relative Boundary Modeling✓ Link69.383.680.172.962.447.4TriDet (I3D features)2023-03-13
Action Sensitivity Learning for Temporal Action Localization67.983.179.071.759.745.8ASL(I3D features)2023-05-25
TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization✓ Link67.782.878.971.860.544.7TemporalMaxer (I3D features)2023-03-16
Dual DETRs for Multi-Label Temporal Action Detection66.882.978.070.458.544.4DualDETR (I3D features)2024-03-31
ActionFormer: Localizing Moments of Actions with Transformers✓ Link66.882.177.871.059.443.9ActionFormer (I3D features)2022-02-16
TadML: A fast temporal action detection with Mechanics-MLP✓ Link59.7073.2969.7362.5353.3639.60TadML(two-stream)2022-06-07
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection✓ Link59.675.570.863.550.937.4BasicTAD (160,6,192,R50-SlowOnly)2022-05-05
End-to-end Temporal Action Detection with Transformer✓ Link56.774.869.160.146.632.8TadTR2021-06-18
ReAct: Temporal Action Detection with Relational Queries✓ Link55.069.265.057.147.835.6ReAct (TSN features)2022-07-14
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection✓ Link54.968.465.058.649.233.5BasicTAD (112,3,96,R50-SlowOnly)2022-05-05
An Empirical Study of End-to-End Temporal Action Detection✓ Link54.269.464.356.046.434.9E2E-TAD (SlowFast R50+TadTR)2022-04-06
TadML: A fast temporal action detection with Mechanics-MLP✓ Link53.4668.7864.6656.6145.4031.88TadML(rgb-only)2022-06-07
Multi-shot Temporal Event Localization: a Benchmark✓ Link53.468.964.056.946.331.0MUSES2020-12-17
Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action Localization✓ Link53.370.164.957.145.428.8AVFusion2021-06-27
Proposal-Free Temporal Action Detection via Global Segmentation Mask Learning✓ Link52.868.663.857.046.331.8TAGS (I3D)2022-07-14
DCAN: Improving Temporal Action Detection via Dual Context Aggregation✓ Link52.368.262.754.143.932.6DCAN (TSN features)2021-12-07
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks✓ Link50.4674.0272.2969.163.353.540.426TSP2020-11-23
Video Self-Stitching Graph Network for Temporal Action Localization✓ Link50.266.760.452.441.030.4VSGN2020-11-30
RGB Stream Is Enough for Temporal Action Detection✓ Link50.062.859.553.843.630.1DaoTAD2021-07-09
Decoupling Localization and Classification in Single Shot Temporal Action Detection✓ Link42.060.254.144.232.319.1Decouple-SSAD2019-04-16
Rethinking the Faster R-CNN Architecture for Temporal Action Localization39.859.857.153.248.5 42.833.820.8TAL-Net2018-04-20
Graph Convolutional Module for Temporal Action Localization in Videos72.570.966.560.851.9GCM2021-12-01
Activity Graph Transformer for Temporal Action Localization72.169.86558.150.2AGT (Ours)2021-01-21
Graph Convolutional Networks for Temporal Action Localization✓ Link69.567.863.657.849.1P-GCN2019-09-07
Weakly Supervised Temporal Action Localization Using Deep Metric Learning✓ Link62.346.829.69.7DeepMetricLearner2020-01-21
Cascaded Boundary Regression for Temporal Action Detection60.156.750.141.33119.19.9CBR-TS2017-05-02
R-C3D: Region Convolutional 3D Network for Temporal Activity Detection✓ Link54.551.544.835.628.9R-C3D2017-03-22
TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals✓ Link5450.944.134.925.6TURN-FL-16 + S-CNN2017-03-17
End-to-end Learning of Action Detection from Frame Glimpses in Videos✓ Link48.944.036.026.417.1Yeung et al.2015-11-22
Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs✓ Link47.743.536.328.719S-CNN2016-01-09
BSN: Boundary Sensitive Network for Temporal Action Proposal Generation✓ Link53.54536.928.420BSN UNet2018-06-08
CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos✓ Link40.129.423.313.17.9CDC2017-03-04
G-TAD: Sub-Graph Localization for Temporal Action Detection✓ Link40.2G-TAD2019-11-26
BMN: Boundary-Matching Network for Temporal Action Proposal Generation✓ Link32.2BMN2019-07-23