OpenCodePapers

action-recognition-in-videos-on-ava-v21

Action Recognition
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodemAP (Val)GFlopsParams (M)ModelNameReleaseDate
End-to-End Spatio-Temporal Action Localisation with Video Transformers41.7STAR/L2023-04-24
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization✓ Link30.0ACAR-Net, SlowFast R-101 (Kinetics-400 pretraining)2020-06-14
Pose And Joint-Aware Action Recognition✓ Link28.4JMRN + SlowFast-R101-NL2020-10-16
SlowFast Networks for Video Recognition✓ Link28.3SlowFast++ (Kinetics-600 pretraining, NL)2018-12-10
Long-Term Feature Banks for Detailed Video Understanding✓ Link27.7LFB (Kinetics-400 pretraining)2018-12-12
Video Action Transformer Network27.639.619.3I3D Tx HighRes2018-12-06
SlowFast Networks for Video Recognition✓ Link27.3SlowFast (Kinetics-600 pretraining, NL)2018-12-10
SlowFast Networks for Video Recognition✓ Link26.8SlowFast (Kinetics-600 pretraining)2018-12-10
SlowFast Networks for Video Recognition✓ Link26.3SlowFast (Kinetics-400 pretraining)2018-12-10
Video Action Transformer Network23.46.516.2I3D I3D2018-12-06
D3D: Distilled 3D Networks for Video Action Recognition✓ Link23D3D (ResNet RPN, Kinetics-400 pretraining)2018-12-19
A Better Baseline for AVA22.8I3D w/ RPN + JFT (Kinetics-400 pretraining(2018-07-26
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions✓ Link22.0S3D-G w/ ResNet RPN (Kinetics-400 pretraining(2017-05-23
A Better Baseline for AVA21.9I3D w/ RPN (Kinetics-400 pretraining(2018-07-26
Actor-Centric Relation Network✓ Link17.4ARCN2018-07-28