OpenCodePapers

zero-shot-action-recognition-on-hmdb51

Zero-Shot Action Recognition
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeTop-1 AccuracyTop-5 AccuracyAccuracyModelNameReleaseDate
Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models64.7MOV (ViT-L/14)2022-07-15
Orthogonal Temporal Interpolation for Zero-Shot Video Recognition✓ Link64OTI(ViT-L/14)2023-08-14
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models✓ Link61.4BIKE2022-12-31
Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models60.8MOV (ViT-B/16)2022-07-15
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception59.1IMP-MoE-L2023-05-10
VideoCoCa: Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners58.784.5VideoCoCa2022-12-09
Revisiting Classifier: Transferring Vision-Language Models for Video Recognition✓ Link58.4Text4Vis2022-07-04
Leveraging Temporal Contextualization for Video Action Recognition✓ Link56.0TC-CLIP2024-04-15
OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition✓ Link55.9OST2023-11-30
MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge✓ Link52.3MAXI2023-03-15
VicTR: Video-conditioned Text Representations for Activity Recognition51.0VicTR (ViT-B/16)2023-04-05
LoCATe-GAT: Modeling Multi-Scale Local Context and Action Relationships for Zero-Shot Action Recognition✓ Link50.7LoCATe-GAT2024-11-27
Expanding Language-Image Pretrained Models for General Video Recognition✓ Link44.6X-CLIP2022-08-04
CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition43.2CLASTER2021-01-18
Cross-modal Representation Learning for Zero-shot Action Recognition41.1ResT2022-05-03
Alignment-Uniformity aware Representation Learning for Zero-shot Video Classification✓ Link39AURL2022-03-29
Rethinking Zero-shot Action Recognition: Learning from Latent Atomic Actions✓ Link38.7JigsawNet2022-03-28
Synthetic Sample Selection for Generalized Zero-Shot Learning35.9SPOT2023-04-06
Elaborative Rehearsal for Zero-shot Action Recognition✓ Link35.3ER-ZSAR2021-08-05
Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications✓ Link32.7E2E2020-03-03
Towards Universal Representation for Unseen Action Recognition24.4UR2018-03-22
I Know the Relationships: Zero-Shot Action Recognition via Two-Stream Graph Convolutional Networks and Knowledge Graphs✓ Link23.2TS-GCN2019-07-17
Zero-Shot Action Recognition With Error-Correcting Output Codes22.6ZSECOC2017-07-01
Alternative Semantic Representations for Zero-Shot Human Action Recognition21.8ASR2017-06-28
Multi-Task Zero-Shot Action Recognition with Prioritised Data Augmentation19.7MTE2016-11-26
[]()18.5ESZSL
Objects2action: Classifying and localizing actions without any video example15.6O2A2015-10-23
Evaluation of Output Embeddings for Fine-Grained Image Classification✓ Link13.3SJE(word embedding)2014-09-30
Actor-agnostic Multi-label Action Recognition with Multi-modal Query✓ Link69.43MSQNet2023-07-20