OpenCodePapers

video-question-answering-on-ivqa

Video Question Answering
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyModelNameReleaseDate
Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval✓ Link40.2Text + Text (no Multimodal Pretext Training)2022-06-05
Zero-Shot Video Question Answering via Frozen Bidirectional Language Models✓ Link39.6FrozenBiLM2022-06-16
VideoCoCa: Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners39.0VideoCoCa2022-12-09
Video Question Answering with Iterative Video-Text Co-Tokenization38.2Co-Tokenization2022-08-01
Just Ask: Learning to Answer Questions from Millions of Narrated Videos✓ Link35.4Just Ask (fine-tune)2020-12-01
Zero-Shot Video Question Answering via Frozen Bidirectional Language Models✓ Link26.8FrozenBiLM (0-shot)2022-06-16
Just Ask: Learning to Answer Questions from Millions of Narrated Videos✓ Link12.2Just Ask (0-shot)2020-12-01