OpenCodePapers

video-question-answering-on-how2qa

Video Question Answering
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyModelNameReleaseDate
Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval✓ Link93.2Text + Text (no Multimodal Pretext Training)2022-06-05
Zero-Shot Video Question Answering via Frozen Bidirectional Language Models✓ Link86.7FrozenBiLM2022-06-16
Just Ask: Learning to Answer Questions from Millions of Narrated Videos✓ Link84.4Just Ask2020-12-01
[]()83.7SeViLA
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training✓ Link77.75Hero w/ pre-training2020-05-01
Revisiting the "Video" in Video-Language Understanding✓ Link65.1ATP2022-06-03
Zero-Shot Video Question Answering via Frozen Bidirectional Language Models✓ Link58.4FrozenBiLM (0-shot)2022-06-16
Just Ask: Learning to Answer Questions from Millions of Narrated Videos✓ Link51.1Just Ask (0-shot)2020-12-01