OpenCodePapers

zero-shot-video-question-answer-on-next-gqa

Video Question AnsweringZero-Shot Video Question Answer
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAcc@GQAModelNameReleaseDate
Question-Answering Dense Video Events✓ Link28.9DeVi (Gemini 2.0)2024-09-06
VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning✓ Link28.2VideoMind(7B)2025-03-17
Question-Answering Dense Video Events✓ Link28.0DeVi (GPT-4)2024-09-06
A Simple LLM Framework for Long-Range Video Question-Answering✓ Link26.8LLoVi (GPT-4)2023-12-28
VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning✓ Link25.2VideoMind (2B)2025-03-17
Streaming Long Video Understanding with Large Language Models17.8VideoStreaming2024-05-25
Language Repository for Long Video Understanding✓ Link17.1LangRepo (12B)2024-03-21
A Simple LLM Framework for Long-Range Video Question-Answering✓ Link11.2LLoVi (7B)2023-12-28
Mistral 7B✓ Link9.2Mistral (7B)2023-10-10