OpenCodePapers

question-answering-on-next-qa-open-ended

Question Answering
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyConfidence ScoreModelNameReleaseDate
Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams✓ Link61.63.4Flash-VStream2024-06-12
Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens60.73.4Vista-LLaMA2023-12-12
VideoChat: Chat-Centric Video Understanding✓ Link56.63.2VideoChat2023-05-10
MovieChat+: Question-aware Sparse Memory for Long Video Question Answering✓ Link54.83.0MovieChat+2024-04-26
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models✓ Link54.63.2Video-ChatGPT2023-06-08
MovieChat: From Dense Token to Sparse Memory for Long Video Understanding✓ Link49.92.7MovieChat2023-07-31