OpenCodePapers

visual-question-answering-vqa-on-5

Visual Question Answering (VQA)
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeOverall AccuracyModelNameReleaseDate
AutoHallusion: Automatic Generation of Hallucination Benchmarks for Vision-Language Models✓ Link66.0GPT-4V2024-06-16
[]()51.4Gemini Pro Vision
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models✓ Link51.0miniGPT42023-04-20
Improved Baselines with Visual Instruction Tuning✓ Link44.5LLaVA-1.52023-10-05
[]()37.1Claude 3