OpenCodePapers

visual-question-answering-vqa-on-3

Visual Question Answering (VQA)
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeQuestion Pair AccQuestion Pair AccModelNameReleaseDate
HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models✓ Link12.2047GPT-4V2023-10-23
[]()4.3307LLaVA-1.5
Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning✓ Link1.57LRV-Instruct2023-06-26
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality✓ Link2.36mPLUG-Owl2023-04-27