visual-question-answering-vqa-on-ai2d

Visual Question Answering (VQA)

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	EM	ModelName	ReleaseDate
Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts		82.5	SMoLA-PaLI-X Specialist Model	2023-12-01
Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts		81.4	SMoLA-PaLI-X Generalist Model	2023-12-01
Gemini: A Family of Highly Capable Multimodal Models	✓ Link	79.5	Gemini Ultra	2023-12-19
DUBLIN -- Document Understanding By Language-Image Network		51.11	DUBLIN	2023-05-23

OpenCodePapers