OpenCodePapers

visual-question-answering-vqa-on-whoops

Visual Question Answering (VQA)

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Exact Match	BEM	ModelName	ReleaseDate
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images		21	57	BLIP2 FlanT5-XXL (Fine-tuned)	2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images		20	55	BLIP2 FlanT5-XL (Fine-tuned)	2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images		15	55	BLIP2 FlanT5-XXL (Zero-shot)	2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images		8	38	OFA Large	2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images		6	39	BLIP Large	2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images		4	24	BLIP2 FlanT5-XXL (Text-only FT)	2023-03-13