OpenCodePapers

image-to-text-retrieval-on-whoops

Image-to-Text Retrieval

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Specificity	ModelName	ReleaseDate
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images		94	BLIP2 FlanT5-XXL (Text-only FT)	2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images		84	BLIP2 FlanT5-XXL (Fine-tuned)	2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images		81	BLIP2 FlanT5-XL (Fine-tuned)	2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images		77	BLIP Large	2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images		72	CoCa ViT-L-14 MSCOCO	2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images		71	BLIP2 FlanT5-XXL (Zero-shot)	2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images		70	CLIP ViT-L/14	2023-03-13