OpenCodePapers

image-to-text-retrieval-on-whoops

Image-to-Text Retrieval
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeSpecificityModelNameReleaseDate
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images94BLIP2 FlanT5-XXL (Text-only FT)2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images84BLIP2 FlanT5-XXL (Fine-tuned)2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images81BLIP2 FlanT5-XL (Fine-tuned)2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images77BLIP Large2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images72CoCa ViT-L-14 MSCOCO2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images71BLIP2 FlanT5-XXL (Zero-shot)2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images70CLIP ViT-L/142023-03-13