OpenCodePapers

explanation-generation-on-whoops

Explanation Generation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeHuman (%)AccuracyModelNameReleaseDate
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images68Ground-truth Caption -> GPT3 (Oracle)2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images33Predicted Caption -> GPT32023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images27BLIP2 FlanT5-XXL (Fine-tuned)2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images15BLIP2 FlanT5-XL (Fine-tuned)2023-03-13
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images0BLIP2 FlanT5-XXL (Zero-shot)2023-03-13
VLIS: Unimodal Language Models Guide Multimodal Language Generation✓ Link80VLIS (Lynx)2023-10-15
VLIS: Unimodal Language Models Guide Multimodal Language Generation✓ Link73VLIS (LLaVA)2023-10-15