Paper | Code | Specificity | ModelName | ReleaseDate |
---|---|---|---|---|
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | 94 | BLIP2 FlanT5-XXL (Text-only FT) | 2023-03-13 | |
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | 84 | BLIP2 FlanT5-XXL (Fine-tuned) | 2023-03-13 | |
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | 81 | BLIP2 FlanT5-XL (Fine-tuned) | 2023-03-13 | |
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | 77 | BLIP Large | 2023-03-13 | |
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | 72 | CoCa ViT-L-14 MSCOCO | 2023-03-13 | |
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | 71 | BLIP2 FlanT5-XXL (Zero-shot) | 2023-03-13 | |
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | 70 | CLIP ViT-L/14 | 2023-03-13 |