OpenCodePapers

image-captioning-on-nocaps-val

Image Captioning
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeCIDErSPICEModelNameReleaseDate
Prismer: A Vision-Language Model with Multi-Task Experts✓ Link107.914.8Prismer2023-03-04
Language Models are General-Purpose Interfaces✓ Link58.78.6MetaLM2022-06-13
Unifying Vision-and-Language Tasks via Text Generation✓ Link4.4 5.3VL-T52021-02-04