OpenCodePapers

image-captioning-on-flickr30k-captions-test

Image Captioning
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeBLEU-4CIDErMETEORSPICEModelNameReleaseDate
Unified Vision-Language Pre-Training for Image Captioning and VQA✓ Link30.167.42317Unified VLP2019-09-24
Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention21.346.420.0-Cornia et al2017-06-26
Deep Visual-Semantic Alignments for Generating Image Descriptions✓ Link15.724.715.3-BRNN2014-12-07
[]()67.114.5KOSMOS-1 1.6B (zero-shot)
Language Models are General-Purpose Interfaces✓ Link43.311.7MetaLM2022-06-13
A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models✓ Link 31.010.0FewVLM2021-10-16
Unifying Vision-and-Language Tasks via Text Generation✓ Link2.62.0VL-T52021-02-04