OpenCodePapers

image-captioning-on-nocaps-xd-out-of-domain

Image Captioning
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeCIDErB1B2B3B4ROUGE-LMETEORSPICEModelNameReleaseDate
GIT: A Generative Image-to-text Transformer for Vision and Language✓ Link122.2786.2871.1552.3630.1560.9130.1515.62GIT22022-05-27
GIT: A Generative Image-to-text Transformer for Vision and Language✓ Link122.0485.9971.2852.6630.0460.9630.4515.7GIT2022-05-27
VIVO: Visual Vocabulary Pre-Training for Novel Object Captioning95.579.4461.1541.0321.7955.4926.5612.66Microsoft Cognitive Services team2020-09-28
[]()91.6274.8453.933.5116.651.526.8314.21Human
[]()90.3479.5961.0440.0919.6154.8626.1413.11VLAF2
[]()85.2875.5956.7135.6317.7251.9223.7711.28icp2ssi1_coco_si_0.02_5_test
[]()77.9474.553.6330.9113.4149.6623.4711.07test_cbs2
[]()66.6771.5748.5825.779.6847.1320.889.74UpDown + ELMo + CBS
[]()58.4865.9843.221.167.544.4719.048.77Neural Baby Talk + CBS
[]()48.7364.4542.821.487.9244.1118.318.2Neural Baby Talk
[]()30.0966.5444.2824.2310.1744.8418.298.08UpDown