OpenCodePapers

human-judgment-correlation-on-flickr8k-expert

Human Judgment Correlation
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeKendall's Tau-cModelNameReleaseDate
Mutual Information Divergence: A Unified Metric for Multimodal Generative Models✓ Link54.9MID2022-05-25
FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing✓ Link54.2SoftSPICE2023-05-27
CLIPScore: A Reference-free Evaluation Metric for Image Captioning✓ Link53.0RefCLIP-S2021-04-18
CLIPScore: A Reference-free Evaluation Metric for Image Captioning✓ Link51.2CLIP-S2021-04-18