OpenCodePapers

phrase-grounding-on-flickr30k-entities-dev

Phrase Grounding
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeR@1R@10R@5ModelNameReleaseDate
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone✓ Link87.197.496.1Fiber-B2022-06-15
PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models✓ Link84.1PEVL2022-05-23
VisualBERT: A Simple and Performant Baseline for Vision and Language✓ Link70.486.3184.49VisualBERT2019-08-09