OpenCodePapers
image-retrieval-on-flickr30k
Image Retrieval
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
Show papers without code
Paper
Code
Recall@10
↕
Recall@5
↕
Recall@1
↕
Recall@Sum
↕
Image-to-text R@1
↕
Image-to-text R@10
↕
Image-to-text R@5
↕
QPS
↕
ModelName
ReleaseDate
↕
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
✓ Link
98.9
98.1
89.7
BLIP-2 ViT-G (zero-shot, 1K test set)
2023-01-30
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
✓ Link
98.9
97.6
88.6
BLIP-2 ViT-L (zero-shot, 1K test set)
2023-01-30
HADA: A Graph-based Amalgamation Framework in Image-text Retrieval
✓ Link
98.02
95.94
81.36
HADA
2023-01-11
MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks
✓ Link
98
96
82.5
94.9
99.9
99.5
MaMMUT (ours)
2023-03-29
HADA: A Graph-based Amalgamation Framework in Image-text Retrieval
✓ Link
97.72
95.3
79.76
ALBEF
2023-01-11
HADA: A Graph-based Amalgamation Framework in Image-text Retrieval
✓ Link
96.76
94.08
75.56
UNITER
2023-01-11
A Deep Local and Global Scene-Graph Matching for Image-Text Retrieval
✓ Link
90.2
84.1
57.4
231.7
LGSGM
2021-06-04
A Deep Local and Global Scene-Graph Matching for Image-Text Retrieval
✓ Link
89
82.3
228.7
GSMN
2021-06-04
VisualSparta: An Embarrassingly Simple Approach to Large-scale Text-to-Image Search with Weighted Bag-of-words
✓ Link
88.1
82.0
57.4
451.4
VisualSparta
2021-01-01