OpenCodePapers

image-retrieval-on-flickr30k

Image Retrieval

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Recall@10	Recall@5	Recall@1	Recall@Sum	Image-to-text R@1	Image-to-text R@10	Image-to-text R@5	QPS	ModelName	ReleaseDate
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models	✓ Link	98.9	98.1	89.7						BLIP-2 ViT-G (zero-shot, 1K test set)	2023-01-30
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models	✓ Link	98.9	97.6	88.6						BLIP-2 ViT-L (zero-shot, 1K test set)	2023-01-30
HADA: A Graph-based Amalgamation Framework in Image-text Retrieval	✓ Link	98.02	95.94	81.36						HADA	2023-01-11
MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks	✓ Link	98	96	82.5		94.9	99.9	99.5		MaMMUT (ours)	2023-03-29
HADA: A Graph-based Amalgamation Framework in Image-text Retrieval	✓ Link	97.72	95.3	79.76						ALBEF	2023-01-11
HADA: A Graph-based Amalgamation Framework in Image-text Retrieval	✓ Link	96.76	94.08	75.56						UNITER	2023-01-11
A Deep Local and Global Scene-Graph Matching for Image-Text Retrieval	✓ Link	90.2	84.1	57.4	231.7					LGSGM	2021-06-04
A Deep Local and Global Scene-Graph Matching for Image-Text Retrieval	✓ Link	89	82.3		228.7					GSMN	2021-06-04
VisualSparta: An Embarrassingly Simple Approach to Large-scale Text-to-Image Search with Weighted Bag-of-words	✓ Link	88.1	82.0	57.4					451.4	VisualSparta	2021-01-01