OpenCodePapers

image-retrieval-on-coco

Image Retrieval
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCoderecall@1recall@5Recall@10QPSModelNameReleaseDate
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models✓ Link68.387.792.6BLIP-2 ViT-G (fine-tuned)2023-01-30
VisualSparta: An Embarrassingly Simple Approach to Large-scale Text-to-Image Search with Weighted Bag-of-words✓ Link68.291.896.3451.4VisualSparta2021-01-01
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models✓ Link66.386.591.8BLIP-2 ViT-L (fine-tuned)2023-01-30
FLAVA: A Foundational Language And Vision Alignment Model✓ Link38.3867.47FLAVA (zero-shot)2021-12-08
FLAVA: A Foundational Language And Vision Alignment Model✓ Link33.2962.47CLIP (zero-shot)2021-12-08
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks✓ Link98.3Oscar2020-04-13