OpenCodePapers

image-retrieval-on-flickr30k-1k-test

Image Retrieval
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeR@1R@5R@10ModelNameReleaseDate
Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts✓ Link86.997.398.7X-VLM (base)2021-11-16
Plug-and-Play Regulators for Image-Text Matching✓ Link62.685.891.1RCAR2023-03-23
Similarity Reasoning and Filtration for Image-Text Matching✓ Link58.583.088.8SGRAF2021-01-05
A Deep Local and Global Scene-Graph Matching for Image-Text Retrieval✓ Link57.484.190.2LGSGM2021-06-04
VisualSparta: An Embarrassingly Simple Approach to Large-scale Text-to-Image Search with Weighted Bag-of-words✓ Link57.482.088.1VisualSparta2021-01-01
Fine-grained Visual Textual Alignment for Cross-Modal Retrieval using Transformer Encoders✓ Link56.581.288.2TERAN MrSw2020-08-12
Fine-grained Visual Textual Alignment for Cross-Modal Retrieval using Transformer Encoders✓ Link55.783.189.3TERAN Symm.2020-08-12
Visual Semantic Reasoning for Image-Text Matching✓ Link54.781.888.2VSRN2019-09-06
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval✓ Link51.577.185.3CAMP2019-09-12
Stacked Cross Attention for Image-Text Matching✓ Link44.074.282.6SCAN i-t2018-03-21
Learning Semantic Concepts and Order for Image and Sentence Matching41.170.580.1SCO2017-12-06
Dual Attention Networks for Multimodal Reasoning and Matching✓ Link39.469.279.1DAN2016-11-02
Linking Image and Text with 2-Way Nets✓ Link36.02WayNet (VGG)2016-08-29
Instance-aware Image and Sentence Matching with Selective Multimodal LSTM30.272.3SM-LSTM (VGG)2016-11-17
Learning Deep Structure-Preserving Image-Text Embeddings29.760.172.1SPE2015-11-19
Multimodal Convolutional Neural Networks for Matching Image and Sentence✓ Link26.256.369.6mCNN2015-04-23
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models✓ Link24.753.466.8HGLMM FV2015-05-19
Deep Visual-Semantic Alignments for Generating Image Descriptions✓ Link15.250.5DVSA (R-CNN, AlexNet)2014-12-07