OpenCodePapers

image-retrieval-on-photochat

Image Retrieval
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeR1R@10R@5Sum(R@1,5,10)ModelNameReleaseDate
PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts✓ Link15.249.636.7101.5PaCE2023-05-24
VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts✓ Link11.539.430.083.2VLMo2021-11-03
ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision✓ Link11.525.633.871.0ViLT2021-02-05
Stacked Cross Attention for Image-Text Matching✓ Link10.437.127.074.5SCAN2018-03-21
PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling9.035.726.471.1DE++2021-07-06