OpenCodePapers

cross-modal-retrieval-on-rsicd

Cross-Modal Retrieval
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeMean RecallImage-to-text R@1text-to-image R@1ModelNameReleaseDate
Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment✓ Link38.95%20.52%15.84%HarMA (w/ GeoRSCLIP)2024-04-28
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing✓ Link38.87%21.13%15.59%GeoRSCLIP-FT2023-06-20
Global–Local Information Soft-Alignment for Cross-Modal Remote-Sensing Image–Text Retrieval37.69%20.68%14.73%GLISA2024-05-14
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing✓ Link36.35%18.39%14.73%RemoteCLIP2023-06-19
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval✓ Link31.12%14.13%11.63%PE-RSITR (MRS-Adapter)2023-08-24
A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval✓ Link24.46%9.88%6.97%PIR2023-10-27
Direction-Oriented Visual-semantic Embedding Model for Remote Sensing Image-text Retrieval22.72%8.66%6.04%DOVE2023-10-12
Reducing Semantic Confusion: Scene-aware Aggregation Network for Remote Sensing Cross-modal Retrieval✓ Link20.61%7.41%5.56%SWAN2023-06-12
Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information✓ Link18.96%6.59%4.69%GaLR2022-04-21
Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval✓ Link15.53%5.21%4.08%AMFMN2022-04-21