OpenCodePapers
cross-modal-retrieval-on-rsicd
Cross-Modal Retrieval
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
Show papers without code
Paper
Code
Mean Recall
↕
Image-to-text R@1
↕
text-to-image R@1
↕
ModelName
ReleaseDate
↕
Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment
✓ Link
38.95%
20.52%
15.84%
HarMA (w/ GeoRSCLIP)
2024-04-28
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing
✓ Link
38.87%
21.13%
15.59%
GeoRSCLIP-FT
2023-06-20
Global–Local Information Soft-Alignment for Cross-Modal Remote-Sensing Image–Text Retrieval
37.69%
20.68%
14.73%
GLISA
2024-05-14
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing
✓ Link
36.35%
18.39%
14.73%
RemoteCLIP
2023-06-19
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval
✓ Link
31.12%
14.13%
11.63%
PE-RSITR (MRS-Adapter)
2023-08-24
A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval
✓ Link
24.46%
9.88%
6.97%
PIR
2023-10-27
Direction-Oriented Visual-semantic Embedding Model for Remote Sensing Image-text Retrieval
22.72%
8.66%
6.04%
DOVE
2023-10-12
Reducing Semantic Confusion: Scene-aware Aggregation Network for Remote Sensing Cross-modal Retrieval
✓ Link
20.61%
7.41%
5.56%
SWAN
2023-06-12
Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information
✓ Link
18.96%
6.59%
4.69%
GaLR
2022-04-21
Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval
✓ Link
15.53%
5.21%
4.08%
AMFMN
2022-04-21