OpenCodePapers
cross-modal-retrieval-on-rsitmd
Cross-Modal Retrieval
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
Show papers without code
Paper
Code
Image-to-text R@1
↕
Mean Recall
↕
text-to-imageR@1
↕
ModelName
ReleaseDate
↕
Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment
✓ Link
32.74%
52.27%
25.62%
HarMA (w/ GeoRSCLIP)
2024-04-28
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing
✓ Link
32.30%
51.81%
25.04%
GeoRSCLIP-FT
2023-06-20
Global–Local Information Soft-Alignment for Cross-Modal Remote-Sensing Image–Text Retrieval
32.08%
50.69%
23.36%
GLISA
2024-05-14
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing
✓ Link
28.76%
50.52%
23.76%
RemoteCLIP
2023-06-19
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval
✓ Link
23.67%
44.47%
20.10%
PE-RSITR (MRS-Adapter)
2023-08-24
A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval
✓ Link
18.14%
38.24%
12.17%
PIR
2023-10-27
Direction-Oriented Visual-semantic Embedding Model for Remote Sensing Image-text Retrieval
16.81%
37.73%
12.20%
DOVE
2023-10-12
Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information
✓ Link
14.82%
31.41%
11.15%
GaLR
2022-04-21
Reducing Semantic Confusion: Scene-aware Aggregation Network for Remote Sensing Cross-modal Retrieval
✓ Link
13.35%
34.11%
11.24%
SWAN
2023-06-12
Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval
✓ Link
10.63%
29.72%
11.51%
AMFMN
2022-04-21