OpenCodePapers

cross-modal-retrieval-on-rsitmd

Cross-Modal Retrieval
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeImage-to-text R@1Mean Recalltext-to-imageR@1ModelNameReleaseDate
Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment✓ Link32.74%52.27%25.62%HarMA (w/ GeoRSCLIP)2024-04-28
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing✓ Link32.30%51.81%25.04%GeoRSCLIP-FT2023-06-20
Global–Local Information Soft-Alignment for Cross-Modal Remote-Sensing Image–Text Retrieval32.08%50.69%23.36%GLISA2024-05-14
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing✓ Link28.76%50.52%23.76%RemoteCLIP2023-06-19
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval✓ Link23.67%44.47%20.10%PE-RSITR (MRS-Adapter)2023-08-24
A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval✓ Link18.14%38.24%12.17%PIR2023-10-27
Direction-Oriented Visual-semantic Embedding Model for Remote Sensing Image-text Retrieval16.81%37.73%12.20%DOVE2023-10-12
Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information✓ Link14.82%31.41%11.15%GaLR2022-04-21
Reducing Semantic Confusion: Scene-aware Aggregation Network for Remote Sensing Cross-modal Retrieval✓ Link13.35%34.11%11.24%SWAN2023-06-12
Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval✓ Link10.63%29.72%11.51%AMFMN2022-04-21