OpenCodePapers

cross-modal-retrieval-on-rsicd

Cross-Modal Retrieval

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Mean Recall	Image-to-text R@1	text-to-image R@1	ModelName	ReleaseDate
Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment	✓ Link	38.95%	20.52%	15.84%	HarMA (w/ GeoRSCLIP)	2024-04-28
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing	✓ Link	38.87%	21.13%	15.59%	GeoRSCLIP-FT	2023-06-20
Global–Local Information Soft-Alignment for Cross-Modal Remote-Sensing Image–Text Retrieval		37.69%	20.68%	14.73%	GLISA	2024-05-14
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing	✓ Link	36.35%	18.39%	14.73%	RemoteCLIP	2023-06-19
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval	✓ Link	31.12%	14.13%	11.63%	PE-RSITR (MRS-Adapter)	2023-08-24
A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval	✓ Link	24.46%	9.88%	6.97%	PIR	2023-10-27
Direction-Oriented Visual-semantic Embedding Model for Remote Sensing Image-text Retrieval		22.72%	8.66%	6.04%	DOVE	2023-10-12
Reducing Semantic Confusion: Scene-aware Aggregation Network for Remote Sensing Cross-modal Retrieval	✓ Link	20.61%	7.41%	5.56%	SWAN	2023-06-12
Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information	✓ Link	18.96%	6.59%	4.69%	GaLR	2022-04-21
Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval	✓ Link	15.53%	5.21%	4.08%	AMFMN	2022-04-21