OpenCodePapers

cross-modal-retrieval-on-rsitmd

Cross-Modal Retrieval

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Image-to-text R@1	Mean Recall	text-to-imageR@1	ModelName	ReleaseDate
Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment	✓ Link	32.74%	52.27%	25.62%	HarMA (w/ GeoRSCLIP)	2024-04-28
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing	✓ Link	32.30%	51.81%	25.04%	GeoRSCLIP-FT	2023-06-20
Global–Local Information Soft-Alignment for Cross-Modal Remote-Sensing Image–Text Retrieval		32.08%	50.69%	23.36%	GLISA	2024-05-14
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing	✓ Link	28.76%	50.52%	23.76%	RemoteCLIP	2023-06-19
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval	✓ Link	23.67%	44.47%	20.10%	PE-RSITR (MRS-Adapter)	2023-08-24
A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval	✓ Link	18.14%	38.24%	12.17%	PIR	2023-10-27
Direction-Oriented Visual-semantic Embedding Model for Remote Sensing Image-text Retrieval		16.81%	37.73%	12.20%	DOVE	2023-10-12
Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information	✓ Link	14.82%	31.41%	11.15%	GaLR	2022-04-21
Reducing Semantic Confusion: Scene-aware Aggregation Network for Remote Sensing Cross-modal Retrieval	✓ Link	13.35%	34.11%	11.24%	SWAN	2023-06-12
Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval	✓ Link	10.63%	29.72%	11.51%	AMFMN	2022-04-21