OpenCodePapers

unsupervised-semantic-segmentation-with-3

Semantic SegmentationUnsupervised Semantic SegmentationUnsupervised Semantic Segmentation with Language-image Pre-training

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	mIoU	pixel accuracy	ModelName	ReleaseDate
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation	✓ Link	51.1		CorrCLIP	2024-11-15
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation	✓ Link	47.6		Trident	2024-11-14
ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation	✓ Link	42.0		ProxyCLIP	2024-08-09
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training	✓ Link	34.7		COSMOS ViT-B/16	2024-12-02
TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias	✓ Link	32.0		TTD (MaskCLIP)	2024-03-30
TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification	✓ Link	27.5		TagAlign	2023-12-21
TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias	✓ Link	27.0		TTD (TCL)	2024-03-30
ReCo: Retrieve and Co-segment for Zero-shot Transfer	✓ Link	24.2	83.7	ReCo+	2022-06-14
Learning to Generate Text-grounded Mask for Open-world Semantic Segmentation from Only Image-Text Pairs	✓ Link	24.0		TCL	2022-12-01
ReCo: Retrieve and Co-segment for Zero-shot Transfer	✓ Link	19.3	74.6	ReCo	2022-06-14
Perceptual Grouping in Contrastive Vision-Language Models	✓ Link	18.1		CLIPpy ViT-B	2022-10-18
Extract Free Dense Labels from CLIP	✓ Link	10.0	35.9	MaskCLIP	2021-12-02