OpenCodePapers

unsupervised-semantic-segmentation-with-7

Semantic SegmentationUnsupervised Semantic SegmentationUnsupervised Semantic Segmentation with Language-image Pre-training

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	mIoU	ModelName	ReleaseDate
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation	✓ Link	91.8	CorrCLIP	2024-11-15
TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models	✓ Link	89.5	TextRegion	2025-05-29
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation	✓ Link	88.7	Trident	2024-11-14
TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification	✓ Link	87.9	TagAlign	2023-12-21
ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation	✓ Link	83.3	ProxyCLIP	2024-08-09
Learning to Generate Text-grounded Mask for Open-world Semantic Segmentation from Only Image-Text Pairs	✓ Link	83.2	TCL	2022-12-01
GroupViT: Semantic Segmentation Emerges from Text Supervision	✓ Link	79.7	GroupViT (RedCaps)	2022-02-22
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training	✓ Link	77.7	COSMOS ViT-B/16	2024-12-02
Extract Free Dense Labels from CLIP	✓ Link	74.9	MaskCLIP	2021-12-02
ReCo: Retrieve and Co-segment for Zero-shot Transfer	✓ Link	57.7	ReCo	2022-06-14