OpenCodePapers
unsupervised-semantic-segmentation-with-4
Semantic Segmentation
Unsupervised Semantic Segmentation
Unsupervised Semantic Segmentation with Language-image Pre-training
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
Show papers without code
Paper
Code
Mean IoU (val)
↕
ModelName
ReleaseDate
↕
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation
✓ Link
30.7
CorrCLIP
2024-11-15
TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models
✓ Link
27.3
TextRegion
2025-05-29
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation
✓ Link
26.7
Trident
2024-11-14
ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation
✓ Link
24.2
ProxyCLIP
2024-08-09
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
✓ Link
17.7
COSMOS ViT-B/16
2024-12-02
TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification
✓ Link
17.3
TagAlign
2023-12-21
Learning to Generate Text-grounded Mask for Open-world Semantic Segmentation from Only Image-Text Pairs
✓ Link
17.1
TCL
2022-12-01
TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias
✓ Link
17.0
TTD (TCL)
2024-03-30
Perceptual Grouping in Contrastive Vision-Language Models
✓ Link
13.5
CLIPpy ViT-B
2022-10-18
TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias
✓ Link
12.7
TTD (MaskCLIP)
2024-03-30
ReCo: Retrieve and Co-segment for Zero-shot Transfer
✓ Link
11.2
ReCo
2022-06-14
Extract Free Dense Labels from CLIP
✓ Link
9.8
MaskCLIP
2021-12-02
GroupViT: Semantic Segmentation Emerges from Text Supervision
✓ Link
9.2
GroupViT (RedCaps)
2022-02-22