OpenCodePapers

unsupervised-semantic-segmentation-with-7

Semantic SegmentationUnsupervised Semantic SegmentationUnsupervised Semantic Segmentation with Language-image Pre-training
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodemIoUModelNameReleaseDate
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation✓ Link91.8CorrCLIP2024-11-15
TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models✓ Link89.5TextRegion2025-05-29
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation✓ Link88.7Trident2024-11-14
TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification✓ Link87.9TagAlign2023-12-21
ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation✓ Link83.3ProxyCLIP2024-08-09
Learning to Generate Text-grounded Mask for Open-world Semantic Segmentation from Only Image-Text Pairs✓ Link83.2TCL2022-12-01
GroupViT: Semantic Segmentation Emerges from Text Supervision✓ Link79.7GroupViT (RedCaps)2022-02-22
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training✓ Link77.7COSMOS ViT-B/162024-12-02
Extract Free Dense Labels from CLIP✓ Link74.9MaskCLIP2021-12-02
ReCo: Retrieve and Co-segment for Zero-shot Transfer✓ Link57.7ReCo2022-06-14