Paper | Code | mIoU | ModelName | ReleaseDate |
---|---|---|---|---|
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation | ✓ Link | 44.9 | CorrCLIP | 2024-11-15 |
TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models | ✓ Link | 41.2 | TextRegion | 2025-05-29 |
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation | ✓ Link | 40.1 | Trident | 2024-11-14 |
ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation | ✓ Link | 35.4 | ProxyCLIP | 2024-08-09 |