OpenCodePapers

open-vocabulary-semantic-segmentation-on-7

Open Vocabulary Semantic Segmentation

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	mIoU	ModelName	ReleaseDate
SILC: Improving Vision Language Pretraining with Self-Distillation		25.8	SILC	2023-10-20
UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding	✓ Link	25.2	UMG-CLIP-E/14	2024-01-12
MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation	✓ Link	23.9	MaskCLIP++	2024-12-16
CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation	✓ Link	23.8	CAT-Seg	2023-03-21
UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding	✓ Link	23.2	UMG-CLIP-L/14	2024-01-12
Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation	✓ Link	22.7	Mask-Adapter	2024-12-05
SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation	✓ Link	22.6	SED	2023-11-27
Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation	✓ Link	21.6	MAFT+	2024-08-01
Open-Vocabulary Semantic Segmentation with Image Embedding Balancing	✓ Link	21.0	EBSeg-L	2024-06-14
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP	✓ Link	18.2	FC-CLIP	2023-08-04
Open-Vocabulary Segmentation with Semantic-Assisted Calibration	✓ Link	16.7	SCAN	2023-12-07
Learning Mask-aware CLIP Representations for Zero-Shot Segmentation	✓ Link	15.7	MAFT-ViTL	2023-09-30
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models	✓ Link	14.5	ODISE	2023-03-08
Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP	✓ Link	12.4	OVSeg Swin-B	2022-10-09
Open-Vocabulary Universal Image Segmentation with MaskCLIP	✓ Link	10	MaskCLIP	2022-08-18