open-vocabulary-semantic-segmentation-on-5

Open Vocabulary Semantic Segmentation

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	mIoU	hIoU	ModelName	ReleaseDate
UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding	✓ Link	97.9		UMG-CLIP-L/14	2024-01-12
SILC: Improving Vision Language Pretraining with Self-Distillation		97.6		SILC	2023-10-20
Open-Vocabulary Segmentation with Semantic-Assisted Calibration	✓ Link	97.2		SCAN	2023-12-07
CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation	✓ Link	97.0		CAT-Seg	2023-03-21
MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation	✓ Link	96.8		MaskCLIP++	2024-12-16
Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation	✓ Link	96.5		MAFT+	2024-08-01
Open-Vocabulary Semantic Segmentation with Image Embedding Balancing	✓ Link	96.4		EBSeg-L	2024-06-14
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP	✓ Link	95.4		FC-CLIP	2023-08-04
Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP	✓ Link	94.5		OVSeg Swin-B	2022-10-09
[]()		92.1		MAFT-ViTL
Learning Mask-aware CLIP Representations for Zero-Shot Segmentation	✓ Link	92.1		MAFT-ViTL	2023-09-30
HyperSeg: Towards Universal Visual Segmentation with Large Language Model	✓ Link	92.1		HyperSeg	2024-11-26
Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition	✓ Link	89.4	84.4	POMP	2023-04-10
TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification	✓ Link	87.9		TagAlign(trained with image-text pairs)	2023-12-21
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models	✓ Link	84.6		ODISE	2023-03-08
Learning to Generate Text-grounded Mask for Open-world Semantic Segmentation from Only Image-Text Pairs	✓ Link	83.2		TCL	2022-12-01
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation	✓ Link	82.5		LaVG	2024-08-09
Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning	✓ Link	72.3		PACL	2022-12-09
A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language Model	✓ Link		77.5	ZSSeg	2021-12-29
Decoupling Zero-Shot Semantic Segmentation	✓ Link		73.3	ZegFormer	2021-12-15

OpenCodePapers

open-vocabulary-semantic-segmentation-on-5