OpenCodePapers

open-vocabulary-panoptic-segmentation-on

Open Vocabulary Panoptic Segmentation

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	PQ	ModelName	ReleaseDate
UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding	✓ Link	31.6	UMG-CLIP-E/14	2024-01-12
PosSAM: Panoptic Open-vocabulary Segment Anything	✓ Link	29.2	PosSAM	2024-03-14
UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding	✓ Link	29.1	UMG-CLIP-L/14	2024-01-12
Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation	✓ Link	27.1	MAFT+	2024-08-01
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP	✓ Link	26.8	FC-CLIP	2023-08-04
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction	✓ Link	23.7	CLIPSelf	2023-10-02
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models	✓ Link	23.4	ODISE(Caption)	2023-03-08
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models	✓ Link	22.6	ODISE (Label)	2023-03-08
FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation		16.3	FreeSeg	2023-03-30
Extract Free Dense Labels from CLIP	✓ Link	15.1	MaskCLIP	2021-12-02