open-vocabulary-object-detection-on-mscoco

Object DetectionOpen Vocabulary Object Detection

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	AP 0.5	ModelName	ReleaseDate
Enhancing Novel Object Detection via Cooperative Foundational Models	✓ Link	50.3	Cooperative Foundational Models	2023-11-19
Detect Everything with Few Examples	✓ Link	50	DE-ViT	2023-09-22
YOLOv8-Based Visual Detection of Road Hazards: Potholes, Sewer Covers, and Manholes		47.2	Yolov8-nano	2023-10-31
Region-centric Image-Language Pretraining for Open-Vocabulary Detection	✓ Link	46.1	DITO	2023-09-29
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision	✓ Link	45.6	OV-DQUO(RN50x4)	2024-05-28
LP-OVOD: Open-Vocabulary Object Detection by Linear Probing	✓ Link	44.9	LP-OVOD (OWL-ViT Proposals)	2023-10-26
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction	✓ Link	44.3	CLIPSelf	2023-10-02
CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching	✓ Link	43.1	CORA+	2023-03-23
Aligning Bag of Regions for Open-Vocabulary Object Detection	✓ Link	42.7	BARON	2023-02-27
SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary Detection	✓ Link	41.9	SIA-OVD (RN50x4)	2024-10-08
CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching	✓ Link	41.7	CORA	2023-03-23
Retrieval-Augmented Open-Vocabulary Object Detection	✓ Link	41.3	RALF	2024-04-08
LP-OVOD: Open-Vocabulary Object Detection by Linear Probing	✓ Link	40.5	LP-OVOD	2023-10-26
RegionCLIP: Region-based Language-Image Pretraining	✓ Link	39.3	Region-CLIP (RN50x4-C4)	2021-12-16
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision	✓ Link	39.2	OV-DQUO(R50)	2024-05-28
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection	✓ Link	36.9	Object-Centric-OVD	2022-07-07
CLIM: Contrastive Language-Image Mosaic for Region Representation	✓ Link	36.9	CLIM (RN50)	2023-12-18
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection	✓ Link	35.6	OADP (G-OVD)	2023-03-10
SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary Detection	✓ Link	35.5	SIA-OVD (RN50)	2024-10-08
Exploiting Unlabeled Data with Vision and Language Models for Object Detection	✓ Link	34.4	VL-PLM (RN50)	2022-07-18
Contrastive Feature Masking Open-Vocabulary Vision Transformer		34.1	CFM-ViT	2023-09-02
Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization	✓ Link	32.6	MEDet (RN50)	2022-06-22
RegionCLIP: Region-based Language-Image Pretraining	✓ Link	31.4	Region-CLIP (RN50-C4)	2021-12-16
Open-vocabulary Attribute Detection	✓ Link	30.0	OVAD-Baseline	2022-11-23
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection	✓ Link	30.0	OADP	2023-03-10
Open-Vocabulary DETR with Conditional Matching	✓ Link	29.4	OV-DERT	2022-03-22
Localized Vision-Language Matching for Open-vocabulary Object Detection	✓ Link	28.6	LocOv (RN50-C4)	2022-05-12
Detecting Twenty-thousand Classes using Image-level Supervision	✓ Link	27.8	Detic	2022-01-07
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation	✓ Link	27.6	ViLD	2021-04-28
Open-Vocabulary Object Detection Using Captions	✓ Link	22.8	OVR-CNN	2020-11-20
Open-Vocabulary One-Stage Detection with Hierarchical Visual-Language Knowledge Distillation	✓ Link	20.3	HierKD	2022-03-20
YOLOv8-AM: YOLOv8 Based on Effective Attention Mechanisms for Pediatric Wrist Fracture Detection	✓ Link	0.5	Yolov8	2024-02-14

OpenCodePapers

open-vocabulary-object-detection-on-mscoco