OpenCodePapers

open-vocabulary-object-detection-on-mscoco

Object DetectionOpen Vocabulary Object Detection
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAP 0.5ModelNameReleaseDate
Enhancing Novel Object Detection via Cooperative Foundational Models✓ Link50.3Cooperative Foundational Models2023-11-19
Detect Everything with Few Examples✓ Link50DE-ViT2023-09-22
YOLOv8-Based Visual Detection of Road Hazards: Potholes, Sewer Covers, and Manholes47.2Yolov8-nano2023-10-31
Region-centric Image-Language Pretraining for Open-Vocabulary Detection✓ Link46.1DITO2023-09-29
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision✓ Link45.6OV-DQUO(RN50x4)2024-05-28
LP-OVOD: Open-Vocabulary Object Detection by Linear Probing✓ Link44.9LP-OVOD (OWL-ViT Proposals)2023-10-26
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction✓ Link44.3CLIPSelf2023-10-02
CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching✓ Link43.1CORA+2023-03-23
Aligning Bag of Regions for Open-Vocabulary Object Detection✓ Link42.7BARON2023-02-27
SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary Detection✓ Link41.9SIA-OVD (RN50x4)2024-10-08
CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching✓ Link41.7CORA2023-03-23
Retrieval-Augmented Open-Vocabulary Object Detection✓ Link41.3RALF2024-04-08
LP-OVOD: Open-Vocabulary Object Detection by Linear Probing✓ Link40.5LP-OVOD2023-10-26
RegionCLIP: Region-based Language-Image Pretraining✓ Link39.3Region-CLIP (RN50x4-C4)2021-12-16
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision✓ Link39.2OV-DQUO(R50)2024-05-28
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection✓ Link36.9Object-Centric-OVD2022-07-07
CLIM: Contrastive Language-Image Mosaic for Region Representation✓ Link36.9CLIM (RN50)2023-12-18
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection✓ Link35.6OADP (G-OVD)2023-03-10
SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary Detection✓ Link35.5SIA-OVD (RN50)2024-10-08
Exploiting Unlabeled Data with Vision and Language Models for Object Detection✓ Link34.4VL-PLM (RN50)2022-07-18
Contrastive Feature Masking Open-Vocabulary Vision Transformer34.1CFM-ViT2023-09-02
Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization✓ Link 32.6MEDet (RN50)2022-06-22
RegionCLIP: Region-based Language-Image Pretraining✓ Link31.4Region-CLIP (RN50-C4)2021-12-16
Open-vocabulary Attribute Detection✓ Link30.0OVAD-Baseline2022-11-23
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection✓ Link30.0OADP2023-03-10
Open-Vocabulary DETR with Conditional Matching✓ Link29.4OV-DERT2022-03-22
Localized Vision-Language Matching for Open-vocabulary Object Detection✓ Link28.6LocOv (RN50-C4)2022-05-12
Detecting Twenty-thousand Classes using Image-level Supervision✓ Link27.8Detic2022-01-07
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation✓ Link27.6ViLD2021-04-28
Open-Vocabulary Object Detection Using Captions✓ Link22.8OVR-CNN2020-11-20
Open-Vocabulary One-Stage Detection with Hierarchical Visual-Language Knowledge Distillation✓ Link20.3HierKD2022-03-20
YOLOv8-AM: YOLOv8 Based on Effective Attention Mechanisms for Pediatric Wrist Fracture Detection✓ Link0.5Yolov82024-02-14