OpenCodePapers

object-detection-on-coco-o

Object Detection
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAverage mAPEffective RobustnessModelNameReleaseDate
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale✓ Link57.828.86EVA2022-11-14
NMS Strikes Back✓ Link48.520.15DETA (Swin-L)2022-12-12
Grounded Language-Image Pre-training✓ Link48.024.89GLIP-L (Swin-L)2021-12-07
GRiT: A Generative Region-to-text Transformer for Object Understanding✓ Link42.915.72GRiT (ViT-H)2022-12-01
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection✓ Link42.115.76DINO (Swin-L)2022-03-07
CBNet: A Composite Backbone Network Architecture for Object Detection✓ Link39.012.36CBNetV2 (Swin-L)2021-07-01
A ConvNet for the 2020s✓ Link37.512.68ConvNeXt-XL (Cascade Mask R-CNN)2022-01-10
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions✓ Link37.011.72InternImage-L (Cascade Mask R-CNN)2022-11-10
Dynamic Head: Unifying Object Detection Heads with Attentions✓ Link35.310.00DyHead (Swin-L)2021-06-15
Exploring Plain Vision Transformer Backbones for Object Detection✓ Link34.3ViTDet (ViT-H)2022-03-30
Vision Transformer Adapter for Dense Predictions✓ Link34.257.79ViT-Adapter (BEiTv2-L)2022-05-17
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone✓ Link33.711.43FIBER-B (Swin-B)2022-06-15
Instances as Queries✓ Link33.28.26QueryInst (Swin-L)2021-05-05
YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications✓ Link32.56.73YOLOv6-L62022-09-07
YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors✓ Link32.06.42YOLOv7-E6E2022-07-06
MViTv2: Improved Multiscale Vision Transformers for Classification and Detection✓ Link30.95.62MViTV2-H (Cascade Mask R-CNN)2021-12-02
Robust and Accurate Object Detection via Adversarial Learning✓ Link30.87.34Det-AdvProp (EfficientNet-B5)2021-03-23
YOLOv4: Optimal Speed and Accuracy of Object Detection✓ Link30.45.89YOLOv4-P62020-04-23
YOLOX: Exceeding YOLO Series in 2021✓ Link30.37.26YOLOX-X2021-07-18
Probabilistic two-stage detection✓ Link29.54.29CenterNet2 (R2-101-DCN)2021-03-12
Grounded Language-Image Pre-training✓ Link29.18.11GLIP-T (Swin-T)2021-12-07
EfficientDet: Scalable and Efficient Object Detection✓ Link28.55.44EfficientDet-D5 (EfficientNet-B5)2019-11-20
PVT v2: Improved Baselines with Pyramid Vision Transformer✓ Link28.26.85PVTv2-B5 (Mask R-CNN)2021-06-25
VarifocalNet: An IoU-aware Dense Object Detector✓ Link28.05.27VFNet (RX-101-64x4d)2020-08-31
GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond✓ Link26.04.38GCNet (RX-101-32x4d-DCN)2019-04-25
Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection✓ Link25.12.6GFLv2 (R2-101-DCN)2020-11-25
RepPoints V2: Verification Meets Regression for Object Detection✓ Link24.92.7RepPointsV2 (RX-101-64x4d-DCN)2020-07-16
USB: Universal-Scale Object Detection Benchmark✓ Link24.8UniverseNet (R2-101-DCN)2021-03-25
YOLOX: Exceeding YOLO Series in 2021✓ Link20.62.48YOLOX-S2021-07-18
You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection✓ Link20.01.05YOLOS-B (ViT-B)2021-06-01
Dynamic Head: Unifying Object Detection Heads with Attentions✓ Link19.30.16DyHead (ResNet-50)2021-06-15
Hybrid Task Cascade for Instance Segmentation✓ Link19.10.08HTC (ResNet-50)2019-01-22
Deformable DETR: Deformable Transformers for End-to-End Object Detection✓ Link18.5-1.49Deformable-DETR (ResNet-50)2020-10-08
Cascade R-CNN: High Quality Object Detection and Instance Segmentation✓ Link18.20.02Cascade R-CNN (ResNet-50)2019-06-24
Mask R-CNN✓ Link17.1Mask R-CNN (ResNet-50)2017-03-20
End-to-End Object Detection with Transformers✓ Link17.1-1.82DETR (ResNet-50)2020-05-26
Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection✓ Link16.8-0.91ATSS (ResNet-50)2019-12-05
FCOS: Fully Convolutional One-Stage Object Detection✓ Link16.70.25FCOS (ResNet-50)2019-04-02
Focal Loss for Dense Object Detection✓ Link16.60.18RetinaNet (ResNet-50)2017-08-07
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks✓ Link16.4-0.41Faster R-CNN (ResNet-50-FPN)2015-06-04
YOLOv3: An Incremental Improvement✓ Link14.8-0.37YOLOv3 (DarkNet-53)2018-04-08
SSD: Single Shot MultiBox Detector✓ Link13.60.36SSD (VGG-16)2015-12-08
Exploring Plain Vision Transformer Backbones for Object Detection✓ Link7.89ViTDet (ViT-H)2022-03-30
USB: Universal-Scale Object Detection Benchmark✓ Link1.86UniverseNet (R2-101-DCN)2021-03-25
Mask R-CNN✓ Link-0.11Mask R-CNN (ResNet-50)2017-03-20