Paper | Code | AP | ModelName | ReleaseDate |
---|---|---|---|---|
CP-DETR: Concept Prompt Guide DETR Toward Stronger Universal Object Detection | 73.1 | CP-DETR-L(only optimize prompt) | 2024-12-13 | |
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection | ✓ Link | 72.4 | Grounding DINO 1.5 Pro | 2024-05-16 |
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection | 72.1 | DetCLIPv3 | 2024-04-14 | |
Multi-modal Queried Object Detection in the Wild | ✓ Link | 71.3 | MQ-GLIP-L | 2023-05-30 |
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection | ✓ Link | 70.9 | Grounding DINO | 2023-03-09 |
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment | 70.4 | DetCLIPv2 | 2023-04-10 | |
GLIPv2: Unifying Localization and Vision-Language Understanding | ✓ Link | 70.4 | GLIPv2 | 2022-06-12 |
Grounded Language-Image Pre-training | ✓ Link | 68.9 | GLIP | 2021-12-07 |