Paper | Code | box AP | ModelName | ReleaseDate |
---|---|---|---|---|
DETRs with Collaborative Hybrid Assignments Training | ✓ Link | 72.0 | Co-DETR (single-scale) | 2022-11-22 |
CP-DETR: Concept Prompt Guide DETR Toward Stronger Universal Object Detection | 69.2 | CP-DETR-L Swin-L(with chunk) | 2024-12-13 | |
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection | ✓ Link | 68.1 | Grounding DINO 1.5 Pro | 2024-05-16 |
Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information | ✓ Link | 65.8 | M3I Pre-training (InternImage-H, single-scale) | 2022-11-17 |
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions | ✓ Link | 65.8 | InternImage-H | 2022-11-10 |
GLIPv2: Unifying Localization and Vision-Language Understanding | ✓ Link | 59.8 | GLIPv2 | 2022-06-12 |