OpenCodePapers

open-vocabulary-attribute-detection-on-ovad-1

Object DetectionOpen Vocabulary Object DetectionOpen Vocabulary Attribute Detection
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodemean average precisionModelNameReleaseDate
Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts✓ Link28.0X-VLM2021-11-16
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models✓ Link25.5BLIP 2 (pretrained)2023-01-30
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation✓ Link24.3BLIP2022-01-28
Open-vocabulary Attribute Detection✓ Link21.4OVAD-Baseline-Box2022-11-23
Align before Fuse: Vision and Language Representation Learning with Momentum Distillation✓ Link21.0 ALBEF2021-07-16
Reproducible scaling laws for contrastive language-image learning✓ Link17.0Open CLIP ViT-B322022-12-14
Learning Transferable Visual Models From Natural Language Supervision✓ Link16.6CLIP VIT-B162021-02-26