Paper | Code | Top 1 Accuracy | ModelName | ReleaseDate |
---|---|---|---|---|
The effectiveness of MAE pre-pretraining for billion-scale pretraining | ✓ Link | 96.2 | MAWS (ViT-2B) | 2023-03-23 |
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters | ✓ Link | 95.8 | EVA-CLIP-18B | 2024-02-06 |
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks | ✓ Link | 95.3 | InternVL-C | 2023-12-21 |
EVA-CLIP: Improved Training Techniques for CLIP at Scale | ✓ Link | 94.9 | EVA-CLIP-E/14+ | 2023-03-27 |
Your Diffusion Model is Secretly a Zero-Shot Classifier | ✓ Link | 77.7 | Diffusion Classifier (zero-shot) | 2023-03-28 |