OpenCodePapers

zero-shot-transfer-image-classification-on-6

Zero-Shot Transfer Image Classification
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracy (Private)Accuracy (Public)Top 5 AccuracyModelNameReleaseDate
Scaling Vision Transformers to 22 Billion Parameters✓ Link87.6LiT-22B2023-02-10
PaLI: A Jointly-Scaled Multilingual Language-Image Model✓ Link84.9LiT ViT-e2022-09-14
CoCa: Contrastive Captioners are Image-Text Foundation Models✓ Link82.7CoCa2022-05-04
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters✓ Link82.2EVA-CLIP-18B2024-02-06
LiT: Zero-Shot Transfer with Locked-image text Tuning✓ Link81.1 54.5LiT-tuning2021-11-15
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks✓ Link80.6InternVL-C2023-12-21
EVA-CLIP: Improved Training Techniques for CLIP at Scale✓ Link79.6EVA-CLIP-E/14+2023-03-27
Learning Transferable Visual Models From Natural Language Supervision✓ Link72.3-CLIP2021-02-26
PaLI: A Jointly-Scaled Multilingual Language-Image Model✓ Link42.6258.35PaLI2022-09-14