OpenCodePapers

zero-shot-transfer-image-classification-on-3

Zero-Shot Transfer Image Classification
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracy (Private)Accuracy (Public)ModelNameReleaseDate
[]()81.2BASIC (Lion)
Scaling Vision Transformers to 22 Billion Parameters✓ Link80.9LiT-22B2023-02-10
CoCa: Contrastive Captioners are Image-Text Foundation Models✓ Link80.7CoCa2022-05-04
Combined Scaling for Zero-shot Transfer Learning80.6BASIC2021-11-19
PaLI: A Jointly-Scaled Multilingual Language-Image Model✓ Link80.6LiT ViT-e2022-09-14
LiT: Zero-Shot Transfer with Locked-image text Tuning✓ Link78.7 66.6LiT-tuning2021-11-15
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters✓ Link77.9EVA-CLIP-18B2024-02-06
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks✓ Link77.3InternVL-C2023-12-21
EVA-CLIP: Improved Training Techniques for CLIP at Scale✓ Link75.7EVA-CLIP-E/14+2023-03-27
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision✓ Link 70.1-ALIGN2021-02-11
Learning Transferable Visual Models From Natural Language Supervision✓ Link70.1-CLIP2021-02-26
AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities✓ Link68.1AltCLIP2022-11-12
PaLI: A Jointly-Scaled Multilingual Language-Image Model✓ Link64.46PaLI2022-09-14