Paper | Code | Accuracy | ModelName | ReleaseDate |
---|---|---|---|---|
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters | ✓ Link | 77.7 | EVA-CLIP-18B | 2024-02-06 |
Learning Transferable Visual Models From Natural Language Supervision | ✓ Link | 58.5 | CLIP | 2021-02-26 |
Learning Visual N-Grams from Web Data | 23.0 | Visual N-Grams | 2016-12-29 |