Paper | Code | Accuracy | ModelName | ReleaseDate |
---|---|---|---|---|
PaLI-X: On Scaling up a Multilingual Vision and Language Model | ✓ Link | 23.1 | PaLI-X | 2023-05-29 |
Open-domain Visual Entity Recognition: Towards Recognizing Millions of Wikipedia Entities | ✓ Link | 20.2 | PaLI (17B) | 2023-02-22 |
Retrieval-Enhanced Contrastive Vision-Text Models | 12.6 | RECO | 2023-06-12 | |
Open-domain Visual Entity Recognition: Towards Recognizing Millions of Wikipedia Entities | ✓ Link | 11.8 | PaLI (3B) | 2023-02-22 |
Open-domain Visual Entity Recognition: Towards Recognizing Millions of Wikipedia Entities | ✓ Link | 5.3 | CLIP2CLIP | 2023-02-22 |