Paper | Code | Top 1 Accuracy | ModelName | ReleaseDate |
---|---|---|---|---|
Revisiting Weakly Supervised Pre-Training of Visual Perception Models | ✓ Link | 60.7 | SWAG (ViT H/14) | 2022-01-20 |
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles | ✓ Link | 60.6 | Hiera-H (448px) | 2023-06-01 |
Masked Autoencoders Are Scalable Vision Learners | ✓ Link | 60.3 | MAE (ViT-H, 448) | 2021-11-11 |
WaveMix: A Resource-efficient Neural Network for Image Analysis | ✓ Link | 56.45 | WaveMix-240/12 (level 4) | 2022-05-28 |