OpenCodePapers

domain-generalization-on-imagenet-sketch

Domain Generalization
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeTop-1 accuracyModelNameReleaseDate
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time✓ Link77.18Model soups (BASIC-L)2022-03-10
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time✓ Link74.24Model soups (ViT-G/14)2022-03-10
Context-Aware Robust Fine-Tuning65.5CAR-FT (CLIP, ViT-L/14@336px)2022-11-29
A ConvNet for the 2020s✓ Link55.0ConvNeXt-XL (Im21k, 384)2022-01-10
MetaFormer Baselines for Vision✓ Link54.5CAFormer-B36 (IN21K, 384)2022-10-24
A Whac-A-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One Amplifies Others✓ Link53.39LLE (ViT-H/14, MAE, Edge Aug)2022-12-09
MetaFormer Baselines for Vision✓ Link52.9ConvFormer-B36 (IN21K, 384)2022-10-24
MetaFormer Baselines for Vision✓ Link52.8CAFormer-B36 (IN21K)2022-10-24
MetaFormer Baselines for Vision✓ Link52.7ConvFormer-B36 (IN21K)2022-10-24
Masked Autoencoders Are Scalable Vision Learners✓ Link50.9MAE (ViT-H, 448)2021-11-11
Enhance the Visual Representation via Discrete Adversarial Training✓ Link50.03MAE+DAT (ViT-H)2022-09-16
Generalized Parametric Contrastive Learning✓ Link48.3GPaCo (ViT-L)2022-09-26
Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models✓ Link46.1Discrete Adversarial Distillation (ViT-B, 224)2023-11-02
Pyramid Adversarial Training Improves ViT Performance✓ Link46.03Pyramid Adversarial Training Improves ViT (Im21k)2021-11-30
Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision✓ Link45.6SEER (RegNet10B)2022-02-16
Discrete Representations Strengthen Vision Transformer Robustness✓ Link44.72DrViT2021-11-20
MetaFormer Baselines for Vision✓ Link42.5CAFormer-B362022-10-24
Pyramid Adversarial Training Improves ViT Performance✓ Link41.04Pyramid Adversarial Training Improves ViT2021-11-30
MetaFormer Baselines for Vision✓ Link39.5ConvFormer-B362022-10-24
Sequencer: Deep LSTM for Image Classification✓ Link35.8Sequencer2D-L2022-05-04