OpenCodePapers

self-supervised-image-classification-on-1

Self-Supervised Image Classification
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeTop 1 AccuracyNumber of ParamsModelNameReleaseDate
DINOv2: Learning Robust Visual Features without Supervision✓ Link88.9%1100MDINOv2 (ViT-g/14, 448)2023-04-14
Improving Visual Representation Learning through Perceptual Understanding✓ Link88.6%307MPercMAE (ViT-L, dVAE)2022-12-30
DINOv2: Learning Robust Visual Features without Supervision✓ Link88.5%1100MDINOv2 (ViT-g/14)2023-04-14
PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers✓ Link88.3%632MPeCo(ViT-H/14, 448)2021-11-24
Improving Visual Representation Learning through Perceptual Understanding✓ Link88.1%307MPercMAE (ViT-L)2022-12-30
Exploring Target Representations for Masked Autoencoders✓ Link88.0%632MdBOT (ViT-H/14)2022-09-08
Masked Autoencoders Are Scalable Vision Learners✓ Link87.8%632MMAE (ViT-H/14, 448)2021-11-11
iBOT: Image BERT Pre-Training with Online Tokenizer✓ Link87.8%307MiBOT(ViT-L/16, 512)2021-11-15
Masking meets Supervision: A Strong Learning Alliance✓ Link87.2%632MMAE + AugSub finetune (ViT-H/14)2023-06-20
SimMIM: A Simple Framework for Masked Image Modeling✓ Link87.1%658MSimMIM (SwinV2-H, 512)2021-11-18
Masked Autoencoders Are Scalable Vision Learners✓ Link86.9%MAE (ViT-H/14)2021-11-11
iBOT: Image BERT Pre-Training with Online Tokenizer✓ Link86.6%307MiBOT(ViT-L/16)2021-11-15
Towards Sustainable Self-supervised Learning✓ Link86.5%TEC_MAE (ViT-L/16, 224)2022-10-20
BEiT: BERT Pre-Training of Image Transformers✓ Link86.3%307MBEiT-L (ViT)2021-06-15
Context Autoencoder for Self-Supervised Representation Learning✓ Link86.3%307MCAE (ViT-L/16)2022-02-07
Masked Image Residual Learning for Scaling Deeper Vision Transformers✓ Link86.2%341MMIRL (ViT-B-48)2023-09-25
Masking meets Supervision: A Strong Learning Alliance✓ Link86.1%304MMAE + AugSub finetune (ViT-L/16)2023-06-20
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling✓ Link86.0%198MSparK (ConvNeXt-Large, 384)2023-01-09
Bootstrapped Masked Autoencoders for Vision BERT Pretraining✓ Link85.9%307MBootMAE(ViT-L)2022-07-14
Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision✓ Link85.8%10000MSEER (Regnet10B)2022-02-16
Masked Feature Prediction for Self-Supervised Visual Pre-Training✓ Link85.7%307MMaskFeat (ViT-L)2021-12-16
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework✓ Link85.6%473MOFA (Large)2022-02-07
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling✓ Link85.4%198MSparK (ConvNeXt-Large)2023-01-09
SimMIM: A Simple Framework for Masked Image Modeling✓ Link85.4%197MSimMIM (Swin-L)2021-11-18
Mugs: A Multi-Granular Self-Supervised Learning Framework✓ Link85.2%307MMugs (ViT-L/16)2022-03-27
iBOT: Image BERT Pre-Training with Online Tokenizer✓ Link84.8%307MiBOT (ViT-L/16)2021-11-15
Masked Image Residual Learning for Scaling Deeper Vision Transformers✓ Link84.8%96MMIRL (ViT-S-54)2023-09-25
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling✓ Link84.8%89MConvNeXt-Base (SparK pre-training)2023-01-09
BEiT: BERT Pre-Training of Image Transformers✓ Link84.6%86MBEiT-B (ViT)2021-06-15
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN✓ Link84.5%A2MIM+ (ViT-B)2022-05-27
iBOT: Image BERT Pre-Training with Online Tokenizer✓ Link84.4%85MiBOT (ViT-B/16)2021-11-15
Mugs: A Multi-Granular Self-Supervised Learning Framework✓ Link84.3%85MMugs (ViT-B/16)2022-03-27
Self-supervised Pretraining of Visual Features in the Wild✓ Link84.2%1.3BSEER (RegNetY-256GF)2021-03-02
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN✓ Link84.2%A2MIM (ViT-B)2022-05-27
An Empirical Study of Training Self-Supervised Vision Transformers✓ Link84.1%304MMoCo v3 (ViT-L/16)2021-04-05
mc-BEiT: Multi-choice Discretization for Image BERT Pre-training✓ Link84.1%86Mmc-BEiT (ViT-B/16)2022-03-29
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling✓ Link84.1%50MConvNeXt-Small (SparK pre-training)2023-01-09
SimMIM: A Simple Framework for Masked Image Modeling✓ Link84.0%88MSimMIM (Swin-B)2021-11-18
iBOT: Image BERT Pre-Training with Online Tokenizer✓ Link84.0%85MiBOT (ViT-B/16)2021-11-15
Efficient Self-supervised Vision Transformers for Representation Learning✓ Link83.9%87MEsViT (Swin-B)2021-06-17
Masking meets Supervision: A Strong Learning Alliance✓ Link83.9%87MMAE + AugSub finetune (ViT-B/16)2023-06-20
Self-supervised Pretraining of Visual Features in the Wild✓ Link83.8%693MSEER (RegNetY-128GF)2021-03-02
SimMIM: A Simple Framework for Masked Image Modeling✓ Link83.8%85MSimMIM (ViT-B/16)2021-11-18
An Empirical Study of Training Self-Supervised Vision Transformers✓ Link83.2%86MMoCo v3 (ViT-B/16)2021-04-05
Multiplexed Immunofluorescence Brain Image Analysis Using Self-Supervised Dual-Loss Adaptive Masked Autoencoder✓ Link83.2%DAMA (ViT-B/16)2022-05-10
Big Self-Supervised Models are Strong Semi-Supervised Learners✓ Link83.1%795MSimCLRv2 (ResNet-152, 3×+SK)2020-06-17
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling✓ Link83.1%65MResNet-200 (SparK pre-training)2023-01-09
Emerging Properties in Self-Supervised Vision Transformers✓ Link82.8%85MDINO (ViT-B/16)2021-04-29
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling✓ Link82.7%60MResNet-152 (SparK pre-training)2023-01-09
Mugs: A Multi-Granular Self-Supervised Learning Framework✓ Link82.6%21MMugs (ViT-S/16)2022-03-27
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN✓ Link82.4%A2MIM+ (ViT-S)2022-05-27
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling✓ Link82.2%44MResNet-101 (SparK pre-training)2023-01-09
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN✓ Link82.2%A2MIM (ViT-S)2022-05-27
Unsupervised Learning of Visual Features by Contrasting Cluster Assignments✓ Link82.0%193MSwAV (ResNeXt-101-32x16d)2020-06-17
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling✓ Link80.6%26MResNet-50 (SparK pre-training)2023-01-09
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN✓ Link80.5%A2MIM+ (ResNet-50 RSB-A2)2022-05-27
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN✓ Link80.4%A2MIM (ResNet-50 RSB-A2)2022-05-27
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN✓ Link78.9%A2MIM+ (ResNet-50 RSB-A3)2022-05-27
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN✓ Link78.8%A2MIM (ResNet-50 RSB-A3)2022-05-27
Divide and Contrast: Self-supervised Learning from Uncurated Data78.2%DnC (Resnet-50)2021-05-17
Unsupervised Learning of Visual Features by Contrasting Cluster Assignments✓ Link77.8%182MSwAV (Resnet-50)2020-06-17
Momentum Contrast for Unsupervised Visual Representation Learning✓ Link77.3%MoCo (Resnet-50)2019-11-13
A Simple Framework for Contrastive Learning of Visual Representations✓ Link77.2%SimCLR (Resnet-50)2020-02-13
Momentum Contrast for Unsupervised Visual Representation Learning✓ Link77.0%MoCo (Resnet-50)2019-11-13
Unsupervised Pre-Training of Image Features on Non-Curated Data✓ Link74.9%138MDeeperCluster (VGG16)2019-05-03