OpenCodePapers

self-supervised-image-classification-on

Self-Supervised Image Classification
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeTop 1 AccuracyTop 5 AccuracyNumber of ParamsModelNameReleaseDate
Vision Transformers Need Registers✓ Link87.11100MDINOv2+reg (ViT-g/14)2023-09-28
DINOv2: Learning Robust Visual Features without Supervision✓ Link86.7%1100MDINOv2 (ViT-g/14 @448)2023-04-14
DINOv2: Learning Robust Visual Features without Supervision✓ Link86.5%1100MDINOv2 (ViT-g/14)2023-04-14
DINOv2: Learning Robust Visual Features without Supervision✓ Link86.3%307MDINOv2 distilled (ViT-L/14)2023-04-14
MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations✓ Link84.7%632MMIM-Refiner (D2V2-ViT-H/14)2024-02-15
MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations✓ Link84.5%1890MMIM-Refiner (MAE-ViT-2B/14)2024-02-15
DINOv2: Learning Robust Visual Features without Supervision✓ Link84.5%85MDINOv2 distilled (ViT-B/14)2023-04-14
MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations✓ Link83.7%632MMIM-Refiner (MAE-ViT-H/142024-02-15
MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations✓ Link83.5%307MMIM-Refiner (D2V2-ViT-L/16)2024-02-15
MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations✓ Link82.8%307MMIM-Refiner (MAE-ViT-L/16)2024-02-15
iBOT: Image BERT Pre-Training with Online Tokenizer✓ Link82.3%307MiBOT (ViT-L/16) (IN22k)2021-11-15
Contrastive Tuning: A Little Help to Make Masked Autoencoders Forget✓ Link82.2%632MMAE-CT (ViT-H/16)2023-04-20
Mugs: A Multi-Granular Self-Supervised Learning Framework✓ Link82.1%307MMugs (VIT-L/16)2022-03-27
Contrastive Tuning: A Little Help to Make Masked Autoencoders Forget✓ Link81.5%307MMAE-CT (ViT-L/162023-04-20
Efficient Self-supervised Vision Transformers for Representation Learning✓ Link81.395.587MEsViT (Swin-B)2021-06-17
iBOT: Image BERT Pre-Training with Online Tokenizer✓ Link81.3%307MiBOT (ViT-L/16)2021-11-15
DINOv2: Learning Robust Visual Features without Supervision✓ Link81.1%21MDINOv2 distilled (ViT-S/14)2023-04-14
An Empirical Study of Training Self-Supervised Vision Transformers✓ Link81.0%304MMoCo v3 (ViT-BN-L/7)2021-04-05
Efficient Self-supervised Vision Transformers for Representation Learning✓ Link80.849MEsViT(Swin-S)2021-06-17
Masked Siamese Networks for Label-Efficient Learning✓ Link80.7%306MMSN (ViT-L/7)2022-04-14
Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet?✓ Link80.6%250MReLICv2 (ResNet-200 x2)2022-01-13
Masked Reconstruction Contrastive Learning with Information Bottleneck Principle80.4%MR BarTwins (MR BarTwins)2022-11-15
Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective✓ Link80.3%732MDiGIT2024-10-16
DINO as a von Mises-Fisher mixture model80.3%85MiBOT-vMF (ViT-B/16)2024-05-17
Emerging Properties in Self-Supervised Vision Transformers✓ Link80.3%84MDINO (xcit_medium_24_p8)2021-04-29
Perceptual Group Tokenizer: Building Perception with Iterative Grouping80.3%70MPGT (PGT-B w/ Flow)2023-11-30
Emerging Properties in Self-Supervised Vision Transformers✓ Link80.1%80MDINO (ViT-B/8)2021-04-29
Big Self-Supervised Models are Strong Semi-Supervised Learners✓ Link79.8%94.9%795MSimCLRv2 (ResNet-152 x3, SK)2020-06-17
Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision✓ Link79.8%10000MSEERv22022-02-16
Improving Visual Representation Learning through Perceptual Understanding✓ Link79.8%80MPercMAE (ViT-B, dVAE)2022-12-30
Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet?✓ Link79.8%63MReLICv2 (ResNet200)2022-01-13
Emerging Properties in Self-Supervised Vision Transformers✓ Link79.7%21MDINO (ViT-S/8)2021-04-29
Bootstrap your own latent: A new approach to self-supervised Learning✓ Link79.6%94.8%250MBYOL (ResNet-200 x2)2020-06-13
Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet?✓ Link79.4%375MReLICv2 (ResNet-50 4x)2022-01-13
Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet?✓ Link79.3%58MReLICv2 (ResNet152)2022-01-13
Unsupervised Representation Learning by Balanced Self Attention Matching✓ Link79.3%BAM (CAFormer-M36)2024-08-04
An Empirical Study of Training Self-Supervised Vision Transformers✓ Link79.1%700MMoCo v3 (ViT-BN-H)2021-04-05
Unicom: Universal and Compact Representation Learning for Image Retrieval✓ Link79.1%80MUnicom (ViT-B/16)2023-04-12
Unsupervised Visual Representation Learning by Synchronous Momentum Grouping✓ Link79.0%94.4375MSMoG (ResNet-50 x4)2022-07-13
Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet?✓ Link79%94MReLICv2 (ResNet-50 x2)2022-01-13
Compressive Visual Representations✓ Link78.8%94.5%94MC-BYOL (ResNet-50 2x, 1000 epochs)2021-09-27
DINO as a von Mises-Fisher mixture model78.8%85MDINO-vMF (ViT-B/16)2024-05-17
Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet?✓ Link78.7%44MReLICv2 (ResNet101)2022-01-13
Bootstrap your own latent: A new approach to self-supervised Learning✓ Link78.6%94.2%375MBYOL (ResNet-50 x4)2020-06-13
Unsupervised Learning of Visual Features by Contrasting Cluster Assignments✓ Link78.5%586MSwAV (ResNet-50 x5)2020-06-17
Emerging Properties in Self-Supervised Vision Transformers✓ Link78.2%85MDINO (ViT-B/16)2021-04-29
An Empirical Study of Training Self-Supervised Vision Transformers✓ Link78.1%632MMoCo v3 (ViT-H)2021-04-05
Improving Visual Representation Learning through Perceptual Understanding✓ Link78.1%80MPercMAE (ViT-B)2022-12-30
Unsupervised Representation Learning by Balanced Self Attention Matching✓ Link78.1%80MBAM (ViT-B/16)2024-08-04
Unsupervised Visual Representation Learning by Synchronous Momentum Grouping✓ Link78.0%93.994MSMoG (ResNet-50 x2)2022-07-13
An Empirical Study of Training Self-Supervised Vision Transformers✓ Link77.6%307MMoCo v3 (ViT-L)2021-04-05
Self-supervised Pretraining of Visual Features in the Wild✓ Link77.5%1300MSEER2021-03-02
Bootstrap your own latent: A new approach to self-supervised Learning✓ Link77.4%93.6%94MBYOL (ResNet-50 x2)2020-06-13
Unsupervised Learning of Visual Features by Contrasting Cluster Assignments✓ Link77.3%94MSwAV (ResNet-50 x2)2020-06-17
Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet?✓ Link77.1%25MReLICv2 (ResNet-50)2022-01-13
Emerging Properties in Self-Supervised Vision Transformers✓ Link77.0%21MDINO (ViT-S/16)2021-04-29
DINO as a von Mises-Fisher mixture model77.0%21MDINO-vMF (ViT-S/16)2024-05-17
An Empirical Study of Training Self-Supervised Vision Transformers✓ Link76.7%86MMoCo v3 (ViT-B/16)2021-04-05
Masked Autoencoders Are Scalable Vision Learners✓ Link76.6%700MMAE (ViT-H)2021-11-11
A Simple Framework for Contrastive Learning of Visual Representations✓ Link76.5%93.2%375MSimCLR (ResNet-50 4x)2020-02-13
Unsupervised Visual Representation Learning by Online Constrained K-Means✓ Link76.4%25MCoKe (ResNet-50)2021-05-24
Unsupervised Visual Representation Learning by Synchronous Momentum Grouping✓ Link76.4%25MSMoG (ResNet-50)2022-07-13
Weak Augmentation Guided Relational Self-Supervised Learning✓ Link76.3%24MReSSL (ResNet-50 w/ Predictor and Stronger Aug)2022-03-16
Weak Augmentation Guided Relational Self-Supervised Learning✓ Link76.0%24MReSSL (ResNet-50 w/ Predictor)2022-03-16
Solving Inefficiency of Self-supervised Representation Learning✓ Link75.9%23.56MTriplet (ResNet-50)2021-04-18
Masked Autoencoders Are Scalable Vision Learners✓ Link75.8%306MMAE (ViT-L)2021-11-11
Divide and Contrast: Self-supervised Learning from Uncurated Data75.8%24MDnC (ResNet-50)2021-05-17
CaCo: Both Positive and Negative Samples are Directly Learnable via Cooperative-adversarial Contrastive Learning✓ Link75.7%24MCaCo (ResNet-50)2022-03-27
Big Self-Supervised Models are Strong Semi-Supervised Learners✓ Link75.6%92.7%94MSimCLRv2 (ResNet-50 x2)2020-06-17
Compressive Visual Representations✓ Link75.6%92.7%25MC-BYOL (ResNet-50, 1000 epochs)2021-09-27
With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of Visual Representations✓ Link75.6%92.425MNNCLR (ResNet-50, multi-crop)2021-04-29
Self-supervised Pre-training with Hard Examples Improves Visual Representations75.5%24MHEXA2020-12-25
Similarity Contrastive Estimation for Self-Supervised Soft Contrastive Learning✓ Link75.4%24MSCE (ResNet-50, multi-crop)2021-11-29
Unsupervised Learning of Visual Features by Contrasting Cluster Assignments✓ Link75.3%24MSwAV (ResNet-50)2020-06-17
Emerging Properties in Self-Supervised Vision Transformers✓ Link75.3%24MDINO (ResNet-50)2021-04-29
What Makes for Good Views for Contrastive Learning?✓ Link75.2%120MInfoMin (ResNeXt-152)2020-05-20
Unsupervised Learning of Visual Features by Contrasting Cluster Assignments✓ Link75.2%24MDeepCluster-v2 (ResNet-50)2020-06-17
Unicom: Universal and Compact Representation Learning for Image Retrieval✓ Link75.0%80MUnicom (ViT-B/32)2023-04-12
Self-Supervised Learning with Swin Transformers✓ Link75%29MMoBY (Swin-T)2021-05-10
Representation Learning via Invariant Causal Mechanisms✓ Link74.8%24MReLIC (ResNet-50)2020-10-15
ReSSL: Relational Self-Supervised Learning with Weak Augmentation✓ Link74.7%92.3%24MReSSL(ResNet-50) 200ep2021-07-20
Weakly Supervised Contrastive Learning✓ Link74.7%24MWCL (ResNet-50)2021-10-10
MV-MR: multi-views and multi-representations for self-supervised learning and knowledge distillation✓ Link74.5%92.1MV-MR2023-03-21
Boosting Contrastive Self-Supervised Learning with False Negative Cancellation✓ Link74.4%91.8%24MFNC (ResNet-50)2020-11-23
Bootstrap your own latent: A new approach to self-supervised Learning✓ Link74.3%91.6%24MBYOL (ResNet-50)2020-06-13
A Simple Framework for Contrastive Learning of Visual Representations✓ Link74.2%92.0%94MSimCLR (ResNet-50 2x)2020-02-13
Self-Supervised Classification Network✓ Link74.2%24MSelf-Classifier (ResNet-50)2021-03-19
Learning by Sorting: Self-supervised Learning with Group Ordering Constraints✓ Link73.9%91.625MGroCo (ResNet-50)2023-01-05
OBoW: Online Bag-of-Visual-Words Generation for Self-Supervised Learning✓ Link73.8%92.2%24MOBoW (ResNet-50)2020-12-21
VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning✓ Link73.291.124MVICReg (ResNet50)2021-05-11
Barlow Twins: Self-Supervised Learning via Redundancy Reduction✓ Link73.2%9124MBarlow Twins (ResNet-50)2021-03-04
What Makes for Good Views for Contrastive Learning?✓ Link73.0%91.1%24MInfoMin (ResNet-50)2020-05-20
ResMLP: Feedforward networks for image classification with data-efficient training✓ Link72.8%30MDINO (ResMLP-24)2021-05-07
Self-Supervised Learning with Swin Transformers✓ Link72.8%22MMoBY (DeiT-S)2021-05-10
VNE: An Effective Method for Improving Deep Representation by Manipulating Eigenvalue Distribution✓ Link72.191.025MI-VNE+ (ResNet-50)2023-04-04
Generative Pretraining from Pixels✓ Link72.0%6801MiGPT-XL (64x64, 15360 features)2020-07-17
Big Self-Supervised Models are Strong Semi-Supervised Learners✓ Link71.7%90.4%24MSimCLRv2 (ResNet-50)2020-06-17
Data-Efficient Image Recognition with Contrastive Predictive Coding✓ Link71.5%90.1%305MCPC v2 (ResNet-161) (arxiv v2)2019-05-22
Exploring Simple Siamese Representation Learning✓ Link71.3%24MSimSiam (ResNet-50)2020-11-20
Improved Baselines with Momentum Contrastive Learning✓ Link71.1%90.1%24MMoCo v2 (ResNet-50)2020-03-09
SynCo: Synthetic Hard Negatives in Contrastive Learning for Better Unsupervised Visual Representations✓ Link70.6%89.8%24MSynCo (ResNet-50) 800ep2024-10-03
Contrastive Multiview Coding✓ Link70.6%89.7%188MCMC (ResNet-50 x2) (arxiv v5)2019-06-13
A Simple Framework for Contrastive Learning of Visual Representations✓ Link69.3%89.0%24MSimCLR (ResNet-50)2020-02-13
Generative Pretraining from Pixels✓ Link68.7%6800MiGPT-XL (64x64, 3072 features)2020-07-17
Momentum Contrast for Unsupervised Visual Representation Learning✓ Link68.6%375MMoCo (ResNet-50 4x)2019-11-13
Learning Representations by Maximizing Mutual Information Across Views✓ Link68.1%626MAMDIM (large) (arxiv v2)2019-06-03
Masked Autoencoders Are Scalable Vision Learners✓ Link68.0%80MMAE (ViT-B)2021-11-11
SynCo: Synthetic Hard Negatives in Contrastive Learning for Better Unsupervised Visual Representations✓ Link67.9%8824MSynCo (ResNet-50) 200ep2024-10-03
ResMLP: Feedforward networks for image classification with data-efficient training✓ Link67.5%15MDINO (ResMLP-12)2021-05-07
Contrastive Multiview Coding✓ Link66.2%87.0%47MCMC (ResNet-50) (arxiv v5)2019-06-13
Prototypical Contrastive Learning of Unsupervised Representations✓ Link65.9%25MPCL (ResNet-50)2020-05-11
Momentum Contrast for Unsupervised Visual Representation Learning✓ Link65.4%94MMoCo (ResNet-50 2x)2019-11-13
Generative Pretraining from Pixels✓ Link65.2%1400MiGPT-L (48x48)2020-07-17
Contrastive Multiview Coding✓ Link65.0%86.0%CMC (ResNet-101) (arxiv v3)2019-06-13
Data-Efficient Image Recognition with Contrastive Predictive Coding✓ Link63.8%85.3%24MCPC v2 (ResNet-50) (arxiv v2)2019-05-22
Max-Margin Contrastive Learning✓ Link63.8%MMCL (100 epoch, 256 batch size)2021-12-21
Self-Supervised Learning of Pretext-Invariant Representations✓ Link63.6%24MPIRL2019-12-04
Learning Representations by Maximizing Mutual Information Across Views✓ Link63.5%194MAMDIM (small) (arxiv v2)2019-06-03
Self-labelling via simultaneous clustering and representation learning✓ Link61.5%84.0%24MSeLa (ResNet50) (arxiv 3)2019-11-13
Large Scale Adversarial Representation Learning✓ Link61.3%81.9%86MBigBiGAN (RevNet-50 ×4, BN+CReLU)2019-07-04
Data-Efficient Image Recognition with Contrastive Predictive Coding✓ Link61.0%83.0%305MCPC v2 (ResNet-161) (arxiv v1)2019-05-22
Large Scale Adversarial Representation Learning✓ Link60.8%81.4%86MBigBiGAN (RevNet-50 ×4)2019-07-04
Momentum Contrast for Unsupervised Visual Representation Learning✓ Link60.6%24MMoCo (ResNet-50)2019-11-13
Generative Pretraining from Pixels✓ Link60.3%1400MiGPT-L (32x32)2020-07-17
Learning Representations by Maximizing Mutual Information Across Views✓ Link60.2%337MAMDIM (arxiv v1)2019-06-03
Local Aggregation for Unsupervised Learning of Visual Embeddings✓ Link60.2%24MLocalAgg (ResNet-50)2019-03-29
Contrastive Multiview Coding✓ Link60.1%82.8%44MCMC (ResNet-101)2019-06-13
Large Scale Adversarial Representation Learning✓ Link56.6%78.6%24MBigBiGAN (ResNet-50, BN+CReLU)2019-07-04
Self-labelling via simultaneous clustering and representation learning✓ Link55.7%79.5%24MSeLa (ResNet50)2019-11-13
Revisiting Self-Supervised Visual Representation Learning✓ Link55.4%77.9%86MRevisited Rotation (RevNet-50 ×4)2019-01-25
Large Scale Adversarial Representation Learning✓ Link55.4%77.4%25MBigBiGAN (ResNet-50)2019-07-04
Revisiting Self-Supervised Visual Representation Learning✓ Link51.4%74.0%94MRevisited Rel.Patch.Loc (ResNet50 ×2)2019-01-25
Self-labelling via simultaneous clustering and representation learning✓ Link50.0%61MSeLa (AlexNet) (arxiv v3)2019-11-13
Representation Learning with Contrastive Predictive Coding✓ Link48.7%73.6%44MCPC (ResNet-101 V2)2018-07-10
Revisiting Self-Supervised Visual Representation Learning✓ Link46.0%68.8%211MRevisited Exemplar (ResNet-50 ×3)2019-01-25
Revisiting Self-Supervised Visual Representation Learning✓ Link44.6%68.0%94MRevisited Jigsaw (ResNet50 ×2)2019-01-25
Contrastive Multiview Coding✓ Link42.6%30MCMC (Alexnet/2)2019-06-13
Deep Clustering for Unsupervised Learning of Visual Features✓ Link41.061MDeepCluster (AlexNet)2018-07-15
Multi-task Self-Supervised Visual Learning39.662.544MColorisation (improved) (ResNet-101)2017-08-25
Unsupervised Representation Learning by Predicting Image Rotations✓ Link38.786MRotation (AlexNet)2018-03-21
Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction✓ Link35.4%61MSplit-Brain (AlexNet)2016-11-29
Representation Learning by Learning to Count✓ Link34.361MCounting (AlexNet)2017-08-22
Colorful Image Colorization✓ Link32.6%61MColorization (AlexNet)2016-03-28
Multi-task Self-Supervised Visual Learning70.244MMulti-task SSL (ResNet-101)2017-08-25