OpenCodePapers

image-generation-on-imagenet-512x512

Image Generation
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeFIDNFEInception scoreModelNameReleaseDate
Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator✓ Link1.2150EDM2-L + DDO (SD-VAE, 25 steps, DPM-Solver-v3)2025-03-03
Unified Continuous Generative Models✓ Link1.24300DDT-XL/2 + UCGM-S (SD-VAE + 150 sampling steps + CFG)2025-05-12
Unified Continuous Generative Models✓ Link1.25200DDT-XL/2 + UCGM-S (SD-VAE + 100 sampling steps + CFG)2025-05-12
Guiding a Diffusion Model with a Bad Version of Itself✓ Link1.25EDM2-XXL Autoguidance2024-06-04
DDT: Decoupled Diffusion Transformer✓ Link1.28500305DDT-XL/2(22en6de 675M + guidance interval )2025-04-08
Guiding a Diffusion Model with a Bad Version of Itself✓ Link1.34EDM2- S Autoguidance (XS, T /16)2024-06-04
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step✓ Link1.3661SiDA-EDM2-XXL (1.5B)2024-10-19
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step✓ Link1.3791SiDA-EDM2-XL (1.1B)2024-10-19
Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models✓ Link1.40EDM2-XXL w/ guidance interval2024-04-11
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step✓ Link1.4131SiDA-EDM2-L (777M)2024-10-19
Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion1.48SiD22024-10-25
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step✓ Link1.4881SiDA-EDM2-M (498M)2024-10-19
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step✓ Link1.6691SiDA-EDM2-S (280M)2024-10-19
Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models✓ Link1.68EDM2-S w/ guidance interval2024-04-11
Generative Modeling with Explicit Memory✓ Link1.71GMem2024-12-11
Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models✓ Link1.72DC-AE-f32 + USiT-2B2024-10-14
Autoregressive Image Generation without Vector Quantization✓ Link1.73MAR-L, Diff Loss2024-06-17
Self-Improving Diffusion Models with Synthetic Data1.73SIMS2024-08-29
PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher✓ Link1.80PaGoDA2024-05-23
Analyzing and Improving the Training Dynamics of Diffusion Models✓ Link1.81126EDM2-XXL2023-12-05
Analyzing and Improving the Training Dynamics of Diffusion Models✓ Link1.85126EDM2-XL2023-12-05
Analyzing and Improving the Training Dynamics of Diffusion Models✓ Link1.88126EDM2-L2023-12-05
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step✓ Link1.8881SiD-EDM2-XL (1.1B)2024-10-19
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step✓ Link1.9071SiD-EDM2-L (777M)2024-10-19
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation✓ Link1.91324.3MAGVIT-v22023-10-09
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step✓ Link1.9691SiD-EDM2-XXL (1.5B)2024-10-19
Analyzing and Improving the Training Dynamics of Diffusion Models✓ Link2.01126EDM2-M2023-12-05
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step✓ Link2.061SiD-EDM2-M (498M)2024-10-19
An Image is Worth 32 Tokens for Reconstruction and Generation✓ Link2.13TiTok-B-1282024-06-11
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step✓ Link2.1561SiDA-EDM2-XS (125M)2024-10-19
Analyzing and Improving the Training Dynamics of Diffusion Models✓ Link2.23126EDM2-S2023-12-05
CADS: Unleashing the Diversity of Diffusion Models through Condition-Annealed Sampling2.31DiT-XL/2 with CADS2023-10-26
StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets✓ Link2.40StyleGAN-XL2022-02-01
An Image is Worth 32 Tokens for Reconstruction and Generation✓ Link2.49TiTok-L-642024-06-11
DiffiT: Diffusion Vision Transformers for Image Generation✓ Link2.67252.12DiffiT2023-12-04
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step✓ Link2.7071SiD-EDM2-S (280M)2024-10-19
SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models✓ Link2.80DiT-XL/2 with SA-Solver2023-09-10
Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models and Time-Dependent Layer Normalization✓ Link2.89DiMR-XL/3R2024-06-13
Analyzing and Improving the Training Dynamics of Diffusion Models✓ Link2.91126EDM2-XS2023-12-05
GIVT: Generative Infinite-Vocabulary Transformers✓ Link2.92GIVT-Causal-L+A2023-12-04
Scalable Diffusion Models with Transformers✓ Link3.04240.82DiT-XL/22022-12-19
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation✓ Link3.07213.1MAGVIT-v2 (w/o guidance)2023-10-09
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step✓ Link3.3531SiD-EDM2-XS (125M)2024-10-19
Discrete Predictor-Corrector Diffusion Models for Image Synthesis3.54350.2DPC-U2022-09-29
High-Resolution Image Synthesis with Latent Diffusion Models✓ Link3.60247.67Latent Diffusion (LDM-4-G)2021-12-20
Polynomial Implicit Neural Representations For Large Diverse Datasets✓ Link3.81Poly-INR2023-03-20
Diffusion Models Beat GANs on Image Synthesis✓ Link3.85221.72ADM-G, ADM-U2021-05-11
Simple diffusion: End-to-end diffusion for high resolution images✓ Link4.28171simple diffusion (U-Net)2023-01-26
MaskGIT: Masked Generative Image Transformer✓ Link4.46342.0MaskGIT (a=0.05)2022-02-08
Simple diffusion: End-to-end diffusion for high resolution images✓ Link4.53205.3simple diffusion (U-ViT, L)2023-01-26
MaskGIT: Masked Generative Image Transformer✓ Link7.32156.0MaskGIT2022-02-08
Diffusion Models Beat GANs on Image Synthesis✓ Link7.72172.71ADM-G2021-05-11