OpenCodePapers

text-to-image-generation-on-multi-modal

Text-to-Image Generation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeFIDLPIPSAccRealModelNameReleaseDate
Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation10.31Swinv2-Imagen2022-10-18
LAFITE: Towards Language-Free Training for Text-to-Image Generation✓ Link12.54Lafite2021-11-27
Shifted Diffusion for Text-to-image Generation✓ Link19.74Corgi2022-11-24
Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models✓ Link26.090.519Unite and Conquer2022-12-01
Towards Open-World Text-Guided Face Image Generation and Manipulation✓ Link101.420.46120.421.0TediGAN-B 2021-04-18
TediGAN: Text-Guided Diverse Face Image Generation and Manipulation✓ Link106.370.45618.422.6TediGAN-A2020-12-06
Controllable Text-to-Image Generation✓ Link116.320.52214.613.1ControlGAN2019-09-16
AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks✓ Link125.980.51213.011.9AttnGAN2017-11-28
DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis✓ Link131.050.54416.416.9DM-GAN2019-04-02
DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis✓ Link137.600.58117.3 14.5DFGAN2020-08-13