OpenCodePapers
text-to-image-generation-on-multi-modal
Text-to-Image Generation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
Show papers without code
Paper
Code
FID
↕
LPIPS
↕
Acc
↕
Real
↕
ModelName
ReleaseDate
↕
Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
10.31
Swinv2-Imagen
2022-10-18
LAFITE: Towards Language-Free Training for Text-to-Image Generation
✓ Link
12.54
Lafite
2021-11-27
Shifted Diffusion for Text-to-image Generation
✓ Link
19.74
Corgi
2022-11-24
Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models
✓ Link
26.09
0.519
Unite and Conquer
2022-12-01
Towards Open-World Text-Guided Face Image Generation and Manipulation
✓ Link
101.42
0.461
20.4
21.0
TediGAN-B
2021-04-18
TediGAN: Text-Guided Diverse Face Image Generation and Manipulation
✓ Link
106.37
0.456
18.4
22.6
TediGAN-A
2020-12-06
Controllable Text-to-Image Generation
✓ Link
116.32
0.522
14.6
13.1
ControlGAN
2019-09-16
AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks
✓ Link
125.98
0.512
13.0
11.9
AttnGAN
2017-11-28
DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis
✓ Link
131.05
0.544
16.4
16.9
DM-GAN
2019-04-02
DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis
✓ Link
137.60
0.581
17.3
14.5
DFGAN
2020-08-13