OpenCodePapers

text-to-image-generation-on-multi-modal

Text-to-Image Generation

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	FID	LPIPS	Acc	Real	ModelName	ReleaseDate
Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation		10.31				Swinv2-Imagen	2022-10-18
LAFITE: Towards Language-Free Training for Text-to-Image Generation	✓ Link	12.54				Lafite	2021-11-27
Shifted Diffusion for Text-to-image Generation	✓ Link	19.74				Corgi	2022-11-24
Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models	✓ Link	26.09	0.519			Unite and Conquer	2022-12-01
Towards Open-World Text-Guided Face Image Generation and Manipulation	✓ Link	101.42	0.461	20.4	21.0	TediGAN-B	2021-04-18
TediGAN: Text-Guided Diverse Face Image Generation and Manipulation	✓ Link	106.37	0.456	18.4	22.6	TediGAN-A	2020-12-06
Controllable Text-to-Image Generation	✓ Link	116.32	0.522	14.6	13.1	ControlGAN	2019-09-16
AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks	✓ Link	125.98	0.512	13.0	11.9	AttnGAN	2017-11-28
DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis	✓ Link	131.05	0.544	16.4	16.9	DM-GAN	2019-04-02
DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis	✓ Link	137.60	0.581	17.3	14.5	DFGAN	2020-08-13