OpenCodePapers

on-wise

Text-to-Image Generation

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Overall	Cultural	Time	Space	Biology	Physics	Chemistry	ModelName	ReleaseDate
Transfer between Modalities with MetaQueries		0.55	0.56	0.55	0.62	0.49	0.63	0.41	MetaQuery-XL	2025-04-08
UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation	✓ Link	0.55	0.53	0.55	0.73	0.45	0.59	0.41	UniWorld-V1	2025-06-03
Emerging Properties in Unified Multimodal Pretraining	✓ Link	0.52	0.44	0.55	0.68	0.44	0.60	0.39	BAGEL	2025-05-20
Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in Text-to-Image Generation		0.49	0.49	0.58	0.55	0.43	0.48	0.33	playground-v2.5	2024-02-27
PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis	✓ Link	0.47	0.45	0.50	0.48	0.49	0.56	0.34	PixArt-Alpha	2023-09-30
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis	✓ Link	0.46	0.44	0.50	0.58	0.44	0.56	0.31	SD3.5-large	2024-03-05
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis	✓ Link	0.43	0.43	0.48	0.47	0.44	0.45	0.27	SDXL	2023-07-04
Emu3: Next-Token Prediction is All You Need	✓ Link	0.39	0.34	0.45	0.48	0.41	0.45	0.27	Emu3	2024-09-27
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation	✓ Link	0.35	0.28	0.40	0.48	0.30	0.46	0.30	Show-o	2024-08-22
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling	✓ Link	0.35	0.30	0.37	0.49	0.36	0.42	0.26	Janus-Pro-7B	2025-01-29
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation	✓ Link	0.23	0.16	0.26	0.35	0.28	0.30	0.14	Janus	2024-10-17