OpenCodePapers

image-generation-on-textatlaseval

Image Generation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeTextVsionBlend OCR (F1 Score)TextVisionBlend OCR (Accuracy)TextVisionBlend OCR (Cer)TextVisionBlend FIDTextVisionBlend Clip ScoreStyledTextSynth OCR (F1 Score)StyledTextSynth OCR (Accuracy)StyledTextSynth OCR (Cer)StyledTextSynth FIDStyledTextSynth Clip ScoreTextScenesHQ OCR (F1 Score)TextScenesHQ OCR (Accuracy)TextScenesHQ OCR (Cer)TextScenesHQ FIDTextScenesHQ Clip ScoreModelNameReleaseDate
[]()44.2241.540.57-0.169721.4015.820.7380.330.293837.9435.070.57-0.3197Grok3
[]()16.2514.550.88118.850.184633.8627.210.7371.090.284924.4519.030.7364.440.2363SD3.5 Large
[]()7.948.380.93153.210.193838.2530.580.7890.700.293851.6369.26-86.730.3367Dalle3
Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data✓ Link3.442.980.8395.690.19791.420.800.9384.950.27271.741.060.8871.590.2346Infinity-2B2024-10-24
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation✓ Link1.572.400.8381.290.18910.620.420.9082.830.27640.530.340.9172.620.2347PixArt-Sigma2024-03-07
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering-----1.460.760.99114.310.25101.250.660.9684.100.2252TextDiffuser22023-11-28
AnyText: Multilingual Visual Text Generation And Editing✓ Link-----0.660.350.98117.710.25010.80.420.95101.320.2174Anytext2023-11-06