OpenCodePapers
language-modelling-on-openwebtext
Language Modelling
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
Show papers without code
Paper
Code
eval_perplexity
↕
eval_loss
↕
parameters
↕
ModelName
ReleaseDate
↕
Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking
15.36
131M
MDLM-Prime
2025-05-24
Simple and Effective Masked Diffusion Language Models
✓ Link
17.54
131M
ARM
2024-06-11
Energy-Based Diffusion Language Models for Text Generation
17.58
131M
EDLM-coAR
2024-10-28
Polynomial, trigonometric, and tropical activations
✓ Link
18.39
2.91
124M
GPT2-Hermite
2025-02-03
Polynomial, trigonometric, and tropical activations
✓ Link
18.64
2.92
124M
GPT2-Tropical
2025-02-03
Polynomial, trigonometric, and tropical activations
✓ Link
18.72
2.93
124M
GPT2-Fourier
2025-02-03
Polynomial, trigonometric, and tropical activations
✓ Link
19.24
2.95
124M
GPT2-GELU
2025-02-03
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
✓ Link
20.73
131M
BD3-LMs
2025-03-12
Energy-Based Diffusion Language Models for Text Generation
21.52
131M
EDLM-NCE
2024-10-28
Simplified and Generalized Masked Diffusion for Discrete Data
✓ Link
21.80
131M
GenMD4
2024-06-06
Simple and Effective Masked Diffusion Language Models
✓ Link
22.98
131M
MDLM
2024-06-11
Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution
✓ Link
24.10
131M
SEDD
2023-10-25