OpenCodePapers
video-prediction-on-kinetics-600-12-frames
Video Prediction
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
Show papers without code
Paper
Code
FVD
↕
IS
↕
Cond
↕
Pred
↕
ModelName
ReleaseDate
↕
Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion
2.3
5
11
SiD2
2024-10-25
Photorealistic Video Generation with Diffusion Models
3.3
W.A.L.T.-L
2023-12-11
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
✓ Link
4.3±0.1
MAGVIT-v2
2023-10-09
LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior
✓ Link
5.1
5
11
LARP
2024-10-28
MAGVIT: Masked Generative Video Transformer
✓ Link
9.9±0.3
5
11
MAGVIT (-L-FP)
2022-12-10
Scalable Adaptive Computation for Iterative Generation
✓ Link
10.8
17.7
RIN (1000 steps)
2022-12-22
Scalable Adaptive Computation for Iterative Generation
✓ Link
11.5
17.7
RIN (400 steps)
2022-12-22
Diffusion Models for Video Prediction and Infilling
✓ Link
16.46
5
11
RaMViD
2022-06-15
MAGVIT: Masked Generative Video Transformer
✓ Link
24.5±0.9
5
11
MAGVIT (-B-FP)
2022-12-10
Transformation-based Adversarial Video Prediction on Large-Scale Data
25.74±0.66
12.54±0.06
5
11
TriVD-GAN-FP
2020-03-09
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
✓ Link
32.9
OmniTokenizer-AR
2024-06-13
CCVS: Context-aware Controllable Video Synthesis
✓ Link
55±1
5
11
CCVS
2021-07-16
Predicting Video with VQVAE
✓ Link
64.30±2.04
4
12
Video VQ-VAE FVD
2021-03-02
Adversarial Video Generation on Complex Datasets
✓ Link
69.15±0.78
5
11
DVD-GAN-FP
2019-07-15
Scaling Autoregressive Video Models
✓ Link
170±5
5
11
Video Transformer
2019-06-06
Latent Video Transformer
✓ Link
224.73
5
11
LVT
2020-06-18