OpenCodePapers

video-prediction-on-kinetics-600-12-frames

Video Prediction

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	FVD	IS	Cond	Pred	ModelName	ReleaseDate
Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion		2.3		5	11	SiD2	2024-10-25
Photorealistic Video Generation with Diffusion Models		3.3				W.A.L.T.-L	2023-12-11
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation	✓ Link	4.3±0.1				MAGVIT-v2	2023-10-09
LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior	✓ Link	5.1		5	11	LARP	2024-10-28
MAGVIT: Masked Generative Video Transformer	✓ Link	9.9±0.3		5	11	MAGVIT (-L-FP)	2022-12-10
Scalable Adaptive Computation for Iterative Generation	✓ Link	10.8	17.7			RIN (1000 steps)	2022-12-22
Scalable Adaptive Computation for Iterative Generation	✓ Link	11.5	17.7			RIN (400 steps)	2022-12-22
Diffusion Models for Video Prediction and Infilling	✓ Link	16.46		5	11	RaMViD	2022-06-15
MAGVIT: Masked Generative Video Transformer	✓ Link	24.5±0.9		5	11	MAGVIT (-B-FP)	2022-12-10
Transformation-based Adversarial Video Prediction on Large-Scale Data		25.74±0.66	12.54±0.06	5	11	TriVD-GAN-FP	2020-03-09
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation	✓ Link	32.9				OmniTokenizer-AR	2024-06-13
CCVS: Context-aware Controllable Video Synthesis	✓ Link	55±1		5	11	CCVS	2021-07-16
Predicting Video with VQVAE	✓ Link	64.30±2.04		4	12	Video VQ-VAE FVD	2021-03-02
Adversarial Video Generation on Complex Datasets	✓ Link	69.15±0.78		5	11	DVD-GAN-FP	2019-07-15
Scaling Autoregressive Video Models	✓ Link	170±5		5	11	Video Transformer	2019-06-06
Latent Video Transformer	✓ Link	224.73		5	11	LVT	2020-06-18