OpenCodePapers

video-generation-on-bair-robot-pushing

Video Generation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeFVD scoreSSIMPSNRLPIPSCondTrainPredNotesModelNameReleaseDate
MAGVIT: Masked Generative Video Transformer✓ Link6211515MAGVIT2022-12-10
Diffusion Models for Video Prediction and Infilling✓ Link84.2012015RaMViD2022-06-15
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion✓ Link86.911515NUWA2021-11-24
MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation✓ Link87.90.83819.12514MCVD : c2t5p142022-05-19
MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation✓ Link89.50.7816.91515MCVD : c1t5p152022-05-19
FitVid: Overfitting in Pixel-Level Video Prediction✓ Link93.611515Uses 100 times more fake than real samples (atypical)FitVid2021-06-24
Scaling Autoregressive Video Models✓ Link94± 211515FVD on only leftmost samples is 94, FVD on unrolled (all subsequences) is 96Video Transformer2019-06-06
CCVS: Context-aware Controllable Video Synthesis✓ Link99 ± 211515CCVS2021-07-16
VideoGPT: Video Generation using VQ-VAE and Transformers✓ Link103.311515VideoGPT2021-04-20
Transformation-based Adversarial Video Prediction on Large-Scale Data103.311515TrIVD-GAN-FP2020-03-09
Adversarial Video Generation on Complex Datasets✓ Link109.811515DVD-GAN-FP2019-07-15
Stochastic Adversarial Video Prediction✓ Link116.421414SAVP (from FVD)2018-04-04
MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation✓ Link118.40.74516.22528MCVD : c2t5p282022-05-19
Latent Video Transformer✓ Link125.76±2.9011515LVT2020-06-18
VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation✓ Link131±531014 (total 16)VideoFlow2019-03-04
Improved Conditional VRNNs for Video Prediction✓ Link143.40.822±0.060.055±0.0321028Hier-VRNN2019-04-27
Stochastic Adversarial Video Prediction✓ Link143.430.795±0.070.062±0.0321028SAVP (from vRNN)2018-04-04
Improved Conditional VRNNs for Video Prediction✓ Link149.220.829±0.060.058±0.0321028VRNN 1L2019-04-27
Stochastic Adversarial Video Prediction✓ Link152±90.7887±0.009218.44±0.250.0634±0.002621228SAVP (from SRVP)2018-04-04
Exploring Spatial-Temporal Multi-Frequency Analysis for High-Fidelity and Temporal-Consistency Video Prediction✓ Link159.60.84421.020.093621428WAM2020-02-23
Stochastic Latent Residual Video Prediction✓ Link162 ± 40.8196±0.008419.59±0.270.0574±0.003221228SRVP2020-02-21
SLAMP: Stochastic Latent Appearance and Motion Prediction✓ Link245 ± 50.8175±0.08419.67±0.260.0596±0.003221028SLAMP2021-08-05
Stochastic Video Generation with a Learned Prior✓ Link255±40.8058±0.008818.95±0.260.0609±0.003421228SVG (from SRVP)2018-02-21
Stochastic Video Generation with a Learned Prior✓ Link256.620.816±0.070.061±0.0321028SVG-LP (from vRNN)2018-02-21
Stochastic Variational Video Prediction✓ Link262.521414SV2P (from FVD)2017-10-30
Unsupervised Learning for Physical Interaction through Video Prediction✓ Link296.521414CDNA (from FVD)2016-05-23
Stochastic Video Generation with a Learned Prior✓ Link315.521414SVG-FP (from FVD)2018-02-21
Latent Video Transformer✓ Link320.911515Baseline (from LVT)2020-06-18
MoCoGAN: Decomposing Motion and Content for Video Generation✓ Link50341212MoCoGAN2017-07-17
Stochastic Variational Video Prediction✓ Link965±170.8169±0.008620.39±0.270.0912±0.005321228SV2P (from SRVP)2017-10-30
Stochastic Adversarial Video Prediction✓ Link0.81519.0921428SAVP-VAE (from WAM)2018-04-04