OpenCodePapers

atari-games-on-atari-2600-skiing

Video GamesAtari Games
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeScoreModelNameReleaseDate
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link0Best Learner2012-07-19
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link0Full Tree2012-07-19
Implicit Quantile Networks for Distributional Reinforcement Learning✓ Link-9289IQN2018-06-14
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model✓ Link-29968.36MuZero2019-11-19
Recurrent Experience Replay in Distributed Reinforcement Learning✓ Link-30021.7R2D22019-05-01
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures✓ Link-10180.38IMPALA (deep)2018-02-05
Fully Parameterized Quantile Function for Distributional Reinforcement Learning✓ Link-9085.3FQF2019-11-05
Noisy Networks for Exploration✓ Link-7550NoisyNet-Dueling2017-06-30
Evolving simple programs for playing Atari games✓ Link-9011CGP2018-06-14
Distributional Reinforcement Learning with Quantile Regression✓ Link-9324QR-DQN-12017-10-27
Distributed Prioritized Experience Replay✓ Link-10789.9Ape-X2018-03-02
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link-13264.51Advantage Learning2015-12-15
First return, then explore✓ Link-3660Go-Explore2020-04-27
Agent57: Outperforming the Atari Human Benchmark✓ Link-4202.6Agent572020-03-30
Mastering Atari with Discrete World Models✓ Link-9299DreamerV22020-10-05
Adaptive Rational Activations to Boost Deep Reinforcement Learning✓ Link-23582Recurrent Rational DQN Average2021-02-18
Adaptive Rational Activations to Boost Deep Reinforcement Learning✓ Link-23487Rational DQN Average2021-02-18
Online and Offline Reinforcement Learning by Planning with a Learned Model✓ Link-30000MuZero (Res2 Adam)2021-04-13
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning-6774GDI-I32021-06-11
Generalized Data Distribution Iteration-6774GDI-I32022-06-07
Generalized Data Distribution Iteration-6025GDI-H32022-06-07
DNA: Proximal Policy Optimization with a Dual Network Architecture✓ Link-29974DNA2022-06-20
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity✓ Link-8295.4ASL DDQN2023-05-07