OpenCodePapers

atari-games-on-atari-2600-tutankham

Video GamesAtari Games
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeScoreModelNameReleaseDate
Agent57: Outperforming the Atari Human Benchmark✓ Link2354.91Agent572020-03-30
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model✓ Link491.48MuZero2019-11-19
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning423.9GDI-I32021-06-11
Generalized Data Distribution Iteration423.9GDI-I32022-06-07
Generalized Data Distribution Iteration418.2GDI-H32022-06-07
Recurrent Experience Replay in Distributed Reinforcement Learning✓ Link395.3R2D22019-05-01
Online and Offline Reinforcement Learning by Planning with a Learned Model✓ Link347.99MuZero (Res2 Adam)2021-04-13
Self-Imitation Learning✓ Link340.5A2C + SIL2018-06-14
Distributional Reinforcement Learning with Quantile Regression✓ Link297QR-DQN-12017-10-27
Implicit Quantile Networks for Distributional Reinforcement Learning✓ Link293IQN2018-06-14
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures✓ Link292.11IMPALA (deep)2018-02-05
A Distributional Perspective on Reinforcement Learning✓ Link280.0C51 noop2017-07-21
Distributed Prioritized Experience Replay✓ Link272.6Ape-X2018-03-02
Noisy Networks for Exploration✓ Link269NoisyNet-Dueling2017-06-30
Mastering Atari with Discrete World Models✓ Link264DreamerV22020-10-05
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity✓ Link252.9ASL DDQN2023-05-07
Dueling Network Architectures for Deep Reinforcement Learning✓ Link245.9Prior+Duel noop2015-11-20
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link245.22Advantage Learning2015-12-15
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization✓ Link241.21POP3D2018-07-02
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link225.5UCT2012-07-19
Dueling Network Architectures for Deep Reinforcement Learning✓ Link218.4DDQN (tuned) noop2015-11-20
Deep Exploration via Bootstrapped DQN✓ Link214.8Bootstrapped DQN2016-02-15
Dueling Network Architectures for Deep Reinforcement Learning✓ Link211.4Duel noop2015-11-20
Prioritized Experience Replay✓ Link204.6Prior noop2015-11-18
Deep Attention Recurrent Q-Network✓ Link197DARQN soft2015-12-05
Human level control through deep reinforcement learning✓ Link186.7Nature DQN2015-02-25
Adaptive Rational Activations to Boost Deep Reinforcement Learning✓ Link184Recurrent Rational DQN Average2021-02-18
Learning values across many orders of magnitude183.9DDQN+Pop-Art noop2016-02-24
Adaptive Rational Activations to Boost Deep Reinforcement Learning✓ Link179Rational DQN Average2021-02-18
Asynchronous Methods for Deep Reinforcement Learning✓ Link156.3A3C FF hs2016-02-04
Asynchronous Methods for Deep Reinforcement Learning✓ Link144.2A3C LSTM hs2016-02-04
Evolution Strategies as a Scalable Alternative to Reinforcement Learning✓ Link130.3ES FF (1 hour) noop2017-03-10
DNA: Proximal Policy Optimization with a Dual Network Architecture✓ Link127DNA2022-06-20
Massively Parallel Methods for Deep Reinforcement Learning✓ Link118.5Gorila2015-07-15
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link114.3Best Learner2012-07-19
Deep Reinforcement Learning with Double Q-learning✓ Link108.6Prior+Duel hs2015-09-22
[]()98.2SARSA
Deep Reinforcement Learning with Double Q-learning✓ Link92.2DDQN (tuned) hs2015-09-22
Deep Reinforcement Learning with Double Q-learning✓ Link68.1DQN noop2015-09-22
Prioritized Experience Replay✓ Link56.9Prior hs2015-11-18
Dueling Network Architectures for Deep Reinforcement Learning✓ Link48.0Duel hs2015-11-20
Deep Reinforcement Learning with Double Q-learning✓ Link45.6DQN hs2015-09-22
Asynchronous Methods for Deep Reinforcement Learning✓ Link26.1A3C FF (1 day) hs2016-02-04
Evolving simple programs for playing Atari games✓ Link0CGP2018-06-14