OpenCodePapers

atari-games-on-atari-2600-time-pilot

Video GamesAtari Games
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeScoreModelNameReleaseDate
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model✓ Link476763.90MuZero2019-11-19
Generalized Data Distribution Iteration450810GDI-H32022-06-07
Recurrent Experience Replay in Distributed Reinforcement Learning✓ Link445377.3R2D22019-05-01
Online and Offline Reinforcement Learning by Planning with a Learned Model✓ Link424011.16MuZero (Res2 Adam)2021-04-13
Agent57: Outperforming the Atari Human Benchmark✓ Link405425.31Agent572020-03-30
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning216770GDI-I32021-06-11
Generalized Data Distribution Iteration216770GDI-I32022-06-07
Distributed Prioritized Experience Replay✓ Link87085Ape-X2018-03-02
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link63854.5UCT2012-07-19
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures✓ Link48481.50IMPALA (deep)2018-02-05
Mastering Atari with Discrete World Models✓ Link37945DreamerV22020-10-05
Asynchronous Methods for Deep Reinforcement Learning✓ Link27202.0A3C LSTM hs2016-02-04
Adaptive Rational Activations to Boost Deep Reinforcement Learning✓ Link17632Rational DQN Average2021-02-18
Noisy Networks for Exploration✓ Link17301NoisyNet-Dueling2017-06-30
Adaptive Rational Activations to Boost Deep Reinforcement Learning✓ Link13261Recurrent Rational DQN Average2021-02-18
DNA: Proximal Policy Optimization with a Dual Network Architecture✓ Link12774DNA2022-06-20
Asynchronous Methods for Deep Reinforcement Learning✓ Link12679.0A3C FF hs2016-02-04
Implicit Quantile Networks for Distributional Reinforcement Learning✓ Link12236IQN2018-06-14
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity✓ Link12071ASL DDQN2023-05-07
Evolving simple programs for playing Atari games✓ Link12040CGP2018-06-14
Dueling Network Architectures for Deep Reinforcement Learning✓ Link11666.0Duel noop2015-11-20
Self-Imitation Learning✓ Link10811.7A2C + SIL2018-06-14
Distributional Reinforcement Learning with Quantile Regression✓ Link10345QR-DQN-12017-10-27
Prioritized Experience Replay✓ Link9197.0Prior noop2015-11-18
Deep Exploration via Bootstrapped DQN✓ Link9079.4Bootstrapped DQN2016-02-15
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link8969.12Advantage Learning2015-12-15
Dueling Network Architectures for Deep Reinforcement Learning✓ Link8339.0DDQN (tuned) noop2015-11-20
A Distributional Perspective on Reinforcement Learning✓ Link8329.0C51 noop2017-07-21
Massively Parallel Methods for Deep Reinforcement Learning✓ Link8267.8Gorila2015-07-15
Dueling Network Architectures for Deep Reinforcement Learning✓ Link7553.0Prior+Duel noop2015-11-20
Deep Reinforcement Learning with Double Q-learning✓ Link6608.0DDQN (tuned) hs2015-09-22
Dueling Network Architectures for Deep Reinforcement Learning✓ Link6601.0Duel hs2015-11-20
Prioritized Experience Replay✓ Link5963.0Prior hs2015-11-18
Human level control through deep reinforcement learning✓ Link5947.0Nature DQN2015-02-25
Asynchronous Methods for Deep Reinforcement Learning✓ Link5825.0A3C FF (1 day) hs2016-02-04
Evolution Strategies as a Scalable Alternative to Reinforcement Learning✓ Link4970.0ES FF (1 hour) noop2017-03-10
Deep Reinforcement Learning with Double Q-learning✓ Link4871.0Prior+Duel hs2015-09-22
Deep Reinforcement Learning with Double Q-learning✓ Link4870.0DQN noop2015-09-22
Learning values across many orders of magnitude4870.0DDQN+Pop-Art noop2016-02-24
Deep Reinforcement Learning with Double Q-learning✓ Link4786.0DQN hs2015-09-22
Playing Atari with Six Neurons✓ Link4600IDVQ + DRSC + XNES2018-06-04
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization✓ Link3770.33POP3D2018-07-02
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link3741.2Best Learner2012-07-19
[]()24.9SARSA