OpenCodePapers

atari-games-on-atari-2600-boxing

Video GamesAtari Games
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeScoreModelNameReleaseDate
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model✓ Link100.00MuZero2019-11-19
Distributed Prioritized Experience Replay✓ Link100Ape-X2018-03-02
Noisy Networks for Exploration✓ Link100NoisyNet-Dueling2017-06-30
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link100UCT2012-07-19
Agent57: Outperforming the Atari Human Benchmark✓ Link100Agent572020-03-30
Online and Offline Reinforcement Learning by Planning with a Learned Model✓ Link100MuZero (Res2 Adam)2021-04-13
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning100GDI-H32021-06-11
Generalized Data Distribution Iteration100GDI-I32022-06-07
Generalized Data Distribution Iteration100GDI-H32022-06-07
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures✓ Link99.96IMPALA (deep)2018-02-05
Distributional Reinforcement Learning with Quantile Regression✓ Link99.9QR-DQN-12017-10-27
DNA: Proximal Policy Optimization with a Dual Network Architecture✓ Link99.9DNA2022-06-20
Implicit Quantile Networks for Distributional Reinforcement Learning✓ Link99.8IQN2018-06-14
Self-Imitation Learning✓ Link99.6A2C + SIL2018-06-14
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity✓ Link99.6ASL DDQN2023-05-07
Dueling Network Architectures for Deep Reinforcement Learning✓ Link99.4Duel noop2015-11-20
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning99.4Reactor 500M2017-04-15
Learning values across many orders of magnitude99.3DDQN+Pop-Art noop2016-02-24
Dueling Network Architectures for Deep Reinforcement Learning✓ Link98.9Prior+Duel noop2015-11-20
Recurrent Experience Replay in Distributed Reinforcement Learning✓ Link98.5R2D22019-05-01
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes✓ Link98DDRL A3C2018-01-09
A Distributional Perspective on Reinforcement Learning✓ Link97.8C51 noop2017-07-21
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization✓ Link97.23POP3D2018-07-02
Prioritized Experience Replay✓ Link95.6Prior noop2015-11-18
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link94.3Persistent AL2015-12-15
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link93.94Advantage Learning2015-12-15
Deep Exploration via Bootstrapped DQN✓ Link93.2Bootstrapped DQN2016-02-15
Mastering Atari with Discrete World Models✓ Link92DreamerV22020-10-05
Dueling Network Architectures for Deep Reinforcement Learning✓ Link91.6DDQN (tuned) noop2015-11-20
Deep Reinforcement Learning with Double Q-learning✓ Link88.0DQN noop2015-09-22
Deep Reinforcement Learning with Double Q-learning✓ Link79.2Prior+Duel hs2015-09-22
Dueling Network Architectures for Deep Reinforcement Learning✓ Link77.3Duel hs2015-11-20
Massively Parallel Methods for Deep Reinforcement Learning✓ Link74.2Gorila2015-07-15
Deep Reinforcement Learning with Double Q-learning✓ Link73.5DDQN (tuned) hs2015-09-22
Prioritized Experience Replay✓ Link72.3Prior hs2015-11-18
Human level control through deep reinforcement learning✓ Link71.8Nature DQN2015-02-25
Deep Reinforcement Learning with Double Q-learning✓ Link70.3DQN hs2015-09-22
Asynchronous Methods for Deep Reinforcement Learning✓ Link59.8A3C FF hs2016-02-04
Evolution Strategies as a Scalable Alternative to Reinforcement Learning✓ Link49.8ES FF (1 hour) noop2017-03-10
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link44Best Learner2012-07-19
Evolving simple programs for playing Atari games✓ Link38.4CGP2018-06-14
Asynchronous Methods for Deep Reinforcement Learning✓ Link37.3A3C LSTM hs2016-02-04
Asynchronous Methods for Deep Reinforcement Learning✓ Link33.7A3C FF (1 day) hs2016-02-04
[]()9.8SARSA
CURL: Contrastive Unsupervised Representations for Reinforcement Learning✓ Link4.8CURL2020-04-08