OpenCodePapers

atari-games-on-atari-2600-frostbite

Video GamesAtari Games
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeScoreBest ScoreModelNameReleaseDate
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model✓ Link631378.53MuZero2019-11-19
Agent57: Outperforming the Atari Human Benchmark✓ Link541280.88Agent572020-03-30
Online and Offline Reinforcement Learning by Planning with a Learned Model✓ Link374769.76MuZero (Res2 Adam)2021-04-13
Recurrent Experience Replay in Distributed Reinforcement Learning✓ Link315456.4R2D22019-05-01
Fully Parameterized Quantile Function for Distributional Reinforcement Learning✓ Link214060Fearlessmrx2019-11-05
Mastering Atari with Discrete World Models✓ Link11384DreamerV22020-10-05
Generalized Data Distribution Iteration11330GDI-H3(200M frames)2022-06-07
Generalized Data Distribution Iteration11330GDI-H32022-06-07
Generalized Data Distribution Iteration10485GDI-I32022-06-07
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning10485GDI-I32021-06-11
Distributed Prioritized Experience Replay✓ Link9328.6Ape-X2018-03-02
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity✓ Link8616.4ASL DDQN2023-05-07
Dueling Network Architectures for Deep Reinforcement Learning✓ Link7413.0Prior+Duel noop2015-11-20
Self-Imitation Learning✓ Link6289.8A2C + SIL2018-06-14
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning✓ Link5214.0TRPO-hash2016-11-15
Dueling Network Architectures for Deep Reinforcement Learning✓ Link4672.8Duel noop2015-11-20
Distributional Reinforcement Learning with Quantile Regression✓ Link4384QR-DQN-12017-10-27
Prioritized Experience Replay✓ Link4380.1Prior noop2015-11-18
Implicit Quantile Networks for Distributional Reinforcement Learning✓ Link4324IQN2018-06-14
Deep Reinforcement Learning with Double Q-learning✓ Link4038.4Prior+Duel hs2015-09-22
A Distributional Perspective on Reinforcement Learning✓ Link3965.0C51 noop2017-07-21
Value Prediction Network✓ Link3811VPN2017-07-11
Prioritized Experience Replay✓ Link3510.0Prior hs2015-11-18
Learning values across many orders of magnitude3469.6DDQN+Pop-Art noop2016-02-24
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link3248.96Persistent AL2015-12-15
Noisy Networks for Exploration✓ Link2923NoisyNet-Dueling2017-06-30
Count-Based Exploration in Feature Space for Reinforcement Learning✓ Link2770.1Sarsa-φ-EB2017-06-25
Model-Free Episodic Control with State Aggregation23944020MFEC2020-08-21
Dueling Network Architectures for Deep Reinforcement Learning✓ Link2332.4Duel hs2015-11-20
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link2305.82Advantage Learning2015-12-15
Deep Exploration via Bootstrapped DQN✓ Link2181.4Bootstrapped DQN2016-02-15
Dueling Network Architectures for Deep Reinforcement Learning✓ Link1683.3DDQN (tuned) noop2015-11-20
Deep Reinforcement Learning with Double Q-learning✓ Link1448.1DDQN (tuned) hs2015-09-22
Count-Based Exploration in Feature Space for Reinforcement Learning✓ Link1394.3Sarsa-ε2017-06-25
CURL: Contrastive Unsupervised Representations for Reinforcement Learning✓ Link924CURL2020-04-08
Deep Reinforcement Learning with Double Q-learning✓ Link797.4DQN noop2015-09-22
Evolving simple programs for playing Atari games✓ Link782CGP2018-06-14
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models✓ Link507.0MP-EB2015-07-03
Deep Reinforcement Learning with Double Q-learning✓ Link496.1DQN hs2015-09-22
Massively Parallel Methods for Deep Reinforcement Learning✓ Link426.6Gorila2015-07-15
Evolution Strategies as a Scalable Alternative to Reinforcement Learning✓ Link370.0ES FF (1 hour) noop2017-03-10
Human level control through deep reinforcement learning✓ Link328.3Nature DQN2015-02-25
DNA: Proximal Policy Optimization with a Dual Network Architecture✓ Link320DNA2022-06-20
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures✓ Link317.75IMPALA (deep)2018-02-05
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization✓ Link316.87POP3D2018-07-02
Playing Atari with Six Neurons✓ Link300IDVQ + DRSC + XNES2018-06-04
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link270.5UCT2012-07-19
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link216.9Best Learner2012-07-19
Asynchronous Methods for Deep Reinforcement Learning✓ Link197.6A3C LSTM hs2016-02-04
Asynchronous Methods for Deep Reinforcement Learning✓ Link190.5A3C FF hs2016-02-04
[]()180.9SARSA
Asynchronous Methods for Deep Reinforcement Learning✓ Link180.1A3C FF (1 day) hs2016-02-04
Soft Actor-Critic for Discrete Action Settings✓ Link59.4SAC2019-10-16