OpenCodePapers

atari-games-on-atari-2600-gravitar

Video GamesAtari Games
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeScoreModelNameReleaseDate
Agent57: Outperforming the Atari Human Benchmark✓ Link19213.96Agent572020-03-30
Recurrent Experience Replay in Distributed Reinforcement Learning✓ Link15680.7R2D22019-05-01
Online and Offline Reinforcement Learning by Planning with a Learned Model✓ Link8006.93MuZero (Res2 Adam)2021-04-13
First return, then explore✓ Link7588Go-Explore2020-04-27
Self-supervised network distillation: an effective approach to exploration in sparse reward environments✓ Link6712SND-VIC2023-02-22
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model✓ Link6682.70MuZero2019-11-19
Generalized Data Distribution Iteration5915GDI-H32022-06-07
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning5905GDI-I32021-06-11
Generalized Data Distribution Iteration5905GDI-I32022-06-07
Self-supervised network distillation: an effective approach to exploration in sparse reward environments✓ Link4643SND-STD2023-02-22
Exploration by Random Network Distillation✓ Link3906RND2018-10-30
Mastering Atari with Discrete World Models✓ Link3789DreamerV22020-10-05
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link2850UCT2012-07-19
Self-supervised network distillation: an effective approach to exploration in sparse reward environments✓ Link2741SND-V2023-02-22
Evolving simple programs for playing Atari games✓ Link2350CGP2018-06-14
Noisy Networks for Exploration✓ Link2209NoisyNet-Dueling2017-06-30
DNA: Proximal Policy Optimization with a Dual Network Architecture✓ Link2190DNA2022-06-20
Self-Imitation Learning✓ Link1874.2A2C + SIL2018-06-14
Distributed Prioritized Experience Replay✓ Link1598.5Ape-X2018-03-02
Fully Parameterized Quantile Function for Distributional Reinforcement Learning✓ Link1406.0FQF2019-11-05
Large-Scale Study of Curiosity-Driven Learning✓ Link1165.1Intrinsic Reward Agent2018-08-13
Count-Based Exploration with the Successor Representation✓ Link1078.3DQNMMCe2018-07-31
Distributional Reinforcement Learning with Quantile Regression✓ Link995QR-DQN-12017-10-27
Implicit Quantile Networks for Distributional Reinforcement Learning✓ Link911IQN2018-06-14
Evolution Strategies as a Scalable Alternative to Reinforcement Learning✓ Link805.0ES FF (1 hour) noop2017-03-10
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity✓ Link760ASL DDQN2023-05-07
Dueling Network Architectures for Deep Reinforcement Learning✓ Link588.0Duel noop2015-11-20
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization✓ Link557.17POP3D2018-07-02
Prioritized Experience Replay✓ Link548.5Prior noop2015-11-18
Massively Parallel Methods for Deep Reinforcement Learning✓ Link538.4Gorila2015-07-15
Count-Based Exploration with Neural Density Models✓ Link498.3DQN-PixelCNN2017-03-03
Learning values across many orders of magnitude483.5DDQN+Pop-Art noop2016-02-24
Deep Reinforcement Learning with Double Q-learning✓ Link473.0DQN noop2015-09-22
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link446.92Persistent AL2015-12-15
A Distributional Perspective on Reinforcement Learning✓ Link440.0C51 noop2017-07-21
[]()429.0SARSA
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link417.65Advantage Learning2015-12-15
Dueling Network Architectures for Deep Reinforcement Learning✓ Link412.0DDQN (tuned) noop2015-11-20
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link387.7Best Learner2012-07-19
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures✓ Link359.50IMPALA (deep)2018-02-05
Asynchronous Methods for Deep Reinforcement Learning✓ Link320.0A3C LSTM hs2016-02-04
Human level control through deep reinforcement learning✓ Link306.7Nature DQN2015-02-25
Asynchronous Methods for Deep Reinforcement Learning✓ Link303.5A3C FF hs2016-02-04
Deep Reinforcement Learning with Double Q-learning✓ Link298.0DQN hs2015-09-22
Dueling Network Architectures for Deep Reinforcement Learning✓ Link297.0Duel hs2015-11-20
Deep Exploration via Bootstrapped DQN✓ Link286.1Bootstrapped DQN2016-02-15
Prioritized Experience Replay✓ Link269.5Prior hs2015-11-18
Asynchronous Methods for Deep Reinforcement Learning✓ Link269.5A3C FF (1 day) hs2016-02-04
Unifying Count-Based Exploration and Intrinsic Motivation✓ Link238.68A3C-CTS2016-06-06
Dueling Network Architectures for Deep Reinforcement Learning✓ Link238.0Prior+Duel noop2015-11-20
Count-Based Exploration with Neural Density Models✓ Link238.0DQN-CTS2017-03-03
Deep Reinforcement Learning with Double Q-learning✓ Link200.5DDQN (tuned) hs2015-09-22
Deep Reinforcement Learning with Double Q-learning✓ Link167.0Prior+Duel hs2015-09-22