OpenCodePapers

atari-games-on-atari-2600-centipede

Video GamesAtari Games
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeScoreModelNameReleaseDate
First return, then explore✓ Link1422628Go-Explore2020-04-27
GDI: Rethinking What Makes Reinforcement Learning Different from Supervised Learning1359533GDI-H3(1B frames)2021-11-24
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model✓ Link1159049.27MuZero2019-11-19
Online and Offline Reinforcement Learning by Planning with a Learned Model✓ Link874301.64MuZero (Res2 Adam)2021-04-13
Recurrent Experience Replay in Distributed Reinforcement Learning✓ Link599140.3R2D22019-05-01
Agent57: Outperforming the Atari Human Benchmark✓ Link412847.86Agent572020-03-30
Generalized Data Distribution Iteration195630GDI-H32022-06-07
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning155830GDI-I32021-06-11
Generalized Data Distribution Iteration155830GDI-I32022-06-07
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link125123Full Tree2012-07-19
DNA: Proximal Policy Optimization with a Dual Network Architecture✓ Link100194DNA2022-06-20
Learning values across many orders of magnitude49065.8DDQN+Pop-Art noop2016-02-24
Evolving simple programs for playing Atari games✓ Link24708CGP2018-06-14
Distributed Prioritized Experience Replay✓ Link12974Ape-X2018-03-02
Distributional Reinforcement Learning with Quantile Regression✓ Link12447QR-DQN-12017-10-27
Mastering Atari with Discrete World Models✓ Link11883DreamerV22020-10-05
Implicit Quantile Networks for Distributional Reinforcement Learning✓ Link11561IQN2018-06-14
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures✓ Link11049.75IMPALA (deep)2018-02-05
A Distributional Perspective on Reinforcement Learning✓ Link9646.0C51 noop2017-07-21
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link8803.8Best Learner2012-07-19
Human level control through deep reinforcement learning✓ Link8309.0Nature DQN2015-02-25
Evolution Strategies as a Scalable Alternative to Reinforcement Learning✓ Link7783.9ES FF (1 hour) noop2017-03-10
Dueling Network Architectures for Deep Reinforcement Learning✓ Link7687.5Prior+Duel noop2015-11-20
Noisy Networks for Exploration✓ Link7596NoisyNet-Dueling2017-06-30
Dueling Network Architectures for Deep Reinforcement Learning✓ Link7561.4Duel noop2015-11-20
Self-Imitation Learning✓ Link7559.5A2C + SIL2018-06-14
Massively Parallel Methods for Deep Reinforcement Learning✓ Link6296.9Gorila2015-07-15
Deep Reinforcement Learning with Double Q-learning✓ Link5570.2Prior+Duel hs2015-09-22
Dueling Network Architectures for Deep Reinforcement Learning✓ Link5409.4DDQN (tuned) noop2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning✓ Link4881.0Duel hs2015-11-20
Deep Reinforcement Learning with Double Q-learning✓ Link4657.7DQN noop2015-09-22
[]()4647.0SARSA
Deep Exploration via Bootstrapped DQN✓ Link4553.5Bootstrapped DQN2016-02-15
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link4539.55Persistent AL2015-12-15
Prioritized Experience Replay✓ Link4463.2Prior noop2015-11-18
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link4225.18Advantage Learning2015-12-15
Deep Reinforcement Learning with Double Q-learning✓ Link3973.9DQN hs2015-09-22
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity✓ Link3899.8ASL DDQN2023-05-07
Deep Reinforcement Learning with Double Q-learning✓ Link3853.5DDQN (tuned) hs2015-09-22
Asynchronous Methods for Deep Reinforcement Learning✓ Link3755.8A3C FF hs2016-02-04
Prioritized Experience Replay✓ Link3489.1Prior hs2015-11-18
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning3422.0Reactor 500M2017-04-15
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization✓ Link3315.44POP3D2018-07-02
Asynchronous Methods for Deep Reinforcement Learning✓ Link3306.5A3C FF (1 day) hs2016-02-04
Asynchronous Methods for Deep Reinforcement Learning✓ Link1997.0A3C LSTM hs2016-02-04