OpenCodePapers

atari-games-on-atari-2600-pong

Video GamesAtari Games
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeScoreModelNameReleaseDate
Dueling Network Architectures for Deep Reinforcement Learning✓ Link21.0Duel noop2015-11-20
Evolution Strategies as a Scalable Alternative to Reinforcement Learning✓ Link21.0ES FF (1 hour) noop2017-03-10
Implicit Quantile Networks for Distributional Reinforcement Learning✓ Link21IQN2018-06-14
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model✓ Link21.00MuZero2019-11-19
Recurrent Experience Replay in Distributed Reinforcement Learning✓ Link21.0R2D22019-05-01
Noisy Networks for Exploration✓ Link21NoisyNet-Dueling2017-06-30
Playing Atari with Deep Reinforcement Learning✓ Link21DQN Best2013-12-19
Distributional Reinforcement Learning with Quantile Regression✓ Link21QR-DQN-12017-10-27
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link21UCT2012-07-19
Generalized Data Distribution Iteration21.0GDI-H3(200M frames)2022-06-07
Generalized Data Distribution Iteration21.0GDI-I3(200M frames)2022-06-07
Generalized Data Distribution Iteration21GDI-I32022-06-07
Generalized Data Distribution Iteration21GDI-H32022-06-07
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity✓ Link21ASL DDQN2023-05-07
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures✓ Link20.98IMPALA (deep)2018-02-05
Online and Offline Reinforcement Learning by Planning with a Learned Model✓ Link20.95MuZero (Res2 Adam)2021-04-13
Dueling Network Architectures for Deep Reinforcement Learning✓ Link20.9DDQN (tuned) noop2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning✓ Link20.9Prior+Duel noop2015-11-20
A Distributional Perspective on Reinforcement Learning✓ Link20.9C51 noop2017-07-21
Deep Exploration via Bootstrapped DQN✓ Link20.9Bootstrapped DQN2016-02-15
Distributed Prioritized Experience Replay✓ Link20.9Ape-X2018-03-02
Self-Imitation Learning✓ Link20.9A2C + SIL2018-06-14
Agent57: Outperforming the Atari Human Benchmark✓ Link20.67Agent572020-03-30
Prioritized Experience Replay✓ Link20.6Prior noop2015-11-18
Learning values across many orders of magnitude20.6DDQN+Pop-Art noop2016-02-24
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization✓ Link20.5POP3D2018-07-02
Smaller World Models for Reinforcement Learning20.2Discrete Latent Space World Model (VQ-VAE)2020-10-12
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes✓ Link20DDRL A3C2018-01-09
Evolving simple programs for playing Atari games✓ Link20CGP2018-06-14
Mastering Atari with Discrete World Models✓ Link20DreamerV22020-10-05
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link19.76Persistent AL2015-12-15
DNA: Proximal Policy Optimization with a Dual Network Architecture✓ Link19.7DNA2022-06-20
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link19.66Advantage Learning2015-12-15
Deep Reinforcement Learning with Double Q-learning✓ Link19.5DQN noop2015-09-22
Deep Reinforcement Learning with Double Q-learning✓ Link19.1DDQN (tuned) hs2015-09-22
Human level control through deep reinforcement learning✓ Link18.9Nature DQN2015-02-25
Prioritized Experience Replay✓ Link18.9Prior hs2015-11-18
Dueling Network Architectures for Deep Reinforcement Learning✓ Link18.8Duel hs2015-11-20
Deep Reinforcement Learning with Double Q-learning✓ Link18.4Prior+Duel hs2015-09-22
Adaptive Rational Activations to Boost Deep Reinforcement Learning✓ Link18.13Recurrent Rational DQN Average2021-02-18
Adaptive Rational Activations to Boost Deep Reinforcement Learning✓ Link18.04Rational DQN Average2021-02-18
Deep Reinforcement Learning with Double Q-learning✓ Link18.0DQN hs2015-09-22
Decision Transformer: Reinforcement Learning via Sequence Modeling✓ Link17.1DT2021-06-02
Massively Parallel Methods for Deep Reinforcement Learning✓ Link16.7Gorila2015-07-15
Asynchronous Methods for Deep Reinforcement Learning✓ Link11.4A3C FF (1 day) hs2016-02-04
Asynchronous Methods for Deep Reinforcement Learning✓ Link10.7A3C LSTM hs2016-02-04
Mean Actor Critic✓ Link10.6MAC2017-09-01
Asynchronous Methods for Deep Reinforcement Learning✓ Link5.6A3C FF hs2016-02-04
CURL: Contrastive Unsupervised Representations for Reinforcement Learning✓ Link2.1CURL2020-04-08
[]()-17.4SARSA
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link-19Best Learner2012-07-19
Soft Actor-Critic for Discrete Action Settings✓ Link-20.98SAC2019-10-16