OpenCodePapers

atari-games-on-atari-2600-james-bond

Video GamesAtari Games
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeScoreMedium Human-Normalized ScoreModelNameReleaseDate
Generalized Data Distribution Iteration620780GDI-H32022-06-07
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning594500GDI-I32021-06-11
Generalized Data Distribution Iteration594500GDI-I32022-06-07
Agent57: Outperforming the Atari Human Benchmark✓ Link135784.96Agent572020-03-30
Fully Parameterized Quantile Function for Distributional Reinforcement Learning✓ Link87291.7FQF2019-11-05
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model✓ Link41063.25MuZero2019-11-19
Mastering Atari with Discrete World Models✓ Link40445DreamerV22020-10-05
Implicit Quantile Networks for Distributional Reinforcement Learning✓ Link35108IQN2018-06-14
Online and Offline Reinforcement Learning by Planning with a Learned Model✓ Link28626.23MuZero (Res2 Adam)2021-04-13
Recurrent Experience Replay in Distributed Reinforcement Learning✓ Link25354.0R2D22019-05-01
Distributed Prioritized Experience Replay✓ Link21322.5Ape-X2018-03-02
DNA: Proximal Policy Optimization with a Dual Network Architecture✓ Link14102DNA2022-06-20
Evolving simple programs for playing Atari games✓ Link6130CGP2018-06-14
Prioritized Experience Replay✓ Link5148.0Prior noop2015-11-18
Distributional Reinforcement Learning with Quantile Regression✓ Link4703QR-DQN-12017-10-27
Prioritized Experience Replay✓ Link3961.0Prior hs2015-11-18
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity✓ Link2237ASL DDQN2023-05-07
A Distributional Perspective on Reinforcement Learning✓ Link1909.0C51 noop2017-07-21
Deep Exploration via Bootstrapped DQN✓ Link1663.5Bootstrapped DQN2016-02-15
Dueling Network Architectures for Deep Reinforcement Learning✓ Link1358.0DDQN (tuned) noop2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning✓ Link1312.5Duel noop2015-11-20
Adaptive Rational Activations to Boost Deep Reinforcement Learning✓ Link1137Recurrent Rational DQN Average2021-02-18
Adaptive Rational Activations to Boost Deep Reinforcement Learning✓ Link1122Rational DQN Average2021-02-18
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link848.46Advantage Learning2015-12-15
Dueling Network Architectures for Deep Reinforcement Learning✓ Link835.5Duel hs2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning✓ Link812.0Prior+Duel noop2015-11-20
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link772.09Persistent AL2015-12-15
Deep Reinforcement Learning with Double Q-learning✓ Link768.5DQN noop2015-09-22
Deep Reinforcement Learning with Double Q-learning✓ Link697.5DQN hs2015-09-22
Asynchronous Methods for Deep Reinforcement Learning✓ Link613.0A3C LSTM hs2016-02-04
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures✓ Link601.50IMPALA (deep)2018-02-05
Deep Reinforcement Learning with Double Q-learning✓ Link585.0Prior+Duel hs2015-09-22
Human level control through deep reinforcement learning✓ Link576.7Nature DQN2015-02-25
Deep Reinforcement Learning with Double Q-learning✓ Link573.0DDQN (tuned) hs2015-09-22
Asynchronous Methods for Deep Reinforcement Learning✓ Link541.0A3C FF hs2016-02-04
Learning values across many orders of magnitude507.5DDQN+Pop-Art noop2016-02-24
Massively Parallel Methods for Deep Reinforcement Learning✓ Link444.0Gorila2015-07-15
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization✓ Link358.54POP3D2018-07-02
[]()354.1SARSA
Asynchronous Methods for Deep Reinforcement Learning✓ Link351.5A3C FF (1 day) hs2016-02-04
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link330UCT2012-07-19
Self-Imitation Learning✓ Link310.8A2C + SIL2018-06-14
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link202.8Best Learner2012-07-19
Soft Actor-Critic for Discrete Action Settings✓ Link68.3SAC2019-10-16
CURL: Contrastive Unsupervised Representations for Reinforcement Learning✓ Link400.1CURL2020-04-08