OpenCodePapers

atari-games-on-atari-2600-double-dunk

Video GamesAtari Games
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeScoreModelNameReleaseDate
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link24UCT2012-07-19
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning24GDI-H32021-06-11
Generalized Data Distribution Iteration24GDI-I32022-06-07
Generalized Data Distribution Iteration24GDI-H32022-06-07
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model✓ Link23.94MuZero2019-11-19
Agent57: Outperforming the Atari Human Benchmark✓ Link23.93Agent572020-03-30
Online and Offline Reinforcement Learning by Planning with a Learned Model✓ Link23.91MuZero (Res2 Adam)2021-04-13
Recurrent Experience Replay in Distributed Reinforcement Learning✓ Link23.7R2D22019-05-01
Distributed Prioritized Experience Replay✓ Link23.5Ape-X2018-03-02
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning23.0Reactor 500M2017-04-15
Distributional Reinforcement Learning with Quantile Regression✓ Link21.9QR-DQN-12017-10-27
Self-Imitation Learning✓ Link21.5A2C + SIL2018-06-14
Prioritized Experience Replay✓ Link18.5Prior noop2015-11-18
Mastering Atari with Discrete World Models✓ Link17DreamerV22020-10-05
Prioritized Experience Replay✓ Link16.0Prior hs2015-11-18
Implicit Quantile Networks for Distributional Reinforcement Learning✓ Link5.6IQN2018-06-14
Deep Exploration via Bootstrapped DQN✓ Link3Bootstrapped DQN2016-02-15
A Distributional Perspective on Reinforcement Learning✓ Link2.5C51 noop2017-07-21
Evolving simple programs for playing Atari games✓ Link2CGP2018-06-14
Noisy Networks for Exploration✓ Link1NoisyNet-Dueling2017-06-30
Evolution Strategies as a Scalable Alternative to Reinforcement Learning✓ Link0.2ES FF (1 hour) noop2017-03-10
Dueling Network Architectures for Deep Reinforcement Learning✓ Link0.1Duel noop2015-11-20
Asynchronous Methods for Deep Reinforcement Learning✓ Link0.1A3C FF (1 day) hs2016-02-04
Asynchronous Methods for Deep Reinforcement Learning✓ Link0.1A3C LSTM hs2016-02-04
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity✓ Link0.1ASL DDQN2023-05-07
[]()-16.0SARSA
Human level control through deep reinforcement learning✓ Link-18.1Nature DQN2015-02-25
Massively Parallel Methods for Deep Reinforcement Learning✓ Link-11.3Gorila2015-07-15
Deep Reinforcement Learning with Double Q-learning✓ Link-6.0DQN hs2015-09-22
Deep Reinforcement Learning with Double Q-learning✓ Link-6.6DQN noop2015-09-22
Dueling Network Architectures for Deep Reinforcement Learning✓ Link-0.8Duel hs2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning✓ Link-5.5DDQN (tuned) noop2015-11-20
Deep Reinforcement Learning with Double Q-learning✓ Link-0.3DDQN (tuned) hs2015-09-22
Deep Reinforcement Learning with Double Q-learning✓ Link-10.7Prior+Duel hs2015-09-22
Learning values across many orders of magnitude-11.5DDQN+Pop-Art noop2016-02-24
Dueling Network Architectures for Deep Reinforcement Learning✓ Link-12.5Prior+Duel noop2015-11-20
Asynchronous Methods for Deep Reinforcement Learning✓ Link-0.1A3C FF hs2016-02-04
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures✓ Link-0.33IMPALA (deep)2018-02-05
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link-0.15Advantage Learning2015-12-15
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link-2.51Persistent AL2015-12-15
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization✓ Link-7.89POP3D2018-07-02
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link-13.1Best Learner2012-07-19
DNA: Proximal Policy Optimization with a Dual Network Architecture✓ Link-1.3DNA2022-06-20