OpenCodePapers

atari-games-on-atari-2600-battle-zone

Video GamesAtari Games
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeScoreModelNameReleaseDate
Agent57: Outperforming the Atari Human Benchmark✓ Link934134.88Agent572020-03-30
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model✓ Link848623.00MuZero2019-11-19
Generalized Data Distribution Iteration824360GDI-H32022-06-07
Recurrent Experience Replay in Distributed Reinforcement Learning✓ Link751880.0R2D22019-05-01
Generalized Data Distribution Iteration478830GDI-I32022-06-07
Online and Offline Reinforcement Learning by Planning with a Learned Model✓ Link178716.9MuZero (Res2 Adam)2021-04-13
Distributed Prioritized Experience Replay✓ Link98895Ape-X2018-03-02
Fully Parameterized Quantile Function for Distributional Reinforcement Learning✓ Link87928.6FQF2019-11-05
DNA: Proximal Policy Optimization with a Dual Network Architecture✓ Link71003DNA2022-06-20
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link70333.3UCT2012-07-19
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning64070.0Reactor 500M2017-04-15
Noisy Networks for Exploration✓ Link52262NoisyNet-Dueling2017-06-30
Implicit Quantile Networks for Distributional Reinforcement Learning✓ Link42244IQN2018-06-14
Mastering Atari with Discrete World Models✓ Link40325DreamerV22020-10-05
Distributional Reinforcement Learning with Quantile Regression✓ Link39268QR-DQN-12017-10-27
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity✓ Link38986ASL DDQN2023-05-07
Deep Exploration via Bootstrapped DQN✓ Link38666.7Bootstrapped DQN2016-02-15
Dueling Network Architectures for Deep Reinforcement Learning✓ Link37150.0Duel noop2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning✓ Link35520.0Prior+Duel noop2015-11-20
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link34583.07Persistent AL2015-12-15
Evolving simple programs for playing Atari games✓ Link34200CGP2018-06-14
Dueling Network Architectures for Deep Reinforcement Learning✓ Link31700.0DDQN (tuned) noop2015-11-20
Prioritized Experience Replay✓ Link31530.0Prior noop2015-11-18
Dueling Network Architectures for Deep Reinforcement Learning✓ Link31320.0Duel hs2015-11-20
Deep Reinforcement Learning with Double Q-learning✓ Link30650.0Prior+Duel hs2015-09-22
Deep Reinforcement Learning with Double Q-learning✓ Link29900.0DQN noop2015-09-22
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link28789.29Advantage Learning2015-12-15
A Distributional Perspective on Reinforcement Learning✓ Link28742.0C51 noop2017-07-21
Human level control through deep reinforcement learning✓ Link26300.0Nature DQN2015-02-25
Adaptive Rational Activations to Boost Deep Reinforcement Learning✓ Link25749Recurrent Rational DQN Average2021-02-18
Prioritized Experience Replay✓ Link25520.0Prior hs2015-11-18
Self-Imitation Learning✓ Link25075A2C + SIL2018-06-14
Deep Reinforcement Learning with Double Q-learning✓ Link24740.0DDQN (tuned) hs2015-09-22
Deep Reinforcement Learning with Double Q-learning✓ Link23750.0DQN hs2015-09-22
Adaptive Rational Activations to Boost Deep Reinforcement Learning✓ Link23403Rational DQN Average2021-02-18
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures✓ Link20885.00IMPALA (deep)2018-02-05
Asynchronous Methods for Deep Reinforcement Learning✓ Link20760.0A3C LSTM hs2016-02-04
Massively Parallel Methods for Deep Reinforcement Learning✓ Link19938.0Gorila2015-07-15
Evolution Strategies as a Scalable Alternative to Reinforcement Learning✓ Link16600.0ES FF (1 hour) noop2017-03-10
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link15819.7Best Learner2012-07-19
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization✓ Link15466.67POP3D2018-07-02
Asynchronous Methods for Deep Reinforcement Learning✓ Link12950.0A3C FF hs2016-02-04
Asynchronous Methods for Deep Reinforcement Learning✓ Link11340.0A3C FF (1 day) hs2016-02-04
CURL: Contrastive Unsupervised Representations for Reinforcement Learning✓ Link11208CURL2020-04-08
Learning values across many orders of magnitude8220.0DDQN+Pop-Art noop2016-02-24
Soft Actor-Critic for Discrete Action Settings✓ Link4386.7SAC2019-10-16
[]()16.2SARSA