OpenCodePapers

atari-games-on-atari-2600-kung-fu-master

Video GamesAtari Games
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeScoreModelNameReleaseDate
Generalized Data Distribution Iteration1666665GDI-H32022-06-07
GDI: Rethinking What Makes Reinforcement Learning Different from Supervised Learning1666000GDI-H3 (200M)2021-11-24
Recurrent Experience Replay in Distributed Reinforcement Learning✓ Link233413.3R2D22019-05-01
Agent57: Outperforming the Atari Human Benchmark✓ Link206845.82Agent572020-03-30
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model✓ Link204824.00MuZero2019-11-19
Generalized Data Distribution Iteration140440GDI-I32022-06-07
Online and Offline Reinforcement Learning by Planning with a Learned Model✓ Link116726.96MuZero (Res2 Adam)2021-04-13
Fully Parameterized Quantile Function for Distributional Reinforcement Learning✓ Link111138.5FQF2019-11-05
DNA: Proximal Policy Optimization with a Dual Network Architecture✓ Link110962DNA2022-06-20
Distributed Prioritized Experience Replay✓ Link97829.5Ape-X2018-03-02
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity✓ Link85182ASL DDQN2023-05-07
Distributional Reinforcement Learning with Quantile Regression✓ Link76642QR-DQN-12017-10-27
Implicit Quantile Networks for Distributional Reinforcement Learning✓ Link73512IQN2018-06-14
Mastering Atari with Discrete World Models✓ Link62741DreamerV22020-10-05
Evolving simple programs for playing Atari games✓ Link57400CGP2018-06-14
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link48854.5UCT2012-07-19
Dueling Network Architectures for Deep Reinforcement Learning✓ Link48375.0Prior+Duel noop2015-11-20
A Distributional Perspective on Reinforcement Learning✓ Link48192.0C51 noop2017-07-21
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures✓ Link43375.50IMPALA (deep)2018-02-05
Noisy Networks for Exploration✓ Link41672NoisyNet-Dueling2017-06-30
Asynchronous Methods for Deep Reinforcement Learning✓ Link40835.0A3C LSTM hs2016-02-04
Prioritized Experience Replay✓ Link39581.0Prior noop2015-11-18
Deep Reinforcement Learning with Double Q-learning✓ Link37484.0Prior+Duel hs2015-09-22
Deep Exploration via Bootstrapped DQN✓ Link36733.3Bootstrapped DQN2016-02-15
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link34650.91Persistent AL2015-12-15
Self-Imitation Learning✓ Link34449.2A2C + SIL2018-06-14
Learning values across many orders of magnitude34393.0DDQN+Pop-Art noop2016-02-24
Dueling Network Architectures for Deep Reinforcement Learning✓ Link34294.0Duel noop2015-11-20
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization✓ Link33728POP3D2018-07-02
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link32182.99Advantage Learning2015-12-15
Prioritized Experience Replay✓ Link31676.0Prior hs2015-11-18
Deep Reinforcement Learning with Double Q-learning✓ Link30207.0DDQN (tuned) hs2015-09-22
Dueling Network Architectures for Deep Reinforcement Learning✓ Link29710.0DDQN (tuned) noop2015-11-20
[]()29151.0SARSA
Asynchronous Methods for Deep Reinforcement Learning✓ Link28819.0A3C FF hs2016-02-04
Deep Reinforcement Learning with Double Q-learning✓ Link26059.0DQN noop2015-09-22
Dueling Network Architectures for Deep Reinforcement Learning✓ Link24288.0Duel hs2015-11-20
Human level control through deep reinforcement learning✓ Link23270.0Nature DQN2015-02-25
Deep Reinforcement Learning with Double Q-learning✓ Link20882.0DQN hs2015-09-22
Massively Parallel Methods for Deep Reinforcement Learning✓ Link20620.0Gorila2015-07-15
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link19544Best Learner2012-07-19
CURL: Contrastive Unsupervised Representations for Reinforcement Learning✓ Link14280CURL2020-04-08
Asynchronous Methods for Deep Reinforcement Learning✓ Link3046.0A3C FF (1 day) hs2016-02-04