OpenCodePapers

atari-games-on-atari-2600-montezumas-revenge

Video GamesAtari Games
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeScoreModelNameReleaseDate
First return, then explore✓ Link43791Go-Explore2020-04-27
Go-Explore: a New Approach for Hard-Exploration Problems✓ Link43763Go-Explore2019-01-30
Self-supervised network distillation: an effective approach to exploration in sparse reward environments✓ Link21565SND-V2023-02-22
Agent57: Outperforming the Atari Human Benchmark✓ Link9352.01Agent572020-03-30
Exploration by Random Network Distillation✓ Link8152RND2018-10-30
Self-supervised network distillation: an effective approach to exploration in sparse reward environments✓ Link7838SND-VIC2023-02-22
Self-supervised network distillation: an effective approach to exploration in sparse reward environments✓ Link7212SND-STD2023-02-22
Contingency-Aware Exploration in Reinforcement Learning6635A2C+CoEX2018-11-05
Count-Based Exploration with Neural Density Models✓ Link3705.5DQN-PixelCNN2017-03-03
Unifying Count-Based Exploration and Intrinsic Motivation✓ Link3459DDQN-PC2016-06-06
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning3000GDI-I32021-06-11
Generalized Data Distribution Iteration3000GDI-I32022-06-07
Count-Based Exploration in Feature Space for Reinforcement Learning✓ Link2745.4Sarsa-φ-EB2017-06-25
Large-Scale Study of Curiosity-Driven Learning✓ Link2504.6Intrinsic Reward Agent2018-08-13
Distributed Prioritized Experience Replay✓ Link2500.0Ape-X2018-03-02
Online and Offline Reinforcement Learning by Planning with a Learned Model✓ Link2500MuZero (Res2 Adam)2021-04-13
Generalized Data Distribution Iteration2500GDI-H32022-06-07
Recurrent Experience Replay in Distributed Reinforcement Learning✓ Link2061.3R2D22019-05-01
Count-Based Exploration with the Successor Representation✓ Link1778.8DQN+SR2018-07-31
Count-Based Exploration with the Successor Representation✓ Link1778.6DQNMMCe+SR2018-07-31
Self-Imitation Learning✓ Link1100A2C + SIL2018-06-14
Count-Based Exploration in Feature Space for Reinforcement Learning✓ Link399.5Sarsa-ε2017-06-25
Unifying Count-Based Exploration and Intrinsic Motivation✓ Link273.7A3C-CTS2016-06-06
[]()259SARSA
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models✓ Link142MP-EB2015-07-03
Deep Exploration via Bootstrapped DQN✓ Link100Bootstrapped DQN2016-02-15
Massively Parallel Methods for Deep Reinforcement Learning✓ Link84Gorila2015-07-15
Mastering Atari with Discrete World Models✓ Link81DreamerV22020-10-05
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning✓ Link75TRPO-hash2016-11-15
Asynchronous Methods for Deep Reinforcement Learning✓ Link67A3C FF hs2016-02-04
Noisy Networks for Exploration✓ Link57NoisyNet-Dueling2017-06-30
Asynchronous Methods for Deep Reinforcement Learning✓ Link53A3C FF (1 day) hs2016-02-04
Prioritized Experience Replay✓ Link51Prior hs2015-11-18
Deep Reinforcement Learning with Double Q-learning✓ Link47.0DQN hs2015-09-22
Deep Reinforcement Learning with Double Q-learning✓ Link42.0DDQN (tuned) hs2015-09-22
Asynchronous Methods for Deep Reinforcement Learning✓ Link41A3C LSTM hs2016-02-04
Deep Reinforcement Learning with Double Q-learning✓ Link24.0Prior+Duel hs2015-09-22
Dueling Network Architectures for Deep Reinforcement Learning✓ Link22.0Duel hs2015-11-20
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link10.7Best Learner2012-07-19
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link1.72Persistent AL2015-12-15
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link0.42Advantage Learning2015-12-15
Implicit Quantile Networks for Distributional Reinforcement Learning✓ Link0IQN2018-06-14
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model✓ Link0.00MuZero2019-11-19
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures✓ Link0.00IMPALA (deep)2018-02-05
Evolving simple programs for playing Atari games✓ Link0CGP2018-06-14
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization✓ Link0POP3D2018-07-02
Distributional Reinforcement Learning with Quantile Regression✓ Link0QR-DQN-12017-10-27
DNA: Proximal Policy Optimization with a Dual Network Architecture✓ Link0DNA2022-06-20
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity✓ Link0ASL DDQN2023-05-07
Human level control through deep reinforcement learning✓ Link0Nature DQN2015-02-25