OpenCodePapers

atari-games-on-atari-2600-pitfall

Video GamesAtari Games
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeScoreModelNameReleaseDate
Go-Explore: a New Approach for Hard-Exploration Problems✓ Link102571Go-Explore2019-01-30
Agent57: Outperforming the Atari Human Benchmark✓ Link18756.01Agent572020-03-30
First return, then explore✓ Link6954Go-Explore2020-04-27
Implicit Quantile Networks for Distributional Reinforcement Learning✓ Link0IQN2018-06-14
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model✓ Link0.00MuZero2019-11-19
Recurrent Experience Replay in Distributed Reinforcement Learning✓ Link0.0R2D22019-05-01
Evolving simple programs for playing Atari games✓ Link0CGP2018-06-14
Noisy Networks for Exploration✓ Link0NoisyNet-Dueling2017-06-30
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization✓ Link0POP3D2018-07-02
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link0Advantage Learning2015-12-15
Distributional Reinforcement Learning with Quantile Regression✓ Link0QR-DQN-12017-10-27
Mastering Atari with Discrete World Models✓ Link0DreamerV22020-10-05
Online and Offline Reinforcement Learning by Planning with a Learned Model✓ Link0MuZero (Res2 Adam)2021-04-13
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning0GDI-I32021-06-11
Generalized Data Distribution Iteration0GDI-I32022-06-07
DNA: Proximal Policy Optimization with a Dual Network Architecture✓ Link0DNA2022-06-20
Self-supervised network distillation: an effective approach to exploration in sparse reward environments✓ Link0SND-V2023-02-22
Self-supervised network distillation: an effective approach to exploration in sparse reward environments✓ Link0SND-VIC2023-02-22
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity✓ Link0ASL DDQN2023-05-07
Exploration by Random Network Distillation✓ Link-3RND2018-10-30
Distributed Prioritized Experience Replay✓ Link-0.6Ape-X2018-03-02
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures✓ Link-1.66IMPALA (deep)2018-02-05
Generalized Data Distribution Iteration-4.345GDI-H32022-06-07