OpenCodePapers

atari-games-on-atari-2600-private-eye

Video GamesAtari Games
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeScoreModelNameReleaseDate
First return, then explore✓ Link95756Go-Explore2020-04-27
Agent57: Outperforming the Atari Human Benchmark✓ Link79716.46Agent572020-03-30
Self-supervised network distillation: an effective approach to exploration in sparse reward environments✓ Link17313SND-VIC2023-02-22
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model✓ Link15299.98MuZero2019-11-19
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning15100GDI-I32021-06-11
Generalized Data Distribution Iteration15100GDI-I32022-06-07
Generalized Data Distribution Iteration15100GDI-H32022-06-07
A Distributional Perspective on Reinforcement Learning✓ Link15095.0C51 noop2017-07-21
Self-supervised network distillation: an effective approach to exploration in sparse reward environments✓ Link15089SND-STD2023-02-22
Evolving simple programs for playing Atari games✓ Link12702.2CGP2018-06-14
Exploration by Random Network Distillation✓ Link8666RND2018-10-30
Count-Based Exploration with Neural Density Models✓ Link8358.7DQN-PixelCNN2017-03-03
Recurrent Experience Replay in Distributed Reinforcement Learning✓ Link5322.7R2D22019-05-01
Increasing the Action Gap: New Operators for Reinforcement Learning✓ Link5276.16Advantage Learning2015-12-15
Self-supervised network distillation: an effective approach to exploration in sparse reward environments✓ Link4213SND-V2023-02-22
Large-Scale Study of Curiosity-Driven Learning✓ Link3036.5Intrinsic Reward Agent2018-08-13
Massively Parallel Methods for Deep Reinforcement Learning✓ Link2598.6Gorila2015-07-15
Mastering Atari with Discrete World Models✓ Link2198DreamerV22020-10-05
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link1947.3Best Baseline2012-07-19
Deep Exploration via Bootstrapped DQN✓ Link1812.5Bootstrapped DQN2016-02-15
Human level control through deep reinforcement learning✓ Link1788.0Nature DQN2015-02-25
Deep Reinforcement Learning with Double Q-learning✓ Link1277.6Prior+Duel hs2015-09-22
The Arcade Learning Environment: An Evaluation Platform for General Agents✓ Link684.3Best Learner2012-07-19
Prioritized Experience Replay✓ Link670.7Prior hs2015-11-18
Self-Imitation Learning✓ Link661.2A2C + SIL2018-06-14
Asynchronous Methods for Deep Reinforcement Learning✓ Link421.1A3C LSTM hs2016-02-04
Distributional Reinforcement Learning with Quantile Regression✓ Link350QR-DQN-12017-10-27
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity✓ Link349.7ASL DDQN2023-05-07
Dueling Network Architectures for Deep Reinforcement Learning✓ Link292.6Duel hs2015-11-20
Learning values across many orders of magnitude286.7DDQN+Pop-Art noop2016-02-24
Noisy Networks for Exploration✓ Link279NoisyNet-Dueling2017-06-30
Deep Reinforcement Learning with Double Q-learning✓ Link207.9DQN hs2015-09-22
Asynchronous Methods for Deep Reinforcement Learning✓ Link206.9A3C FF hs2016-02-04
Dueling Network Architectures for Deep Reinforcement Learning✓ Link206.0Prior+Duel noop2015-11-20
Count-Based Exploration with Neural Density Models✓ Link206.0DQN-CTS2017-03-03
Prioritized Experience Replay✓ Link200.0Prior noop2015-11-18
Implicit Quantile Networks for Distributional Reinforcement Learning✓ Link200IQN2018-06-14
Asynchronous Methods for Deep Reinforcement Learning✓ Link194.4A3C FF (1 day) hs2016-02-04
Deep Reinforcement Learning with Double Q-learning✓ Link146.7DQN noop2015-09-22
Dueling Network Architectures for Deep Reinforcement Learning✓ Link129.7DDQN (tuned) noop2015-11-20
CURL: Contrastive Unsupervised Representations for Reinforcement Learning✓ Link105.2CURL2020-04-08
Dueling Network Architectures for Deep Reinforcement Learning✓ Link103.0Duel noop2015-11-20
Evolution Strategies as a Scalable Alternative to Reinforcement Learning✓ Link100.0ES FF (1 hour) noop2017-03-10
Online and Offline Reinforcement Learning by Planning with a Learned Model✓ Link100MuZero (Res2 Adam)2021-04-13
DNA: Proximal Policy Optimization with a Dual Network Architecture✓ Link100DNA2022-06-20
Unifying Count-Based Exploration and Intrinsic Motivation✓ Link99.32A3C-CTS2016-06-06
Count-Based Exploration with the Successor Representation✓ Link99.1DQNMMCe+SR2018-07-31
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures✓ Link98.50IMPALA (deep)2018-02-05
[]()86.0SARSA
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization✓ Link79.67POP3D2018-07-02
Distributed Prioritized Experience Replay✓ Link49.8Ape-X2018-03-02
Deep Reinforcement Learning with Double Q-learning✓ Link-575.5DDQN (tuned) hs2015-09-22