atari-games-on-atari-2600-time-pilot

Video GamesAtari Games

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Score	ModelName	ReleaseDate
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model	✓ Link	476763.90	MuZero	2019-11-19
Generalized Data Distribution Iteration		450810	GDI-H3	2022-06-07
Recurrent Experience Replay in Distributed Reinforcement Learning	✓ Link	445377.3	R2D2	2019-05-01
Online and Offline Reinforcement Learning by Planning with a Learned Model	✓ Link	424011.16	MuZero (Res2 Adam)	2021-04-13
Agent57: Outperforming the Atari Human Benchmark	✓ Link	405425.31	Agent57	2020-03-30
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning		216770	GDI-I3	2021-06-11
Generalized Data Distribution Iteration		216770	GDI-I3	2022-06-07
Distributed Prioritized Experience Replay	✓ Link	87085	Ape-X	2018-03-02
The Arcade Learning Environment: An Evaluation Platform for General Agents	✓ Link	63854.5	UCT	2012-07-19
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures	✓ Link	48481.50	IMPALA (deep)	2018-02-05
Mastering Atari with Discrete World Models	✓ Link	37945	DreamerV2	2020-10-05
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	27202.0	A3C LSTM hs	2016-02-04
Adaptive Rational Activations to Boost Deep Reinforcement Learning	✓ Link	17632	Rational DQN Average	2021-02-18
Noisy Networks for Exploration	✓ Link	17301	NoisyNet-Dueling	2017-06-30
Adaptive Rational Activations to Boost Deep Reinforcement Learning	✓ Link	13261	Recurrent Rational DQN Average	2021-02-18
DNA: Proximal Policy Optimization with a Dual Network Architecture	✓ Link	12774	DNA	2022-06-20
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	12679.0	A3C FF hs	2016-02-04
Implicit Quantile Networks for Distributional Reinforcement Learning	✓ Link	12236	IQN	2018-06-14
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity	✓ Link	12071	ASL DDQN	2023-05-07
Evolving simple programs for playing Atari games	✓ Link	12040	CGP	2018-06-14
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	11666.0	Duel noop	2015-11-20
Self-Imitation Learning	✓ Link	10811.7	A2C + SIL	2018-06-14
Distributional Reinforcement Learning with Quantile Regression	✓ Link	10345	QR-DQN-1	2017-10-27
Prioritized Experience Replay	✓ Link	9197.0	Prior noop	2015-11-18
Deep Exploration via Bootstrapped DQN	✓ Link	9079.4	Bootstrapped DQN	2016-02-15
Increasing the Action Gap: New Operators for Reinforcement Learning	✓ Link	8969.12	Advantage Learning	2015-12-15
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	8339.0	DDQN (tuned) noop	2015-11-20
A Distributional Perspective on Reinforcement Learning	✓ Link	8329.0	C51 noop	2017-07-21
Massively Parallel Methods for Deep Reinforcement Learning	✓ Link	8267.8	Gorila	2015-07-15
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	7553.0	Prior+Duel noop	2015-11-20
Deep Reinforcement Learning with Double Q-learning	✓ Link	6608.0	DDQN (tuned) hs	2015-09-22
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	6601.0	Duel hs	2015-11-20
Prioritized Experience Replay	✓ Link	5963.0	Prior hs	2015-11-18
Human level control through deep reinforcement learning	✓ Link	5947.0	Nature DQN	2015-02-25
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	5825.0	A3C FF (1 day) hs	2016-02-04
Evolution Strategies as a Scalable Alternative to Reinforcement Learning	✓ Link	4970.0	ES FF (1 hour) noop	2017-03-10
Deep Reinforcement Learning with Double Q-learning	✓ Link	4871.0	Prior+Duel hs	2015-09-22
Deep Reinforcement Learning with Double Q-learning	✓ Link	4870.0	DQN noop	2015-09-22
Learning values across many orders of magnitude		4870.0	DDQN+Pop-Art noop	2016-02-24
Deep Reinforcement Learning with Double Q-learning	✓ Link	4786.0	DQN hs	2015-09-22
Playing Atari with Six Neurons	✓ Link	4600	IDVQ + DRSC + XNES	2018-06-04
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization	✓ Link	3770.33	POP3D	2018-07-02
The Arcade Learning Environment: An Evaluation Platform for General Agents	✓ Link	3741.2	Best Learner	2012-07-19
[]()		24.9	SARSA

OpenCodePapers

atari-games-on-atari-2600-time-pilot