atari-games-on-atari-2600-up-and-down

Video GamesAtari Games

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Score	ModelName	ReleaseDate
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning		986440	GDI-I3	2021-06-11
Generalized Data Distribution Iteration		986440	GDI-I3	2022-06-07
Generalized Data Distribution Iteration		966590	GDI-H3	2022-06-07
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model	✓ Link	715545.61	MuZero	2019-11-19
Mastering Atari with Discrete World Models	✓ Link	653662	DreamerV2	2020-10-05
Online and Offline Reinforcement Learning by Planning with a Learned Model	✓ Link	634898.18	MuZero (Res2 Adam)	2021-04-13
Agent57: Outperforming the Atari Human Benchmark	✓ Link	623805.73	Agent57	2020-03-30
Recurrent Experience Replay in Distributed Reinforcement Learning	✓ Link	589226.9	R2D2	2019-05-01
Distributed Prioritized Experience Replay	✓ Link	401884.3	Ape-X	2018-03-02
Recurrent Independent Mechanisms	✓ Link	390000	RIMs-PPO	2019-09-24
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures	✓ Link	332546.75	IMPALA (deep)	2018-02-05
DNA: Proximal Policy Optimization with a Dual Network Architecture	✓ Link	291934	DNA	2022-06-20
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization	✓ Link	242701.51	POP3D	2018-07-02
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	105728.7	A3C LSTM hs	2016-02-04
Implicit Quantile Networks for Distributional Reinforcement Learning	✓ Link	88148	IQN	2018-06-14
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	74705.7	A3C FF hs	2016-02-04
The Arcade Learning Environment: An Evaluation Platform for General Agents	✓ Link	74473.6	UCT	2012-07-19
Distributional Reinforcement Learning with Quantile Regression	✓ Link	71260	QR-DQN-1	2017-10-27
Evolution Strategies as a Scalable Alternative to Reinforcement Learning	✓ Link	67974.0	ES FF (1 hour) noop	2017-03-10
Noisy Networks for Exploration	✓ Link	61326	NoisyNet-Dueling	2017-06-30
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	54525.4	A3C FF (1 day) hs	2016-02-04
Self-Imitation Learning	✓ Link	53314.6	A2C + SIL	2018-06-14
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	44939.6	Duel noop	2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	33879.1	Prior+Duel noop	2015-11-20
Deep Exploration via Bootstrapped DQN	✓ Link	26231	Bootstrapped DQN	2016-02-15
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity	✓ Link	25127.4	ASL DDQN	2023-05-07
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	24759.2	Duel hs	2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	22972.2	DDQN (tuned) noop	2015-11-20
Deep Reinforcement Learning with Double Q-learning	✓ Link	22681.3	Prior+Duel hs	2015-09-22
Learning values across many orders of magnitude		22474.4	DDQN+Pop-Art noop	2016-02-24
Deep Reinforcement Learning with Double Q-learning	✓ Link	19086.9	DDQN (tuned) hs	2015-09-22
Prioritized Experience Replay	✓ Link	16154.1	Prior noop	2015-11-18
A Distributional Perspective on Reinforcement Learning	✓ Link	15612.0	C51 noop	2017-07-21
Evolving simple programs for playing Atari games	✓ Link	14524	CGP	2018-06-14
Increasing the Action Gap: New Operators for Reinforcement Learning	✓ Link	13909.74	Advantage Learning	2015-12-15
Prioritized Experience Replay	✓ Link	12157.4	Prior hs	2015-11-18
Deep Reinforcement Learning with Double Q-learning	✓ Link	9989.9	DQN noop	2015-09-22
Massively Parallel Methods for Deep Reinforcement Learning	✓ Link	8747.7	Gorila	2015-07-15
Human level control through deep reinforcement learning	✓ Link	8456.0	Nature DQN	2015-02-25
Deep Reinforcement Learning with Double Q-learning	✓ Link	8038.5	DQN hs	2015-09-22
The Arcade Learning Environment: An Evaluation Platform for General Agents	✓ Link	3532.7	Best Learner	2012-07-19
CURL: Contrastive Unsupervised Representations for Reinforcement Learning	✓ Link	2735.2	CURL	2020-04-08
[]()		2449.0	SARSA
Soft Actor-Critic for Discrete Action Settings	✓ Link	250.7	SAC	2019-10-16

OpenCodePapers

atari-games-on-atari-2600-up-and-down