atari-games-on-atari-2600-pong

Video GamesAtari Games

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Score	ModelName	ReleaseDate
The Arcade Learning Environment: An Evaluation Platform for General Agents	✓ Link	21	UCT	2012-07-19
Playing Atari with Deep Reinforcement Learning	✓ Link	21	DQN Best	2013-12-19
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	21.0	Duel noop	2015-11-20
Evolution Strategies as a Scalable Alternative to Reinforcement Learning	✓ Link	21.0	ES FF (1 hour) noop	2017-03-10
Noisy Networks for Exploration	✓ Link	21	NoisyNet-Dueling	2017-06-30
Distributional Reinforcement Learning with Quantile Regression	✓ Link	21	QR-DQN-1	2017-10-27
Implicit Quantile Networks for Distributional Reinforcement Learning	✓ Link	21	IQN	2018-06-14
Recurrent Experience Replay in Distributed Reinforcement Learning	✓ Link	21.0	R2D2	2019-05-01
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model	✓ Link	21.00	MuZero	2019-11-19
Generalized Data Distribution Iteration		21.0	GDI-H3(200M frames)	2022-06-07
Generalized Data Distribution Iteration		21.0	GDI-I3(200M frames)	2022-06-07
Generalized Data Distribution Iteration		21	GDI-I3	2022-06-07
Generalized Data Distribution Iteration		21	GDI-H3	2022-06-07
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity	✓ Link	21	ASL DDQN	2023-05-07
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures	✓ Link	20.98	IMPALA (deep)	2018-02-05
Online and Offline Reinforcement Learning by Planning with a Learned Model	✓ Link	20.95	MuZero (Res2 Adam)	2021-04-13
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	20.9	DDQN (tuned) noop	2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	20.9	Prior+Duel noop	2015-11-20
Deep Exploration via Bootstrapped DQN	✓ Link	20.9	Bootstrapped DQN	2016-02-15
A Distributional Perspective on Reinforcement Learning	✓ Link	20.9	C51 noop	2017-07-21
Distributed Prioritized Experience Replay	✓ Link	20.9	Ape-X	2018-03-02
Self-Imitation Learning	✓ Link	20.9	A2C + SIL	2018-06-14
Agent57: Outperforming the Atari Human Benchmark	✓ Link	20.67	Agent57	2020-03-30
Prioritized Experience Replay	✓ Link	20.6	Prior noop	2015-11-18
Learning values across many orders of magnitude		20.6	DDQN+Pop-Art noop	2016-02-24
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization	✓ Link	20.5	POP3D	2018-07-02
Smaller World Models for Reinforcement Learning		20.2	Discrete Latent Space World Model (VQ-VAE)	2020-10-12
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes	✓ Link	20	DDRL A3C	2018-01-09
Evolving simple programs for playing Atari games	✓ Link	20	CGP	2018-06-14
Mastering Atari with Discrete World Models	✓ Link	20	DreamerV2	2020-10-05
Increasing the Action Gap: New Operators for Reinforcement Learning	✓ Link	19.76	Persistent AL	2015-12-15
DNA: Proximal Policy Optimization with a Dual Network Architecture	✓ Link	19.7	DNA	2022-06-20
Increasing the Action Gap: New Operators for Reinforcement Learning	✓ Link	19.66	Advantage Learning	2015-12-15
Deep Reinforcement Learning with Double Q-learning	✓ Link	19.5	DQN noop	2015-09-22
Deep Reinforcement Learning with Double Q-learning	✓ Link	19.1	DDQN (tuned) hs	2015-09-22
Human level control through deep reinforcement learning	✓ Link	18.9	Nature DQN	2015-02-25
Prioritized Experience Replay	✓ Link	18.9	Prior hs	2015-11-18
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	18.8	Duel hs	2015-11-20
Deep Reinforcement Learning with Double Q-learning	✓ Link	18.4	Prior+Duel hs	2015-09-22
Adaptive Rational Activations to Boost Deep Reinforcement Learning	✓ Link	18.13	Recurrent Rational DQN Average	2021-02-18
Adaptive Rational Activations to Boost Deep Reinforcement Learning	✓ Link	18.04	Rational DQN Average	2021-02-18
Deep Reinforcement Learning with Double Q-learning	✓ Link	18.0	DQN hs	2015-09-22
Decision Transformer: Reinforcement Learning via Sequence Modeling	✓ Link	17.1	DT	2021-06-02
Massively Parallel Methods for Deep Reinforcement Learning	✓ Link	16.7	Gorila	2015-07-15
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	11.4	A3C FF (1 day) hs	2016-02-04
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	10.7	A3C LSTM hs	2016-02-04
Mean Actor Critic	✓ Link	10.6	MAC	2017-09-01
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	5.6	A3C FF hs	2016-02-04
CURL: Contrastive Unsupervised Representations for Reinforcement Learning	✓ Link	2.1	CURL	2020-04-08
[]()		-17.4	SARSA
The Arcade Learning Environment: An Evaluation Platform for General Agents	✓ Link	-19	Best Learner	2012-07-19
Soft Actor-Critic for Discrete Action Settings	✓ Link	-20.98	SAC	2019-10-16

OpenCodePapers

atari-games-on-atari-2600-pong