atari-games-on-atari-2600-kangaroo

Video GamesAtari Games

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Score	ModelName	ReleaseDate
Agent57: Outperforming the Atari Human Benchmark	✓ Link	24034.16	Agent57	2020-03-30
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model	✓ Link	16763.60	MuZero	2019-11-19
Prioritized Experience Replay	✓ Link	16200.0	Prior noop	2015-11-18
Implicit Quantile Networks for Distributional Reinforcement Learning	✓ Link	15487	IQN	2018-06-14
Distributional Reinforcement Learning with Quantile Regression	✓ Link	15356	QR-DQN-1	2017-10-27
Noisy Networks for Exploration	✓ Link	15227	NoisyNet-Dueling	2017-06-30
Deep Exploration via Bootstrapped DQN	✓ Link	14862.5	Bootstrapped DQN	2016-02-15
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	14854.0	Duel noop	2015-11-20
Generalized Data Distribution Iteration		14636	GDI-H3	2022-06-07
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning		14500	GDI-I3	2021-06-11
Generalized Data Distribution Iteration		14500	GDI-I3	2022-06-07
DNA: Proximal Policy Optimization with a Dual Network Architecture	✓ Link	14373	DNA	2022-06-20
Recurrent Experience Replay in Distributed Reinforcement Learning	✓ Link	14130.7	R2D2	2019-05-01
Mastering Atari with Discrete World Models	✓ Link	14064	DreamerV2	2020-10-05
Online and Offline Reinforcement Learning by Planning with a Learned Model	✓ Link	13838	MuZero (Res2 Adam)	2021-04-13
Learning values across many orders of magnitude		13150.0	DDQN+Pop-Art noop	2016-02-24
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity	✓ Link	13027	ASL DDQN	2023-05-07
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	12992.0	DDQN (tuned) noop	2015-11-20
A Distributional Perspective on Reinforcement Learning	✓ Link	12853.0	C51 noop	2017-07-21
Prioritized Experience Replay	✓ Link	12185.0	Prior hs	2015-11-18
Increasing the Action Gap: New Operators for Reinforcement Learning	✓ Link	11478.46	Persistent AL	2015-12-15
Deep Reinforcement Learning with Double Q-learning	✓ Link	11204.0	DDQN (tuned) hs	2015-09-22
Evolution Strategies as a Scalable Alternative to Reinforcement Learning	✓ Link	11200.0	ES FF (1 hour) noop	2017-03-10
Increasing the Action Gap: New Operators for Reinforcement Learning	✓ Link	10809.16	Advantage Learning	2015-12-15
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	10334.0	Duel hs	2015-11-20
Deep Reinforcement Learning with Double Q-learning	✓ Link	7259.0	DQN noop	2015-09-22
Human level control through deep reinforcement learning	✓ Link	6740.0	Nature DQN	2015-02-25
Adaptive Rational Activations to Boost Deep Reinforcement Learning	✓ Link	5266	Recurrent Rational DQN Average	2021-02-18
Deep Reinforcement Learning with Double Q-learning	✓ Link	4496.0	DQN hs	2015-09-22
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization	✓ Link	3891.67	POP3D	2018-07-02
Adaptive Rational Activations to Boost Deep Reinforcement Learning	✓ Link	2941	Rational DQN Average	2021-02-18
Self-Imitation Learning	✓ Link	2888.3	A2C + SIL	2018-06-14
The Arcade Learning Environment: An Evaluation Platform for General Agents	✓ Link	1990	UCT	2012-07-19
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	1792.0	Prior+Duel noop	2015-11-20
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures	✓ Link	1632.00	IMPALA (deep)	2018-02-05
The Arcade Learning Environment: An Evaluation Platform for General Agents	✓ Link	1622.1	Best Learner	2012-07-19
Massively Parallel Methods for Deep Reinforcement Learning	✓ Link	1431.0	Gorila	2015-07-15
Distributed Prioritized Experience Replay	✓ Link	1416	Ape-X	2018-03-02
Evolving simple programs for playing Atari games	✓ Link	1400	CGP	2018-06-14
Playing Atari with Six Neurons	✓ Link	1200	IDVQ + DRSC + XNES	2018-06-04
Deep Reinforcement Learning with Double Q-learning	✓ Link	861.0	Prior+Duel hs	2015-09-22
CURL: Contrastive Unsupervised Representations for Reinforcement Learning	✓ Link	345.3	CURL	2020-04-08
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	125.0	A3C LSTM hs	2016-02-04
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	106.0	A3C FF (1 day) hs	2016-02-04
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	94.0	A3C FF hs	2016-02-04
Soft Actor-Critic for Discrete Action Settings	✓ Link	29.3	SAC	2019-10-16
[]()		8.8	SARSA

OpenCodePapers

atari-games-on-atari-2600-kangaroo