atari-games-on-atari-2600-centipede

Video GamesAtari Games

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Score	ModelName	ReleaseDate
First return, then explore	✓ Link	1422628	Go-Explore	2020-04-27
GDI: Rethinking What Makes Reinforcement Learning Different from Supervised Learning		1359533	GDI-H3(1B frames)	2021-11-24
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model	✓ Link	1159049.27	MuZero	2019-11-19
Online and Offline Reinforcement Learning by Planning with a Learned Model	✓ Link	874301.64	MuZero (Res2 Adam)	2021-04-13
Recurrent Experience Replay in Distributed Reinforcement Learning	✓ Link	599140.3	R2D2	2019-05-01
Agent57: Outperforming the Atari Human Benchmark	✓ Link	412847.86	Agent57	2020-03-30
Generalized Data Distribution Iteration		195630	GDI-H3	2022-06-07
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning		155830	GDI-I3	2021-06-11
Generalized Data Distribution Iteration		155830	GDI-I3	2022-06-07
The Arcade Learning Environment: An Evaluation Platform for General Agents	✓ Link	125123	Full Tree	2012-07-19
DNA: Proximal Policy Optimization with a Dual Network Architecture	✓ Link	100194	DNA	2022-06-20
Learning values across many orders of magnitude		49065.8	DDQN+Pop-Art noop	2016-02-24
Evolving simple programs for playing Atari games	✓ Link	24708	CGP	2018-06-14
Distributed Prioritized Experience Replay	✓ Link	12974	Ape-X	2018-03-02
Distributional Reinforcement Learning with Quantile Regression	✓ Link	12447	QR-DQN-1	2017-10-27
Mastering Atari with Discrete World Models	✓ Link	11883	DreamerV2	2020-10-05
Implicit Quantile Networks for Distributional Reinforcement Learning	✓ Link	11561	IQN	2018-06-14
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures	✓ Link	11049.75	IMPALA (deep)	2018-02-05
A Distributional Perspective on Reinforcement Learning	✓ Link	9646.0	C51 noop	2017-07-21
The Arcade Learning Environment: An Evaluation Platform for General Agents	✓ Link	8803.8	Best Learner	2012-07-19
Human level control through deep reinforcement learning	✓ Link	8309.0	Nature DQN	2015-02-25
Evolution Strategies as a Scalable Alternative to Reinforcement Learning	✓ Link	7783.9	ES FF (1 hour) noop	2017-03-10
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	7687.5	Prior+Duel noop	2015-11-20
Noisy Networks for Exploration	✓ Link	7596	NoisyNet-Dueling	2017-06-30
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	7561.4	Duel noop	2015-11-20
Self-Imitation Learning	✓ Link	7559.5	A2C + SIL	2018-06-14
Massively Parallel Methods for Deep Reinforcement Learning	✓ Link	6296.9	Gorila	2015-07-15
Deep Reinforcement Learning with Double Q-learning	✓ Link	5570.2	Prior+Duel hs	2015-09-22
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	5409.4	DDQN (tuned) noop	2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	4881.0	Duel hs	2015-11-20
Deep Reinforcement Learning with Double Q-learning	✓ Link	4657.7	DQN noop	2015-09-22
[]()		4647.0	SARSA
Deep Exploration via Bootstrapped DQN	✓ Link	4553.5	Bootstrapped DQN	2016-02-15
Increasing the Action Gap: New Operators for Reinforcement Learning	✓ Link	4539.55	Persistent AL	2015-12-15
Prioritized Experience Replay	✓ Link	4463.2	Prior noop	2015-11-18
Increasing the Action Gap: New Operators for Reinforcement Learning	✓ Link	4225.18	Advantage Learning	2015-12-15
Deep Reinforcement Learning with Double Q-learning	✓ Link	3973.9	DQN hs	2015-09-22
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity	✓ Link	3899.8	ASL DDQN	2023-05-07
Deep Reinforcement Learning with Double Q-learning	✓ Link	3853.5	DDQN (tuned) hs	2015-09-22
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	3755.8	A3C FF hs	2016-02-04
Prioritized Experience Replay	✓ Link	3489.1	Prior hs	2015-11-18
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning		3422.0	Reactor 500M	2017-04-15
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization	✓ Link	3315.44	POP3D	2018-07-02
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	3306.5	A3C FF (1 day) hs	2016-02-04
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	1997.0	A3C LSTM hs	2016-02-04

OpenCodePapers

atari-games-on-atari-2600-centipede