atari-games-on-atari-2600-battle-zone

Video GamesAtari Games

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Score	ModelName	ReleaseDate
Agent57: Outperforming the Atari Human Benchmark	✓ Link	934134.88	Agent57	2020-03-30
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model	✓ Link	848623.00	MuZero	2019-11-19
Generalized Data Distribution Iteration		824360	GDI-H3	2022-06-07
Recurrent Experience Replay in Distributed Reinforcement Learning	✓ Link	751880.0	R2D2	2019-05-01
Generalized Data Distribution Iteration		478830	GDI-I3	2022-06-07
Online and Offline Reinforcement Learning by Planning with a Learned Model	✓ Link	178716.9	MuZero (Res2 Adam)	2021-04-13
Distributed Prioritized Experience Replay	✓ Link	98895	Ape-X	2018-03-02
Fully Parameterized Quantile Function for Distributional Reinforcement Learning	✓ Link	87928.6	FQF	2019-11-05
DNA: Proximal Policy Optimization with a Dual Network Architecture	✓ Link	71003	DNA	2022-06-20
The Arcade Learning Environment: An Evaluation Platform for General Agents	✓ Link	70333.3	UCT	2012-07-19
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning		64070.0	Reactor 500M	2017-04-15
Noisy Networks for Exploration	✓ Link	52262	NoisyNet-Dueling	2017-06-30
Implicit Quantile Networks for Distributional Reinforcement Learning	✓ Link	42244	IQN	2018-06-14
Mastering Atari with Discrete World Models	✓ Link	40325	DreamerV2	2020-10-05
Distributional Reinforcement Learning with Quantile Regression	✓ Link	39268	QR-DQN-1	2017-10-27
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity	✓ Link	38986	ASL DDQN	2023-05-07
Deep Exploration via Bootstrapped DQN	✓ Link	38666.7	Bootstrapped DQN	2016-02-15
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	37150.0	Duel noop	2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	35520.0	Prior+Duel noop	2015-11-20
Increasing the Action Gap: New Operators for Reinforcement Learning	✓ Link	34583.07	Persistent AL	2015-12-15
Evolving simple programs for playing Atari games	✓ Link	34200	CGP	2018-06-14
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	31700.0	DDQN (tuned) noop	2015-11-20
Prioritized Experience Replay	✓ Link	31530.0	Prior noop	2015-11-18
Dueling Network Architectures for Deep Reinforcement Learning	✓ Link	31320.0	Duel hs	2015-11-20
Deep Reinforcement Learning with Double Q-learning	✓ Link	30650.0	Prior+Duel hs	2015-09-22
Deep Reinforcement Learning with Double Q-learning	✓ Link	29900.0	DQN noop	2015-09-22
Increasing the Action Gap: New Operators for Reinforcement Learning	✓ Link	28789.29	Advantage Learning	2015-12-15
A Distributional Perspective on Reinforcement Learning	✓ Link	28742.0	C51 noop	2017-07-21
Human level control through deep reinforcement learning	✓ Link	26300.0	Nature DQN	2015-02-25
Adaptive Rational Activations to Boost Deep Reinforcement Learning	✓ Link	25749	Recurrent Rational DQN Average	2021-02-18
Prioritized Experience Replay	✓ Link	25520.0	Prior hs	2015-11-18
Self-Imitation Learning	✓ Link	25075	A2C + SIL	2018-06-14
Deep Reinforcement Learning with Double Q-learning	✓ Link	24740.0	DDQN (tuned) hs	2015-09-22
Deep Reinforcement Learning with Double Q-learning	✓ Link	23750.0	DQN hs	2015-09-22
Adaptive Rational Activations to Boost Deep Reinforcement Learning	✓ Link	23403	Rational DQN Average	2021-02-18
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures	✓ Link	20885.00	IMPALA (deep)	2018-02-05
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	20760.0	A3C LSTM hs	2016-02-04
Massively Parallel Methods for Deep Reinforcement Learning	✓ Link	19938.0	Gorila	2015-07-15
Evolution Strategies as a Scalable Alternative to Reinforcement Learning	✓ Link	16600.0	ES FF (1 hour) noop	2017-03-10
The Arcade Learning Environment: An Evaluation Platform for General Agents	✓ Link	15819.7	Best Learner	2012-07-19
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization	✓ Link	15466.67	POP3D	2018-07-02
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	12950.0	A3C FF hs	2016-02-04
Asynchronous Methods for Deep Reinforcement Learning	✓ Link	11340.0	A3C FF (1 day) hs	2016-02-04
CURL: Contrastive Unsupervised Representations for Reinforcement Learning	✓ Link	11208	CURL	2020-04-08
Learning values across many orders of magnitude		8220.0	DDQN+Pop-Art noop	2016-02-24
Soft Actor-Critic for Discrete Action Settings	✓ Link	4386.7	SAC	2019-10-16
[]()		16.2	SARSA

OpenCodePapers

atari-games-on-atari-2600-battle-zone