Paper | Code | Average Return | ModelName | ReleaseDate |
---|---|---|---|---|
Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow | ✓ Link | 6923.22 | MEow | 2024-05-22 |
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor | ✓ Link | 6211.50 | SAC | 2018-01-04 |
Proximal Policy Optimization Algorithms | ✓ Link | 925.89 | PPO | 2017-07-20 |
Addressing Function Approximation Error in Actor-Critic Methods | ✓ Link | 198.44 | TD3 | 2018-02-26 |
Continuous control with deep reinforcement learning | ✓ Link | 139.14 | DDPG | 2015-09-09 |