Paper | Code | Average Return | ModelName | ReleaseDate |
---|---|---|---|---|
Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow | ✓ Link | 6586.33 | MEow | 2024-05-22 |
Addressing Function Approximation Error in Actor-Critic Methods | ✓ Link | 5942.55 | TD3 | 2018-02-26 |
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor | ✓ Link | 5208.09 | SAC | 2018-01-04 |
Continuous control with deep reinforcement learning | ✓ Link | 1712.12 | DDPG | 2015-09-09 |
Proximal Policy Optimization Algorithms | ✓ Link | 608.97 | PPO | 2017-07-20 |