Paper | Code | Average Return | ModelName | ReleaseDate |
---|---|---|---|---|
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor | ✓ Link | 15836.04 | SAC | 2018-01-04 |
Continuous control with deep reinforcement learning | ✓ Link | 14934.86 | DDPG | 2015-09-09 |
Addressing Function Approximation Error in Actor-Critic Methods | ✓ Link | 12026.73 | TD3 | 2018-02-26 |
Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow | ✓ Link | 10981.47 | MEow | 2024-05-22 |
Proximal Policy Optimization Algorithms | ✓ Link | 6006.11 | PPO | 2017-07-20 |