Paper | Code | Average Return | ModelName | ReleaseDate |
---|---|---|---|---|
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor | ✓ Link | 5745.27 | SAC | 2018-01-04 |
Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow | ✓ Link | 5526.66 | MEow | 2024-05-22 |
Continuous control with deep reinforcement learning | ✓ Link | 2994.54 | DDPG | 2015-09-09 |
Proximal Policy Optimization Algorithms | ✓ Link | 2739.81 | PPO | 2017-07-20 |
Addressing Function Approximation Error in Actor-Critic Methods | ✓ Link | 2612.74 | TD3 | 2018-02-26 |