| Paper | Code | Average Return | ModelName | ReleaseDate |
|---|---|---|---|---|
| Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow | ✓ Link | 6586.33 | MEow | 2024-05-22 |
| Addressing Function Approximation Error in Actor-Critic Methods | ✓ Link | 5942.55 | TD3 | 2018-02-26 |
| Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor | ✓ Link | 5208.09 | SAC | 2018-01-04 |
| Continuous control with deep reinforcement learning | ✓ Link | 1712.12 | DDPG | 2015-09-09 |
| Proximal Policy Optimization Algorithms | ✓ Link | 608.97 | PPO | 2017-07-20 |