Paper | Code | Average Reward | ModelName | ReleaseDate |
---|---|---|---|---|
Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics | 81.8 | KFC | 2021-11-02 | |
Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning | ✓ Link | 81 | ADMPO | 2024-05-27 |
Decision Transformer: Reinforcement Learning via Sequence Modeling | ✓ Link | 73.5 | Decision Transformer (DT) | 2021-06-02 |