We’ve recently posted several working papers that I hope you’ll find interesting:
- Amin, S., Gomrokchi, M., Satija, H., van Hoof, H., & Precup, D. (2021). A Survey of Exploration Methods in Reinforcement Learning. arXiv preprint arXiv:2109.00157.
- Kool, W., van Hoof, H., Gromicho, J., & Welling, M. (2021). Deep Policy Dynamic Programming for Vehicle Routing Problems. arXiv preprint arXiv:2102.11756.