Herke van Hoof

Bio

Herke van Hoof is currently associate professor at the University of Amsterdam in the Netherlands, where he is part of the Amlab. He is interested in modular reinforcement learning. Reinforcement learning is a very general framework, but this tends to result in extremely data-hungry algorithms. Exploiting modular structures, including hierarchical structures, allows sharing information between tasks and exploiting prior knowledge, to learn more with less data.

Before joining the University of Amsterdam, Herke van Hoof was a postdoc at McGill University in Montreal, Canada, where he worked with Professors Joelle Pineau, Dave Meger, and Gregory Dudek. He obtained his PhD at TU Darmstadt, Germany, under the supervision of Professor Jan Peters, where he graduated in November 2016. Herke got his bachelor and master degrees in Artificial Intelligence at the University of Groningen in the Netherlands.

Recent news

Paper Matthew accepted at ICLR (2/18/2026)

Matthew’s paper Gradient-Based Program Synthesis with Neurally Interpreted Languages, with Clément Bonnet and Levi Lelis, was accepted to ICLR! Congrats, Matthew!
SIKS course on “Reinforcement Learning for Adaptive Hybrid Intelligence” (9/8/2025)

I am co-organizing a two-day graduate course on “Reinforcement Learning for Adaptive Hybrid Intelligence”. We will discuss RL basics and specific challenges for using RL as an assistant or collaborator. Registration and more information via the SIKS website.
Deadline extensions “AI for safety-critical infrastructure workshop” (6/13/2025)

The deadline is now extended for submissions to the AI for safety-critical infrastructure workshop at ECML PKDD.

📅 15th September 2025
📍Porto, Portugal
⏰ NEW Submission deadline: 21st June 2025

Info and submission 👉 https://lnkd.in/drjkZ5Rt

An archive of news items can be found on the News page.

Highlighted publications

Macfarlane, M.; Bonnet, C.; van Hoof, H.; Lelis, L.: Gradient-Based Program Synthesis with Neurally Interpreted Languages. In: Proceedings of the International Conference on Learning Representations, Forthcoming. (Type: Proceedings Article | Links | BibTeX)

Kuric, D.; Infante, G.; Gómez, V.; Jonsson, A.; van Hoof, H.: Planning with a Learned Policy Basis to Optimally Solve Complex Tasks. In: International Conference on Automated Planning and Scheduling, 2024. (Type: Proceedings Article | Links | BibTeX)

Gagrani, Mukul; Rainone, Corrado; Yang, Yang; Teague, Harris; Jeon, Wonseok; Hoof, Herke; Zeng, Weiliang Will; Zappi, Piero; Lott, Christopher; Bondesan, Roberto: Neural Topological Ordering for Computation Graphs. In: Advances in Neural Information Processing Systems, 2022. (Type: Proceedings Article | Links | BibTeX)

Kool, Wouter; Hoof, Herke Van; Welling, Max: Stochastic Beams and Where To Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement. In: International Conference on Machine Learning, pp. 3499–3508, 2019. (Type: Proceedings Article | Links | BibTeX)

Hoof, H. Van; Neumann, G.; Peters, J.: Non-parametric Policy Search with Limited Information Loss. In: Journal of Machine Learning Research, vol. 18, no. 73, pp. 1-46, 2017. (Type: Journal Article | Links | BibTeX)