Herke van Hoof

Bio

Herke van Hoof is currently associate professor at the University of Amsterdam in the Netherlands, where he is part of the Amlab. He is interested in modular reinforcement learning. Reinforcement learning is a very general framework, but this tends to result in extremely data-hungry algorithms. Exploiting modular structures, including hierarchical structures, allows sharing information between tasks and exploiting prior knowledge, to learn more with less data.

Before joining the University of Amsterdam, Herke van Hoof was a postdoc at McGill University in Montreal, Canada, where he worked with Professors Joelle Pineau, Dave Meger, and Gregory Dudek. He obtained his PhD at TU Darmstadt, Germany, under the supervision of Professor Jan Peters, where he graduated in November 2016. Herke got his bachelor and master degrees in Artificial Intelligence at the University of Groningen in the Netherlands.

Recent news

SIKS course on “Reinforcement Learning for Adaptive Hybrid Intelligence” (9/8/2025)

I am co-organizing a two-day graduate course on “Reinforcement Learning for Adaptive Hybrid Intelligence”. We will discuss RL basics and specific challenges for using RL as an assistant or collaborator. Registration and more information via the SIKS website.
Deadline extensions “AI for safety-critical infrastructure workshop” (6/13/2025)

The deadline is now extended for submissions to the AI for safety-critical infrastructure workshop at ECML PKDD.

📅 15th September 2025
📍Porto, Portugal
⏰ NEW Submission deadline: 21st June 2025

Info and submission 👉 https://lnkd.in/drjkZ5Rt
Open PhD position on interactive robot learning at VU (5/23/2025)

We are recruiting a PhD candidate within the The Hybrid Intelligence Centre on the topic of interactive robot learning with flexible human input.

The student will be based at the Vrije Universiteit with Kim Baraka as main supervisor (I will be co-supervisor).

Deadline is June 15, 2025.

More information and application through this website.

An archive of news items can be found on the News page.

Highlighted publications

Kuric, D.; Infante, G.; Gómez, V.; Jonsson, A.; van Hoof, H.: Planning with a Learned Policy Basis to Optimally Solve Complex Tasks. In: International Conference on Automated Planning and Scheduling, 2024. (Type: Proceedings Article | Links | BibTeX)

Gagrani, Mukul; Rainone, Corrado; Yang, Yang; Teague, Harris; Jeon, Wonseok; Hoof, Herke; Zeng, Weiliang Will; Zappi, Piero; Lott, Christopher; Bondesan, Roberto: Neural Topological Ordering for Computation Graphs. In: Advances in Neural Information Processing Systems, 2022. (Type: Proceedings Article | Links | BibTeX)

Kool, Wouter; Hoof, Herke Van; Welling, Max: Stochastic Beams and Where To Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement. In: International Conference on Machine Learning, pp. 3499–3508, 2019. (Type: Proceedings Article | Links | BibTeX)

Smith, M.; Hoof, H.; Pineau, J.: An Inference-Based Policy Gradient Method for Learning Options. In: International Conference on Machine Learning, pp. 4703-4712, 2018. (Type: Proceedings Article | Links | BibTeX)

Hoof, H. Van; Neumann, G.; Peters, J.: Non-parametric Policy Search with Limited Information Loss. In: Journal of Machine Learning Research, vol. 18, no. 73, pp. 1-46, 2017. (Type: Journal Article | Links | BibTeX)