Bio
Herke van Hoof is currently assistant professor at the University of Amsterdam in the Netherlands, where he is part of the Amlab. He is interested in reinforcement learning with structured data and prior knowledge. Reinforcement learning is a very general framework, but this tends to result in extremely data-hungry algorithms. Exploiting structured prior knowledge, or using value function or policy parametrizations that respect known structural properties, is a promising avenue to learn more with less data. Examples of this line of work include reinforcement learning (RL) for combinatorial optimisation, RL with symbolic prior knowledge, and equivariant RL.
Before joining the University of Amsterdam, Herke van Hoof was a postdoc at McGill University in Montreal, Canada, where he worked with Professors Joelle Pineau, Dave Meger, and Gregory Dudek. He obtained his PhD at TU Darmstadt, Germany, under the supervision of Professor Jan Peters, where he graduated in November 2016. Herke got his bachelor and master degrees in Artificial Intelligence at the University of Groningen in the Netherlands.
Recent news
- David & Guillermo present their work at ICAPS (6/3/2024)
Tomorrow, June 4th, David & Guillermo will present their work at ICAPS. The paper proposes a new way to learn sub-policies that can optimally solve complex tasks expressed in linear temporal logic, even in stochastic environments. They’d love to tell you all about it. Or read our paper here.
- BeNeRL 2024 in Amsterdam on June 10th (4/19/2024)
Together with Maryam Tavakol & Vincent Francois-Lavet, we are organizing the 2024 edition of the Belgian-Netherlands Reinforcement Learning Workshop (BeNeRL) in Amsterdam! It will take place on June 10th, and will be a free event thanks to generous support by the Ellis Unit Amsterdam and the NWO. Registration is required, though.
More info & sign-up at the event website.
- Masoud started as assistant professor at TU Delft (4/18/2024)
After his postdoc, Masoud has recently started as assistant professor in the Multimedia Computing Group of TU Delft. Congratulations, Masoud!
An archive of news items can be found on the News page.
Highlighted publications
Kuric, D.; Infante, G.; Gómez, V.; Jonsson, A.; van Hoof, H.: Planning with a Learned Policy Basis to Optimally Solve Complex Tasks. In: International Conference on Automated Planning and Scheduling, 2024. @inproceedings{kuric2024planning,
title = {Planning with a Learned Policy Basis to Optimally Solve Complex Tasks},
author = {Kuric, D. and Infante, G. and Gómez, V. and Jonsson, A. and van Hoof, H. },
url = {https://openreview.net/forum?id=6N1uCtBhcL},
year = {2024},
date = {2024-06-01},
urldate = {2024-06-01},
booktitle = {International Conference on Automated Planning and Scheduling},
keywords = {},
pubstate = {published},
tppubtype = {inproceedings}
}
|
Gagrani, Mukul; Rainone, Corrado; Yang, Yang; Teague, Harris; Jeon, Wonseok; Hoof, Herke; Zeng, Weiliang Will; Zappi, Piero; Lott, Christopher; Bondesan, Roberto: Neural Topological Ordering for Computation Graphs. In: Advances in Neural Information Processing Systems, 2022. @inproceedings{gagrani2022neural,
title = {Neural Topological Ordering for Computation Graphs},
author = {Mukul Gagrani and Corrado Rainone and Yang Yang and Harris Teague and Wonseok Jeon and Herke Hoof and Weiliang Will Zeng and Piero Zappi and Christopher Lott and Roberto Bondesan},
url = {https://arxiv.org/abs/2207.05899},
year = {2022},
date = {2022-11-29},
booktitle = {Advances in Neural Information Processing Systems},
keywords = {},
pubstate = {published},
tppubtype = {inproceedings}
}
|
Pol, Elise; Hoof, Herke; Oliehoek, Frans; Welling, Max: Multi-Agent MDP Homomorphic Networks. In: Proceedings of the International Conference on Learning Representations, 2022. @inproceedings{pol2022multi,
title = {Multi-Agent MDP Homomorphic Networks},
author = {Elise Pol and Herke Hoof and Frans Oliehoek and Max Welling},
url = {https://openreview.net/forum?id=H7HDG–DJF0},
year = {2022},
date = {2022-04-25},
booktitle = {Proceedings of the International Conference on Learning Representations},
keywords = {},
pubstate = {published},
tppubtype = {inproceedings}
}
|
Kool, Wouter; Hoof, Herke; Welling, Max: Estimating Gradients for Discrete Random Variables by Sampling without Replacement. In: International Conference on Learning Representations, 2020. @inproceedings{kool2020estimating,
title = {Estimating Gradients for Discrete Random Variables by Sampling without Replacement},
author = {Wouter Kool and Herke Hoof and Max Welling},
url = {https://openreview.net/pdf?id=rklEj2EFvB
https://youtu.be/KtP-Z2bvPPE},
year = {2020},
date = {2020-04-26},
booktitle = {International Conference on Learning Representations},
keywords = {},
pubstate = {published},
tppubtype = {inproceedings}
}
|
Smith, M.; Hoof, H.; Pineau, J.: An Inference-Based Policy Gradient Method for Learning Options. In: International Conference on Machine Learning, pp. 4703-4712, 2018. @inproceedings{smith2018inference,
title = {An Inference-Based Policy Gradient Method for Learning Options},
author = {M. Smith and H. Hoof and J. Pineau},
url = {http://proceedings.mlr.press/v80/smith18a/smith18a.pdf},
year = {2018},
date = {2018-07-10},
booktitle = {International Conference on Machine Learning},
pages = {4703-4712},
keywords = {},
pubstate = {published},
tppubtype = {inproceedings}
}
|
Hoof, H. Van; Neumann, G.; Peters, J.: Non-parametric Policy Search with Limited Information Loss. In: Journal of Machine Learning Research, vol. 18, no. 73, pp. 1-46, 2017. @article{hoof2017nonparametric,
title = {Non-parametric Policy Search with Limited Information Loss},
author = {H. Van Hoof and G. Neumann and J. Peters},
editor = {K. Murphy},
url = {http://jmlr.org/papers/volume18/16-142/16-142.pdf},
year = {2017},
date = {2017-08-01},
journal = {Journal of Machine Learning Research},
volume = {18},
number = {73},
pages = {1-46},
keywords = {},
pubstate = {published},
tppubtype = {article}
}
|