Publications

Journal papers

2020

Kool, W; van Hoof, H; Welling, M

Ancestral Gumbel-Top-k Sampling for Sampling without Replacement

Journal of Machine Learning Research, 21 (47), pp. 1–36, 2020.

Links | BibTeX

Akata, Zeynep ; Balliet, Dan ; de Rijke, Maarten ; Dignum, Frank ; Dignum, Virginia ; Eiben, Guszti ; others,

A Research Agenda for Hybrid Intelligence: Augmenting Human Intellect With Collaborative, Adaptive, Responsible, and Explainable Artificial Intelligence

Computer, 53 (8), pp. 18–28, 2020.

Links | BibTeX

2017

Van Hoof, H; Neumann, G; Peters, J

Non-parametric Policy Search with Limited Information Loss

Journal of Machine Learning Research, 18 (73), pp. 1-46, 2017.

Links | BibTeX

van Hoof, Herke ; Tanneberg, Daniel ; Peters, Jan

Generalized Exploration in Policy Search

Machine Learning - Special issue ECML PKDD, 106 (9--10), pp. 1705–1724, 2017, ISSN: 1573-0565.

Links | BibTeX

2016

Daniel, Christian; van Hoof, Herke; Neumann, Gerhard; Peters, Jan

Probabilistic Inference for Determining Options in Reinforcement Learning

Machine Learning - Special issue ECML PKDD, 104 (2--3), pp. 337–357, 2016.

Links | BibTeX

2014

van Hoof, Herke; Kroemer, Oliver; Peters, Jan

Probabilistic Segmentation and Targeted Exploration of Objects in Cluttered Environments

IEEE Transactions on Robotics (TRo), 5 , pp. 1198-1209, 2014.

Links | BibTeX

Conference papers

2021

Wöhlke, J; Schmitt, F; van Hoof, H

Hierarchies of Planning and Reinforcement Learning for Robot Navigation Forthcoming

IEEE International Conference on Robotics and Automation, Forthcoming.

BibTeX

2020

Mollinga, Jasper ; van Hoof, Herke

An Autonomous Free Airspace En-route Controller using Deep Reinforcement Learning Techniques

International Conference on Research in Air Transportation, 2020.

Links | BibTeX

Huang, Jin ; Oosterhuis, Harrie ; de Rijke, Maarten ; van Hoof, Herke

Keeping Dataset Biases out of the Simulation: A Debiased Simulator for Reinforcement Learning based Recommender Systems

The ACM Conference on Recommender Systems, 2020.

Links | BibTeX

Wöhlke, Jan ; Schmitt, Felix ; van Hoof, Herke

A Performance-Based Start State Curriculum Framework for Reinforcement Learning

International Conference on Autonomous Agents and Multi-Agent Systems, 2020.

Links | BibTeX

Kool, Wouter ; van Hoof, Herke ; Welling, Max

Estimating Gradients for Discrete Random Variables by Sampling without Replacement

International Conference on Learning Representations, 2020.

Links | BibTeX

van der Heide, Tessa ; Mirus, Florian ; van Hoof, Herke

Social Navigation with Human Empowerment Driven Reinforcement Learning

International Conference on Artificial Neural Networks, 2020.

Links | BibTeX

Wang, Qi ; van Hoof, Herke

Doubly Stochastic Variational Inference for Neural Processes with Hierarchical Latent Variables

International Conference on Machine Learning, 2020.

Links | BibTeX

van der Pol, Elise ; Worrall, Daniel ; van Hoof, Herke ; Oliehoek, Frans ; Welling, Max

MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning

Advances in Neural Information Processing Systems, 2020.

Links | BibTeX

Bakker, Tim ; van Hoof, Herke ; Welling, Max

Experimental design for MRI by greedy policy search

Advances in Neural Information Processing Systems, 2020.

Links | BibTeX

2019

Caccia, Lucas; van Hoof, Herke ; Courville, Aaron C; Pineau, Joelle

Deep Generative Modeling of LiDAR Data

IEEE International Conference on Intelligent Robots and Systems, 2019.

Links | BibTeX

Shang, Wenling; van der Wal, Douwe; van Hoof, Herke; Welling, Max

Stochastic Activation Actor Critic Methods

European Conference on Machine Learning, 2019.

Links | BibTeX

Kool, Wouter ; Van Hoof, Herke ; Welling, Max

Stochastic Beams and Where To Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement

International Conference on Machine Learning, pp. 3499–3508, 2019.

Links | BibTeX

Kool, Wouter; van Hoof, Herke; Welling, Max

Attention! Learn to solve routing problems!

International Conference on Learning Representations, 2019.

Links | BibTeX

Thakur, S; van Hoof, H; Gamboa Higuera, J C; Precup, D; Meger, D

Uncertainty aware Imitation Learning on Multiple Tasks using Bayesian Neural Networks

International Conference on Robotics and Automation, 2019.

Links | BibTeX

2018

Manjanna, S; van Hoof, H; Dudek, G

Policy Search on Aggregated State Space for Active Sampling

International Symposium on Experimental Robotics, 2018.

Links | BibTeX

Dong, Y; Shen, Y; Crawford, E; van Hoof, H; Cheung, J C K

BanditSum: Extractive Summarization as a Contextual Bandit

Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 3739–3748, 2018.

Links | BibTeX

Manjanna, S; van Hoof, H; Dudek, G

Reinforcement Learning with Non-uniform State Representations for Adaptive Search

IEEE International Symposium on Safety, Security, and Rescue Robotics, 2018.

Links | BibTeX

Fujimoto, S; van Hoof, H; Meger, D

Addressing function approximation error in actor-critic methods

International Conference on Machine Learning, pp. 1587–1596, 2018.

Links | BibTeX

Smith, M; van Hoof, H; Pineau, J

An Inference-Based Policy Gradient Method for Learning Options

International Conference on Machine Learning, pp. 4703-4712, 2018.

Links | BibTeX

Barbaros, V; van Hoof, H; Abdolmaleki, A; Meger, D

Eager and Memory-Based Non-Parametric Stochastic Search Methods for Learning Control

International Conference on Robotics and Automation, 2018.

Links | BibTeX

2017

Tangkaratt, V; van Hoof, H; Parisi, S; Neumann, G; Peters, J; Sugiyama, M

Policy Search with High-Dimensional Context Variables

Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), pp. 2632–2638, 2017.

Links | BibTeX

2016

van Hoof, Herke; Chen, Nutan; Karl, Maximilian; van der Smart, Patrick; Peters, Jan

Stable Reinforcement Learning with Auto-Encoders for Tactile and Visual Data

International Conference on Intelligent Robots and Systems, pp. 3928–3934, 2016.

Links | BibTeX

Yi, Z; Calandra, R; Veiga, F; van Hoof, H; Hermans, T; Zhang, Y; Peters, J

Active Tactile Object Exploration with Gaussian Processes

Proceedings of the IEEE/RSJ Conference on Intelligent Robots and Systems (IROS), pp. 4925–4930, 2016.

Links | BibTeX

2015

van Hoof, Herke; Peters, Jan; Neumann, Gerhard

Learning of Non-Parametric Control Policies with High-Dimensional State Features

International Conference on Artificial Intelligence and Statistics, pp. 1004–1012, 2015.

Links | BibTeX

van Hoof, H; Hermans, T; Neumann, G; Peters, J

Learning Robot In-Hand Manipulation with Tactile Features

Proceedings of the International Conference on Humanoid Robots (HUMANOIDS), 2015.

Links | BibTeX

Veiga, F F; van Hoof, H; Peters, J; Hermans, T

Stabilizing Novel Objects by Learning to Predict Tactile Slip

Proceedings of the IEEE/RSJ Conference on Intelligent Robots and Systems (IROS), pp. 5065–5072, 2015.

Links | BibTeX

Kroemer, O; Daniel, C; Neumann, G; van Hoof, H; Peters, J

Towards Learning Hierarchical Skills for Multi-Phase Manipulation Tasks

Proceedings of the International Conference on Robotics and Automation (ICRA), 2015.

Links | BibTeX

2014

Kroemer, O; van Hoof, H; Neumann, G; Peters, J

Learning to Predict Phases of Manipulation Tasks as Hidden States

Proceedings of 2014 IEEE International Conference on Robotics and Automation (ICRA), 2014.

Links | BibTeX

Bischoff, B; Nguyen-Tuong, D; van Hoof, H; McHutchon, A; Rasmussen, C E; Knoll, A; Peters, J; Deisenroth, M P

Policy Search For Learning Robot Control Using Sparse Data

Proceedings of 2014 IEEE International Conference on Robotics and Automation (ICRA), 2014.

Links | BibTeX

2013

van Hoof, H; Kroemer, O; Peters, J

Probabilistic Interactive Segmentation for Anthropomorphic Robots in Cluttered Environments

Proceedings of the International Conference on Humanoid Robots (HUMANOIDS), 2013.

Links | BibTeX

2012

van Hoof, H; Kroemer, O; Ben Amor, H; Peters, J

Maximally Informative Interaction Learning for Scene Exploration

Proceedings of the International Conference on Robot Systems (IROS), 2012.

Links | BibTeX

2011

van Hoof, H; van der Zant, T; Wiering, M A

Adaptive Visual Face Tracking for an Autonomous Robot

Proceedings of the Belgian-Dutch Artificial Intelligence Conference (BNAIC 11), 2011.

Links | BibTeX

Ph.D. Thesis

2016

van Hoof, H

Machine Learning through Exploration for Perception-Driven Robotics

2016.

Links | BibTeX