Dialogue generation: From imitation learning to inverse reinforcement learning by Ziming Li, Julia Kiseleva, and Maarten de Rijke is online now at this location. The performance of adversarial dialogue generation models relies on the quality of the reward signal produced by the discriminator. The reward signal from a poor discriminator can be very sparse and unstable,…
Category: Uncategorized
Open Science
I’m a professor. My job description is very simple: to create new knowledge and to transfer it. To students, colleagues, and anyone else, really. To academia, industry, governments, and the rest of society. I do my job by working with a large team of very talented PhD students and postdocs from around the planet and…