“Relative upper confidence bound for the K-armed dueling bandit problem” by Masrour Zoghi, Shimon Whiteson, Remi Munos and Maarten de Rijke is online as a preprint at arXiv. This paper proposes a new method for the K-armed dueling bandit problem, a variation on the regular K-armed bandit problem that offers only relative feedback about pairs of…