An Information Processing & Management paper on burst-aware data fusion for microblog search by Shangsong Liang and Maarten de Rijke is online now.
We consider the problem of searching posts in microblog environments. We frame this microblog post search problem as a late data fusion problem. Previous work on data fusion has mainly focused on aggregating document lists based on retrieval status values or ranks of documents without fully utilizing temporal features of the set of documents being fused. Additionally, previous work on data fusion has often worked on the assumption that only documents that are highly ranked in many of the lists are likely to be of relevance. We propose BurstFuseX, a fusion model that not only utilizes a microblog post’s ranking information but also exploits its publication time. BurstFuseX builds on an existing fusion method and rewards posts that are published in or near a burst of posts that are highly ranked in many of the lists being aggregated. We experimentally verify the effectiveness of the proposed late data fusion algorithm, and demonstrate that in terms of mean average precision it significantly outperforms the standard, state-of-the-art fusion approaches as well as burst or time-sensitive retrieval methods.
Our ECIR 2015 paper on user behavior in location search on mobile devices by Yaser Norouzzadeh Ravari, Ilya Markov, Artem Grotov, Maarten Clements and Maarten de Rijke is online now.
Location search engines are an important part of GPS-enabled devices such as mobile phones and tablet computers. In this paper, we study how users behave when they interact with a location search engine by analyzing logs from a popular GPS-navigation service to find out whether mobile users’ location search characteristics differ from those of regular web search. In particular, we analyze query- and session-based characteristics and the temporal distribution of location searches performed on smart phones and tablet computers. Our findings may be used to improve the design of search interfaces in order to help users perform location search more effectively and improve the overall experience on GPS-enabled mobile devices.
Our ECIR 2015 paper on multi-emotion detection in user-generated reviews by Lars Buitinck, Jesse van Amerongen, Ed Tan and Maarten de Rijke is online now.
Expressions of emotion abound in user-generated content, whether it be in blogs, reviews, or on social media. Much work has been devoted to detecting and classifying these emotions, but little of it has acknowledged the fact that emotionally charged text may express multiple emotions at the same time. We describe a new dataset of user-generated movie reviews annotated for emotional expressions, and experimentally validate two algorithms that can detect multiple emotions in each sentence of these reviews.
Our ECIR 2015 paper on automatically assessing article quality by exploiting article-bitor networks by Xinyi Li, Jintao Tang, Ting Wang, Zhunchen Luo and Maarten de Rijke is online now.
We consider the problem of automatically assessing Wikipedia article quality. We develop several models to rank articles by using the editing relations between articles and editors. First, we create a basic model by modeling the article-editor network. Then we design measures of an editor’s contribution and build weighted models that improve the ranking performance. Finally, we use a combination of featured article information and the weighted models to obtain the best performance. We find that using manual evaluation to assist automatic evaluation is a viable solution for the article quality assessment task on Wikipedia.
In the run-up to the Buma Music meets Tech Award at Noorderslag 2015, an update to our music discovery demonstrator Streamwatchr has gone live. An improved interface that is easier on your device’s battery life, some new functionality and a Twitter bot called @lyricswatchr are the most important ingredients of the update.