TY - GEN
T1 - What Snippets Say About Pages in Federated Web Search
AU - Demeester, Thomas
AU - Nguyen, Dong-Phuong
AU - Trieschnigg, Rudolf Berend
AU - Develder, Chris
AU - Hiemstra, Djoerd
N1 - http://airs2012.sinaapp.com/
PY - 2012/12
Y1 - 2012/12
N2 - What is the likelihood that a Web page is considered
relevant to a query, given the relevance assessment of the corresponding
snippet? Using a new federated IR test collection that contains search
results from over a hundred search engines on the internet, we are able
to investigate such research questions from a global perspective. Our test
collection covers the main Web search engines like Google, Yahoo!, and
Bing, as well as a number of smaller search engines dedicated to multimedia,
shopping, etc., and as such reflects a realistic Web environment.
Using a large set of relevance assessments, we are able to investigate
the connection between snippet quality and page relevance. The dataset
is strongly inhomogeneous, and although the assessors’ consistency is
shown to be satisfying, care is required when comparing resources. To
this end, a number of probabilistic quantities, based on snippet and page
relevance, are introduced and evaluated.
AB - What is the likelihood that a Web page is considered
relevant to a query, given the relevance assessment of the corresponding
snippet? Using a new federated IR test collection that contains search
results from over a hundred search engines on the internet, we are able
to investigate such research questions from a global perspective. Our test
collection covers the main Web search engines like Google, Yahoo!, and
Bing, as well as a number of smaller search engines dedicated to multimedia,
shopping, etc., and as such reflects a realistic Web environment.
Using a large set of relevance assessments, we are able to investigate
the connection between snippet quality and page relevance. The dataset
is strongly inhomogeneous, and although the assessors’ consistency is
shown to be satisfying, care is required when comparing resources. To
this end, a number of probabilistic quantities, based on snippet and page
relevance, are introduced and evaluated.
KW - METIS-296096
KW - EWI-22303
KW - IR-81857
U2 - 10.1007/978-3-642-35341-3_21
DO - 10.1007/978-3-642-35341-3_21
M3 - Conference contribution
SN - 978-3-642-35340-6
T3 - Lecture Notes in Computer Science
SP - 250
EP - 261
BT - Proceedings of the 8th Asia Information Retrieval Societies Conference (AIRS 2012)
A2 - Hou, Yuexian
A2 - Nie, Jian-Yun
A2 - Sun, Le
A2 - Wang, Bo
A2 - Zhang, Peng
PB - Springer
CY - Tianjin, China
T2 - 8th Asia Information Retrieval Societies Conference (AIRS 2012), Tianjin, China
Y2 - 17 December 2012 through 19 December 2012
ER -