What Snippets Say About Pages in Federated Web Search

Thomas Demeester, Dong-Phuong Nguyen, Rudolf Berend Trieschnigg, Chris Develder, Djoerd Hiemstra

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

6 Citations (Scopus)
48 Downloads (Pure)

Abstract

What is the likelihood that a Web page is considered relevant to a query, given the relevance assessment of the corresponding snippet? Using a new federated IR test collection that contains search results from over a hundred search engines on the internet, we are able to investigate such research questions from a global perspective. Our test collection covers the main Web search engines like Google, Yahoo!, and Bing, as well as a number of smaller search engines dedicated to multimedia, shopping, etc., and as such reflects a realistic Web environment. Using a large set of relevance assessments, we are able to investigate the connection between snippet quality and page relevance. The dataset is strongly inhomogeneous, and although the assessors’ consistency is shown to be satisfying, care is required when comparing resources. To this end, a number of probabilistic quantities, based on snippet and page relevance, are introduced and evaluated.
Original languageUndefined
Title of host publicationProceedings of the 8th Asia Information Retrieval Societies Conference (AIRS 2012)
EditorsYuexian Hou, Jian-Yun Nie, Le Sun, Bo Wang, Peng Zhang
Place of PublicationTianjin, China
PublisherSpringer
Pages250-261
Number of pages12
ISBN (Print)978-3-642-35340-6
DOIs
Publication statusPublished - Dec 2012

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume7675
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Keywords

  • METIS-296096
  • EWI-22303
  • IR-81857

Cite this

Demeester, T., Nguyen, D-P., Trieschnigg, R. B., Develder, C., & Hiemstra, D. (2012). What Snippets Say About Pages in Federated Web Search. In Y. Hou, J-Y. Nie, L. Sun, B. Wang, & P. Zhang (Eds.), Proceedings of the 8th Asia Information Retrieval Societies Conference (AIRS 2012) (pp. 250-261). (Lecture Notes in Computer Science; Vol. 7675). Tianjin, China: Springer. https://doi.org/10.1007/978-3-642-35341-3_21