What snippets say about pages

Thomas Demeester, Dong-Phuong Nguyen, Rudolf Berend Trieschnigg, Chris Develder, Djoerd Hiemstra

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

19 Downloads (Pure)

Abstract

What is the likelihood that a Web page is considered relevant to a query, given the relevance assessment of the corresponding snippet? Using a new FederatedWeb Search test collection that contains search results from over a hundred search engines on the internet, we are able to investigate such research questions from a global perspective. Our test collection covers the main Web search engines like Google, Yahoo!, and Bing, as well as smaller search engines dedicated to multimedia, shopping, etc., and as such reflects a realistic Web environment. Using a large set of relevance assessments, we are able to investigate the connection between snippet quality and page relevance. The dataset is strongly heterogeneous, and care is required when comparing resources. To this end, a number of probabilistic variables, based on snippet and page relevance, are introduced and discussed.
Original languageUndefined
Title of host publicationProceedings of the 13th Dutch-Belgian Information Retrieval Workshop, DIR 2013
PublisherCEUR
Pages34-35
Number of pages2
Publication statusPublished - Apr 2013
Event13th Dutch-Belgian Information Retrieval Workshop, DIR 2013 - Delft, Netherlands
Duration: 26 Apr 201326 Apr 2013
Conference number: 13

Publication series

NameCEUR Workshop Proceedings
PublisherCEUR
Volume986
ISSN (Print)1613-0073

Workshop

Workshop13th Dutch-Belgian Information Retrieval Workshop, DIR 2013
Abbreviated titleDIR
CountryNetherlands
CityDelft
Period26/04/1326/04/13

Keywords

  • EWI-24060
  • METIS-300202
  • IR-88460

Cite this

Demeester, T., Nguyen, D-P., Trieschnigg, R. B., Develder, C., & Hiemstra, D. (2013). What snippets say about pages. In Proceedings of the 13th Dutch-Belgian Information Retrieval Workshop, DIR 2013 (pp. 34-35). (CEUR Workshop Proceedings; Vol. 986). CEUR.
Demeester, Thomas ; Nguyen, Dong-Phuong ; Trieschnigg, Rudolf Berend ; Develder, Chris ; Hiemstra, Djoerd. / What snippets say about pages. Proceedings of the 13th Dutch-Belgian Information Retrieval Workshop, DIR 2013. CEUR, 2013. pp. 34-35 (CEUR Workshop Proceedings).
@inproceedings{b07b9ff10b104fa8b3d1b7710c6cfd07,
title = "What snippets say about pages",
abstract = "What is the likelihood that a Web page is considered relevant to a query, given the relevance assessment of the corresponding snippet? Using a new FederatedWeb Search test collection that contains search results from over a hundred search engines on the internet, we are able to investigate such research questions from a global perspective. Our test collection covers the main Web search engines like Google, Yahoo!, and Bing, as well as smaller search engines dedicated to multimedia, shopping, etc., and as such reflects a realistic Web environment. Using a large set of relevance assessments, we are able to investigate the connection between snippet quality and page relevance. The dataset is strongly heterogeneous, and care is required when comparing resources. To this end, a number of probabilistic variables, based on snippet and page relevance, are introduced and discussed.",
keywords = "EWI-24060, METIS-300202, IR-88460",
author = "Thomas Demeester and Dong-Phuong Nguyen and Trieschnigg, {Rudolf Berend} and Chris Develder and Djoerd Hiemstra",
year = "2013",
month = "4",
language = "Undefined",
series = "CEUR Workshop Proceedings",
publisher = "CEUR",
pages = "34--35",
booktitle = "Proceedings of the 13th Dutch-Belgian Information Retrieval Workshop, DIR 2013",

}

Demeester, T, Nguyen, D-P, Trieschnigg, RB, Develder, C & Hiemstra, D 2013, What snippets say about pages. in Proceedings of the 13th Dutch-Belgian Information Retrieval Workshop, DIR 2013. CEUR Workshop Proceedings, vol. 986, CEUR, pp. 34-35, 13th Dutch-Belgian Information Retrieval Workshop, DIR 2013, Delft, Netherlands, 26/04/13.

What snippets say about pages. / Demeester, Thomas; Nguyen, Dong-Phuong; Trieschnigg, Rudolf Berend; Develder, Chris; Hiemstra, Djoerd.

Proceedings of the 13th Dutch-Belgian Information Retrieval Workshop, DIR 2013. CEUR, 2013. p. 34-35 (CEUR Workshop Proceedings; Vol. 986).

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - What snippets say about pages

AU - Demeester, Thomas

AU - Nguyen, Dong-Phuong

AU - Trieschnigg, Rudolf Berend

AU - Develder, Chris

AU - Hiemstra, Djoerd

PY - 2013/4

Y1 - 2013/4

N2 - What is the likelihood that a Web page is considered relevant to a query, given the relevance assessment of the corresponding snippet? Using a new FederatedWeb Search test collection that contains search results from over a hundred search engines on the internet, we are able to investigate such research questions from a global perspective. Our test collection covers the main Web search engines like Google, Yahoo!, and Bing, as well as smaller search engines dedicated to multimedia, shopping, etc., and as such reflects a realistic Web environment. Using a large set of relevance assessments, we are able to investigate the connection between snippet quality and page relevance. The dataset is strongly heterogeneous, and care is required when comparing resources. To this end, a number of probabilistic variables, based on snippet and page relevance, are introduced and discussed.

AB - What is the likelihood that a Web page is considered relevant to a query, given the relevance assessment of the corresponding snippet? Using a new FederatedWeb Search test collection that contains search results from over a hundred search engines on the internet, we are able to investigate such research questions from a global perspective. Our test collection covers the main Web search engines like Google, Yahoo!, and Bing, as well as smaller search engines dedicated to multimedia, shopping, etc., and as such reflects a realistic Web environment. Using a large set of relevance assessments, we are able to investigate the connection between snippet quality and page relevance. The dataset is strongly heterogeneous, and care is required when comparing resources. To this end, a number of probabilistic variables, based on snippet and page relevance, are introduced and discussed.

KW - EWI-24060

KW - METIS-300202

KW - IR-88460

M3 - Conference contribution

T3 - CEUR Workshop Proceedings

SP - 34

EP - 35

BT - Proceedings of the 13th Dutch-Belgian Information Retrieval Workshop, DIR 2013

PB - CEUR

ER -

Demeester T, Nguyen D-P, Trieschnigg RB, Develder C, Hiemstra D. What snippets say about pages. In Proceedings of the 13th Dutch-Belgian Information Retrieval Workshop, DIR 2013. CEUR. 2013. p. 34-35. (CEUR Workshop Proceedings).