Improving Web Page Retrieval using Search Context from Clicked Domain Names

Rongmei Li

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

3 Downloads (Pure)

Abstract

Search context is a crucial factor that helps to understand a user’s information need in ad-hoc Web page retrieval. A query log of a search engine contains rich information on issued queries and their corresponding clicked Web pages. The clicked data implies its relevance to the query and can be used to define the topical context. However, the log is usually not completely available due to privacy concerns. In this paper, we derive clicked pages from clicked domains and use the surrounding query context to enhance retrieval performance. One strategy is to promote clicked pages directly in the initial retrieval result. Another strategy is to expand the original query using selected terms from the clicked pages. Our experimental results on the TREC GOV2 data and a query log of a major search engine show that both strategies can boost retrieval performance compared to the standard language model and pseudo relevance feedback (PRF) model. Their good performance on early precision allows us to apply PRF further for even more accurate result that is comparable to the performance of true relevance feedback.
Original languageEnglish
Title of host publicationDEXA 2009 : 20th International Conference on Database and Expert Systems Applications
Subtitle of host publicationProceedings, 31 August - 4 September 2009, Linz, Austria
Place of PublicationLos Alamitos, NJ
PublisherIEEE Computer Society
Pages393-397
Number of pages5
ISBN (Print)978-0-7695-3763-4
DOIs
Publication statusPublished - Sep 2009

Fingerprint

Websites
Search engines
Feedback

Keywords

  • METIS-263884
  • EWI-15432
  • IR-62834
  • DB-IR: INFORMATION RETRIEVAL

Cite this

Li, R. (2009). Improving Web Page Retrieval using Search Context from Clicked Domain Names. In DEXA 2009 : 20th International Conference on Database and Expert Systems Applications: Proceedings, 31 August - 4 September 2009, Linz, Austria (pp. 393-397). Los Alamitos, NJ: IEEE Computer Society. https://doi.org/10.1109/DEXA.2009.59
Li, Rongmei. / Improving Web Page Retrieval using Search Context from Clicked Domain Names. DEXA 2009 : 20th International Conference on Database and Expert Systems Applications: Proceedings, 31 August - 4 September 2009, Linz, Austria. Los Alamitos, NJ : IEEE Computer Society, 2009. pp. 393-397
@inproceedings{c4692dd0229941b9b515a0eab8d98d96,
title = "Improving Web Page Retrieval using Search Context from Clicked Domain Names",
abstract = "Search context is a crucial factor that helps to understand a user’s information need in ad-hoc Web page retrieval. A query log of a search engine contains rich information on issued queries and their corresponding clicked Web pages. The clicked data implies its relevance to the query and can be used to define the topical context. However, the log is usually not completely available due to privacy concerns. In this paper, we derive clicked pages from clicked domains and use the surrounding query context to enhance retrieval performance. One strategy is to promote clicked pages directly in the initial retrieval result. Another strategy is to expand the original query using selected terms from the clicked pages. Our experimental results on the TREC GOV2 data and a query log of a major search engine show that both strategies can boost retrieval performance compared to the standard language model and pseudo relevance feedback (PRF) model. Their good performance on early precision allows us to apply PRF further for even more accurate result that is comparable to the performance of true relevance feedback.",
keywords = "METIS-263884, EWI-15432, IR-62834, DB-IR: INFORMATION RETRIEVAL",
author = "Rongmei Li",
note = "10.1109/DEXA.2009.59",
year = "2009",
month = "9",
doi = "10.1109/DEXA.2009.59",
language = "English",
isbn = "978-0-7695-3763-4",
pages = "393--397",
booktitle = "DEXA 2009 : 20th International Conference on Database and Expert Systems Applications",
publisher = "IEEE Computer Society",
address = "United States",

}

Li, R 2009, Improving Web Page Retrieval using Search Context from Clicked Domain Names. in DEXA 2009 : 20th International Conference on Database and Expert Systems Applications: Proceedings, 31 August - 4 September 2009, Linz, Austria. IEEE Computer Society, Los Alamitos, NJ, pp. 393-397. https://doi.org/10.1109/DEXA.2009.59

Improving Web Page Retrieval using Search Context from Clicked Domain Names. / Li, Rongmei.

DEXA 2009 : 20th International Conference on Database and Expert Systems Applications: Proceedings, 31 August - 4 September 2009, Linz, Austria. Los Alamitos, NJ : IEEE Computer Society, 2009. p. 393-397.

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - Improving Web Page Retrieval using Search Context from Clicked Domain Names

AU - Li, Rongmei

N1 - 10.1109/DEXA.2009.59

PY - 2009/9

Y1 - 2009/9

N2 - Search context is a crucial factor that helps to understand a user’s information need in ad-hoc Web page retrieval. A query log of a search engine contains rich information on issued queries and their corresponding clicked Web pages. The clicked data implies its relevance to the query and can be used to define the topical context. However, the log is usually not completely available due to privacy concerns. In this paper, we derive clicked pages from clicked domains and use the surrounding query context to enhance retrieval performance. One strategy is to promote clicked pages directly in the initial retrieval result. Another strategy is to expand the original query using selected terms from the clicked pages. Our experimental results on the TREC GOV2 data and a query log of a major search engine show that both strategies can boost retrieval performance compared to the standard language model and pseudo relevance feedback (PRF) model. Their good performance on early precision allows us to apply PRF further for even more accurate result that is comparable to the performance of true relevance feedback.

AB - Search context is a crucial factor that helps to understand a user’s information need in ad-hoc Web page retrieval. A query log of a search engine contains rich information on issued queries and their corresponding clicked Web pages. The clicked data implies its relevance to the query and can be used to define the topical context. However, the log is usually not completely available due to privacy concerns. In this paper, we derive clicked pages from clicked domains and use the surrounding query context to enhance retrieval performance. One strategy is to promote clicked pages directly in the initial retrieval result. Another strategy is to expand the original query using selected terms from the clicked pages. Our experimental results on the TREC GOV2 data and a query log of a major search engine show that both strategies can boost retrieval performance compared to the standard language model and pseudo relevance feedback (PRF) model. Their good performance on early precision allows us to apply PRF further for even more accurate result that is comparable to the performance of true relevance feedback.

KW - METIS-263884

KW - EWI-15432

KW - IR-62834

KW - DB-IR: INFORMATION RETRIEVAL

U2 - 10.1109/DEXA.2009.59

DO - 10.1109/DEXA.2009.59

M3 - Conference contribution

SN - 978-0-7695-3763-4

SP - 393

EP - 397

BT - DEXA 2009 : 20th International Conference on Database and Expert Systems Applications

PB - IEEE Computer Society

CY - Los Alamitos, NJ

ER -

Li R. Improving Web Page Retrieval using Search Context from Clicked Domain Names. In DEXA 2009 : 20th International Conference on Database and Expert Systems Applications: Proceedings, 31 August - 4 September 2009, Linz, Austria. Los Alamitos, NJ: IEEE Computer Society. 2009. p. 393-397 https://doi.org/10.1109/DEXA.2009.59