Abstract
Search context is a crucial factor that helps to understand a user’s information need in ad-hoc Web page retrieval. A query log of a search engine contains rich information on issued queries and their corresponding clicked Web pages. The clicked data implies its relevance to the query and can be used to define the topical context. However, the log is usually not completely available due to privacy concerns. In this paper, we derive clicked pages from clicked domains and use the surrounding query context to enhance retrieval performance. One strategy is to promote clicked pages directly in the initial retrieval result. Another strategy is to expand the original query using selected terms from the clicked pages. Our experimental results on the TREC GOV2 data and a query log of a major search engine show that both strategies can boost retrieval performance compared to the standard language model and pseudo relevance feedback (PRF) model. Their good performance on early precision allows us to apply PRF further for even more accurate result that is comparable to the performance of true relevance feedback.
Original language | English |
---|---|
Title of host publication | DEXA 2009 : 20th International Conference on Database and Expert Systems Applications |
Subtitle of host publication | Proceedings, 31 August - 4 September 2009, Linz, Austria |
Place of Publication | Los Alamitos, NJ |
Publisher | IEEE |
Pages | 393-397 |
Number of pages | 5 |
ISBN (Print) | 978-0-7695-3763-4 |
DOIs | |
Publication status | Published - Sept 2009 |
Event | DEXA Workshop on Text-based Information Retrieval (TIR2009) - Linz, Austria Duration: 31 Aug 2009 → 4 Sept 2009 |
Workshop
Workshop | DEXA Workshop on Text-based Information Retrieval (TIR2009) |
---|---|
Abbreviated title | TIR 2009 |
Country/Territory | Austria |
City | Linz |
Period | 31/08/09 → 4/09/09 |
Keywords
- METIS-263884
- EWI-15432
- IR-62834
- DB-IR: INFORMATION RETRIEVAL