Abstract
This paper presents WikiTranslate, a system which performs query translation for cross-lingual information retrieval (CLIR) using only Wikipedia to obtain translations. Queries are mapped to Wikipedia concepts and the corresponding translations of these concepts in the target language are used to create the final query. WikiTranslate is evaluated by searching with topics formulated in Dutch, French and Spanish in an English data collection. The system achieved a performance of 67% compared to the monolingual baseline.
Original language | English |
---|---|
Title of host publication | Evaluating Systems for Multilingual and Multimodal Information Access |
Subtitle of host publication | 9th Workshop of the Cross-Language Evaluation Forum, CLEF 2008, Aarhus, Denmark, September 17-19, 2008, Revised Selected Papers |
Editors | Carol Peters, Thomas Deselaers, Nicola Ferro, Julio Gonzalo |
Place of Publication | Berlin |
Publisher | Springer |
Pages | 58-65 |
Number of pages | 8 |
ISBN (Print) | 978-3-642-04446-5 |
DOIs | |
Publication status | Published - 2009 |
Event | 9th Workshop of the Cross-Language Evaluation Forum, CLEF 2008 - Aarhus, Denmark Duration: 17 Sept 2008 → 19 Sept 2008 Conference number: 9 |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Publisher | Springer |
Volume | 5706 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Workshop
Workshop | 9th Workshop of the Cross-Language Evaluation Forum, CLEF 2008 |
---|---|
Abbreviated title | CLEF |
Country/Territory | Denmark |
City | Aarhus |
Period | 17/09/08 → 19/09/08 |
Keywords
- Cross-lingual information retrieval
- Query translation
- Word sense disambiguation
- Wikipedia
- Comparable corpus