Abstract
This paper gives an overview of the tools and methods for Cross-Language Information Retrieval (CLIR) that were developed within the Twenty-One project. The tools and methods are evaluated with the TREC CLIR task document collection using Dutch queries on the English document base. The main issue addressed here is an evaluation of two approaches to disambiguation. The underlying question is whether a lot of effort should be put in finding the correct translation for each query term before searching, or whether searching with more than one possible translation leads to better results? The experimental study suggests that in terms of average precision, searching with ambiguities leads to better retrieval performance than searching with disambiguated queries.
Original language | English |
---|---|
Title of host publication | Language Technology in Multimedia Information Retrieval |
Subtitle of host publication | Proceedings of the Fourteenth Twente Workshop on Language Technology TWLT-14 |
Place of Publication | Enschede |
Publisher | University of Twente |
Pages | 19-26 |
Number of pages | 8 |
Publication status | Published - 1998 |
Event | Twente Workshop on Language Technology, TWLT 14: Language Technology in Multimedia Information Retrieval - University of Twente, Enschede, Netherlands Duration: 7 Dec 1998 → 8 Dec 1998 Conference number: 14 |
Publication series
Name | Twente Workshop on Language Technology |
---|---|
Publisher | Neslia Paniculata |
Number | 14 |
ISSN (Print) | 0929-0672 |
Workshop
Workshop | Twente Workshop on Language Technology, TWLT 14 |
---|---|
Abbreviated title | TWLT |
Country/Territory | Netherlands |
City | Enschede |
Period | 7/12/98 → 8/12/98 |
Keywords
- EWI-9487
- CR-I.2.7
- Statistical Machine Translation
- IR-66994
- Cross-Language Information Retrieval
- METIS-119181