Disambiguation strategies for cross-language information retrieval

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

32 Citations (Scopus)
66 Downloads (Pure)

Abstract

This paper gives an overview of tools and methods for Cross-Language Information Retrieval (CLIR) that are developed within the Twenty-One project. The tools and methods are evaluated with the TREC CLIR task document collection using Dutch queries on the English document base. The main issue addressed here is an evaluation of two approaches to disambiguation. The underlying question is whether a lot of effort should be put in finding the correct translation for each query term before searching, or whether searching with more than one possible translation leads to better results? The experimental study suggests that the quality of search methods is more important than the quality of disambiguation methods. Good retrieval methods are able to disambiguate translated queries implicitly during searching.
Original languageUndefined
Title of host publicationResearch and Advanced Technology for Digital Libraries
Place of PublicationBerlin
PublisherSpringer
Pages274-293
Number of pages20
ISBN (Print)3-540-66558-7
Publication statusPublished - 1999

Publication series

NameLecture Notes in Computer Science
PublisherSpringer Verlag
Volume1696
ISSN (Print)0302-9743

Keywords

  • METIS-119719
  • CR-H.3.3
  • IR-66981
  • EWI-9423

Cite this

Hiemstra, D., & de Jong, F. M. G. (1999). Disambiguation strategies for cross-language information retrieval. In Research and Advanced Technology for Digital Libraries (pp. 274-293). (Lecture Notes in Computer Science; Vol. 1696). Berlin: Springer.
Hiemstra, Djoerd ; de Jong, Franciska M.G. / Disambiguation strategies for cross-language information retrieval. Research and Advanced Technology for Digital Libraries. Berlin : Springer, 1999. pp. 274-293 (Lecture Notes in Computer Science).
@inproceedings{da9cb2219948400b971c2af25f7e7339,
title = "Disambiguation strategies for cross-language information retrieval",
abstract = "This paper gives an overview of tools and methods for Cross-Language Information Retrieval (CLIR) that are developed within the Twenty-One project. The tools and methods are evaluated with the TREC CLIR task document collection using Dutch queries on the English document base. The main issue addressed here is an evaluation of two approaches to disambiguation. The underlying question is whether a lot of effort should be put in finding the correct translation for each query term before searching, or whether searching with more than one possible translation leads to better results? The experimental study suggests that the quality of search methods is more important than the quality of disambiguation methods. Good retrieval methods are able to disambiguate translated queries implicitly during searching.",
keywords = "METIS-119719, CR-H.3.3, IR-66981, EWI-9423",
author = "Djoerd Hiemstra and {de Jong}, {Franciska M.G.}",
note = "Awarded by the British Computer Society's Informer-Springer competition for the best student paper in Information Retrieval in 1999",
year = "1999",
language = "Undefined",
isbn = "3-540-66558-7",
series = "Lecture Notes in Computer Science",
publisher = "Springer",
pages = "274--293",
booktitle = "Research and Advanced Technology for Digital Libraries",

}

Hiemstra, D & de Jong, FMG 1999, Disambiguation strategies for cross-language information retrieval. in Research and Advanced Technology for Digital Libraries. Lecture Notes in Computer Science, vol. 1696, Springer, Berlin, pp. 274-293.

Disambiguation strategies for cross-language information retrieval. / Hiemstra, Djoerd; de Jong, Franciska M.G.

Research and Advanced Technology for Digital Libraries. Berlin : Springer, 1999. p. 274-293 (Lecture Notes in Computer Science; Vol. 1696).

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - Disambiguation strategies for cross-language information retrieval

AU - Hiemstra, Djoerd

AU - de Jong, Franciska M.G.

N1 - Awarded by the British Computer Society's Informer-Springer competition for the best student paper in Information Retrieval in 1999

PY - 1999

Y1 - 1999

N2 - This paper gives an overview of tools and methods for Cross-Language Information Retrieval (CLIR) that are developed within the Twenty-One project. The tools and methods are evaluated with the TREC CLIR task document collection using Dutch queries on the English document base. The main issue addressed here is an evaluation of two approaches to disambiguation. The underlying question is whether a lot of effort should be put in finding the correct translation for each query term before searching, or whether searching with more than one possible translation leads to better results? The experimental study suggests that the quality of search methods is more important than the quality of disambiguation methods. Good retrieval methods are able to disambiguate translated queries implicitly during searching.

AB - This paper gives an overview of tools and methods for Cross-Language Information Retrieval (CLIR) that are developed within the Twenty-One project. The tools and methods are evaluated with the TREC CLIR task document collection using Dutch queries on the English document base. The main issue addressed here is an evaluation of two approaches to disambiguation. The underlying question is whether a lot of effort should be put in finding the correct translation for each query term before searching, or whether searching with more than one possible translation leads to better results? The experimental study suggests that the quality of search methods is more important than the quality of disambiguation methods. Good retrieval methods are able to disambiguate translated queries implicitly during searching.

KW - METIS-119719

KW - CR-H.3.3

KW - IR-66981

KW - EWI-9423

M3 - Conference contribution

SN - 3-540-66558-7

T3 - Lecture Notes in Computer Science

SP - 274

EP - 293

BT - Research and Advanced Technology for Digital Libraries

PB - Springer

CY - Berlin

ER -

Hiemstra D, de Jong FMG. Disambiguation strategies for cross-language information retrieval. In Research and Advanced Technology for Digital Libraries. Berlin: Springer. 1999. p. 274-293. (Lecture Notes in Computer Science).