Creating a Dutch testbed to evaluate the retrieval from textual databases

Djoerd Hiemstra, David A. van Leeuwen

Research output: Book/ReportReportAcademic

11 Downloads (Pure)

Abstract

This paper describes the first large-scale evaluation of information retrieval systems using Dutch documents and queries. We describe in detail the characteristics of the Dutch test data, which is part of the official CLEF multilingual texttual database, and give an overview of the experimental results of companies and research institutions that participated in the first official Dutch CLEF experiments. Judging from these experiments, the handling of language-specific issues of Dutch, like for instance simple morphology and compound nouns, significantly improves the performance of information retrieval systems in many cases. Careful examination of the test collection shows that it serves as a reliable tool for the evaluation of information retrieval systems in the future.
Original languageUndefined
PublisherUniversity of Twente
Number of pages14
Publication statusPublished - Feb 2002

Publication series

NameCTIT Technical Report Series
No.01-xx

Keywords

  • EWI-5885
  • IR-38058
  • METIS-208509

Cite this

Hiemstra, D., & van Leeuwen, D. A. (2002). Creating a Dutch testbed to evaluate the retrieval from textual databases. (CTIT Technical Report Series; No. 01-xx). University of Twente.
Hiemstra, Djoerd ; van Leeuwen, David A. / Creating a Dutch testbed to evaluate the retrieval from textual databases. University of Twente, 2002. 14 p. (CTIT Technical Report Series; 01-xx).
@book{8aa5ffee24874cda8c644cd6680a260c,
title = "Creating a Dutch testbed to evaluate the retrieval from textual databases",
abstract = "This paper describes the first large-scale evaluation of information retrieval systems using Dutch documents and queries. We describe in detail the characteristics of the Dutch test data, which is part of the official CLEF multilingual texttual database, and give an overview of the experimental results of companies and research institutions that participated in the first official Dutch CLEF experiments. Judging from these experiments, the handling of language-specific issues of Dutch, like for instance simple morphology and compound nouns, significantly improves the performance of information retrieval systems in many cases. Careful examination of the test collection shows that it serves as a reliable tool for the evaluation of information retrieval systems in the future.",
keywords = "EWI-5885, IR-38058, METIS-208509",
author = "Djoerd Hiemstra and {van Leeuwen}, {David A.}",
note = "Imported from CTIT",
year = "2002",
month = "2",
language = "Undefined",
series = "CTIT Technical Report Series",
publisher = "University of Twente",
number = "01-xx",
address = "Netherlands",

}

Hiemstra, D & van Leeuwen, DA 2002, Creating a Dutch testbed to evaluate the retrieval from textual databases. CTIT Technical Report Series, no. 01-xx, University of Twente.

Creating a Dutch testbed to evaluate the retrieval from textual databases. / Hiemstra, Djoerd; van Leeuwen, David A.

University of Twente, 2002. 14 p. (CTIT Technical Report Series; No. 01-xx).

Research output: Book/ReportReportAcademic

TY - BOOK

T1 - Creating a Dutch testbed to evaluate the retrieval from textual databases

AU - Hiemstra, Djoerd

AU - van Leeuwen, David A.

N1 - Imported from CTIT

PY - 2002/2

Y1 - 2002/2

N2 - This paper describes the first large-scale evaluation of information retrieval systems using Dutch documents and queries. We describe in detail the characteristics of the Dutch test data, which is part of the official CLEF multilingual texttual database, and give an overview of the experimental results of companies and research institutions that participated in the first official Dutch CLEF experiments. Judging from these experiments, the handling of language-specific issues of Dutch, like for instance simple morphology and compound nouns, significantly improves the performance of information retrieval systems in many cases. Careful examination of the test collection shows that it serves as a reliable tool for the evaluation of information retrieval systems in the future.

AB - This paper describes the first large-scale evaluation of information retrieval systems using Dutch documents and queries. We describe in detail the characteristics of the Dutch test data, which is part of the official CLEF multilingual texttual database, and give an overview of the experimental results of companies and research institutions that participated in the first official Dutch CLEF experiments. Judging from these experiments, the handling of language-specific issues of Dutch, like for instance simple morphology and compound nouns, significantly improves the performance of information retrieval systems in many cases. Careful examination of the test collection shows that it serves as a reliable tool for the evaluation of information retrieval systems in the future.

KW - EWI-5885

KW - IR-38058

KW - METIS-208509

M3 - Report

T3 - CTIT Technical Report Series

BT - Creating a Dutch testbed to evaluate the retrieval from textual databases

PB - University of Twente

ER -

Hiemstra D, van Leeuwen DA. Creating a Dutch testbed to evaluate the retrieval from textual databases. University of Twente, 2002. 14 p. (CTIT Technical Report Series; 01-xx).