A Cross-lingual Framework for Monolingual Biomedical Information Retrieval

Rudolf Berend Trieschnigg, Djoerd Hiemstra, Franciska M.G. de Jong, Wessel Kraaij

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

16 Citations (Scopus)
73 Downloads (Pure)

Abstract

An important challenge for biomedical information retrieval (IR) is dealing with the complex, inconsistent and ambiguous biomedical terminology. Frequently, a concept-based representation defined in terms of a domain-specific terminological resource is employed to deal with this challenge. In this paper, we approach the incorporation of a concept-based representation in monolingual biomedical IR from a cross-lingual perspective. In the proposed framework, this is reali sed by translating and matching between text and concept-based representations. The approach allows for deployment of a rich set of techniques proposed and evaluated in traditional cross-lingual IR. We compare six translation models and measure their effectiveness in the biomedical domain. We demonstrate that the approach can result in significant improvements in retrieval effectiveness over word-based retrieval. Moreover, we demonstrate increased effectiveness of a CLIR framework for monolingual biomedical IR if basic translations models are combined.
Original languageUndefined
Title of host publicationProceedings of the 19th ACM Conference on Information and Knowledge Management (CIKM '10)
EditorsJ. Huang, N. Koudas, G. Jones, Xindong Wu, K. Collins-Thompson, Aijun An
Place of PublicationNew York
PublisherAssociation for Computing Machinery
Pages169-178
Number of pages10
ISBN (Print)978-1-4503-0099-5
DOIs
Publication statusPublished - 2010
Event19th ACM International Conference on Information and Knowledge Management, CIKM 2010 - Toronto, Canada
Duration: 26 Oct 201030 Oct 2010
Conference number: 19

Publication series

Name
PublisherACM

Conference

Conference19th ACM International Conference on Information and Knowledge Management, CIKM 2010
Abbreviated titleCIKM
Country/TerritoryCanada
CityToronto
Period26/10/1030/10/10

Keywords

  • IR-72470
  • METIS-270940
  • Biomedical information retrieval
  • CR-H.3.3
  • Language modeling
  • TREC Genomics
  • EWI-18213
  • CLIR framework

Cite this