Structured Document Retrieval, Multimedia Retrieval, and Entity Ranking Using PF/Tijah

Theodora Tsikrika, Pavel Serdyukov, H. Rode, T.H.W. Westerveld, Robin Aly, Djoerd Hiemstra, A.P. de Vries

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

21 Citations (Scopus)
138 Downloads (Pure)

Abstract

CWI and University of Twente used PF/Tijah, a flexible XML retrieval system, to evaluate structured document retrieval, multimedia retrieval, and entity ranking tasks in the context of INEX 2007. For the retrieval of textual and multimedia elements in the Wikipedia data, we investigated various length priors and found that biasing towards longer elements than the ones retrieved by our language modelling approach can be useful. For retrieving images in isolation, we found that their associated text is a very good source of evidence in the Wikipedia collection. For the entity ranking task, we used random walks to model multi-step relevance propagation from the articles describing entities to all related entities and further, and obtained promising results.
Original languageUndefined
Title of host publicationProceedings of the 6th Initiative on the Evaluation of XML Retrieval (INEX 2007)
Place of PublicationLondon
PublisherSpringer
Pages306-320
Number of pages15
ISBN (Print)978-3-540-85901-7
DOIs
Publication statusPublished - 11 Mar 2008
Event6th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2007 - Dagstuhl, Germany
Duration: 17 Dec 200719 Dec 2007
Conference number: 6

Publication series

NameLecture Notes in Computer Science
PublisherSpringer Verlag
Number1
Volume4862

Workshop

Workshop6th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2007
Abbreviated titleINEX
Country/TerritoryGermany
CityDagstuhl
Period17/12/0719/12/07

Keywords

  • DB-IR: INFORMATION RETRIEVAL
  • METIS-250968
  • EWI-12318
  • IR-64734

Cite this