Score Region Algebra: Building a Transparent XML-IR Database

V. Mihajlovic, H.E. Blok, Djoerd Hiemstra, Peter M.G. Apers

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

7 Citations (Scopus)
159 Downloads (Pure)

Abstract

A unified database framework that will enable better comprehension of ranked XML retrieval is still a challenge in the XML database field. We propose a logical algebra, named score region algebra, that enables transparent specification of information retrieval (IR) models for XML databases. The transparency is achieved by a possibility to instantiate various retrieval models, using abstract score functions within algebra operators, while logical query plan and operator definitions remain unchanged. Our algebra operators model three important aspects of XML retrieval: element relevance score computation, element score propagation, and element score combination. To illustrate the usefulness of our algebra we instantiate four different, well known IR scoring models, and combine them with different score propagation and combination functions. We implemented the algebra operators in a prototype system on top of a low-level database kernel. The evaluation of the system is performed on a collection of IEEE articles in XML format provided by INEX. We argue that state of the art XML IR models can be transparently implemented using our score region algebra framework on top of any low-level physical database engine or existing RDBMS, allowing a more systematic investigation of retrieval model behavior.
Original languageUndefined
Title of host publicationProceedings of the 14th ACM international conference on Information and knowledge management (CIKM 2005)
EditorsA. Chowdhury, N. Fuhr, M. Ronthaler, H-J. Schek, W. Teiken
Place of PublicationNew York, NY, USA
PublisherACM Press
Pages12-19
Number of pages8
ISBN (Print)1-59593-140-6
DOIs
Publication statusPublished - Oct 2005
Event14th ACM International Conference on Information and Knowledge Management, CIKM 2005 - Bremen, Germany
Duration: 31 Oct 20055 Nov 2005
Conference number: 14

Publication series

Name
PublisherACM

Conference

Conference14th ACM International Conference on Information and Knowledge Management, CIKM 2005
Abbreviated titleCIKM
Country/TerritoryGermany
CityBremen
Period31/10/055/11/05

Keywords

  • METIS-225956
  • DB-XMLIR: XML INFORMATION RETRIEVAL
  • IR-53340
  • EWI-7297

Cite this