MonetDB/XQuery: a fast XQuery processor powered by a relational engine

P. Boncz, T. Grust, Maurice van Keulen, S. Manegold, J. Rittinger, J. Teubner

  • 171 Citations

Abstract

Relational XQuery systems try to re-use mature relational data management infrastructures to create fast and scalable XML database technology. This paper describes the main features, key contributions, and lessons learned while implementing such a system. Its architecture consists of (i) a range-based encoding of XML documents into relational tables, (ii) a compilation technique that translates XQuery into a basic relational algebra, (iii) a restricted (order) property-aware peephole relational query optimization strategy, and (iv) a mapping from XML update statements into relational updates. Thus, this system implements all essential XML database functionalities (rather than a single feature) such that we can learn from the full consequences of our architectural decisions. While implementing this system, we had to extend the state-of-the-art with a number of new technical contributions, such as loop-lifted staircase join and efficient relational query evaluation strategies for XQuery theta-joins with existential semantics. These contributions as well as the architectural lessons learned are also deemed valuable for other relational back-end engines. The performance and scalability of the resulting system is evaluated on the XMark benchmark up to data sizes of 11GB. The performance section also provides an extensive benchmark comparison of all major XMark results published previously, which confirm that the goal of purely relational XQuery processing, namely speed and scalability, was met.
Original languageUndefined
Title of host publicationProceedings of the 2006 ACM SIGMOD international conference on Management of data
Place of PublicationNew York, NY, USA
PublisherACM Press
Pages479-490
Number of pages12
ISBN (Print)1-59593-434-0
DOIs
StatePublished - Jun 2006

Publication series

Name
PublisherACM Press
Number2

Fingerprint

XML
Scalability
Information management
Algebra
Semantics
Engines

Keywords

  • DB-XMLDB: XML DATABASES
  • EWI-7427
  • IR-66482
  • DB-PRJPF: PATHFINDER
  • METIS-238207

Cite this

Boncz, P., Grust, T., van Keulen, M., Manegold, S., Rittinger, J., & Teubner, J. (2006). MonetDB/XQuery: a fast XQuery processor powered by a relational engine. In Proceedings of the 2006 ACM SIGMOD international conference on Management of data (pp. 479-490). [10.1145/1142473.1142527] New York, NY, USA: ACM Press. DOI: 10.1145/1142473.1142527

Boncz, P.; Grust, T.; van Keulen, Maurice; Manegold, S.; Rittinger, J.; Teubner, J. / MonetDB/XQuery: a fast XQuery processor powered by a relational engine.

Proceedings of the 2006 ACM SIGMOD international conference on Management of data. New York, NY, USA : ACM Press, 2006. p. 479-490 10.1145/1142473.1142527.

Research output: Scientific - peer-reviewConference contribution

@inbook{d0c90b044d8043f1a9101fcf4f796384,
title = "MonetDB/XQuery: a fast XQuery processor powered by a relational engine",
abstract = "Relational XQuery systems try to re-use mature relational data management infrastructures to create fast and scalable XML database technology. This paper describes the main features, key contributions, and lessons learned while implementing such a system. Its architecture consists of (i) a range-based encoding of XML documents into relational tables, (ii) a compilation technique that translates XQuery into a basic relational algebra, (iii) a restricted (order) property-aware peephole relational query optimization strategy, and (iv) a mapping from XML update statements into relational updates. Thus, this system implements all essential XML database functionalities (rather than a single feature) such that we can learn from the full consequences of our architectural decisions. While implementing this system, we had to extend the state-of-the-art with a number of new technical contributions, such as loop-lifted staircase join and efficient relational query evaluation strategies for XQuery theta-joins with existential semantics. These contributions as well as the architectural lessons learned are also deemed valuable for other relational back-end engines. The performance and scalability of the resulting system is evaluated on the XMark benchmark up to data sizes of 11GB. The performance section also provides an extensive benchmark comparison of all major XMark results published previously, which confirm that the goal of purely relational XQuery processing, namely speed and scalability, was met.",
keywords = "DB-XMLDB: XML DATABASES, EWI-7427, IR-66482, DB-PRJPF: PATHFINDER, METIS-238207",
author = "P. Boncz and T. Grust and {van Keulen}, Maurice and S. Manegold and J. Rittinger and J. Teubner",
note = "Imported from EWI/DB PMS [db-utwente:inpr:0000003717]",
year = "2006",
month = "6",
doi = "10.1145/1142473.1142527",
isbn = "1-59593-434-0",
publisher = "ACM Press",
number = "2",
pages = "479--490",
booktitle = "Proceedings of the 2006 ACM SIGMOD international conference on Management of data",

}

Boncz, P, Grust, T, van Keulen, M, Manegold, S, Rittinger, J & Teubner, J 2006, MonetDB/XQuery: a fast XQuery processor powered by a relational engine. in Proceedings of the 2006 ACM SIGMOD international conference on Management of data., 10.1145/1142473.1142527, ACM Press, New York, NY, USA, pp. 479-490. DOI: 10.1145/1142473.1142527

MonetDB/XQuery: a fast XQuery processor powered by a relational engine. / Boncz, P.; Grust, T.; van Keulen, Maurice; Manegold, S.; Rittinger, J.; Teubner, J.

Proceedings of the 2006 ACM SIGMOD international conference on Management of data. New York, NY, USA : ACM Press, 2006. p. 479-490 10.1145/1142473.1142527.

Research output: Scientific - peer-reviewConference contribution

TY - CHAP

T1 - MonetDB/XQuery: a fast XQuery processor powered by a relational engine

AU - Boncz,P.

AU - Grust,T.

AU - van Keulen,Maurice

AU - Manegold,S.

AU - Rittinger,J.

AU - Teubner,J.

N1 - Imported from EWI/DB PMS [db-utwente:inpr:0000003717]

PY - 2006/6

Y1 - 2006/6

N2 - Relational XQuery systems try to re-use mature relational data management infrastructures to create fast and scalable XML database technology. This paper describes the main features, key contributions, and lessons learned while implementing such a system. Its architecture consists of (i) a range-based encoding of XML documents into relational tables, (ii) a compilation technique that translates XQuery into a basic relational algebra, (iii) a restricted (order) property-aware peephole relational query optimization strategy, and (iv) a mapping from XML update statements into relational updates. Thus, this system implements all essential XML database functionalities (rather than a single feature) such that we can learn from the full consequences of our architectural decisions. While implementing this system, we had to extend the state-of-the-art with a number of new technical contributions, such as loop-lifted staircase join and efficient relational query evaluation strategies for XQuery theta-joins with existential semantics. These contributions as well as the architectural lessons learned are also deemed valuable for other relational back-end engines. The performance and scalability of the resulting system is evaluated on the XMark benchmark up to data sizes of 11GB. The performance section also provides an extensive benchmark comparison of all major XMark results published previously, which confirm that the goal of purely relational XQuery processing, namely speed and scalability, was met.

AB - Relational XQuery systems try to re-use mature relational data management infrastructures to create fast and scalable XML database technology. This paper describes the main features, key contributions, and lessons learned while implementing such a system. Its architecture consists of (i) a range-based encoding of XML documents into relational tables, (ii) a compilation technique that translates XQuery into a basic relational algebra, (iii) a restricted (order) property-aware peephole relational query optimization strategy, and (iv) a mapping from XML update statements into relational updates. Thus, this system implements all essential XML database functionalities (rather than a single feature) such that we can learn from the full consequences of our architectural decisions. While implementing this system, we had to extend the state-of-the-art with a number of new technical contributions, such as loop-lifted staircase join and efficient relational query evaluation strategies for XQuery theta-joins with existential semantics. These contributions as well as the architectural lessons learned are also deemed valuable for other relational back-end engines. The performance and scalability of the resulting system is evaluated on the XMark benchmark up to data sizes of 11GB. The performance section also provides an extensive benchmark comparison of all major XMark results published previously, which confirm that the goal of purely relational XQuery processing, namely speed and scalability, was met.

KW - DB-XMLDB: XML DATABASES

KW - EWI-7427

KW - IR-66482

KW - DB-PRJPF: PATHFINDER

KW - METIS-238207

U2 - 10.1145/1142473.1142527

DO - 10.1145/1142473.1142527

M3 - Conference contribution

SN - 1-59593-434-0

SP - 479

EP - 490

BT - Proceedings of the 2006 ACM SIGMOD international conference on Management of data

PB - ACM Press

ER -

Boncz P, Grust T, van Keulen M, Manegold S, Rittinger J, Teubner J. MonetDB/XQuery: a fast XQuery processor powered by a relational engine. In Proceedings of the 2006 ACM SIGMOD international conference on Management of data. New York, NY, USA: ACM Press. 2006. p. 479-490. 10.1145/1142473.1142527. Available from, DOI: 10.1145/1142473.1142527