TY - GEN
T1 - MonetDB/XQuery: a fast XQuery processor powered by a relational engine
AU - Boncz, P.
AU - Grust, T.
AU - van Keulen, Maurice
AU - Manegold, S.
AU - Rittinger, J.
AU - Teubner, J.
N1 - Imported from EWI/DB PMS [db-utwente:inpr:0000003717]
PY - 2006/6
Y1 - 2006/6
N2 - Relational XQuery systems try to re-use mature relational data management infrastructures to create fast and scalable XML database technology. This paper describes the main features, key contributions, and lessons learned while implementing such a system. Its architecture consists of (i) a range-based encoding of XML documents into relational tables, (ii) a compilation technique that translates XQuery into a basic relational algebra, (iii) a restricted (order) property-aware peephole relational query optimization strategy, and (iv) a mapping from XML update statements into relational updates. Thus, this system implements all essential XML database functionalities (rather than a single feature) such that we can learn from the full consequences of our architectural decisions. While implementing this system, we had to extend the state-of-the-art with a number of new technical contributions, such as loop-lifted staircase join and efficient relational query evaluation strategies for XQuery theta-joins with existential semantics. These contributions as well as the architectural lessons learned are also deemed valuable for other relational back-end engines. The performance and scalability of the resulting system is evaluated on the XMark benchmark up to data sizes of 11GB. The performance section also provides an extensive benchmark comparison of all major XMark results published previously, which confirm that the goal of purely relational XQuery processing, namely speed and scalability, was met.
AB - Relational XQuery systems try to re-use mature relational data management infrastructures to create fast and scalable XML database technology. This paper describes the main features, key contributions, and lessons learned while implementing such a system. Its architecture consists of (i) a range-based encoding of XML documents into relational tables, (ii) a compilation technique that translates XQuery into a basic relational algebra, (iii) a restricted (order) property-aware peephole relational query optimization strategy, and (iv) a mapping from XML update statements into relational updates. Thus, this system implements all essential XML database functionalities (rather than a single feature) such that we can learn from the full consequences of our architectural decisions. While implementing this system, we had to extend the state-of-the-art with a number of new technical contributions, such as loop-lifted staircase join and efficient relational query evaluation strategies for XQuery theta-joins with existential semantics. These contributions as well as the architectural lessons learned are also deemed valuable for other relational back-end engines. The performance and scalability of the resulting system is evaluated on the XMark benchmark up to data sizes of 11GB. The performance section also provides an extensive benchmark comparison of all major XMark results published previously, which confirm that the goal of purely relational XQuery processing, namely speed and scalability, was met.
KW - DB-XMLDB: XML DATABASES
KW - EWI-7427
KW - IR-66482
KW - DB-PRJPF: PATHFINDER
KW - METIS-238207
U2 - 10.1145/1142473.1142527
DO - 10.1145/1142473.1142527
M3 - Conference contribution
SN - 1-59593-434-0
SP - 479
EP - 490
BT - Proceedings of the 2006 ACM SIGMOD international conference on Management of data
PB - ACM Press
CY - New York, NY, USA
T2 - 2006 ACM SIGMOD international conference on Management of data, Chicago, IL, USA
Y2 - 1 June 2006
ER -