A probabilistic XML approach to data integration

Maurice van Keulen, Ander de Keijzer, Wouter Alink

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

89 Citations (Scopus)
67 Downloads (Pure)

Abstract

In mobile and ambient environments, devices need to become autonomous, managing and resolving problems without interference from a user. The database of a (mobile) device can be seen as its knowledge about objects in the ýreal worldý. Data exchange between small and/or large computing devices can be used to supplement and update this knowledge whenever a connection gets established. In many situations, however, data from different data sources referring to the same real world objects, may conflict. It is the task of the data management system of the device to resolve such conflicts without interference from a user. In this paper, we take a first step in the development of a probabilistic XML DBMS. The main idea is to drop the assumption that data in the database should be certain: subtrees in XML documents may denote possible views on the real world. We formally define the notion of probabilistic XML tree and several operations thereon. We also present an approach for determining a logical semantics for queries on probabilistic XML data. Finally, we introduce an approach for XML data integration where conflicts are resolved by the introduction of possibilities in the database.
Original languageUndefined
Title of host publicationProceedings of the 21st International Conference on Data Engineering (ICDE'05)
Place of PublicationWashington, DC, USA
PublisherIEEE Computer Society
Pages459-470
Number of pages12
ISBN (Print)0-7695-2285-8
DOIs
Publication statusPublished - Apr 2005
Event21st International Conference on Data Engineering, ICDE 2005 - Tokyo, Japan
Duration: 5 Apr 20058 Apr 2005
Conference number: 21

Publication series

NameIEEE Conference Proceedings
PublisherIEEE Computer Society
ISSN (Print)1084-4627

Conference

Conference21st International Conference on Data Engineering, ICDE 2005
Abbreviated titleICDE
CountryJapan
CityTokyo
Period5/04/058/04/05

Keywords

  • METIS-225759
  • IR-53251
  • EWI-7273
  • DB-SDI: SCHEMA AND DATA INTEGRATION

Cite this

van Keulen, M., de Keijzer, A., & Alink, W. (2005). A probabilistic XML approach to data integration. In Proceedings of the 21st International Conference on Data Engineering (ICDE'05) (pp. 459-470). (IEEE Conference Proceedings). Washington, DC, USA: IEEE Computer Society. https://doi.org/10.1109/ICDE.2005.11
van Keulen, Maurice ; de Keijzer, Ander ; Alink, Wouter. / A probabilistic XML approach to data integration. Proceedings of the 21st International Conference on Data Engineering (ICDE'05). Washington, DC, USA : IEEE Computer Society, 2005. pp. 459-470 (IEEE Conference Proceedings).
@inproceedings{05d67ec0e0f84a4297b737db92ba754e,
title = "A probabilistic XML approach to data integration",
abstract = "In mobile and ambient environments, devices need to become autonomous, managing and resolving problems without interference from a user. The database of a (mobile) device can be seen as its knowledge about objects in the {\'y}real world{\'y}. Data exchange between small and/or large computing devices can be used to supplement and update this knowledge whenever a connection gets established. In many situations, however, data from different data sources referring to the same real world objects, may conflict. It is the task of the data management system of the device to resolve such conflicts without interference from a user. In this paper, we take a first step in the development of a probabilistic XML DBMS. The main idea is to drop the assumption that data in the database should be certain: subtrees in XML documents may denote possible views on the real world. We formally define the notion of probabilistic XML tree and several operations thereon. We also present an approach for determining a logical semantics for queries on probabilistic XML data. Finally, we introduce an approach for XML data integration where conflicts are resolved by the introduction of possibilities in the database.",
keywords = "METIS-225759, IR-53251, EWI-7273, DB-SDI: SCHEMA AND DATA INTEGRATION",
author = "{van Keulen}, Maurice and {de Keijzer}, Ander and Wouter Alink",
note = "Imported from EWI/DB PMS [db-utwente:inpr:0000003600]",
year = "2005",
month = "4",
doi = "10.1109/ICDE.2005.11",
language = "Undefined",
isbn = "0-7695-2285-8",
series = "IEEE Conference Proceedings",
publisher = "IEEE Computer Society",
pages = "459--470",
booktitle = "Proceedings of the 21st International Conference on Data Engineering (ICDE'05)",
address = "United States",

}

van Keulen, M, de Keijzer, A & Alink, W 2005, A probabilistic XML approach to data integration. in Proceedings of the 21st International Conference on Data Engineering (ICDE'05). IEEE Conference Proceedings, IEEE Computer Society, Washington, DC, USA, pp. 459-470, 21st International Conference on Data Engineering, ICDE 2005, Tokyo, Japan, 5/04/05. https://doi.org/10.1109/ICDE.2005.11

A probabilistic XML approach to data integration. / van Keulen, Maurice; de Keijzer, Ander; Alink, Wouter.

Proceedings of the 21st International Conference on Data Engineering (ICDE'05). Washington, DC, USA : IEEE Computer Society, 2005. p. 459-470 (IEEE Conference Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - A probabilistic XML approach to data integration

AU - van Keulen, Maurice

AU - de Keijzer, Ander

AU - Alink, Wouter

N1 - Imported from EWI/DB PMS [db-utwente:inpr:0000003600]

PY - 2005/4

Y1 - 2005/4

N2 - In mobile and ambient environments, devices need to become autonomous, managing and resolving problems without interference from a user. The database of a (mobile) device can be seen as its knowledge about objects in the ýreal worldý. Data exchange between small and/or large computing devices can be used to supplement and update this knowledge whenever a connection gets established. In many situations, however, data from different data sources referring to the same real world objects, may conflict. It is the task of the data management system of the device to resolve such conflicts without interference from a user. In this paper, we take a first step in the development of a probabilistic XML DBMS. The main idea is to drop the assumption that data in the database should be certain: subtrees in XML documents may denote possible views on the real world. We formally define the notion of probabilistic XML tree and several operations thereon. We also present an approach for determining a logical semantics for queries on probabilistic XML data. Finally, we introduce an approach for XML data integration where conflicts are resolved by the introduction of possibilities in the database.

AB - In mobile and ambient environments, devices need to become autonomous, managing and resolving problems without interference from a user. The database of a (mobile) device can be seen as its knowledge about objects in the ýreal worldý. Data exchange between small and/or large computing devices can be used to supplement and update this knowledge whenever a connection gets established. In many situations, however, data from different data sources referring to the same real world objects, may conflict. It is the task of the data management system of the device to resolve such conflicts without interference from a user. In this paper, we take a first step in the development of a probabilistic XML DBMS. The main idea is to drop the assumption that data in the database should be certain: subtrees in XML documents may denote possible views on the real world. We formally define the notion of probabilistic XML tree and several operations thereon. We also present an approach for determining a logical semantics for queries on probabilistic XML data. Finally, we introduce an approach for XML data integration where conflicts are resolved by the introduction of possibilities in the database.

KW - METIS-225759

KW - IR-53251

KW - EWI-7273

KW - DB-SDI: SCHEMA AND DATA INTEGRATION

U2 - 10.1109/ICDE.2005.11

DO - 10.1109/ICDE.2005.11

M3 - Conference contribution

SN - 0-7695-2285-8

T3 - IEEE Conference Proceedings

SP - 459

EP - 470

BT - Proceedings of the 21st International Conference on Data Engineering (ICDE'05)

PB - IEEE Computer Society

CY - Washington, DC, USA

ER -

van Keulen M, de Keijzer A, Alink W. A probabilistic XML approach to data integration. In Proceedings of the 21st International Conference on Data Engineering (ICDE'05). Washington, DC, USA: IEEE Computer Society. 2005. p. 459-470. (IEEE Conference Proceedings). https://doi.org/10.1109/ICDE.2005.11