Taming Data Explosion in Probabilistic Information Integration

Ander de Keijzer, Maurice van Keulen, Yiping Li

Research output: Book/ReportReportProfessional

291 Downloads (Pure)

Abstract

Data integration has been a challenging problem for decades. In an ambient environment, where many autonomous devices have their own information sources and network connectivity is ad hoc and peer-to-peer, it even becomes a serious bottleneck. To enable devices to exchange information without the need for interaction with a user at data integration time and without the need for extensive semantic annotations, a probabilistic approach seems rather promising. It simply teaches the device how to cope with the uncertainty occurring during data integration. Unfortunately, without any kind of world knowledge, almost everything becomes uncertain, hence maintaining all possibilities produces huge integrated information sources. In this paper, we claim that only very simple and generic rules are enough world knowledge to drastically reduce the amount of uncertainty, hence to tame the data explosion to a manageable size.
Original languageEnglish
Place of PublicationEnschede
PublisherCentre for Telematics and Information Technology (CTIT)
Number of pages13
Publication statusPublished - Feb 2006

Publication series

NameCTIT Technical Report Series
PublisherUniversity of Twente, Centre for Telematics and Information Technology
No.06-05
ISSN (Print)1381-3625

Keywords

  • DB-SDI: SCHEMA AND DATA INTEGRATION

Fingerprint

Dive into the research topics of 'Taming Data Explosion in Probabilistic Information Integration'. Together they form a unique fingerprint.

Cite this