Taming Data Explosion in Probabilistic Information Integration

Ander de Keijzer, Maurice van Keulen, Yiping Li

Research output: Book/ReportReportProfessional

177 Downloads (Pure)

Abstract

Data integration has been a challenging problem for decades. In an ambient environment, where many autonomous devices have their own information sources and network connectivity is ad hoc and peer-to-peer, it even becomes a serious bottleneck. To enable devices to exchange information without the need for interaction with a user at data integration time and without the need for extensive semantic annotations, a probabilistic approach seems rather promising. It simply teaches the device how to cope with the uncertainty occurring during data integration. Unfortunately, without any kind of world knowledge, almost everything becomes uncertain, hence maintaining all possibilities produces huge integrated information sources. In this paper, we claim that only very simple and generic rules are enough world knowledge to drastically reduce the amount of uncertainty, hence to tame the data explosion to a manageable size.
Original languageUndefined
Place of PublicationEnschede
PublisherCentrum voor Telematica en Informatie Technologie
Number of pages13
Publication statusPublished - Feb 2006

Publication series

NameCTIT Technical Report Series
PublisherUniversity of Twente, Centre for Telematics and Information Technology
No.06-05
ISSN (Print)1381-3625

Keywords

  • DB-SDI: SCHEMA AND DATA INTEGRATION
  • IR-66507
  • EWI-7534
  • METIS-238691

Cite this

de Keijzer, A., van Keulen, M., & Li, Y. (2006). Taming Data Explosion in Probabilistic Information Integration. (CTIT Technical Report Series; No. 06-05). Enschede: Centrum voor Telematica en Informatie Technologie.