Data integration has been a challenging problem for decades. In an ambient environment, where many autonomous devices have their own information sources and network connectivity is ad hoc and peer-to-peer, it even becomes a serious bottleneck. To enable devices to exchange information without the need for interaction with a user at data integration time and without the need for extensive semantic annotations, a probabilistic approach seems rather promising. It simply teaches the device how to cope with the uncertainty occurring during data integration. Unfortunately, without any kind of world knowledge, almost everything becomes uncertain, hence maintaining all possibilities produces huge integrated information sources. In this paper, we claim that only very simple and generic rules are enough world knowledge to drastically reduce the amount of uncertainty, hence to tame the data explosion to a manageable size.
|Place of Publication||Enschede|
|Publisher||Centrum voor Telematica en Informatie Technologie|
|Number of pages||13|
|Publication status||Published - Feb 2006|
|Name||CTIT Technical Report Series|
|Publisher||University of Twente, Centre for Telematics and Information Technology|
- DB-SDI: SCHEMA AND DATA INTEGRATION
de Keijzer, A., van Keulen, M., & Li, Y. (2006). Taming Data Explosion in Probabilistic Information Integration. (CTIT Technical Report Series; No. 06-05). Enschede: Centrum voor Telematica en Informatie Technologie.