Taming Data Explosion in Probabilistic Information Integration

Ander de Keijzer, Maurice van Keulen, Yiping Li

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

17 Downloads (Pure)

Abstract

Data integration has been a challenging problem for decades. In autonomous data integration, i.e., without a user to solve semantic uncertainty and conflicts between data sources, it even becomes a serious bottleneck. A probabilistic approach seems promising as it does not require extensive semantic annotations nor user interaction at integration time. It simply teaches the application how to generically cope with uncertainty. Unfortunately, without any world knowledge, uncertainty abounds as almost everything becomes (theoretically) possible and maintaining all possibilities produces huge volumes of data. In this paper, we claim that simple and generic knowledge rules are sufficient to drastically reduce uncertainty, hence tame data explosion to a manageable size.
Original languageUndefined
Title of host publicationPre-Proceedings of the International Workshop on Inconsistency and Incompleteness in Databases (IIDB 2006)
PublisherUniversity of Mons-Hainaut, Belgium
Pages82-86
Number of pages5
ISBN (Print)not assigned
Publication statusPublished - 26 Mar 2006

Publication series

Name
PublisherUniversity of Mons-Hainaut, Belgium

Keywords

  • IR-66509
  • METIS-238693
  • DB-SDI: SCHEMA AND DATA INTEGRATION
  • EWI-7537

Cite this

de Keijzer, A., van Keulen, M., & Li, Y. (2006). Taming Data Explosion in Probabilistic Information Integration. In Pre-Proceedings of the International Workshop on Inconsistency and Incompleteness in Databases (IIDB 2006) (pp. 82-86). University of Mons-Hainaut, Belgium.