Taming Data Explosion in Probabilistic Information Integration

Ander de Keijzer, Maurice van Keulen, Yiping Li

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

23 Downloads (Pure)


Data integration has been a challenging problem for decades. In autonomous data integration, i.e., without a user to solve semantic uncertainty and conflicts between data sources, it even becomes a serious bottleneck. A probabilistic approach seems promising as it does not require extensive semantic annotations nor user interaction at integration time. It simply teaches the application how to generically cope with uncertainty. Unfortunately, without any world knowledge, uncertainty abounds as almost everything becomes (theoretically) possible and maintaining all possibilities produces huge volumes of data. In this paper, we claim that simple and generic knowledge rules are sufficient to drastically reduce uncertainty, hence tame data explosion to a manageable size.
Original languageUndefined
Title of host publicationPre-Proceedings of the International Workshop on Inconsistency and Incompleteness in Databases (IIDB 2006)
PublisherUniversity of Mons-Hainaut, Belgium
Number of pages5
ISBN (Print)not assigned
Publication statusPublished - 26 Mar 2006

Publication series

PublisherUniversity of Mons-Hainaut, Belgium


  • IR-66509
  • METIS-238693
  • EWI-7537

Cite this