TY - GEN
T1 - Taming Data Explosion in Probabilistic Information Integration
AU - de Keijzer, Ander
AU - van Keulen, Maurice
AU - Li, Yiping
N1 - Position paper. Pre-proceedings can be obtained from workshop website (http://ssi.umh.ac.be/iidb) or Jef Wijsen, Institut d'Informatique, Universite de Mons-Hainaut, B-7000 Mons, Belgium.
PY - 2006/3/26
Y1 - 2006/3/26
N2 - Data integration has been a challenging problem for decades. In autonomous data integration, i.e., without a user to solve semantic uncertainty and conflicts between data sources, it even becomes a serious bottleneck. A probabilistic approach seems promising as it does not require extensive semantic annotations nor user interaction at integration time. It simply teaches the application how to generically cope with uncertainty. Unfortunately, without any world knowledge, uncertainty abounds as almost everything becomes (theoretically) possible and maintaining all possibilities produces huge volumes of data. In this paper, we claim that simple and generic knowledge rules are sufficient to drastically reduce uncertainty, hence tame data explosion to a manageable size.
AB - Data integration has been a challenging problem for decades. In autonomous data integration, i.e., without a user to solve semantic uncertainty and conflicts between data sources, it even becomes a serious bottleneck. A probabilistic approach seems promising as it does not require extensive semantic annotations nor user interaction at integration time. It simply teaches the application how to generically cope with uncertainty. Unfortunately, without any world knowledge, uncertainty abounds as almost everything becomes (theoretically) possible and maintaining all possibilities produces huge volumes of data. In this paper, we claim that simple and generic knowledge rules are sufficient to drastically reduce uncertainty, hence tame data explosion to a manageable size.
KW - DB-SDI: SCHEMA AND DATA INTEGRATION
M3 - Conference contribution
SP - 82
EP - 86
BT - Pre-Proceedings of the International Workshop on Inconsistency and Incompleteness in Databases (IIDB 2006)
PB - University of Mons-Hainaut, Belgium
T2 - Pre-International Workshop on Inconsistency and Incompleteness in Databases (IIDB 2006)
Y2 - 26 March 2006 through 26 March 2006
ER -