Activities per year
Abstract
Data interoperability is a major issue in data management for data science and big data analytics. Probabilistic data integration (PDI) is a specific kind of data integration where extraction and integration problems such as inconsistency and uncertainty are handled by means of a probabilistic data representation. This allows a data integration process with two phases: (1) a quick partial integration where data quality problems are represented as uncertainty in the resulting integrated data, and (2) using the uncertain data and continuously improving its quality as more evidence is gathered. The main contribution of this paper is an iterative approach for incorporating evidence of users in the probabilistically integrated data. Evidence can be specified as hard or soft rules (i.e., rules that are uncertain themselves).
Original language | English |
---|---|
Title of host publication | Scalable Uncertainty Management |
Subtitle of host publication | 12th International Conference, SUM 2018, Milan, Italy, October 3-5, 2018, Proceedings |
Editors | Davide Ciucci, Gabriella Pasi, Barbara Vantaggi |
Place of Publication | Cham |
Publisher | Springer |
Pages | 290-305 |
Number of pages | 16 |
ISBN (Electronic) | 978-3-030-00461-3 |
ISBN (Print) | 978-3-030-00460-6 |
DOIs | |
Publication status | Published - 1 Jan 2018 |
Event | 12th International Conference on Scalable Uncertainty Management 2018 - Milan, Italy Duration: 3 Oct 2018 → 5 Oct 2018 Conference number: 12 http://www.ir.disco.unimib.it/sum2018/ |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Publisher | Springer |
Volume | 11142 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 12th International Conference on Scalable Uncertainty Management 2018 |
---|---|
Abbreviated title | SUM 2018 |
Country/Territory | Italy |
City | Milan |
Period | 3/10/18 → 5/10/18 |
Internet address |
Keywords
- Data cleaning
- Data integration
- Information extraction
- Probabilistic databases
- Probabilistic programming
Fingerprint
Dive into the research topics of 'Rule-based conditioning of probabilistic data'. Together they form a unique fingerprint.Activities
- 1 Oral presentation
-
Rule-based Conditioning of Probabilistic Data
van Keulen, M. (Speaker)
4 Oct 2018Activity: Talk or presentation › Oral presentation
Prizes
-
Beste paper award
van Keulen, M. (Recipient), Kaminski, B. (Recipient), Matheja, C. (Recipient) & Katoen, J. P. (Recipient), 4 Oct 2018
Prize