Abstract
Probabilistic approaches for data integration have much potential. We view data integration as an iterative process where data understanding gradually increases as the data scientist continuously refines his view on how to deal with learned intricacies like data conflicts. This paper presents a probabilistic approach for integrating data on groupings. We focus on a bio-informatics use case concerning homology. A bio-informatician has a large number of homology data sources to choose from. To enable querying combined knowledge contained in these sources, they need to be integrated. We validate our approach by integrating three real-world biological databases on homology in three iterations.
Original language | English |
---|---|
Title of host publication | Proceedings of the 26th International Conference on Database and Expert Systems Applications, DEXA 2015 |
Place of Publication | Switzerland |
Publisher | Springer |
Pages | 236-250 |
Number of pages | 15 |
ISBN (Print) | 978-3-319-22848-8 |
DOIs | |
Publication status | Published - 1 Sept 2015 |
Event | 26th International Conference on Database and Expert Systems Applications, DEXA 2015 - Valencia, Spain Duration: 1 Sept 2015 → 4 Sept 2015 Conference number: 26 http://www.dexa.org/previous/dexa2015/ |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Publisher | Springer International Publishing |
Volume | 9261 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 26th International Conference on Database and Expert Systems Applications, DEXA 2015 |
---|---|
Abbreviated title | DEXA |
Country/Territory | Spain |
City | Valencia |
Period | 1/09/15 → 4/09/15 |
Internet address |
Keywords
- Grouping
- Probabilistic databases