Often data processing is not implemented by a workflow system or an integration application but is performed manually by humans along the lines of a more or less specified procedure. Collecting provenance information in semi-structured processes can not be automated. Further, manual collection of provenance information is error prone and time consuming. Therefore, we propose to infer provenance information based on the file read and write access of users. The derived provenance information is complete, but has a low precision. Therefore, we propose further to introducing organizational guidelines in order to improve the precision of the inferred provenance information.
|Title of host publication||Data-driven process discovery and analysis : second IFIP WG 2.6, 2.12 International Symposium, SIMPDA 2012, Campione d'Italia, Italy, June 18-20, 2012 : revised selected papers|
|Editors||Philippe Cudré-Mauroux, Paolo Ceravolo, Dragan Gasevic|
|Place of Publication||Berlin|
|Number of pages||16|
|Publication status||Published - 2013|
|Name||Lecture notes in business information processing|
Wombacher, A., & Huq, M. R. (2013). Towards automatic capturing of semi-structured process provenance. In P. Cudré-Mauroux, P. Ceravolo, & D. Gasevic (Eds.), Data-driven process discovery and analysis : second IFIP WG 2.6, 2.12 International Symposium, SIMPDA 2012, Campione d'Italia, Italy, June 18-20, 2012 : revised selected papers (pp. 84-99). (Lecture notes in business information processing; Vol. 162, No. 162). Berlin: Springer. https://doi.org/10.1007/978-3-642-40919-6_5