Fine-Grained Provenance Inference for a Large Processing Chain with Non-materialized Intermediate Views

M.R. Huq, Peter M.G. Apers, Andreas Wombacher

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

4 Citations (Scopus)
22 Downloads (Pure)

Abstract

Many applications facilitate a data processing chain, i.e. a workflow, to process data. Results of intermediate processing steps may not be persistent since reproducing these results are not costly and these are hardly re-usable. However, in stream data processing where data arrives continuously, documenting fine-grained provenance explicitly for a processing chain to reproduce results is not a feasible solution since the provenance data may become a multiple of the actual sensor data. In this paper, we propose the multi-step provenance inference technique that infers provenance data for the entire workflow with non-materialized intermediate views. Our solution provides high quality provenance graph.
Original languageUndefined
Title of host publicationProceedings of the 24th International Conference of Scientific and Statistical Database Management (SSDBM 2012)
EditorsAnastasia Ailamaki, Shawn Bowers
Place of PublicationBerlin
PublisherSpringer
Pages397-405
Number of pages9
ISBN (Print)978-3-642-31234-2
DOIs
Publication statusPublished - Jun 2012

Publication series

NameLecture Notes in Computer Science
PublisherSpringer Verlag
Volume7338
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Keywords

  • METIS-287957
  • IR-81213
  • EWI-22111
  • Data Provenance
  • Inference

Cite this

Huq, M. R., Apers, P. M. G., & Wombacher, A. (2012). Fine-Grained Provenance Inference for a Large Processing Chain with Non-materialized Intermediate Views. In A. Ailamaki, & S. Bowers (Eds.), Proceedings of the 24th International Conference of Scientific and Statistical Database Management (SSDBM 2012) (pp. 397-405). (Lecture Notes in Computer Science; Vol. 7338). Berlin: Springer. https://doi.org/10.1007/978-3-642-31235-9_26
Huq, M.R. ; Apers, Peter M.G. ; Wombacher, Andreas. / Fine-Grained Provenance Inference for a Large Processing Chain with Non-materialized Intermediate Views. Proceedings of the 24th International Conference of Scientific and Statistical Database Management (SSDBM 2012). editor / Anastasia Ailamaki ; Shawn Bowers. Berlin : Springer, 2012. pp. 397-405 (Lecture Notes in Computer Science).
@inproceedings{b38fb5edb1ae4bb98a46ca6ca95e4aa9,
title = "Fine-Grained Provenance Inference for a Large Processing Chain with Non-materialized Intermediate Views",
abstract = "Many applications facilitate a data processing chain, i.e. a workflow, to process data. Results of intermediate processing steps may not be persistent since reproducing these results are not costly and these are hardly re-usable. However, in stream data processing where data arrives continuously, documenting fine-grained provenance explicitly for a processing chain to reproduce results is not a feasible solution since the provenance data may become a multiple of the actual sensor data. In this paper, we propose the multi-step provenance inference technique that infers provenance data for the entire workflow with non-materialized intermediate views. Our solution provides high quality provenance graph.",
keywords = "METIS-287957, IR-81213, EWI-22111, Data Provenance, Inference",
author = "M.R. Huq and Apers, {Peter M.G.} and Andreas Wombacher",
note = "10.1007/978-3-642-31235-9_26",
year = "2012",
month = "6",
doi = "10.1007/978-3-642-31235-9_26",
language = "Undefined",
isbn = "978-3-642-31234-2",
series = "Lecture Notes in Computer Science",
publisher = "Springer",
pages = "397--405",
editor = "Anastasia Ailamaki and Shawn Bowers",
booktitle = "Proceedings of the 24th International Conference of Scientific and Statistical Database Management (SSDBM 2012)",

}

Huq, MR, Apers, PMG & Wombacher, A 2012, Fine-Grained Provenance Inference for a Large Processing Chain with Non-materialized Intermediate Views. in A Ailamaki & S Bowers (eds), Proceedings of the 24th International Conference of Scientific and Statistical Database Management (SSDBM 2012). Lecture Notes in Computer Science, vol. 7338, Springer, Berlin, pp. 397-405. https://doi.org/10.1007/978-3-642-31235-9_26

Fine-Grained Provenance Inference for a Large Processing Chain with Non-materialized Intermediate Views. / Huq, M.R.; Apers, Peter M.G.; Wombacher, Andreas.

Proceedings of the 24th International Conference of Scientific and Statistical Database Management (SSDBM 2012). ed. / Anastasia Ailamaki; Shawn Bowers. Berlin : Springer, 2012. p. 397-405 (Lecture Notes in Computer Science; Vol. 7338).

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - Fine-Grained Provenance Inference for a Large Processing Chain with Non-materialized Intermediate Views

AU - Huq, M.R.

AU - Apers, Peter M.G.

AU - Wombacher, Andreas

N1 - 10.1007/978-3-642-31235-9_26

PY - 2012/6

Y1 - 2012/6

N2 - Many applications facilitate a data processing chain, i.e. a workflow, to process data. Results of intermediate processing steps may not be persistent since reproducing these results are not costly and these are hardly re-usable. However, in stream data processing where data arrives continuously, documenting fine-grained provenance explicitly for a processing chain to reproduce results is not a feasible solution since the provenance data may become a multiple of the actual sensor data. In this paper, we propose the multi-step provenance inference technique that infers provenance data for the entire workflow with non-materialized intermediate views. Our solution provides high quality provenance graph.

AB - Many applications facilitate a data processing chain, i.e. a workflow, to process data. Results of intermediate processing steps may not be persistent since reproducing these results are not costly and these are hardly re-usable. However, in stream data processing where data arrives continuously, documenting fine-grained provenance explicitly for a processing chain to reproduce results is not a feasible solution since the provenance data may become a multiple of the actual sensor data. In this paper, we propose the multi-step provenance inference technique that infers provenance data for the entire workflow with non-materialized intermediate views. Our solution provides high quality provenance graph.

KW - METIS-287957

KW - IR-81213

KW - EWI-22111

KW - Data Provenance

KW - Inference

U2 - 10.1007/978-3-642-31235-9_26

DO - 10.1007/978-3-642-31235-9_26

M3 - Conference contribution

SN - 978-3-642-31234-2

T3 - Lecture Notes in Computer Science

SP - 397

EP - 405

BT - Proceedings of the 24th International Conference of Scientific and Statistical Database Management (SSDBM 2012)

A2 - Ailamaki, Anastasia

A2 - Bowers, Shawn

PB - Springer

CY - Berlin

ER -

Huq MR, Apers PMG, Wombacher A. Fine-Grained Provenance Inference for a Large Processing Chain with Non-materialized Intermediate Views. In Ailamaki A, Bowers S, editors, Proceedings of the 24th International Conference of Scientific and Statistical Database Management (SSDBM 2012). Berlin: Springer. 2012. p. 397-405. (Lecture Notes in Computer Science). https://doi.org/10.1007/978-3-642-31235-9_26