Process prediction in noisy data sets: a case study in a Dutch hospital

Research output: Chapter in Book/Report/Conference proceedingChapterAcademicpeer-review

19 Citations (Scopus)
7 Downloads (Pure)

Abstract

Predicting the amount of money that can be claimed is critical to the effective running of an Hospital. In this paper we describe a case study of a Dutch Hospital where we use process mining to predict the cash flow of the Hospital. In order to predict the cost of a treatment, we use different data mining techniques to predict the sequence of treatments administered, the duration and the final ‿care product‿ or diagnosis of the patient. While performing the data analysis we encountered three specific kinds of noise that we call sequence noise, human noise and duration noise. Studies in the past have discussed ways to reduce the noise in process data. However, it is not very clear what effect the noise has to different kinds of process analysis. In this paper we describe the combined effect of sequence noise, human noise and duration noise on the analysis of process data, by comparing the performance of several mining techniques on the data.
Original languageEnglish
Title of host publicationData-Driven Process Discovery and Analysis
Subtitle of host publicationSecond IFIP WG 2.6, 2.12 International Symposium, SIMPDA 2012, Campione d’Italia, Italy, June 18-20, 2012, Revised Selected Papers
EditorsPhilippe Cudre-Mauroux, Paolo Ceravolo, Dragan Gašević
Place of PublicationBerlin
PublisherSpringer
Pages60-83
Number of pages24
ISBN (Electronic)978-3-642-40919-6
ISBN (Print)978-364240918-9
DOIs
Publication statusPublished - 2013
Event2nd IFIP WG 2.6, 2.12 International Symposium on Data-Driven Process Discovery and Analysis, SIMPDA 2012 - Campione d’Italia, Italy
Duration: 18 Jun 201220 Jun 2012
Conference number: 2

Publication series

NameLecture Notes in Business Information Processing
PublisherSpringer
Volume162
ISSN (Print)1865-1348
ISSN (Electronic)1865-1356

Conference

Conference2nd IFIP WG 2.6, 2.12 International Symposium on Data-Driven Process Discovery and Analysis, SIMPDA 2012
Abbreviated titleSIMPDA
CountryItaly
CityCampione d’Italia
Period18/06/1220/06/12

Fingerprint

Data mining
Costs

Keywords

  • IR-87303
  • METIS-297828
  • EWI-23700

Cite this

van der Spoel, S., van Keulen, M., & Amrit, C. A. (2013). Process prediction in noisy data sets: a case study in a Dutch hospital. In P. Cudre-Mauroux, P. Ceravolo, & D. Gašević (Eds.), Data-Driven Process Discovery and Analysis: Second IFIP WG 2.6, 2.12 International Symposium, SIMPDA 2012, Campione d’Italia, Italy, June 18-20, 2012, Revised Selected Papers (pp. 60-83). (Lecture Notes in Business Information Processing; Vol. 162). Berlin: Springer. https://doi.org/10.1007/978-3-642-40919-6_4
van der Spoel, Sjoerd ; van Keulen, Maurice ; Amrit, Chintan Amrit. / Process prediction in noisy data sets: a case study in a Dutch hospital. Data-Driven Process Discovery and Analysis: Second IFIP WG 2.6, 2.12 International Symposium, SIMPDA 2012, Campione d’Italia, Italy, June 18-20, 2012, Revised Selected Papers. editor / Philippe Cudre-Mauroux ; Paolo Ceravolo ; Dragan Gašević. Berlin : Springer, 2013. pp. 60-83 (Lecture Notes in Business Information Processing).
@inbook{60803126c0c140ddbd8dbcfa6ff4f098,
title = "Process prediction in noisy data sets: a case study in a Dutch hospital",
abstract = "Predicting the amount of money that can be claimed is critical to the effective running of an Hospital. In this paper we describe a case study of a Dutch Hospital where we use process mining to predict the cash flow of the Hospital. In order to predict the cost of a treatment, we use different data mining techniques to predict the sequence of treatments administered, the duration and the final ‿care product‿ or diagnosis of the patient. While performing the data analysis we encountered three specific kinds of noise that we call sequence noise, human noise and duration noise. Studies in the past have discussed ways to reduce the noise in process data. However, it is not very clear what effect the noise has to different kinds of process analysis. In this paper we describe the combined effect of sequence noise, human noise and duration noise on the analysis of process data, by comparing the performance of several mining techniques on the data.",
keywords = "IR-87303, METIS-297828, EWI-23700",
author = "{van der Spoel}, Sjoerd and {van Keulen}, Maurice and Amrit, {Chintan Amrit}",
note = "eemcs-eprint-23700",
year = "2013",
doi = "10.1007/978-3-642-40919-6_4",
language = "English",
isbn = "978-364240918-9",
series = "Lecture Notes in Business Information Processing",
publisher = "Springer",
pages = "60--83",
editor = "Philippe Cudre-Mauroux and Paolo Ceravolo and Dragan Gašević",
booktitle = "Data-Driven Process Discovery and Analysis",

}

van der Spoel, S, van Keulen, M & Amrit, CA 2013, Process prediction in noisy data sets: a case study in a Dutch hospital. in P Cudre-Mauroux, P Ceravolo & D Gašević (eds), Data-Driven Process Discovery and Analysis: Second IFIP WG 2.6, 2.12 International Symposium, SIMPDA 2012, Campione d’Italia, Italy, June 18-20, 2012, Revised Selected Papers. Lecture Notes in Business Information Processing, vol. 162, Springer, Berlin, pp. 60-83, 2nd IFIP WG 2.6, 2.12 International Symposium on Data-Driven Process Discovery and Analysis, SIMPDA 2012, Campione d’Italia, Italy, 18/06/12. https://doi.org/10.1007/978-3-642-40919-6_4

Process prediction in noisy data sets: a case study in a Dutch hospital. / van der Spoel, Sjoerd; van Keulen, Maurice; Amrit, Chintan Amrit.

Data-Driven Process Discovery and Analysis: Second IFIP WG 2.6, 2.12 International Symposium, SIMPDA 2012, Campione d’Italia, Italy, June 18-20, 2012, Revised Selected Papers. ed. / Philippe Cudre-Mauroux; Paolo Ceravolo; Dragan Gašević. Berlin : Springer, 2013. p. 60-83 (Lecture Notes in Business Information Processing; Vol. 162).

Research output: Chapter in Book/Report/Conference proceedingChapterAcademicpeer-review

TY - CHAP

T1 - Process prediction in noisy data sets: a case study in a Dutch hospital

AU - van der Spoel, Sjoerd

AU - van Keulen, Maurice

AU - Amrit, Chintan Amrit

N1 - eemcs-eprint-23700

PY - 2013

Y1 - 2013

N2 - Predicting the amount of money that can be claimed is critical to the effective running of an Hospital. In this paper we describe a case study of a Dutch Hospital where we use process mining to predict the cash flow of the Hospital. In order to predict the cost of a treatment, we use different data mining techniques to predict the sequence of treatments administered, the duration and the final ‿care product‿ or diagnosis of the patient. While performing the data analysis we encountered three specific kinds of noise that we call sequence noise, human noise and duration noise. Studies in the past have discussed ways to reduce the noise in process data. However, it is not very clear what effect the noise has to different kinds of process analysis. In this paper we describe the combined effect of sequence noise, human noise and duration noise on the analysis of process data, by comparing the performance of several mining techniques on the data.

AB - Predicting the amount of money that can be claimed is critical to the effective running of an Hospital. In this paper we describe a case study of a Dutch Hospital where we use process mining to predict the cash flow of the Hospital. In order to predict the cost of a treatment, we use different data mining techniques to predict the sequence of treatments administered, the duration and the final ‿care product‿ or diagnosis of the patient. While performing the data analysis we encountered three specific kinds of noise that we call sequence noise, human noise and duration noise. Studies in the past have discussed ways to reduce the noise in process data. However, it is not very clear what effect the noise has to different kinds of process analysis. In this paper we describe the combined effect of sequence noise, human noise and duration noise on the analysis of process data, by comparing the performance of several mining techniques on the data.

KW - IR-87303

KW - METIS-297828

KW - EWI-23700

U2 - 10.1007/978-3-642-40919-6_4

DO - 10.1007/978-3-642-40919-6_4

M3 - Chapter

SN - 978-364240918-9

T3 - Lecture Notes in Business Information Processing

SP - 60

EP - 83

BT - Data-Driven Process Discovery and Analysis

A2 - Cudre-Mauroux, Philippe

A2 - Ceravolo, Paolo

A2 - Gašević, Dragan

PB - Springer

CY - Berlin

ER -

van der Spoel S, van Keulen M, Amrit CA. Process prediction in noisy data sets: a case study in a Dutch hospital. In Cudre-Mauroux P, Ceravolo P, Gašević D, editors, Data-Driven Process Discovery and Analysis: Second IFIP WG 2.6, 2.12 International Symposium, SIMPDA 2012, Campione d’Italia, Italy, June 18-20, 2012, Revised Selected Papers. Berlin: Springer. 2013. p. 60-83. (Lecture Notes in Business Information Processing). https://doi.org/10.1007/978-3-642-40919-6_4