Automatic Process Comparison for Subpopulations: Application in Cancer Care

Francesca Marazza, Faiza Allah Bukhsh, Jeroen Geerdink, Onno Vijlbrief, Shreyasi Pathak, Maurice van Keulen, Christin Seifert

Research output: Contribution to journalArticleAcademicpeer-review

5 Citations (Scopus)
81 Downloads (Pure)


Processes in organisations, such as hospitals, may deviate from the intended standard processes, due to unforeseeable events and the complexity of the organisation. For hospitals, the knowledge of actual patient streams for patient populations (e.g., severe or non-severe cases) is important for quality control and improvement. Process discovery from event data in electronic health records can shed light on the patient flows, but their comparison for different populations is cumbersome and time-consuming. In this paper, we present an approach for the automatic comparison of process models that were extracted from events in electronic health records. Concretely, we propose comparing processes for different patient populations by cross-log conformance checking, and standard graph similarity measures obtained from the directed graph underlying the process model. We perform a user study with 20 participants in order to obtain a ground truth for similarity of process models. We evaluate our approach on two data sets, the publicly available MIMIC database with the focus on different cancer patients in intensive care, and a database on breast cancer patients from a Dutch hospital. In our experiments, we found average fitness to be a good indicator for visual similarity in the ZGT use case, while the average precision and graph edit distance are strongly correlated with visual impression for cancer process models on MIMIC. These results are a call for further research and evaluation for determining which similarity or combination of similarities is needed in which type of process model comparison.
Original languageEnglish
Article number5707
Pages (from-to)1-23
Number of pages23
JournalInternational journal of environmental research and public health
Issue number16
Publication statusPublished - 7 Aug 2020


  • Breast cancer care
  • Cancer types
  • MIMIC database
  • Process comparison
  • Process mining
  • Quality control


Dive into the research topics of 'Automatic Process Comparison for Subpopulations: Application in Cancer Care'. Together they form a unique fingerprint.

Cite this