TY - JOUR
T1 - A study on interoperability between two Personal Health Train infrastructures in leukodystrophy data analysis
AU - Welten, Sascha
AU - de Arruda Botelho Herr, Marius
AU - Hempel, Lars
AU - Hieber, David
AU - Placzek, Peter
AU - Graf, Michael
AU - Weber, Sven
AU - Neumann, Laurenz
AU - Jugl, Maximilian
AU - Tirpitz, Liam
AU - Kindermann, Karl
AU - Geisler, Sandra
AU - Bonino da Silva Santos, Luiz Olavo
AU - Decker, Stefan
AU - Pfeifer, Nico
AU - Kohlbacher, Oliver
AU - Kirsten, Toralf
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024/6/22
Y1 - 2024/6/22
N2 - The development of platforms for distributed analytics has been driven by a growing need to comply with various governance-related or legal constraints. Among these platforms, the so-called Personal Health Train (PHT) is one representative that has emerged over the recent years. However, in projects that require data from sites featuring different PHT infrastructures, institutions are facing challenges emerging from the combination of multiple PHT ecosystems, including data governance, regulatory compliance, or the modification of existing workflows. In these scenarios, the interoperability of the platforms is preferable. In this work, we introduce a conceptual framework for the technical interoperability of the PHT covering five essential requirements: Data integration, unified station identifiers, mutual metadata, aligned security protocols, and business logic. We evaluated our concept in a feasibility study that involves two distinct PHT infrastructures: PHT-meDIC and PADME. We analyzed data on leukodystrophy from patients in the University Hospitals of Tübingen and Leipzig, and patients with differential diagnoses at the University Hospital Aachen. The results of our study demonstrate the technical interoperability between these two PHT infrastructures, allowing researchers to perform analyses across the participating institutions. Our method is more space-efficient compared to the multi-homing strategy, and it shows only a minimal time overhead.
AB - The development of platforms for distributed analytics has been driven by a growing need to comply with various governance-related or legal constraints. Among these platforms, the so-called Personal Health Train (PHT) is one representative that has emerged over the recent years. However, in projects that require data from sites featuring different PHT infrastructures, institutions are facing challenges emerging from the combination of multiple PHT ecosystems, including data governance, regulatory compliance, or the modification of existing workflows. In these scenarios, the interoperability of the platforms is preferable. In this work, we introduce a conceptual framework for the technical interoperability of the PHT covering five essential requirements: Data integration, unified station identifiers, mutual metadata, aligned security protocols, and business logic. We evaluated our concept in a feasibility study that involves two distinct PHT infrastructures: PHT-meDIC and PADME. We analyzed data on leukodystrophy from patients in the University Hospitals of Tübingen and Leipzig, and patients with differential diagnoses at the University Hospital Aachen. The results of our study demonstrate the technical interoperability between these two PHT infrastructures, allowing researchers to perform analyses across the participating institutions. Our method is more space-efficient compared to the multi-homing strategy, and it shows only a minimal time overhead.
UR - http://www.scopus.com/inward/record.url?scp=85196650607&partnerID=8YFLogxK
U2 - 10.1038/s41597-024-03450-6
DO - 10.1038/s41597-024-03450-6
M3 - Article
C2 - 38909050
AN - SCOPUS:85196650607
SN - 2052-4463
VL - 11
JO - Scientific Data
JF - Scientific Data
IS - 1
M1 - 663
ER -