TY - JOUR
T1 - Personal Health Train Architecture with Dynamic Cloud Staging
AU - Bonino da Silva Santos, Luiz Olavo
AU - Ferreira Pires, Luís
AU - Graciano Martinez, Virginia
AU - Rebelo Moreira, João Luiz
AU - Silva Souza Guizzardi, Renata
N1 - Funding Information:
This study has been partially carried out in the context of the following projects: C4yourself (funder: Health Holland—Top Sector Life Sciences and Health, grant number: LSHM 21044_C4YOURSELF) and Personal Genetic Locker (funder: Nederlandse Organisatie voor Wetenschappelijk Onderzoek, grant number: 628.011.022). LOBSS work is partially supported by the funding from the European Union’s Horizon 2020 research and innovation programme under the EJP RD COFUND-EJP N 825575, particularly in providing input for the design of the FDP-based federated infrastructure of the EJP RD Virtual Platform.
Publisher Copyright:
© 2022, The Author(s).
PY - 2023/1
Y1 - 2023/1
N2 - Scientific advances, especially in the healthcare domain, can be accelerated by making data available for analysis. However, in traditional data analysis systems, data need to be moved to a central processing unit that performs analyses, which may be undesirable, e.g. due to privacy regulations in case these data contain personal information. This paper discusses the Personal Health Train (PHT) approach in which data processing is brought to the (personal health) data rather than the other way around, allowing (private) data accessed to be controlled, and to observe ethical and legal concerns. This paper introduces the PHT architecture and discusses the data staging solution that allows processing to be delegated to components spawned in a private cloud environment in case the (health) organisation hosting the data has limited resources to execute the required processing. This paper shows the feasibility and suitability of the solution with a relatively simple, yet representative, case study of data analysis of Covid-19 infections, which is performed by components that are created on demand and run in the Amazon Web Services platform. This paper also shows that the performance of our solution is acceptable, and that our solution is scalable. This paper demonstrates that the PHT approach enables data analysis with controlled access, preserving privacy and complying with regulations such as GDPR, while the solution is deployed in a private cloud environment.
AB - Scientific advances, especially in the healthcare domain, can be accelerated by making data available for analysis. However, in traditional data analysis systems, data need to be moved to a central processing unit that performs analyses, which may be undesirable, e.g. due to privacy regulations in case these data contain personal information. This paper discusses the Personal Health Train (PHT) approach in which data processing is brought to the (personal health) data rather than the other way around, allowing (private) data accessed to be controlled, and to observe ethical and legal concerns. This paper introduces the PHT architecture and discusses the data staging solution that allows processing to be delegated to components spawned in a private cloud environment in case the (health) organisation hosting the data has limited resources to execute the required processing. This paper shows the feasibility and suitability of the solution with a relatively simple, yet representative, case study of data analysis of Covid-19 infections, which is performed by components that are created on demand and run in the Amazon Web Services platform. This paper also shows that the performance of our solution is acceptable, and that our solution is scalable. This paper demonstrates that the PHT approach enables data analysis with controlled access, preserving privacy and complying with regulations such as GDPR, while the solution is deployed in a private cloud environment.
KW - Cloud
KW - Data station
KW - Personal health train
KW - Privacy preservation
KW - Staging station
KW - UT-Hybrid-D
UR - http://www.scopus.com/inward/record.url?scp=85139993273&partnerID=8YFLogxK
U2 - 10.1007/s42979-022-01422-4
DO - 10.1007/s42979-022-01422-4
M3 - Article
AN - SCOPUS:85139993273
SN - 2662-995X
VL - 4
JO - SN Computer Science
JF - SN Computer Science
IS - 1
M1 - 14
ER -