Abstract
In this paper, we revisit the data of the San Antonio Heart Study, and employ machine learning to predict the future development of type-2 diabetes. To build the prediction model, we use the support vector machines and ten features that are wellknown in the literature as strong predictors of future diabetes. Due to the unbalanced nature of the dataset in terms of the class labels, we use 10-fold cross-validation to train the model and a hold-out set to validate it. The results of this study show a validation accuracy of 84.1% with a recall rate of 81.1% averaged over 100 iterations. The outcomes of this study can help in identifying the population that is at high risk of developing type-2 diabetes in the future.
Original language | English |
---|---|
Title of host publication | 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems, CBMS 2019 |
Publisher | IEEE |
Pages | 567-570 |
Number of pages | 4 |
ISBN (Electronic) | 9781728122861 |
DOIs | |
Publication status | Published - 5 Aug 2019 |
Event | 32nd IEEE International Symposium on Computer-Based Medical Systems, CBMS 2019 - Instituto Maimónides de Investigación Biomédica de Córdoba, Cordoba, Spain Duration: 5 Jun 2019 → 7 Jun 2019 Conference number: 32 |
Conference
Conference | 32nd IEEE International Symposium on Computer-Based Medical Systems, CBMS 2019 |
---|---|
Abbreviated title | CBMS 2019 |
Country/Territory | Spain |
City | Cordoba |
Period | 5/06/19 → 7/06/19 |
Keywords
- Disease Prediction
- Support vector machine
- Type 2 diabetes