Comparison of Machine Learning Methods for the Prediction of Type 2 Diabetes in Primary Care Setting Using EHR Data

Olwendo, Amos Otieno; Ochieng, George; Rucha, Kenneth

Comparison of Machine Learning Methods for the Prediction of Type 2 Diabetes in Primary Care Setting Using EHR Data

dc.contributor.author	Olwendo, Amos Otieno
dc.contributor.author	Ochieng, George
dc.contributor.author	Rucha, Kenneth
dc.date.accessioned	2024-01-16T06:16:39Z
dc.date.available	2024-01-16T06:16:39Z
dc.date.issued	2023-10
dc.description	article	en_US
dc.description.abstract	ABSTRACT Diabetes remains a major global public health challenge, thus the need for better methods for managing diabetes. Machine learning could provide reliable solutions to the need for early detection and management of diabetes. This study conducted experiments to compare a number of selected machine learning approaches to determine their suitability for early detection of diabetes in the primary care setting. A retrospective study was conducted using EHR dataset of confirmed cases of diabetes collected during routine care at Nairobi Hospital. Institutional ethical approvals were obtained, and data were retrieved from the database through stratified sampling based on gender. Diagnoses were confirmed using the ICD-10 codes. Records with 5% or so of missing values were excluded from this analysis. Data were processed by correction of errors and replacement of missing values using measures of central tendency. The data were transformed through normalization using the decimal-scaling method. Data analysis was conducted using selected supervised and unsupervised learning algorithms. Model performances were validated using metrics for the evaluation of classification and clustering results, respectively. Random Forest had the highest accuracy (0.95) and error rate (0.05), while Gradient Boosting and Multilayer Perceptron (MLP) with 3 hidden layers obtained accuracy (0.94) and error rate (0.06), respectively. The process of selecting machine learning algorithms needs to explore both supervised and unsupervised learning techniques. In addition, an appropriate architectural desig	en_US
dc.identifier.citation	Olwendo, A. O., Ochieng, G., & Rucha, K. (2024). Comparison of machine learning methods for the prediction of type 2 diabetes in primary care setting using EHR data. Journal of Agriculture, Science and Technology, 23(1), 24-36.	en_US
dc.identifier.issn	1561-7645
dc.identifier.other	doi: 10.4314/jagst.v23i1.3
dc.identifier.uri	https://ir-library.ku.ac.ke/handle/123456789/27278
dc.language.iso	en_US	en_US
dc.publisher	JAGST	en_US
dc.subject	Comparison	en_US
dc.subject	machine learning	en_US
dc.subject	classification	en_US
dc.subject	clustering	en_US
dc.subject	type 2 diabetes	en_US
dc.title	Comparison of Machine Learning Methods for the Prediction of Type 2 Diabetes in Primary Care Setting Using EHR Data	en_US
dc.type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Comparison of machine learning methods for the prediction of type 2 diabetes in primary.pdf
Size:: 1.38 MB
Format:: Adobe Portable Document Format
Description:: Full text article

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

RP-Department of Health Management & Informatics