Electronic Health Record (EHR) System Development for Study on EHR Data-based Early Prediction of Diabetes Using Machine Learning Algorithms

Q3 Computer Science Open Bioinformatics Journal Pub Date : 2023-10-05 DOI:10.2174/18750362-v16-e230906-2023-15
Jagadamba G, Shashidhar R, Gururaj H L, Vinayakumar Ravi, Meshari Almeshari, Yasser Alzamil
{"title":"Electronic Health Record (EHR) System Development for Study on EHR Data-based Early Prediction of Diabetes Using Machine Learning Algorithms","authors":"Jagadamba G, Shashidhar R, Gururaj H L, Vinayakumar Ravi, Meshari Almeshari, Yasser Alzamil","doi":"10.2174/18750362-v16-e230906-2023-15","DOIUrl":null,"url":null,"abstract":"Aims: This research work aims to develop an interoperable electronic health record (EHR) system to aid the early detection of diabetes by the use of Machine Learning (ML) algorithms. A decision support system developed using many ML algorithms results in optimizing the decision in preventive care in the health information system. Methods: The proposed system consisted of two models. The first model included interoperable EHR system development using a precise database structure. The second module comprised of data extraction from the EHR system, data cleaning, and data processing and prediction. For testing and training, about 1080 patients’ health record was considered. Among 1080, 1000 records were from the Kaggle dataset, and 80 records were demographic information from patients who visited our health center of Siddaganga organization for a regular checkup or during emergencies. The demographic information was collected from the proposed EHR system. Results: The proposed system was tested for the interoperability nature of the EHR system and accuracy in diabetic disease prediction using the proposed decision support system. The proposed EHR system development was tested for interoperability by random updations from various systems maintained in the laboratory. Each system acted like the admin system of different hospitals. The EHR system was tested for handling the load and interoperability by considering user view status, system matching with the real world, consistency in data updations, security etc . However, in the prediction phase, diabetes prediction was concentrated. The features considered were not randomly chosen; however, the features were those prescribed by a doctor who insisted that the features were sufficient for initial prediction. The reports collected from the doctors revealed several features they considered before giving the test details. The proposed system dataset was split into test and train datasets with eight proper features taken as input and one set as a target variable where the result was present. After this, the model was imported using standard “sklearn” libraries, and it fit with the required number of estimators, that is, the number of decision trees. The features included pregnancies, glucose level, blood pressure, skin thickness, insulin level, bone marrow index, diabetic pedigree function, age, weight, etc . At the outset, the research work concentrated on developing an interoperable EHR system, identifying the expectation of diabetic and non-diabetic conditions and demonstrating the accuracy of the system. Conclusion: In this study, the first aim was to design an interoperable EHR system that could help in accumulating, storing, and sharing patients' timely health records over a lifetime. The second aim was to use EHR data for early prediction of diabetes in the user. To confirm the accuracy of the system, the system was tested regarding interoperability to support early prediction through a decision support system.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Open Bioinformatics Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/18750362-v16-e230906-2023-15","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0

Abstract

Aims: This research work aims to develop an interoperable electronic health record (EHR) system to aid the early detection of diabetes by the use of Machine Learning (ML) algorithms. A decision support system developed using many ML algorithms results in optimizing the decision in preventive care in the health information system. Methods: The proposed system consisted of two models. The first model included interoperable EHR system development using a precise database structure. The second module comprised of data extraction from the EHR system, data cleaning, and data processing and prediction. For testing and training, about 1080 patients’ health record was considered. Among 1080, 1000 records were from the Kaggle dataset, and 80 records were demographic information from patients who visited our health center of Siddaganga organization for a regular checkup or during emergencies. The demographic information was collected from the proposed EHR system. Results: The proposed system was tested for the interoperability nature of the EHR system and accuracy in diabetic disease prediction using the proposed decision support system. The proposed EHR system development was tested for interoperability by random updations from various systems maintained in the laboratory. Each system acted like the admin system of different hospitals. The EHR system was tested for handling the load and interoperability by considering user view status, system matching with the real world, consistency in data updations, security etc . However, in the prediction phase, diabetes prediction was concentrated. The features considered were not randomly chosen; however, the features were those prescribed by a doctor who insisted that the features were sufficient for initial prediction. The reports collected from the doctors revealed several features they considered before giving the test details. The proposed system dataset was split into test and train datasets with eight proper features taken as input and one set as a target variable where the result was present. After this, the model was imported using standard “sklearn” libraries, and it fit with the required number of estimators, that is, the number of decision trees. The features included pregnancies, glucose level, blood pressure, skin thickness, insulin level, bone marrow index, diabetic pedigree function, age, weight, etc . At the outset, the research work concentrated on developing an interoperable EHR system, identifying the expectation of diabetic and non-diabetic conditions and demonstrating the accuracy of the system. Conclusion: In this study, the first aim was to design an interoperable EHR system that could help in accumulating, storing, and sharing patients' timely health records over a lifetime. The second aim was to use EHR data for early prediction of diabetes in the user. To confirm the accuracy of the system, the system was tested regarding interoperability to support early prediction through a decision support system.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于EHR数据的糖尿病早期预测机器学习算法的电子健康记录(EHR)系统开发
目的:本研究工作旨在开发一个可互操作的电子健康记录(EHR)系统,通过使用机器学习(ML)算法来帮助早期发现糖尿病。一个决策支持系统开发使用许多机器学习算法的结果在卫生信息系统中的预防保健决策优化。方法:该系统由两个模型组成。第一个模型包括使用精确的数据库结构开发可互操作的EHR系统。第二个模块包括从EHR系统中提取数据、清理数据、处理和预测数据。为了测试和培训,大约1080名患者的健康记录被考虑在内。在1080条记录中,有1000条记录来自Kaggle数据集,80条记录是访问Siddaganga组织健康中心进行定期检查或紧急情况的患者的人口统计信息。从拟议的电子病历系统中收集人口统计信息。结果:所提出的系统被用于测试EHR系统的互操作性和使用所提出的决策支持系统预测糖尿病疾病的准确性。通过实验室维护的各种系统的随机更新,对拟议的EHR系统开发进行了互操作性测试。每个系统就像不同医院的管理系统。从用户视图状态、系统与现实世界的匹配、数据更新的一致性、安全性等方面对EHR系统的负载处理和互操作性进行了测试。但在预测阶段,糖尿病预测较为集中。所考虑的特征不是随机选择的;然而,这些特征是由医生规定的,医生坚持认为这些特征足以进行初步预测。从医生那里收集的报告揭示了他们在给出测试细节之前考虑的几个特征。提出的系统数据集被分为测试和训练数据集,其中八个适当的特征作为输入,一个集作为目标变量,其中结果存在。在此之后,使用标准的“sklearn”库导入模型,它与所需的估计器数量(即决策树的数量)相匹配。这些特征包括怀孕、血糖水平、血压、皮肤厚度、胰岛素水平、骨髓指数、糖尿病谱系功能、年龄、体重等。一开始,研究工作集中于开发一个可互操作的电子病历系统,确定糖尿病和非糖尿病疾病的预期,并证明该系统的准确性。结论:在本研究中,第一个目标是设计一个可互操作的电子病历系统,以帮助积累、存储和共享患者一生中及时的健康记录。第二个目的是利用电子病历数据对用户的糖尿病进行早期预测。为了确认系统的准确性,对系统进行了互操作性测试,以通过决策支持系统支持早期预测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Open Bioinformatics Journal
Open Bioinformatics Journal Computer Science-Computer Science (miscellaneous)
CiteScore
2.40
自引率
0.00%
发文量
4
期刊介绍: The Open Bioinformatics Journal is an Open Access online journal, which publishes research articles, reviews/mini-reviews, letters, clinical trial studies and guest edited single topic issues in all areas of bioinformatics and computational biology. The coverage includes biomedicine, focusing on large data acquisition, analysis and curation, computational and statistical methods for the modeling and analysis of biological data, and descriptions of new algorithms and databases. The Open Bioinformatics Journal, a peer reviewed journal, is an important and reliable source of current information on the developments in the field. The emphasis will be on publishing quality articles rapidly and freely available worldwide.
期刊最新文献
Decision-making Support System for Predicting and Eliminating Malnutrition and Anemia Immunoinformatics Approach for the Design of Chimeric Vaccine Against Whitmore Disease A New Deep Learning Model based on Neuroimaging for Predicting Alzheimer's Disease Early Prediction of Covid-19 Samples from Chest X-ray Images using Deep Learning Approach Electronic Health Record (EHR) System Development for Study on EHR Data-based Early Prediction of Diabetes Using Machine Learning Algorithms
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1