Detection and prediction of diabetes using effective biomarkers

Mohammad Ehsan Farnoodian, Mohammad Karimi Moridani, Hanieh Mokhber
{"title":"Detection and prediction of diabetes using effective biomarkers","authors":"Mohammad Ehsan Farnoodian, Mohammad Karimi Moridani, Hanieh Mokhber","doi":"10.1080/21681163.2023.2264937","DOIUrl":null,"url":null,"abstract":"ABSTRACTDiabetes is a prevalent and costly condition, with early diagnosis pivotal in mitigating ‎its progression and complications. The diagnostic process often contends with data ‎ambiguity and decision uncertainty, adding complexity to achieving definitive ‎outcomes. This study addresses the diabetes diagnostic challenge through data mining ‎and machine learning techniques. It involves training various machine learning ‎algorithms and conducting statistical analysis on a dataset comprising 520 patients, ‎encompassing both normal and diabetic cases, to discern influential features.‎ Incorporating 17 features as classifier inputs, this research evaluates the diagnostic ‎performance using four reputable techniques: support vector machine (SVM), random ‎forest (RF), multi-layer perceptron (MLP), and k-nearest neighbor (kNN). The outcomes ‎underscore the SVM model's superior performance, boasting accuracy, specificity, and ‎sensitivity values of 98.78±1.96%, 99.28±1.63%, and 97.32±2.45%, ‎respectively, across 50 iterations. The findings establish SVM as the preferred method ‎for diabetes diagnosis.‎ This study highlights the efficacy of data mining and machine learning models in ‎diabetes diagnosis. While these methods exhibit respectable predictive accuracy, their ‎integration with a physician's assessment promises even better patient outcomes.‎KEYWORDS: Data miningdiabetesSVMdetectionprediction Abbreviations ANN=Artificial Neural NetworkAUC=Area under CurveCDC=Centers for Disease ControlCPCSSN=Canadian Primary Care Sentinel Surveillance NetworkDT=Decision TreeFN=False NegativeFP=False PositivekNN=k Nearest NeighborLDA=Linear Discrimination AnalysisLR=Logistic RegressionML=Machine LearningMLP=Multi-Layer PerceptronNB=Naive BayesianPIDD=Pima Indians Diabetes DatasetRF=Random ForestROC=Receiver Operating CharacteristicSVM=Support Vector MachineTN=True NegativeTP=True PositiveUKPDS=UK Prospective Diabetes StudyDisclosure statementNo potential conflict of interest was reported by the author(s)Authors’ contributionsAll authors evenly contributed to the whole work. All authors read and approved the final manuscript.Availability of data and materialsThe data used in this paper is cited throughout the paper.Ethical approvalThis article does not contain any studies with human participants performed by any of the authors.Additional informationFundingNo source of funding for this work.Notes on contributorsMohammad Ehsan FarnoodianMohammad Ehsan Farnoodian received a B.S. degree in biomedical engineering-‎‎bioelectric from Tehran Medical Science, Islamic Azad University, Tehran, Iran, ‎and earned his M.S. degree in biomedical engineering-bioelectric from Science and ‎Research branch, Islamic Azad University, Tehran, Iran, in 2023. He is passionately ‎dedicated to the examination and interpretation of biomedical data, particularly in ‎the context of disease prediction and detection. His academic pursuits involve in-‎depth exploration of biomedical data analysis intricacies, with a specific focus on ‎employing data-driven approaches for disease anticipation and identification.‎Mohammad Karimi MoridaniMohammad Karimi Moridani received a BS degree in electrical engineering-‎Electronic from 2006, and he obtained MS and Ph.D. degrees in biomedical ‎engineering-bioelectric in 2008 and 2015, respectively. Currently, he serves as an ‎assistant professor in the biomedical engineering department at Tehran Medical ‎Science, Islamic Azad University in Tehran, Iran. His research focuses on ‎biomedical signal and image processing, nonlinear time series analysis, and ‎cognitive science, with specific applications ranging from ECG, HRV, and EEG ‎signal processing for disease detection and prediction to epileptic seizure ‎prediction, pattern recognition, image processing for facial and beauty recognition, ‎watermarking, and more. He is driven by a passion to contribute meaningfully to ‎the scientific community and employs data-driven methodologies to address ‎critical challenges in healthcare and related fields.‎Hanieh MokhberHanieh Mokhber received a B.S. degree in biomedical engineering-bioelectric ‎from Islamic Azad University of Tehran Medical science. Her scholarly endeavors ‎involve a meticulous exploration of the complexities of biomedical data analysis, ‎with a specific and unwavering emphasis on harnessing data-driven methodologies ‎to anticipate and identify various diseases.‎","PeriodicalId":51800,"journal":{"name":"Computer Methods in Biomechanics and Biomedical Engineering-Imaging and Visualization","volume":"2 1","pages":"0"},"PeriodicalIF":1.3000,"publicationDate":"2023-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Methods in Biomechanics and Biomedical Engineering-Imaging and Visualization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/21681163.2023.2264937","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

Abstract

ABSTRACTDiabetes is a prevalent and costly condition, with early diagnosis pivotal in mitigating ‎its progression and complications. The diagnostic process often contends with data ‎ambiguity and decision uncertainty, adding complexity to achieving definitive ‎outcomes. This study addresses the diabetes diagnostic challenge through data mining ‎and machine learning techniques. It involves training various machine learning ‎algorithms and conducting statistical analysis on a dataset comprising 520 patients, ‎encompassing both normal and diabetic cases, to discern influential features.‎ Incorporating 17 features as classifier inputs, this research evaluates the diagnostic ‎performance using four reputable techniques: support vector machine (SVM), random ‎forest (RF), multi-layer perceptron (MLP), and k-nearest neighbor (kNN). The outcomes ‎underscore the SVM model's superior performance, boasting accuracy, specificity, and ‎sensitivity values of 98.78±1.96%, 99.28±1.63%, and 97.32±2.45%, ‎respectively, across 50 iterations. The findings establish SVM as the preferred method ‎for diabetes diagnosis.‎ This study highlights the efficacy of data mining and machine learning models in ‎diabetes diagnosis. While these methods exhibit respectable predictive accuracy, their ‎integration with a physician's assessment promises even better patient outcomes.‎KEYWORDS: Data miningdiabetesSVMdetectionprediction Abbreviations ANN=Artificial Neural NetworkAUC=Area under CurveCDC=Centers for Disease ControlCPCSSN=Canadian Primary Care Sentinel Surveillance NetworkDT=Decision TreeFN=False NegativeFP=False PositivekNN=k Nearest NeighborLDA=Linear Discrimination AnalysisLR=Logistic RegressionML=Machine LearningMLP=Multi-Layer PerceptronNB=Naive BayesianPIDD=Pima Indians Diabetes DatasetRF=Random ForestROC=Receiver Operating CharacteristicSVM=Support Vector MachineTN=True NegativeTP=True PositiveUKPDS=UK Prospective Diabetes StudyDisclosure statementNo potential conflict of interest was reported by the author(s)Authors’ contributionsAll authors evenly contributed to the whole work. All authors read and approved the final manuscript.Availability of data and materialsThe data used in this paper is cited throughout the paper.Ethical approvalThis article does not contain any studies with human participants performed by any of the authors.Additional informationFundingNo source of funding for this work.Notes on contributorsMohammad Ehsan FarnoodianMohammad Ehsan Farnoodian received a B.S. degree in biomedical engineering-‎‎bioelectric from Tehran Medical Science, Islamic Azad University, Tehran, Iran, ‎and earned his M.S. degree in biomedical engineering-bioelectric from Science and ‎Research branch, Islamic Azad University, Tehran, Iran, in 2023. He is passionately ‎dedicated to the examination and interpretation of biomedical data, particularly in ‎the context of disease prediction and detection. His academic pursuits involve in-‎depth exploration of biomedical data analysis intricacies, with a specific focus on ‎employing data-driven approaches for disease anticipation and identification.‎Mohammad Karimi MoridaniMohammad Karimi Moridani received a BS degree in electrical engineering-‎Electronic from 2006, and he obtained MS and Ph.D. degrees in biomedical ‎engineering-bioelectric in 2008 and 2015, respectively. Currently, he serves as an ‎assistant professor in the biomedical engineering department at Tehran Medical ‎Science, Islamic Azad University in Tehran, Iran. His research focuses on ‎biomedical signal and image processing, nonlinear time series analysis, and ‎cognitive science, with specific applications ranging from ECG, HRV, and EEG ‎signal processing for disease detection and prediction to epileptic seizure ‎prediction, pattern recognition, image processing for facial and beauty recognition, ‎watermarking, and more. He is driven by a passion to contribute meaningfully to ‎the scientific community and employs data-driven methodologies to address ‎critical challenges in healthcare and related fields.‎Hanieh MokhberHanieh Mokhber received a B.S. degree in biomedical engineering-bioelectric ‎from Islamic Azad University of Tehran Medical science. Her scholarly endeavors ‎involve a meticulous exploration of the complexities of biomedical data analysis, ‎with a specific and unwavering emphasis on harnessing data-driven methodologies ‎to anticipate and identify various diseases.‎
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用有效的生物标志物检测和预测糖尿病
糖尿病是一种普遍且昂贵的疾病,早期诊断对于减轻其进展和并发症至关重要。诊断过程经常与数据模糊和决策不确定性相冲突,增加了获得明确结果的复杂性。本研究通过数据挖掘和机器学习技术解决了糖尿病诊断的挑战。它包括训练各种机器学习算法,并对包含520名患者(包括正常和糖尿病病例)的数据集进行统计分析,以识别有影响的特征。将17个特征作为分类器输入,本研究使用四种知名技术评估诊断性能:支持向量机(SVM)、随机森林(RF)、多层感知器(MLP)和k-近邻(kNN)。结果表明,SVM模型在50次迭代中,准确率、特异性和灵敏度分别为98.78±1.96%、99.28±1.63%和97.32±2.45%。研究结果表明支持向量机是糖尿病诊断的首选方法。这项研究强调了数据挖掘和机器学习模型在糖尿病诊断中的功效。虽然这些方法表现出可观的预测准确性,但它们与医生的评估相结合,有望为患者带来更好的结果。‎关键词:数据挖掘diabetessvmdetection prediction缩写ANN=人工神经网络auc =曲率下面积ecdc =疾病控制中心cpcsn =加拿大初级保健哨点监测网络dt =决策树efn =假阴性fp =假阳性knn =k最近邻lda =线性判别分析lr =逻辑回归ml =机器学习mlp =多层感知器nb =朴素贝叶斯pidd =皮马印第安人糖尿病数据etrf =随机森林c =Receiver Operating feature svm =支持向量MachineTN=真阴性tp =真阳性ukpds =英国前瞻性糖尿病研究披露声明作者未报告潜在的利益冲突作者的贡献所有作者平均贡献了全部工作。所有作者都阅读并批准了最终的手稿。数据和材料的可用性本文中使用的数据在全文中被引用。伦理批准:本文不包含任何作者进行的任何人类参与者的研究。其他信息资金来源本工作没有资金来源。mohammad Ehsan Farnoodian获得伊朗德黑兰伊斯兰阿扎德大学德黑兰医学科学生物医学工程-生物电学士学位,并于2023年在伊朗德黑兰伊斯兰阿扎德大学科学与研究分部获得生物医学工程-生物电硕士学位。他热情地致力于检查和解释生物医学数据,特别是在疾病预测和检测的背景下。他的学术追求涉及生物医学数据分析复杂性的深入探索,特别侧重于采用数据驱动的方法进行疾病预测和识别。Mohammad Karimi Moridani于2006年获得电气工程-电子学士学位,并分别于2008年和2015年获得生物医学工程-生物电学硕士和博士学位。目前,他是伊朗德黑兰伊斯兰阿扎德大学(Islamic Azad University)德黑兰医学科学生物医学工程系的助理教授。他的研究重点是生物医学信号和图像处理、非线性时间序列分析和认知科学,具体应用范围从用于疾病检测和预测的ECG、HRV和EEG信号处理到癫痫发作预测、模式识别、面部和美丽识别的图像处理、水印等。他热衷于为科学界做出有意义的贡献,并采用数据驱动的方法来解决医疗保健和相关领域的关键挑战。Hanieh MokhberHanieh Mokhber获得德黑兰伊斯兰阿扎德大学医学科学生物医学工程-生物电学士学位。她的学术努力涉及对生物医学数据分析复杂性的细致探索,特别强调利用数据驱动的方法来预测和识别各种疾病
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
2.80
自引率
6.20%
发文量
102
期刊介绍: Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization is an international journal whose main goals are to promote solutions of excellence for both imaging and visualization of biomedical data, and establish links among researchers, clinicians, the medical technology sector and end-users. The journal provides a comprehensive forum for discussion of the current state-of-the-art in the scientific fields related to imaging and visualization, including, but not limited to: Applications of Imaging and Visualization Computational Bio- imaging and Visualization Computer Aided Diagnosis, Surgery, Therapy and Treatment Data Processing and Analysis Devices for Imaging and Visualization Grid and High Performance Computing for Imaging and Visualization Human Perception in Imaging and Visualization Image Processing and Analysis Image-based Geometric Modelling Imaging and Visualization in Biomechanics Imaging and Visualization in Biomedical Engineering Medical Clinics Medical Imaging and Visualization Multi-modal Imaging and Visualization Multiscale Imaging and Visualization Scientific Visualization Software Development for Imaging and Visualization Telemedicine Systems and Applications Virtual Reality Visual Data Mining and Knowledge Discovery.
期刊最新文献
Optimization of deep neural networks for multiclassification of dental X-rays using transfer learning A prototype smartphone jaw tracking application to quantitatively model tooth contact Computer-aided diagnosis of Canine Hip Dysplasia using deep learning approach in a novel X-ray image dataset Decorrelation stretch for enhancing colour fundus photographs affected by cataracts Genetic algorithm for feature selection in mammograms for breast masses classification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1