利用机器学习诊断丙型肝炎程度的有效因素

M. Sayadi, Vijayakumar Varadarajan, Elahe Gozali, M. Sadeghi
{"title":"利用机器学习诊断丙型肝炎程度的有效因素","authors":"M. Sayadi, Vijayakumar Varadarajan, Elahe Gozali, M. Sadeghi","doi":"10.30699/fhi.v12i0.440","DOIUrl":null,"url":null,"abstract":"Introduction: Hepatitis C virus (HCV) is a major public health threat, which can be treated if diagnosed early, but unfortunately, many people with chronic diseases are not diagnosed until the final stages. Machine learning and its techniques can be very helpful in diagnosis. This study examines the factors affecting hepatitis C diagnosis using machine learning.Material and Methods: A total of 27 features were used with a dataset containing 1385 records of patients with different grades of HCV. The dataset was clean and preprocessed to ensure accuracy and consistency. To reduce the dimension of the dataset and determine the effective features three feature selection, Pearson Correlation, ANOVA, and Random Forest, were applied. Among all the algorithms, KNN, random forests, and Deep Neural Networks were selected to be utilized, and then their evaluation metrics, such as Accuracy and Recall. To create prediction models, fifteen features were selected for the mentioned machine learning algorithms.Results: Performance evaluation of these models based on accuracy showed that Deep Learning with Accuracy = 92.067 had the highest performance. KNN and Random Forest had almost the same performance after Deep Learning. This performance was achieved on dataset containing features that were selected by ANOVA feature selection.Conclusion: Machine learning has been very effective in solving many challenges in the field of health. This study showed that using data-mining algorithms also can be useful for HCV diagnosing. The proposed model in this study can help physicians diagnose the degree of HCV at an affordable and with high accuracy.","PeriodicalId":154611,"journal":{"name":"Frontiers in Health Informatics","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Effective Factors in Diagnosing the Degree of Hepatitis C Using Machine Learning\",\"authors\":\"M. Sayadi, Vijayakumar Varadarajan, Elahe Gozali, M. Sadeghi\",\"doi\":\"10.30699/fhi.v12i0.440\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction: Hepatitis C virus (HCV) is a major public health threat, which can be treated if diagnosed early, but unfortunately, many people with chronic diseases are not diagnosed until the final stages. Machine learning and its techniques can be very helpful in diagnosis. This study examines the factors affecting hepatitis C diagnosis using machine learning.Material and Methods: A total of 27 features were used with a dataset containing 1385 records of patients with different grades of HCV. The dataset was clean and preprocessed to ensure accuracy and consistency. To reduce the dimension of the dataset and determine the effective features three feature selection, Pearson Correlation, ANOVA, and Random Forest, were applied. Among all the algorithms, KNN, random forests, and Deep Neural Networks were selected to be utilized, and then their evaluation metrics, such as Accuracy and Recall. To create prediction models, fifteen features were selected for the mentioned machine learning algorithms.Results: Performance evaluation of these models based on accuracy showed that Deep Learning with Accuracy = 92.067 had the highest performance. KNN and Random Forest had almost the same performance after Deep Learning. This performance was achieved on dataset containing features that were selected by ANOVA feature selection.Conclusion: Machine learning has been very effective in solving many challenges in the field of health. This study showed that using data-mining algorithms also can be useful for HCV diagnosing. The proposed model in this study can help physicians diagnose the degree of HCV at an affordable and with high accuracy.\",\"PeriodicalId\":154611,\"journal\":{\"name\":\"Frontiers in Health Informatics\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Health Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.30699/fhi.v12i0.440\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Health Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30699/fhi.v12i0.440","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

简介:丙型肝炎病毒(HCV)是一种主要的公共卫生威胁,如果早期诊断可以治疗,但不幸的是,许多患有慢性疾病的人直到最后阶段才被诊断出来。机器学习及其技术对诊断非常有帮助。本研究使用机器学习检查影响丙型肝炎诊断的因素。材料和方法:在包含1385例不同级别HCV患者记录的数据集中,共使用了27个特征。数据集经过清理和预处理,以确保准确性和一致性。为了降低数据集的维数并确定有效特征,应用了三种特征选择,Pearson相关,ANOVA和Random Forest。在所有算法中,选择了KNN、随机森林和深度神经网络,然后对它们的评价指标,如准确率和召回率进行了研究。为了创建预测模型,我们为上述机器学习算法选择了15个特征。结果:基于准确率对这些模型进行性能评价,准确率为92.067的深度学习表现最好。经过深度学习后,KNN和Random Forest的性能几乎相同。这种性能是在包含由方差分析特征选择选择的特征的数据集上实现的。结论:机器学习在解决健康领域的许多挑战方面非常有效。这项研究表明,使用数据挖掘算法也可以用于HCV诊断。本研究提出的模型可以帮助医生以负担得起的价格和较高的准确性诊断HCV的程度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Effective Factors in Diagnosing the Degree of Hepatitis C Using Machine Learning
Introduction: Hepatitis C virus (HCV) is a major public health threat, which can be treated if diagnosed early, but unfortunately, many people with chronic diseases are not diagnosed until the final stages. Machine learning and its techniques can be very helpful in diagnosis. This study examines the factors affecting hepatitis C diagnosis using machine learning.Material and Methods: A total of 27 features were used with a dataset containing 1385 records of patients with different grades of HCV. The dataset was clean and preprocessed to ensure accuracy and consistency. To reduce the dimension of the dataset and determine the effective features three feature selection, Pearson Correlation, ANOVA, and Random Forest, were applied. Among all the algorithms, KNN, random forests, and Deep Neural Networks were selected to be utilized, and then their evaluation metrics, such as Accuracy and Recall. To create prediction models, fifteen features were selected for the mentioned machine learning algorithms.Results: Performance evaluation of these models based on accuracy showed that Deep Learning with Accuracy = 92.067 had the highest performance. KNN and Random Forest had almost the same performance after Deep Learning. This performance was achieved on dataset containing features that were selected by ANOVA feature selection.Conclusion: Machine learning has been very effective in solving many challenges in the field of health. This study showed that using data-mining algorithms also can be useful for HCV diagnosing. The proposed model in this study can help physicians diagnose the degree of HCV at an affordable and with high accuracy.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
1.20
自引率
0.00%
发文量
0
期刊最新文献
Self-Care Application for Rheumatoid Arthritis: Identifying Key Data Elements Effective use of electronic health records system for healthcare delivery in Ghana Predictive Modeling of COVID-19 Hospitalization Using Twenty Machine Learning Classification Algorithms on Cohort Data Development and Usability Evaluation of a Web-Based Health Information Technology Dashboard of Quality and Economic Indicators Potentially Highly Effective Drugs for COVID-19: Virtual Screening and Molecular Docking Study Through PyRx-Vina Approach
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1