M. Sayadi, Vijayakumar Varadarajan, Elahe Gozali, M. Sadeghi
{"title":"利用机器学习诊断丙型肝炎程度的有效因素","authors":"M. Sayadi, Vijayakumar Varadarajan, Elahe Gozali, M. Sadeghi","doi":"10.30699/fhi.v12i0.440","DOIUrl":null,"url":null,"abstract":"Introduction: Hepatitis C virus (HCV) is a major public health threat, which can be treated if diagnosed early, but unfortunately, many people with chronic diseases are not diagnosed until the final stages. Machine learning and its techniques can be very helpful in diagnosis. This study examines the factors affecting hepatitis C diagnosis using machine learning.Material and Methods: A total of 27 features were used with a dataset containing 1385 records of patients with different grades of HCV. The dataset was clean and preprocessed to ensure accuracy and consistency. To reduce the dimension of the dataset and determine the effective features three feature selection, Pearson Correlation, ANOVA, and Random Forest, were applied. Among all the algorithms, KNN, random forests, and Deep Neural Networks were selected to be utilized, and then their evaluation metrics, such as Accuracy and Recall. To create prediction models, fifteen features were selected for the mentioned machine learning algorithms.Results: Performance evaluation of these models based on accuracy showed that Deep Learning with Accuracy = 92.067 had the highest performance. KNN and Random Forest had almost the same performance after Deep Learning. This performance was achieved on dataset containing features that were selected by ANOVA feature selection.Conclusion: Machine learning has been very effective in solving many challenges in the field of health. This study showed that using data-mining algorithms also can be useful for HCV diagnosing. The proposed model in this study can help physicians diagnose the degree of HCV at an affordable and with high accuracy.","PeriodicalId":154611,"journal":{"name":"Frontiers in Health Informatics","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Effective Factors in Diagnosing the Degree of Hepatitis C Using Machine Learning\",\"authors\":\"M. Sayadi, Vijayakumar Varadarajan, Elahe Gozali, M. Sadeghi\",\"doi\":\"10.30699/fhi.v12i0.440\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction: Hepatitis C virus (HCV) is a major public health threat, which can be treated if diagnosed early, but unfortunately, many people with chronic diseases are not diagnosed until the final stages. Machine learning and its techniques can be very helpful in diagnosis. This study examines the factors affecting hepatitis C diagnosis using machine learning.Material and Methods: A total of 27 features were used with a dataset containing 1385 records of patients with different grades of HCV. The dataset was clean and preprocessed to ensure accuracy and consistency. To reduce the dimension of the dataset and determine the effective features three feature selection, Pearson Correlation, ANOVA, and Random Forest, were applied. Among all the algorithms, KNN, random forests, and Deep Neural Networks were selected to be utilized, and then their evaluation metrics, such as Accuracy and Recall. To create prediction models, fifteen features were selected for the mentioned machine learning algorithms.Results: Performance evaluation of these models based on accuracy showed that Deep Learning with Accuracy = 92.067 had the highest performance. KNN and Random Forest had almost the same performance after Deep Learning. This performance was achieved on dataset containing features that were selected by ANOVA feature selection.Conclusion: Machine learning has been very effective in solving many challenges in the field of health. This study showed that using data-mining algorithms also can be useful for HCV diagnosing. The proposed model in this study can help physicians diagnose the degree of HCV at an affordable and with high accuracy.\",\"PeriodicalId\":154611,\"journal\":{\"name\":\"Frontiers in Health Informatics\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Health Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.30699/fhi.v12i0.440\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Health Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30699/fhi.v12i0.440","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Effective Factors in Diagnosing the Degree of Hepatitis C Using Machine Learning
Introduction: Hepatitis C virus (HCV) is a major public health threat, which can be treated if diagnosed early, but unfortunately, many people with chronic diseases are not diagnosed until the final stages. Machine learning and its techniques can be very helpful in diagnosis. This study examines the factors affecting hepatitis C diagnosis using machine learning.Material and Methods: A total of 27 features were used with a dataset containing 1385 records of patients with different grades of HCV. The dataset was clean and preprocessed to ensure accuracy and consistency. To reduce the dimension of the dataset and determine the effective features three feature selection, Pearson Correlation, ANOVA, and Random Forest, were applied. Among all the algorithms, KNN, random forests, and Deep Neural Networks were selected to be utilized, and then their evaluation metrics, such as Accuracy and Recall. To create prediction models, fifteen features were selected for the mentioned machine learning algorithms.Results: Performance evaluation of these models based on accuracy showed that Deep Learning with Accuracy = 92.067 had the highest performance. KNN and Random Forest had almost the same performance after Deep Learning. This performance was achieved on dataset containing features that were selected by ANOVA feature selection.Conclusion: Machine learning has been very effective in solving many challenges in the field of health. This study showed that using data-mining algorithms also can be useful for HCV diagnosing. The proposed model in this study can help physicians diagnose the degree of HCV at an affordable and with high accuracy.