利用机器和迁移学习在计算机断层扫描肺部图像中检测Covid-19

IF 3.3 4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Informatica Pub Date : 2023-09-28 DOI:10.31449/inf.v47i8.4258
Dalila Cherifi, Abderraouf Djaber, Mohammed-Elfateh Guedouar, Amine Feghoul, Zahia Zineb Chelbi, Amazigh Ait Ouakli
{"title":"利用机器和迁移学习在计算机断层扫描肺部图像中检测Covid-19","authors":"Dalila Cherifi, Abderraouf Djaber, Mohammed-Elfateh Guedouar, Amine Feghoul, Zahia Zineb Chelbi, Amazigh Ait Ouakli","doi":"10.31449/inf.v47i8.4258","DOIUrl":null,"url":null,"abstract":"Coronavirus disease 2019 (COVID-19) is a fast-spreading disease infectious that causes lung pneumonia which killed millions of lives around the world and has a significant impact on public healthcare. The diagnostic approach of the infection is principally divided into two broad categories, a laboratory-based and chest radiography approach where the CT imaging tests showed some advantages in the prediction over the other methods. Due to the restricted medical capability and the impressive raise of the suspected cases, the need for finding an immediate, accurate and automated method to alleviate the overcapacity of radiologists’ efforts for diagnosis has emerged . In order to accomplish this objective, our work is based on developing machine and deep learning algorithms to classify chest CT scans into Covid or non-Covid classes. To obtain a good performance, the accuracy of the classifier should be high so the patients may have a clear idea about their state. For this purpose, there are many hyper parameters that can be changed in order to advance the performance of the artificial models that are used for the identification of such illnesses. We have worked on two non-similar datasets from different sources, a small one of 746 images and a larger one with 14486 images. In the other hand, we have proposed various machine learning models starting by an SVM which contains different kernel types, KNN model with changing the distance measurements and an RF model with two different number of trees. Moreover, two CNN based approaches have been developed considering one convolution layer followed by a pooling layer then two consecutive convolution layers followed by a single pooling layer each time. The machine learning models showed better performance comparing to the CNN on the small dataset. While on the large dataset, CNN outperforms these algorithms. In order to improve performance of the models, transfer learning also have been used in this project where we trained the pre-trained InceptionV3 and ResNet50V2 on the same datasets. Among all the examined classifiers, the ResNet50V2 achieved the best scores with 86.67% accuracy, 93.94% sensitivity, 81% specificity and 86% F1-score on the small dataset while the respective scores on the large dataset were 97.52%, 97.28%, 97.77% and 98%. Experimental interpretation advise the potential applicability of ResNet50V2 transfer learning approach in real diagnostic scenarios, which might be of very high usefulness in terms of achieving fast testing for COVID19.","PeriodicalId":56292,"journal":{"name":"Informatica","volume":"12 1","pages":"0"},"PeriodicalIF":3.3000,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Covid-19 Detecting in Computed Tomography Lungs Images using Machine and transfer Learning\",\"authors\":\"Dalila Cherifi, Abderraouf Djaber, Mohammed-Elfateh Guedouar, Amine Feghoul, Zahia Zineb Chelbi, Amazigh Ait Ouakli\",\"doi\":\"10.31449/inf.v47i8.4258\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Coronavirus disease 2019 (COVID-19) is a fast-spreading disease infectious that causes lung pneumonia which killed millions of lives around the world and has a significant impact on public healthcare. The diagnostic approach of the infection is principally divided into two broad categories, a laboratory-based and chest radiography approach where the CT imaging tests showed some advantages in the prediction over the other methods. Due to the restricted medical capability and the impressive raise of the suspected cases, the need for finding an immediate, accurate and automated method to alleviate the overcapacity of radiologists’ efforts for diagnosis has emerged . In order to accomplish this objective, our work is based on developing machine and deep learning algorithms to classify chest CT scans into Covid or non-Covid classes. To obtain a good performance, the accuracy of the classifier should be high so the patients may have a clear idea about their state. For this purpose, there are many hyper parameters that can be changed in order to advance the performance of the artificial models that are used for the identification of such illnesses. We have worked on two non-similar datasets from different sources, a small one of 746 images and a larger one with 14486 images. In the other hand, we have proposed various machine learning models starting by an SVM which contains different kernel types, KNN model with changing the distance measurements and an RF model with two different number of trees. Moreover, two CNN based approaches have been developed considering one convolution layer followed by a pooling layer then two consecutive convolution layers followed by a single pooling layer each time. The machine learning models showed better performance comparing to the CNN on the small dataset. While on the large dataset, CNN outperforms these algorithms. In order to improve performance of the models, transfer learning also have been used in this project where we trained the pre-trained InceptionV3 and ResNet50V2 on the same datasets. Among all the examined classifiers, the ResNet50V2 achieved the best scores with 86.67% accuracy, 93.94% sensitivity, 81% specificity and 86% F1-score on the small dataset while the respective scores on the large dataset were 97.52%, 97.28%, 97.77% and 98%. Experimental interpretation advise the potential applicability of ResNet50V2 transfer learning approach in real diagnostic scenarios, which might be of very high usefulness in terms of achieving fast testing for COVID19.\",\"PeriodicalId\":56292,\"journal\":{\"name\":\"Informatica\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2023-09-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Informatica\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.31449/inf.v47i8.4258\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatica","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31449/inf.v47i8.4258","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

2019冠状病毒病(COVID-19)是一种快速传播的传染性疾病,可导致肺部肺炎,导致全球数百万人死亡,并对公共医疗保健产生重大影响。感染的诊断方法主要分为两大类,以实验室为基础的方法和胸部x线检查方法,其中CT成像检查在预测方面比其他方法有一些优势。由于医疗能力的限制和疑似病例的急剧增加,迫切需要找到一种即时、准确和自动化的方法来缓解放射科医生的诊断能力过剩。为了实现这一目标,我们的工作是基于开发机器和深度学习算法,将胸部CT扫描分为Covid或非Covid类。为了获得良好的性能,分类器的准确率应该很高,这样患者才能对自己的状态有一个清晰的认识。为此,有许多超参数可以改变,以提高用于识别此类疾病的人工模型的性能。我们研究了来自不同来源的两个不相似的数据集,一个小的有746张图像,一个大的有14486张图像。另一方面,我们提出了各种机器学习模型,从包含不同核类型的支持向量机开始,改变距离测量的KNN模型和具有两种不同数量树的RF模型。此外,还开发了两种基于CNN的方法,每次考虑一个卷积层后面跟着一个池化层,然后两个连续的卷积层后面跟着一个池化层。机器学习模型在小数据集上表现出比CNN更好的性能。而在大数据集上,CNN的表现优于这些算法。为了提高模型的性能,在这个项目中也使用了迁移学习,我们在相同的数据集上训练了预训练的InceptionV3和ResNet50V2。在所有被检测的分类器中,ResNet50V2在小数据集上的准确率为86.67%,灵敏度为93.94%,特异性为81%,f1评分为86%,在大数据集上的评分分别为97.52%,97.28%,97.77%和98%。实验解释表明,ResNet50V2迁移学习方法在真实诊断场景中的潜在适用性,在实现covid - 19快速检测方面可能非常有用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Covid-19 Detecting in Computed Tomography Lungs Images using Machine and transfer Learning
Coronavirus disease 2019 (COVID-19) is a fast-spreading disease infectious that causes lung pneumonia which killed millions of lives around the world and has a significant impact on public healthcare. The diagnostic approach of the infection is principally divided into two broad categories, a laboratory-based and chest radiography approach where the CT imaging tests showed some advantages in the prediction over the other methods. Due to the restricted medical capability and the impressive raise of the suspected cases, the need for finding an immediate, accurate and automated method to alleviate the overcapacity of radiologists’ efforts for diagnosis has emerged . In order to accomplish this objective, our work is based on developing machine and deep learning algorithms to classify chest CT scans into Covid or non-Covid classes. To obtain a good performance, the accuracy of the classifier should be high so the patients may have a clear idea about their state. For this purpose, there are many hyper parameters that can be changed in order to advance the performance of the artificial models that are used for the identification of such illnesses. We have worked on two non-similar datasets from different sources, a small one of 746 images and a larger one with 14486 images. In the other hand, we have proposed various machine learning models starting by an SVM which contains different kernel types, KNN model with changing the distance measurements and an RF model with two different number of trees. Moreover, two CNN based approaches have been developed considering one convolution layer followed by a pooling layer then two consecutive convolution layers followed by a single pooling layer each time. The machine learning models showed better performance comparing to the CNN on the small dataset. While on the large dataset, CNN outperforms these algorithms. In order to improve performance of the models, transfer learning also have been used in this project where we trained the pre-trained InceptionV3 and ResNet50V2 on the same datasets. Among all the examined classifiers, the ResNet50V2 achieved the best scores with 86.67% accuracy, 93.94% sensitivity, 81% specificity and 86% F1-score on the small dataset while the respective scores on the large dataset were 97.52%, 97.28%, 97.77% and 98%. Experimental interpretation advise the potential applicability of ResNet50V2 transfer learning approach in real diagnostic scenarios, which might be of very high usefulness in terms of achieving fast testing for COVID19.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Informatica
Informatica 工程技术-计算机:信息系统
CiteScore
5.90
自引率
6.90%
发文量
19
审稿时长
12 months
期刊介绍: The quarterly journal Informatica provides an international forum for high-quality original research and publishes papers on mathematical simulation and optimization, recognition and control, programming theory and systems, automation systems and elements. Informatica provides a multidisciplinary forum for scientists and engineers involved in research and design including experts who implement and manage information systems applications.
期刊最新文献
Beyond Quasi-Adjoint Graphs: On Polynomial-Time Solvable Cases of the Hamiltonian Cycle and Path Problems Confidential Transaction Balance Verification by the Net Using Non-Interactive Zero-Knowledge Proofs An Improved Algorithm for Extracting Frequent Gradual Patterns Offloaded Data Processing Energy Efficiency Evaluation Demystifying the Stability and the Performance Aspects of CoCoSo Ranking Method under Uncertain Preferences
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1