Covid-19 Chest Radiography Images Analysis Based on Integration of Image Preprocess, Guided Grad-CAM, Machine Learning and Risk Management

Proceedings of the 4th International Conference on Medical and Health Informatics Pub Date : 2020-08-14 DOI:10.1145/3418094.3418096

Tsung-Chieh Lin, Hsi-Chieh Lee

{"title":"Covid-19 Chest Radiography Images Analysis Based on Integration of Image Preprocess, Guided Grad-CAM, Machine Learning and Risk Management","authors":"Tsung-Chieh Lin, Hsi-Chieh Lee","doi":"10.1145/3418094.3418096","DOIUrl":null,"url":null,"abstract":"COVID19 coronavirus has widely infected more than 10 million people and killed more than 500,000 globally till July 1, 2020. In this paper, we describe a potential methodology, integration of image preprocess, Guided Grad-CAM, machine learning and risk management based on chest radiography images, as one of workable alarm and analysis systems to support clinicians against COVID-19 outbreak threat. We leverage pre-trained CNN models as backbone with further transfer learning to analyze public open datasets composed of 5851 chest radiography images for 4 classes classification, and 15478 images from COVIDx dataset for 3 classes classification, facilitated with steps of ROI and mask, and CNN layer visualization of guided grad-CAM to help CNN focused on critical infection focus in qualitative perspective. In quantitative perspective of 4 classes classification result, accuracy, average sensitivity, average precision, and COVID19 sensitivity of single ResNet50 and our second bagging ensemble model are (77.2%/78.8%/81.9%/100%) and (81.5%/81.4%,86.8%/100%) respectively. Ensemble way of several CNNs and other machine learning methods used here is to contribute about 4% accuracy improvement on top of best single CNN (ResNet50). In our 3 classes classification, those metrics of ensemble model and benchmark are (93.1%/90.1%/89.7%/83%) and (90%/85.9%, 82.4%/77%). We conclude ensemble approach would facilitate weaker classifier more. Beside to accuracy-oriented analysis, a cost minimization approach is suggested here to provide clinicians options of different risk consideration flexibility by trade off among different categories and performance rates.","PeriodicalId":192804,"journal":{"name":"Proceedings of the 4th International Conference on Medical and Health Informatics","volume":"87 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Conference on Medical and Health Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3418094.3418096","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 16

Abstract

COVID19 coronavirus has widely infected more than 10 million people and killed more than 500,000 globally till July 1, 2020. In this paper, we describe a potential methodology, integration of image preprocess, Guided Grad-CAM, machine learning and risk management based on chest radiography images, as one of workable alarm and analysis systems to support clinicians against COVID-19 outbreak threat. We leverage pre-trained CNN models as backbone with further transfer learning to analyze public open datasets composed of 5851 chest radiography images for 4 classes classification, and 15478 images from COVIDx dataset for 3 classes classification, facilitated with steps of ROI and mask, and CNN layer visualization of guided grad-CAM to help CNN focused on critical infection focus in qualitative perspective. In quantitative perspective of 4 classes classification result, accuracy, average sensitivity, average precision, and COVID19 sensitivity of single ResNet50 and our second bagging ensemble model are (77.2%/78.8%/81.9%/100%) and (81.5%/81.4%,86.8%/100%) respectively. Ensemble way of several CNNs and other machine learning methods used here is to contribute about 4% accuracy improvement on top of best single CNN (ResNet50). In our 3 classes classification, those metrics of ensemble model and benchmark are (93.1%/90.1%/89.7%/83%) and (90%/85.9%, 82.4%/77%). We conclude ensemble approach would facilitate weaker classifier more. Beside to accuracy-oriented analysis, a cost minimization approach is suggested here to provide clinicians options of different risk consideration flexibility by trade off among different categories and performance rates.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于图像预处理、Guided Grad-CAM、机器学习和风险管理集成的Covid-19胸片图像分析

截至2020年7月1日，covid - 19冠状病毒已在全球范围内广泛感染了1000多万人，造成50多万人死亡。在本文中，我们描述了一种潜在的方法，将图像预处理、Guided Grad-CAM、机器学习和基于胸部x线图像的风险管理集成为一种可行的报警和分析系统，以支持临床医生应对COVID-19爆发威胁。我们以预训练的CNN模型为骨干，通过进一步的迁移学习，对5851张胸片图像组成的公共开放数据集进行4类分类，对来自covid数据集的15478张图像进行3类分类，并借助ROI和mask的步骤，以及引导的grad-CAM的CNN层可视化，帮助CNN从定性的角度关注关键感染焦点。从4类分类结果的定量角度来看，单个ResNet50和我们的第二bagging集成模型的准确率、平均灵敏度、平均精度和COVID19灵敏度分别为(77.2%/78.8%/81.9%/100%)和(81.5%/81.4%、86.8%/100%)。本文使用的几种CNN和其他机器学习方法的集成方法在最佳单一CNN (ResNet50)的基础上贡献了约4%的准确率提高。在我们的3类分类中，集合模型和基准的度量分别为(93.1%/90.1%/89.7%/83%)和(90%/85.9%，82.4%/77%)。我们得出结论，集成方法更有利于弱分类器。除了以准确性为导向的分析外，本文还建议采用成本最小化的方法，通过在不同类别和绩效率之间进行权衡，为临床医生提供不同风险考虑灵活性的选择。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 4th International Conference on Medical and Health Informatics

自引率

0.00%

发文量