Natalia de Sousa Freire, Pedro Paulo de Souza Leo, Leonardo Albuquerque Tiago, Alberto de Almeida Campos Gonalves, Rafael Albuquerque Pinto, Eulanda Miranda dos Santos, Eduardo Souto
{"title":"Analysis of generalizability on predicting COVID-19 from chest X-ray images using pre-trained deep models","authors":"Natalia de Sousa Freire, Pedro Paulo de Souza Leo, Leonardo Albuquerque Tiago, Alberto de Almeida Campos Gonalves, Rafael Albuquerque Pinto, Eulanda Miranda dos Santos, Eduardo Souto","doi":"10.1080/21681163.2023.2264408","DOIUrl":null,"url":null,"abstract":"ABSTRACTMachine learning methods have been extensively employed to predict COVID-19 using chest X-ray images in numerous studies. However, a machine learning model must exhibit robustness and provide reliable predictions for diverse populations, beyond those used in its training data, to be truly valuable. Unfortunately, the assessment of model generalisability is frequently overlooked in current literature. In this study, we investigate the generalisability of three classification models – ResNet50v2, MobileNetv2, and Swin Transformer – for predicting COVID-19 using chest X-ray images. We adopt three concurrent approaches for evaluation: the internal-and-external validation procedure, lung region cropping, and image enhancement. The results show that the combined approaches allow deep models to achieve similar internal and external generalisation capability.KEYWORDS: COVID-19X-raymachine learning Disclosure statementNo potential conflict of interest was reported by the author(s).Notes1. https://github.com/dirtmaxim/lungs-finder2. https://keras.io/examples/vision/swin_transformers/3. https://www.kaggle.com/c/rsna-pneumonia-detection-challenge4. https://github.com/agchung/Actualmed-COVID-chestxray-dataset5. Figure 1-COVID-chestxray-datasethttps://github.com/agchung/Figure 1-COVID-chestxray-datasetAdditional informationFundingThe present work is the result of the Research and Development (R&D) project 001/2020, signed with Federal University of Amazonas and FAEPI, Brazil, which has funding from Samsung, using resources from the Informatics Law for the Western Amazon (Federal Law no 8.387/1991), and its disclosure is in accordance with article 39 of Decree No. 10.521/2020.Notes on contributorsNatalia de Sousa FreireNatalia de Sousa Freire is currently a Software Engineering student at the Federal University of Amazonas (UFAM). His main research interests include the areas of machine learning and computer vision.Pedro Paulo de Souza LeoPedro Paulo de Souza Leão obtained his Bachelor's degree in Software Engineering from the Federal University of Amazonas (Brazil) in 2023. His main research interest is machine learning.Leonardo Albuquerque TiagoLeonardo de Albuquerque Tiago is currently pursuing a Bachelor's degree in Software Engineering at Federal University of Amazonas (Brazil). His main research interests are machine learning and software testing.Alberto de Almeida Campos GonalvesAlberto de Almeida Campos Gonçalves received his B.S. degree in Computer Science from the Federal University of Amazonas in 2022. His research interests include the areas of machine learning and computer vision.Rafael Albuquerque PintoRafael Albuquerque Pinto received his B.S. degree in Computer Science from the Federal University of Roraima (UFRR) in 2017 and his M.Sc. degree in Informatics from the Federal University of Amazonas (UFAM) in 2022. He is currently pursuing a Ph.D. degree in Informatics at UFAM, focusing his research on biosignals using machine learning techniques.Eulanda Miranda dos SantosEulanda Miranda dos Santos is an Associate Professor in the Institute of Computing (IComp) of the Federal University of Amazonas. She received a B.Sc. degree in Informatics from Federal University of Para (Brazil), a M.Sc. degree in Informatics from Federal University of Paraiba (Brazil) and a Ph.D. degree in Engineering from École de Technologie Supérieure, University of Quebec (Canada) in 1999, 2002 and 2008, respectively. Her research interests include pattern recognition, machine learning and computer vision.Eduardo SoutoEduardo Souto received the Ph.D. degree in computer science from the Federal University of Pernambuco (UFPE), Recife, Brazil, in 2007. He is currently an Associate Professor with the Institute of Computing, Federal University of Amazonas (UFAM). He is also the head of the Emerging Technologies and System Security (ETSS) Research Group. His research interests include the areas of applied machine learning, internet of things, and network security.","PeriodicalId":51800,"journal":{"name":"Computer Methods in Biomechanics and Biomedical Engineering-Imaging and Visualization","volume":"17 1","pages":"0"},"PeriodicalIF":1.3000,"publicationDate":"2023-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Methods in Biomechanics and Biomedical Engineering-Imaging and Visualization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/21681163.2023.2264408","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
ABSTRACTMachine learning methods have been extensively employed to predict COVID-19 using chest X-ray images in numerous studies. However, a machine learning model must exhibit robustness and provide reliable predictions for diverse populations, beyond those used in its training data, to be truly valuable. Unfortunately, the assessment of model generalisability is frequently overlooked in current literature. In this study, we investigate the generalisability of three classification models – ResNet50v2, MobileNetv2, and Swin Transformer – for predicting COVID-19 using chest X-ray images. We adopt three concurrent approaches for evaluation: the internal-and-external validation procedure, lung region cropping, and image enhancement. The results show that the combined approaches allow deep models to achieve similar internal and external generalisation capability.KEYWORDS: COVID-19X-raymachine learning Disclosure statementNo potential conflict of interest was reported by the author(s).Notes1. https://github.com/dirtmaxim/lungs-finder2. https://keras.io/examples/vision/swin_transformers/3. https://www.kaggle.com/c/rsna-pneumonia-detection-challenge4. https://github.com/agchung/Actualmed-COVID-chestxray-dataset5. Figure 1-COVID-chestxray-datasethttps://github.com/agchung/Figure 1-COVID-chestxray-datasetAdditional informationFundingThe present work is the result of the Research and Development (R&D) project 001/2020, signed with Federal University of Amazonas and FAEPI, Brazil, which has funding from Samsung, using resources from the Informatics Law for the Western Amazon (Federal Law no 8.387/1991), and its disclosure is in accordance with article 39 of Decree No. 10.521/2020.Notes on contributorsNatalia de Sousa FreireNatalia de Sousa Freire is currently a Software Engineering student at the Federal University of Amazonas (UFAM). His main research interests include the areas of machine learning and computer vision.Pedro Paulo de Souza LeoPedro Paulo de Souza Leão obtained his Bachelor's degree in Software Engineering from the Federal University of Amazonas (Brazil) in 2023. His main research interest is machine learning.Leonardo Albuquerque TiagoLeonardo de Albuquerque Tiago is currently pursuing a Bachelor's degree in Software Engineering at Federal University of Amazonas (Brazil). His main research interests are machine learning and software testing.Alberto de Almeida Campos GonalvesAlberto de Almeida Campos Gonçalves received his B.S. degree in Computer Science from the Federal University of Amazonas in 2022. His research interests include the areas of machine learning and computer vision.Rafael Albuquerque PintoRafael Albuquerque Pinto received his B.S. degree in Computer Science from the Federal University of Roraima (UFRR) in 2017 and his M.Sc. degree in Informatics from the Federal University of Amazonas (UFAM) in 2022. He is currently pursuing a Ph.D. degree in Informatics at UFAM, focusing his research on biosignals using machine learning techniques.Eulanda Miranda dos SantosEulanda Miranda dos Santos is an Associate Professor in the Institute of Computing (IComp) of the Federal University of Amazonas. She received a B.Sc. degree in Informatics from Federal University of Para (Brazil), a M.Sc. degree in Informatics from Federal University of Paraiba (Brazil) and a Ph.D. degree in Engineering from École de Technologie Supérieure, University of Quebec (Canada) in 1999, 2002 and 2008, respectively. Her research interests include pattern recognition, machine learning and computer vision.Eduardo SoutoEduardo Souto received the Ph.D. degree in computer science from the Federal University of Pernambuco (UFPE), Recife, Brazil, in 2007. He is currently an Associate Professor with the Institute of Computing, Federal University of Amazonas (UFAM). He is also the head of the Emerging Technologies and System Security (ETSS) Research Group. His research interests include the areas of applied machine learning, internet of things, and network security.
摘要在许多研究中,机器学习方法被广泛用于利用胸部x线图像预测COVID-19。然而,机器学习模型必须表现出鲁棒性,并为不同的人群提供可靠的预测,而不仅仅是在训练数据中使用的预测,才能真正有价值。不幸的是,在目前的文献中,对模型通用性的评估经常被忽视。在这项研究中,我们研究了三种分类模型——ResNet50v2、MobileNetv2和Swin Transformer——用于使用胸部x线图像预测COVID-19的通用性。我们采用三种并行的方法进行评估:内部和外部验证程序,肺区域裁剪和图像增强。结果表明,两种方法的结合可以使深度模型获得相似的内部和外部泛化能力。关键词:covid -19 x射线机器学习披露声明作者未报告潜在利益冲突。https://github.com/dirtmaxim/lungs-finder2。https://keras.io/examples/vision/swin_transformers/3。https://www.kaggle.com/c/rsna-pneumonia-detection-challenge4。https://github.com/agchung/Actualmed-COVID-chestxray-dataset5。本工作是研究与开发(R&D)项目001/2020的成果,该项目与亚马逊联邦大学和巴西FAEPI签署,该项目由三星资助,使用了西部亚马逊信息学法(第8.387/1991号联邦法)的资源,其披露符合第10.521/2020号法令第39条。关于贡献者的说明natalia de Sousa Freire renalia de Sousa Freire目前是亚马逊联邦大学(UFAM)软件工程专业的学生。他的主要研究兴趣包括机器学习和计算机视觉。Pedro Paulo de Souza le,于2023年在巴西亚马逊联邦大学获得软件工程学士学位。他的主要研究兴趣是机器学习。Leonardo Albuquerque Tiago目前正在亚马逊联邦大学(巴西)攻读软件工程学士学位。他的主要研究兴趣是机器学习和软件测试。Alberto de Almeida Campos gonalalves于2022年获得亚马逊联邦大学计算机科学学士学位。他的研究兴趣包括机器学习和计算机视觉领域。Rafael Albuquerque Pinto于2017年获得罗赖马联邦大学(UFRR)的计算机科学学士学位,并于2022年获得亚马逊联邦大学(UFAM)的信息学硕士学位。他目前正在UFAM攻读信息学博士学位,他的研究重点是使用机器学习技术的生物信号。ulanda Miranda dos Santos是亚马逊联邦大学计算机研究所(IComp)的副教授。她于1999年、2002年和2008年分别获得巴西帕拉联邦大学信息学学士学位、巴西帕拉伊巴联邦大学信息学硕士学位和加拿大魁北克大学École de Technologie supsamrieure工程博士学位。她的研究兴趣包括模式识别、机器学习和计算机视觉。Eduardo SoutoEduardo Souto于2007年获得巴西累西腓伯南布哥联邦大学(UFPE)计算机科学博士学位。他目前是亚马逊联邦大学计算机研究所的副教授。他也是新兴技术和系统安全(ETSS)研究小组的负责人。他的研究兴趣包括应用机器学习、物联网和网络安全领域。
期刊介绍:
Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization is an international journal whose main goals are to promote solutions of excellence for both imaging and visualization of biomedical data, and establish links among researchers, clinicians, the medical technology sector and end-users. The journal provides a comprehensive forum for discussion of the current state-of-the-art in the scientific fields related to imaging and visualization, including, but not limited to: Applications of Imaging and Visualization Computational Bio- imaging and Visualization Computer Aided Diagnosis, Surgery, Therapy and Treatment Data Processing and Analysis Devices for Imaging and Visualization Grid and High Performance Computing for Imaging and Visualization Human Perception in Imaging and Visualization Image Processing and Analysis Image-based Geometric Modelling Imaging and Visualization in Biomechanics Imaging and Visualization in Biomedical Engineering Medical Clinics Medical Imaging and Visualization Multi-modal Imaging and Visualization Multiscale Imaging and Visualization Scientific Visualization Software Development for Imaging and Visualization Telemedicine Systems and Applications Virtual Reality Visual Data Mining and Knowledge Discovery.