Analyzing the Impact of Resampling Approaches on Chest X-Ray Images for COVID-19 Identification in a Local Hierarchical Classification Scenario

F. K. H. D. Barros, André L. Jeller Selleti, Vinicius Queiroz, R. M. Pereira, C. Silla
{"title":"Analyzing the Impact of Resampling Approaches on Chest X-Ray Images for COVID-19 Identification in a Local Hierarchical Classification Scenario","authors":"F. K. H. D. Barros, André L. Jeller Selleti, Vinicius Queiroz, R. M. Pereira, C. Silla","doi":"10.1109/BIBE52308.2021.9635433","DOIUrl":null,"url":null,"abstract":"Researchers dealing with real-world data - such as in the healthcare domain - tend to face class imbalance issues. More specifically, publicly available datasets containing Chest X-Ray (CXR) of Pneumonia diseases (including COVID-19) usually have an imbalanced class distribution. This dataset imbalance causes automatic diagnosis systems to classify majority classes with much more accuracy than the minority ones. Several resampling algorithms were proposed in the past to deal with the class imbalance issue. Hierarchical classifiers have also been proposed to increase the predictive performance of classifiers, but there is little research in the literature verifying if using existing resampling algorithms with hierarchical classifiers are a good alternative to improve classification performance. This work proposes an experimental classification schema to investigate the effectiveness of using resampling algorithms in the identification of COVID-19 and other types of Pneumonia through CXR images. The proposed schema uses resampling algorithms to rebalance the class distribution, in a Local Hierarchical Classification scenario. The experimental evaluation, which is supported by inferential statistical analysis, showed that using specific resampling algorithms with Local Hierarchical Classifiers brings a statistically significant increase to the macro-averaged Fl-Score, and improves the predictive performance for the minority classes.","PeriodicalId":343724,"journal":{"name":"2021 IEEE 21st International Conference on Bioinformatics and Bioengineering (BIBE)","volume":"243 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 21st International Conference on Bioinformatics and Bioengineering (BIBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBE52308.2021.9635433","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Researchers dealing with real-world data - such as in the healthcare domain - tend to face class imbalance issues. More specifically, publicly available datasets containing Chest X-Ray (CXR) of Pneumonia diseases (including COVID-19) usually have an imbalanced class distribution. This dataset imbalance causes automatic diagnosis systems to classify majority classes with much more accuracy than the minority ones. Several resampling algorithms were proposed in the past to deal with the class imbalance issue. Hierarchical classifiers have also been proposed to increase the predictive performance of classifiers, but there is little research in the literature verifying if using existing resampling algorithms with hierarchical classifiers are a good alternative to improve classification performance. This work proposes an experimental classification schema to investigate the effectiveness of using resampling algorithms in the identification of COVID-19 and other types of Pneumonia through CXR images. The proposed schema uses resampling algorithms to rebalance the class distribution, in a Local Hierarchical Classification scenario. The experimental evaluation, which is supported by inferential statistical analysis, showed that using specific resampling algorithms with Local Hierarchical Classifiers brings a statistically significant increase to the macro-averaged Fl-Score, and improves the predictive performance for the minority classes.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
局部分层分类场景下重采样方法对胸部x线图像COVID-19识别的影响分析
处理现实世界数据的研究人员——比如在医疗保健领域——往往面临着阶级不平衡的问题。更具体地说,包含肺炎疾病(包括COVID-19)的胸部x射线(CXR)的公开可用数据集通常具有不平衡的类别分布。这种数据不平衡导致自动诊断系统对多数类的分类比少数类的分类准确率高得多。过去提出了几种重采样算法来处理类不平衡问题。层次分类器也被提出用于提高分类器的预测性能,但文献中很少有研究验证使用现有的重采样算法与层次分类器是否是提高分类性能的一个很好的选择。本文提出了一种实验分类模式,以研究利用重采样算法通过CXR图像识别COVID-19和其他类型肺炎的有效性。在局部分层分类场景中,提出的模式使用重采样算法来重新平衡类分布。实验评估结果表明,采用局部分层分类器的特定重采样算法可以显著提高宏观平均Fl-Score,并提高对少数类别的预测性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Structural, antimicrobial, and molecular docking study of 3-(1-(4-hydroxyphenyl)amino) ethylidene)chroman-2,4-dione and its corresponding Pd complex Multiple-Activation Parallel Convolution Network in Combination with t-SNE for the Classification of Mild Cognitive Impairment Analyzing the Impact of Resampling Approaches on Chest X-Ray Images for COVID-19 Identification in a Local Hierarchical Classification Scenario Analysis of knee joint forces in different types of jumps of top futsal players at the beginning and at the end of the preparation period Design and evaluation of a noninvasive tongue-computer interface for individuals with severe disabilities
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1