Multiclass blood cancer classification using deep CNN with optimized features

IF 2.3 Q2 COMPUTER SCIENCE, THEORY & METHODS Array Pub Date : 2023-07-01 DOI:10.1016/j.array.2023.100292
Wahidur Rahman , Mohammad Gazi Golam Faruque , Kaniz Roksana , A H M Saifullah Sadi , Mohammad Motiur Rahman , Mir Mohammad Azad
{"title":"Multiclass blood cancer classification using deep CNN with optimized features","authors":"Wahidur Rahman ,&nbsp;Mohammad Gazi Golam Faruque ,&nbsp;Kaniz Roksana ,&nbsp;A H M Saifullah Sadi ,&nbsp;Mohammad Motiur Rahman ,&nbsp;Mir Mohammad Azad","doi":"10.1016/j.array.2023.100292","DOIUrl":null,"url":null,"abstract":"<div><p>Breast cancer, lung cancer, skin cancer, and blood malignancies such as leukemia and lymphoma are just a few instances of cancer, which is a collection of cells that proliferate uncontrollably within the body. Acute lymphoblastic leukemia is of one the significant form of malignancy. The hematologists frequently makes an oversight while determining a blood cancer diagnosis, which requires an excessive amount of time. Thus, this research reflects on a novel method for the grouping of the leukemia with the aid of the modern technologies like Machine Learning and Deep Learning. The proposed research pipeline is occupied into some interconnected parts like dataset building, feature extraction with pre-trained Convolutional Neural Network (CNN) architectures from each individual images of blood cells, and classification with the conventional classifiers. The dataset for this study is divided into two identical categories, Benign and Malignant, and then reshaped into four significant classes, each with three subtypes of malignant, namely, Benign, Early Pre-B, Pre-B, and Pro-B. The research first extracts the features from the individual images with CNN models and then transfers the extracted features to the features selections such as Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), and SVC Feature Selectors along with two nature inspired algorithms like Particle Swarm Optimization (PSO) and Cat Swarm Optimization (CSO). After that, research has applied the seven Machine Learning classifiers to accomplish the multi-class malignant classification. To assess the efficacy of the proposed architecture a set of experimental data have been enumerated and interpreted accordingly. The study discovered a maximum accuracy of 98.43% when solely using pre-trained CNN and classifiers. Nevertheless, after incorporating PSO and CSO, the proposed model achieved the highest accuracy of 99.84% by integrating the ResNet50 CNN architecture, SVC feature selector, and LR classifiers. Although the model has a higher accuracy rate, it does have some drawbacks. However, the proposed model may also be helpful for real-world blood cancer classification.</p></div>","PeriodicalId":8417,"journal":{"name":"Array","volume":null,"pages":null},"PeriodicalIF":2.3000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Array","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590005623000176","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 3

Abstract

Breast cancer, lung cancer, skin cancer, and blood malignancies such as leukemia and lymphoma are just a few instances of cancer, which is a collection of cells that proliferate uncontrollably within the body. Acute lymphoblastic leukemia is of one the significant form of malignancy. The hematologists frequently makes an oversight while determining a blood cancer diagnosis, which requires an excessive amount of time. Thus, this research reflects on a novel method for the grouping of the leukemia with the aid of the modern technologies like Machine Learning and Deep Learning. The proposed research pipeline is occupied into some interconnected parts like dataset building, feature extraction with pre-trained Convolutional Neural Network (CNN) architectures from each individual images of blood cells, and classification with the conventional classifiers. The dataset for this study is divided into two identical categories, Benign and Malignant, and then reshaped into four significant classes, each with three subtypes of malignant, namely, Benign, Early Pre-B, Pre-B, and Pro-B. The research first extracts the features from the individual images with CNN models and then transfers the extracted features to the features selections such as Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), and SVC Feature Selectors along with two nature inspired algorithms like Particle Swarm Optimization (PSO) and Cat Swarm Optimization (CSO). After that, research has applied the seven Machine Learning classifiers to accomplish the multi-class malignant classification. To assess the efficacy of the proposed architecture a set of experimental data have been enumerated and interpreted accordingly. The study discovered a maximum accuracy of 98.43% when solely using pre-trained CNN and classifiers. Nevertheless, after incorporating PSO and CSO, the proposed model achieved the highest accuracy of 99.84% by integrating the ResNet50 CNN architecture, SVC feature selector, and LR classifiers. Although the model has a higher accuracy rate, it does have some drawbacks. However, the proposed model may also be helpful for real-world blood cancer classification.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用具有优化特征的深度CNN对多类别血液癌症进行分类
乳腺癌、肺癌、皮肤癌和血液恶性肿瘤如白血病和淋巴瘤只是癌症的几个例子,癌症是一种在体内不受控制地增殖的细胞的集合。急性淋巴细胞白血病是恶性肿瘤的重要形式之一。血液学家在诊断血癌时经常会出现疏忽,这需要大量的时间。因此,本研究反思了一种借助机器学习和深度学习等现代技术对白血病进行分组的新方法。所提出的研究管道分为几个相互关联的部分,如数据集构建,使用预训练的卷积神经网络(CNN)架构从每个单独的血细胞图像中提取特征,以及使用常规分类器进行分类。本研究的数据集被分为两个相同的类别,Benign和Malignant,然后重塑为四个重要的类别,每个类别有三个恶性亚型,即Benign, Early Pre-B, Pre-B和Pro-B。该研究首先利用CNN模型对单个图像进行特征提取,然后结合粒子群优化(PSO)和Cat群优化(CSO)两种自然启发算法,将提取的特征转移到主成分分析(PCA)、线性判别分析(LDA)和SVC特征选择器等特征选择中。之后,研究应用了7种机器学习分类器完成了多类恶性分类。为了评估所提出的体系结构的有效性,我们列举了一组实验数据并对其进行了相应的解释。研究发现,单独使用预训练的CNN和分类器时,准确率最高可达98.43%。然而,在结合PSO和CSO之后,通过集成ResNet50 CNN架构、SVC特征选择器和LR分类器,所提出的模型达到了99.84%的最高准确率。尽管该模型具有较高的准确率,但它也存在一些缺点。然而,所提出的模型也可能有助于现实世界的血癌分类。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Array
Array Computer Science-General Computer Science
CiteScore
4.40
自引率
0.00%
发文量
93
审稿时长
45 days
期刊最新文献
DART: A Solution for decentralized federated learning model robustness analysis Autonomous UAV navigation using deep learning-based computer vision frameworks: A systematic literature review Threat intelligence named entity recognition techniques based on few-shot learning Reimagining otitis media diagnosis: A fusion of nested U-Net segmentation with graph theory-inspired feature set Modeling and supporting adaptive Complex Data-Intensive Web Systems via XML and the O-O paradigm: The OO-XAHM model
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1