Parametric optimization and comparative study of machine learning and deep learning algorithms for breast cancer diagnosis.

Breast disease Pub Date : 2024-01-01 DOI:10.3233/BD-240018
Parul Jain, Shalini Aggarwal, Sufiyan Adam, Mohsin Imam
{"title":"Parametric optimization and comparative study of machine learning and deep learning algorithms for breast cancer diagnosis.","authors":"Parul Jain, Shalini Aggarwal, Sufiyan Adam, Mohsin Imam","doi":"10.3233/BD-240018","DOIUrl":null,"url":null,"abstract":"<p><p>Breast Cancer is the leading form of cancer found in women and a major cause of increased mortality rates among them. However, manual diagnosis of the disease is time-consuming and often limited by the availability of screening systems. Thus, there is a pressing need for an automatic diagnosis system that can quickly detect cancer in its early stages. Data mining and machine learning techniques have emerged as valuable tools in developing such a system. In this study we investigated the performance of several machine learning models on the Wisconsin Breast Cancer (original) dataset with a particular emphasis on finding which models perform the best for breast cancer diagnosis. The study also explores the contrast between the proposed ANN methodology and conventional machine learning techniques. The comparison between the methods employed in the current study and those utilized in earlier research on the Wisconsin Breast Cancer dataset is also compared. The findings of this study are in line with those of previous studies which also highlighted the efficacy of SVM, Decision Tree, CART, ANN, and ELM ANN for breast cancer detection. Several classifiers achieved high accuracy, precision and F1 scores for benign and malignant tumours, respectively. It is also found that models with hyperparameter adjustment performed better than those without and boosting methods like as XGBoost, Adaboost, and Gradient Boost consistently performed well across benign and malignant tumours. The study emphasizes the significance of hyperparameter tuning and the efficacy of boosting algorithms in addressing the complexity and nonlinearity of data. Using the Wisconsin Breast Cancer (original) dataset, a detailed summary of the current status of research on breast cancer diagnosis is provided.</p>","PeriodicalId":9224,"journal":{"name":"Breast disease","volume":"43 1","pages":"257-270"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11492030/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Breast disease","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/BD-240018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Breast Cancer is the leading form of cancer found in women and a major cause of increased mortality rates among them. However, manual diagnosis of the disease is time-consuming and often limited by the availability of screening systems. Thus, there is a pressing need for an automatic diagnosis system that can quickly detect cancer in its early stages. Data mining and machine learning techniques have emerged as valuable tools in developing such a system. In this study we investigated the performance of several machine learning models on the Wisconsin Breast Cancer (original) dataset with a particular emphasis on finding which models perform the best for breast cancer diagnosis. The study also explores the contrast between the proposed ANN methodology and conventional machine learning techniques. The comparison between the methods employed in the current study and those utilized in earlier research on the Wisconsin Breast Cancer dataset is also compared. The findings of this study are in line with those of previous studies which also highlighted the efficacy of SVM, Decision Tree, CART, ANN, and ELM ANN for breast cancer detection. Several classifiers achieved high accuracy, precision and F1 scores for benign and malignant tumours, respectively. It is also found that models with hyperparameter adjustment performed better than those without and boosting methods like as XGBoost, Adaboost, and Gradient Boost consistently performed well across benign and malignant tumours. The study emphasizes the significance of hyperparameter tuning and the efficacy of boosting algorithms in addressing the complexity and nonlinearity of data. Using the Wisconsin Breast Cancer (original) dataset, a detailed summary of the current status of research on breast cancer diagnosis is provided.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于乳腺癌诊断的机器学习和深度学习算法的参数优化和比较研究。
乳腺癌是女性最常见的癌症,也是导致女性死亡率上升的主要原因。然而,人工诊断这种疾病非常耗时,而且往往受到筛查系统的限制。因此,迫切需要一种能够在癌症早期阶段快速检测出癌症的自动诊断系统。数据挖掘和机器学习技术已成为开发此类系统的重要工具。在这项研究中,我们对威斯康星乳腺癌(原始)数据集上的几种机器学习模型的性能进行了调查,重点是找出哪些模型在乳腺癌诊断中表现最佳。这项研究还探讨了拟议的 ANN 方法与传统机器学习技术之间的对比。本研究中采用的方法与早期研究中在威斯康星乳腺癌数据集上采用的方法也进行了比较。本研究的结果与之前的研究结果一致,之前的研究也强调了 SVM、决策树、CART、ANN 和 ELM ANN 在乳腺癌检测方面的功效。几种分类器对良性和恶性肿瘤分别达到了较高的准确度、精确度和 F1 分数。研究还发现,有超参数调整的模型比没有超参数调整的模型表现更好,而 XGBoost、Adaboost 和 Gradient Boost 等增强方法在良性肿瘤和恶性肿瘤中的表现也一直很好。这项研究强调了超参数调整的重要性以及提升算法在解决数据的复杂性和非线性方面的功效。利用威斯康星乳腺癌(原始)数据集,详细总结了乳腺癌诊断的研究现状。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Breast disease
Breast disease Medicine-Oncology
CiteScore
1.80
自引率
0.00%
发文量
59
期刊介绍: The recent expansion of work in the field of breast cancer inevitably will hasten discoveries that will have impact on patient outcome. The breadth of this research that spans basic science, clinical medicine, epidemiology, and public policy poses difficulties for investigators. Not only is it necessary to be facile in comprehending ideas from many disciplines, but also important to understand the public implications of these discoveries. Breast Disease publishes review issues devoted to an in-depth analysis of the scientific and public implications of recent research on a specific problem in breast cancer. Thus, the reviews will not only discuss recent discoveries but will also reflect on their impact in breast cancer research or clinical management.
期刊最新文献
Sentinel node in breast cancer as an indicator of quality in medical care: Evaluation of statistics in Colombia. Telangiectasias induced by combination tucatinib and ado-trastuzumab emtansine in a patient with metastatic breast cancer. Clinicopathological analysis of 38 male patients diagnosed with breast cancer. Impact of the COVID-19 pandemic on breast cancer pathological stage at diagnosis in Tunisian patients. Use of axillary ultrasound to guide breast cancer management in the genomic assay era.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1