Machine Learning Approaches for Early Diagnosis of Breast Cancer: A Comparative Study of Performance Evaluation

Rabia Emhamed Al Mamlook, Sujeet Shresth, Tasnim Gharaibeh, A. Almuflih, Wassnaa Al-Mawee, H. Bzizi
{"title":"Machine Learning Approaches for Early Diagnosis of Breast Cancer: A Comparative Study of Performance Evaluation","authors":"Rabia Emhamed Al Mamlook, Sujeet Shresth, Tasnim Gharaibeh, A. Almuflih, Wassnaa Al-Mawee, H. Bzizi","doi":"10.1109/eIT57321.2023.10187257","DOIUrl":null,"url":null,"abstract":"Breast cancer is a leading cause of death among women worldwide. Early detection and diagnosis are crucial to improving the chances of survival. This paper presents a study on the diagnosis of breast cancer using various machine-learning approaches. The study includes the performance evaluation of nine different techniques using confusion matrix accuracy for Sensitivity, Specificity, Precision, PME, PPV, NPV, and Model Accuracy. AdaBoost is found to have the highest Sensitivity and PME, while Random Forest and MLP gave the best Specificity and Precision. Logistic Regression is found to be the best model for accuracy with 97.8%, followed by SVM with 96.49%, Random Forest with 95.61%, and KNN & Decision Forest with 94.73%. The proposed approach is found to have the highest accuracy of 97.80% compared to other approaches studied. Our results indicate that the proposed approach using discretization can significantly improve the signal-to-noise ratio in the diagnosis of breast cancer. This approach can accurately predict and diagnose breast cancer using a subset of features.","PeriodicalId":113717,"journal":{"name":"2023 IEEE International Conference on Electro Information Technology (eIT)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Electro Information Technology (eIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/eIT57321.2023.10187257","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Breast cancer is a leading cause of death among women worldwide. Early detection and diagnosis are crucial to improving the chances of survival. This paper presents a study on the diagnosis of breast cancer using various machine-learning approaches. The study includes the performance evaluation of nine different techniques using confusion matrix accuracy for Sensitivity, Specificity, Precision, PME, PPV, NPV, and Model Accuracy. AdaBoost is found to have the highest Sensitivity and PME, while Random Forest and MLP gave the best Specificity and Precision. Logistic Regression is found to be the best model for accuracy with 97.8%, followed by SVM with 96.49%, Random Forest with 95.61%, and KNN & Decision Forest with 94.73%. The proposed approach is found to have the highest accuracy of 97.80% compared to other approaches studied. Our results indicate that the proposed approach using discretization can significantly improve the signal-to-noise ratio in the diagnosis of breast cancer. This approach can accurately predict and diagnose breast cancer using a subset of features.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于乳腺癌早期诊断的机器学习方法:性能评估的比较研究
乳腺癌是全世界妇女死亡的主要原因。早期发现和诊断对于提高生存机会至关重要。本文介绍了一项使用各种机器学习方法诊断乳腺癌的研究。该研究包括使用混淆矩阵准确度对灵敏度、特异性、精密度、PME、PPV、NPV和模型精度进行九种不同技术的性能评估。AdaBoost具有最高的灵敏度和PME,而Random Forest和MLP具有最佳的特异性和精度。Logistic回归模型的准确率为97.8%,SVM为96.49%,Random Forest为95.61%,KNN & Decision Forest为94.73%。与其他研究方法相比,该方法具有97.80%的最高准确率。我们的研究结果表明,采用离散化方法可以显著提高乳腺癌诊断的信噪比。这种方法可以使用一组特征准确地预测和诊断乳腺癌。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Correlation of Egg counts, Micro-nutrients, and NDVI Distribution for Accurate Tracking of SCN Population Density Detection Supervised Deep Learning Models for Detecting GPS Spoofing Attacks on Unmanned Aerial Vehicles ChatGPT: A Threat Against the CIA Triad of Cyber Security Smart UX-design for Rescue Operations Wearable - A Knowledge Graph Informed Visualization Approach for Information Retrieval in Emergency Situations Comparative Study of Deep Learning LSTM and 1D-CNN Models for Real-time Flood Prediction in Red River of the North, USA
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1