A Machine Learning Approach to Predict Breast Cancer Using Boosting Classifiers

Md. Mijanur Rahman, Zannatul Ferdousi, Puja Saha, R. Mayuri
{"title":"A Machine Learning Approach to Predict Breast Cancer Using Boosting Classifiers","authors":"Md. Mijanur Rahman, Zannatul Ferdousi, Puja Saha, R. Mayuri","doi":"10.21817/indjcse/2023/v14i3/231403009","DOIUrl":null,"url":null,"abstract":"Breast cancer is a prevalent disease, with the second highest incidence rate among all types of cancer. The risk of death from breast cancer is increasing due to rapid population growth, and a dependable and quick diagnostic system can assist medical professionals in disease diagnosis and lower the mortality rate. In this study, various machine-learning algorithms are examined for predicting the stages of breast cancer, and most especially in the medical field, where those methods are widely used in diagnosis and analysis for decision-making. We focused on boosting classification models and evaluated the performance of XGBoost, AdaBoost, and Gradient Boosting. Our goal is to achieve higher accuracy by using boosting classifiers with hyperparameter tuning for the prediction of breast cancer stages, precisely the distinction between \"Benign\" and \"Malignant\" types of breast cancer. The Wisconsin breast cancer dataset is employed from the UCI machine learning database. The performance of our model was evaluated using metrics such as accuracy, sensitivity, precision, specificity, AUC, and ROC curves for various strategies. After implementing the model, this study achieved the best model accuracy, and 98.60% was achieved on AdaBoost.","PeriodicalId":52250,"journal":{"name":"Indian Journal of Computer Science and Engineering","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Indian Journal of Computer Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21817/indjcse/2023/v14i3/231403009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 0

Abstract

Breast cancer is a prevalent disease, with the second highest incidence rate among all types of cancer. The risk of death from breast cancer is increasing due to rapid population growth, and a dependable and quick diagnostic system can assist medical professionals in disease diagnosis and lower the mortality rate. In this study, various machine-learning algorithms are examined for predicting the stages of breast cancer, and most especially in the medical field, where those methods are widely used in diagnosis and analysis for decision-making. We focused on boosting classification models and evaluated the performance of XGBoost, AdaBoost, and Gradient Boosting. Our goal is to achieve higher accuracy by using boosting classifiers with hyperparameter tuning for the prediction of breast cancer stages, precisely the distinction between "Benign" and "Malignant" types of breast cancer. The Wisconsin breast cancer dataset is employed from the UCI machine learning database. The performance of our model was evaluated using metrics such as accuracy, sensitivity, precision, specificity, AUC, and ROC curves for various strategies. After implementing the model, this study achieved the best model accuracy, and 98.60% was achieved on AdaBoost.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用增强分类器预测乳腺癌的机器学习方法
乳腺癌是一种流行疾病,在所有类型的癌症中发病率第二高。由于人口的快速增长,乳腺癌的死亡风险正在增加,一个可靠、快速的诊断系统可以帮助医疗专业人员进行疾病诊断,降低死亡率。在这项研究中,研究了各种机器学习算法来预测乳腺癌的阶段,尤其是在医学领域,这些方法被广泛用于诊断和决策分析。我们专注于增强分类模型,并评估了XGBoost、AdaBoost和Gradient boosting的性能。我们的目标是通过使用带有超参数调整的增强分类器来预测乳腺癌的分期,精确地区分“良性”和“恶性”类型的乳腺癌,从而达到更高的准确性。威斯康星乳腺癌数据集来自UCI机器学习数据库。使用各种策略的准确度、灵敏度、精密度、特异性、AUC和ROC曲线等指标来评估我们模型的性能。模型实现后,本研究达到了最好的模型准确率,在AdaBoost上达到了98.60%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Indian Journal of Computer Science and Engineering
Indian Journal of Computer Science and Engineering Engineering-Engineering (miscellaneous)
自引率
0.00%
发文量
146
期刊最新文献
FEATURE SELECTION AND CLASSIFICATION FOR MICROARRAY DATA USING UPGRADE CHI-SQUARE TEST ENHANCING PERFORMANCE OF TRAFFIC CLASSIFICATION WITH FEATURE SELECTION METHODS MACHINE LEARNING MODELS FOR HEART DISEASE PREDICTION-A REVIEW A DEEP LEARNING MODEL IMPLEMENTATION OF TABNET FOR PREDICTING PEPTIDE-PROTEIN INTERACTION IN CANCER EXPLORING MALARIA DISEASE BIOMARKERS: AN INTEGRATED APPROACH USING NETWORK TOPOLOGY AND ENRICHMENT ANALYSIS
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1