Prognostic prediction of breast cancer patients using machine learning models: a retrospective analysis.

IF 1.5 3区 医学 Q3 SURGERY Gland surgery Pub Date : 2024-09-30 Epub Date: 2024-09-27 DOI:10.21037/gs-24-106
Xuchun Song, Jiebin Chu, Zijie Guo, Qun Wei, Qingchuan Wang, Wenxian Hu, Linbo Wang, Wenhe Zhao, Heming Zheng, Xudong Lu, Jichun Zhou
{"title":"Prognostic prediction of breast cancer patients using machine learning models: a retrospective analysis.","authors":"Xuchun Song, Jiebin Chu, Zijie Guo, Qun Wei, Qingchuan Wang, Wenxian Hu, Linbo Wang, Wenhe Zhao, Heming Zheng, Xudong Lu, Jichun Zhou","doi":"10.21037/gs-24-106","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Breast cancer is a common and complex disease, with various clinical features affecting prognosis. Accurate prediction of prognosis is essential for guiding personalized treatment strategies. This study aimed to develop machine learning models for predicting prognosis in breast cancer patients using retrospective data.</p><p><strong>Methods: </strong>A total of 6,477 patients from Affiliated Sir Run Run Shaw Hospital were included, and their electronic medical records (EMRs) were thoroughly examined to identify 15 clinical features significantly associated with breast cancer survival. We employed eight different machine learning algorithms, including Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), and Extreme Gradient Boosting (XGBoost), to develop and evaluate the predictive performance of the models. In addition, to investigate the sensitivity of different training/testing set radio to model performance, we examined five sets of ratios: 50:50, 60:40, 70:30, 80:20, 90:10.</p><p><strong>Results: </strong>Among these models, XGBoost demonstrated the highest performance with receiver operating characteristic (ROC) area under the curve (AUC) of 0.813, accuracy of 0.739, sensitivity of 0.815, and specificity of 0.735. Further statistical analysis identified several significant predictors of prognosis, including age, tumor size, lymph node status, and hormone receptor status. The XGBoost model was found to exhibit superior predictive power compared to established prognostic models such as the Nottingham Prognostic Index (NPI) and Predict Breast. Based on the successful performance of the XGBoost model, we developed a prognosis prediction tool specifically designed for breast cancer, providing valuable insights to clinicians, and aiding them in making informed treatment decisions tailored to individual patients.</p><p><strong>Conclusions: </strong>Our study highlights the potential of machine learning models in accurately predicting prognosis for breast cancer patients, ultimately facilitating personalized treatment strategies. Further research and validation are warranted to fully integrate these models into clinical practice.</p>","PeriodicalId":12760,"journal":{"name":"Gland surgery","volume":null,"pages":null},"PeriodicalIF":1.5000,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11480873/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Gland surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/gs-24-106","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/9/27 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"SURGERY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Breast cancer is a common and complex disease, with various clinical features affecting prognosis. Accurate prediction of prognosis is essential for guiding personalized treatment strategies. This study aimed to develop machine learning models for predicting prognosis in breast cancer patients using retrospective data.

Methods: A total of 6,477 patients from Affiliated Sir Run Run Shaw Hospital were included, and their electronic medical records (EMRs) were thoroughly examined to identify 15 clinical features significantly associated with breast cancer survival. We employed eight different machine learning algorithms, including Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), and Extreme Gradient Boosting (XGBoost), to develop and evaluate the predictive performance of the models. In addition, to investigate the sensitivity of different training/testing set radio to model performance, we examined five sets of ratios: 50:50, 60:40, 70:30, 80:20, 90:10.

Results: Among these models, XGBoost demonstrated the highest performance with receiver operating characteristic (ROC) area under the curve (AUC) of 0.813, accuracy of 0.739, sensitivity of 0.815, and specificity of 0.735. Further statistical analysis identified several significant predictors of prognosis, including age, tumor size, lymph node status, and hormone receptor status. The XGBoost model was found to exhibit superior predictive power compared to established prognostic models such as the Nottingham Prognostic Index (NPI) and Predict Breast. Based on the successful performance of the XGBoost model, we developed a prognosis prediction tool specifically designed for breast cancer, providing valuable insights to clinicians, and aiding them in making informed treatment decisions tailored to individual patients.

Conclusions: Our study highlights the potential of machine learning models in accurately predicting prognosis for breast cancer patients, ultimately facilitating personalized treatment strategies. Further research and validation are warranted to fully integrate these models into clinical practice.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用机器学习模型预测乳腺癌患者的预后:一项回顾性分析。
背景:乳腺癌是一种常见而复杂的疾病,各种临床特征都会影响预后。准确预测预后对于指导个性化治疗策略至关重要。本研究旨在利用回顾性数据开发预测乳腺癌患者预后的机器学习模型:方法:我们纳入了邵逸夫附属医院的6477名患者,并对他们的电子病历(EMR)进行了全面检查,以确定与乳腺癌生存率显著相关的15个临床特征。我们采用了八种不同的机器学习算法,包括逻辑回归(Logistic Regression,LR)、支持向量机(Support Vector Machine,SVM)、随机森林(Random Forest,RF)和极端梯度提升(Extreme Gradient Boosting,XGBoost),来开发和评估模型的预测性能。此外,为了研究不同训练/测试集无线电对模型性能的敏感性,我们研究了五组比例:结果:在这些模型中,XGBoost 的性能最高,接收器操作特征曲线下面积 (ROC) 为 0.813,准确率为 0.739,灵敏度为 0.815,特异性为 0.735。进一步的统计分析发现了几个重要的预后预测因素,包括年龄、肿瘤大小、淋巴结状态和激素受体状态。与诺丁汉预后指数 (NPI) 和乳腺预测 (Predict Breast) 等成熟的预后模型相比,XGBoost 模型的预测能力更胜一筹。基于 XGBoost 模型的成功表现,我们开发了一款专为乳腺癌设计的预后预测工具,为临床医生提供有价值的见解,帮助他们根据患者的具体情况做出明智的治疗决策:我们的研究凸显了机器学习模型在准确预测乳腺癌患者预后方面的潜力,最终促进了个性化治疗策略的制定。要将这些模型完全融入临床实践,还需要进一步的研究和验证。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Gland surgery
Gland surgery Medicine-Surgery
CiteScore
3.60
自引率
0.00%
发文量
113
期刊介绍: Gland Surgery (Gland Surg; GS, Print ISSN 2227-684X; Online ISSN 2227-8575) being indexed by PubMed/PubMed Central, is an open access, peer-review journal launched at May of 2012, published bio-monthly since February 2015.
期刊最新文献
Cardiovascular and fracture events analysis and intervention strategies in patients undergoing parathyroidectomy with secondary hyperparathyroidism. Centralization of adrenal surgeries and improved surgeon volume outcomes. Computed tomography-based radiomics and body composition analysis for predicting clinically relevant postoperative pancreatic fistula after pancreaticoduodenectomy. Early detection of concomitant pancreatic cancer during intraductal papillary mucinous neoplasms surveillance. Impact of location and size of minimal extrathyroidal extension on lymph node metastasis in papillary thyroid cancer: a retrospective analysis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1