Machine learning predictive models and risk factors for lymph node metastasis in non-small cell lung cancer.

IF 2.6 3区 医学 Q2 RESPIRATORY SYSTEM BMC Pulmonary Medicine Pub Date : 2024-10-22 DOI:10.1186/s12890-024-03345-7
Bo Wu, Yihui Zhu, Zhuozheng Hu, Jiajun Wu, Weijun Zhou, Maoyan Si, Xiying Cao, Zhicheng Wu, Wenxiong Zhang
{"title":"Machine learning predictive models and risk factors for lymph node metastasis in non-small cell lung cancer.","authors":"Bo Wu, Yihui Zhu, Zhuozheng Hu, Jiajun Wu, Weijun Zhou, Maoyan Si, Xiying Cao, Zhicheng Wu, Wenxiong Zhang","doi":"10.1186/s12890-024-03345-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The prognosis of non-small cell lung cancer (NSCLC) is substantially affected by lymph node metastasis (LNM), but there are no noninvasive, inexpensive methods of relatively high accuracy available to predict LNM in NSCLC patients.</p><p><strong>Methods: </strong>Clinical data on NSCLC patients were obtained from the Surveillance, Epidemiology, and End Results (SEER) database. Risk factors for LNM were recognized LASSO and multivariate logistic regression. Six predictive models were constructed with machine learning based on risk factors. The area under the receiver operating characteristic curve (AUC) was used to assess the performance of the model. Subgroup analysis with different T-stages was performed on an optimal model. A webpage LNM risk calculator for optimal model was built using the Shinyapps.io platform.</p><p><strong>Results: </strong>We enrolled 64,012 NSCLC patients, of whom 26,611 (41.57%) had LNM. Using multivariate logistic regression, we finally identified 10 independent risk factors for LNM: age, sex, race, histology, primary site, grade, T stage, M stage, tumor size, and bone metastases. GLM is the optimal model among all six machine learning models in both the training and validation cohorts. Subgroup analyses revealed that GLM has good predictability for populations with different T staging. A webpage LNM risk calculator based on GLM was posted on the shinyapps.io platform ( https://wubopredict.shinyapps.io/dynnomapp/ ).</p><p><strong>Conclusion: </strong>The predictive model based on GLM can be used to precisely predict the probability of LNM in NSCLC patients, which was proven effective in all subgroup analyses according to T staging.</p>","PeriodicalId":9148,"journal":{"name":"BMC Pulmonary Medicine","volume":"24 1","pages":"526"},"PeriodicalIF":2.6000,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11515794/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Pulmonary Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12890-024-03345-7","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RESPIRATORY SYSTEM","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The prognosis of non-small cell lung cancer (NSCLC) is substantially affected by lymph node metastasis (LNM), but there are no noninvasive, inexpensive methods of relatively high accuracy available to predict LNM in NSCLC patients.

Methods: Clinical data on NSCLC patients were obtained from the Surveillance, Epidemiology, and End Results (SEER) database. Risk factors for LNM were recognized LASSO and multivariate logistic regression. Six predictive models were constructed with machine learning based on risk factors. The area under the receiver operating characteristic curve (AUC) was used to assess the performance of the model. Subgroup analysis with different T-stages was performed on an optimal model. A webpage LNM risk calculator for optimal model was built using the Shinyapps.io platform.

Results: We enrolled 64,012 NSCLC patients, of whom 26,611 (41.57%) had LNM. Using multivariate logistic regression, we finally identified 10 independent risk factors for LNM: age, sex, race, histology, primary site, grade, T stage, M stage, tumor size, and bone metastases. GLM is the optimal model among all six machine learning models in both the training and validation cohorts. Subgroup analyses revealed that GLM has good predictability for populations with different T staging. A webpage LNM risk calculator based on GLM was posted on the shinyapps.io platform ( https://wubopredict.shinyapps.io/dynnomapp/ ).

Conclusion: The predictive model based on GLM can be used to precisely predict the probability of LNM in NSCLC patients, which was proven effective in all subgroup analyses according to T staging.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
非小细胞肺癌淋巴结转移的机器学习预测模型和风险因素。
背景:非小细胞肺癌(NSCLC)的预后受到淋巴结转移(LNM)的严重影响,但目前还没有准确性相对较高的无创、廉价方法来预测 NSCLC 患者的淋巴结转移:方法:从监测、流行病学和最终结果(SEER)数据库中获取 NSCLC 患者的临床数据。对LNM的风险因素进行了LASSO和多变量逻辑回归识别。根据风险因素通过机器学习构建了六个预测模型。接受者操作特征曲线下面积(AUC)用于评估模型的性能。根据最佳模型对不同 T 分期进行了分组分析。我们使用 Shinyapps.io 平台建立了一个网页版 LNM 风险计算器:我们招募了64012名NSCLC患者,其中26611人(41.57%)患有LNM。通过多变量逻辑回归,我们最终确定了LNM的10个独立风险因素:年龄、性别、种族、组织学、原发部位、分级、T期、M期、肿瘤大小和骨转移。在训练组和验证组中,GLM 是所有六个机器学习模型中的最佳模型。亚组分析显示,GLM 对不同 T 分期的人群具有良好的预测能力。基于GLM的LNM风险计算器网页已发布在shinyapps.io平台上( https://wubopredict.shinyapps.io/dynnomapp/ )。结论:结论:基于GLM的预测模型可用于精确预测NSCLC患者发生LNM的概率,在根据T分期进行的所有亚组分析中均被证明有效。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
BMC Pulmonary Medicine
BMC Pulmonary Medicine RESPIRATORY SYSTEM-
CiteScore
4.40
自引率
3.20%
发文量
423
审稿时长
6-12 weeks
期刊介绍: BMC Pulmonary Medicine is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of pulmonary and associated disorders, as well as related molecular genetics, pathophysiology, and epidemiology.
期刊最新文献
Predicting solitary pulmonary lesions in breast cancer patients using 18fluorodeoxyglucose-positron emission tomography/computed tomography combined with clinicopathological characteristics. Assessment of novel cardiovascular biomarkers in chronic obstructive pulmonary disease. Comparison of efficacy and safety of neoadjuvant immunochemotherapy in young and elderly patients with IIA-IIIB non-small-cell lung cancer in real-world practice. Development of a predictive nomogram for early identification of pulmonary embolism in hospitalized patients: a retrospective cohort study. A nomogram for predicting lymphovascular invasion in lung adenocarcinoma: a retrospective study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1