Lung Cancer Detection Using Bayesian Networks: A Retrospective Development and Validation Study on a Danish Population of High-Risk Individuals

IF 3.1 2区 医学 Q2 ONCOLOGY Cancer Medicine Pub Date : 2025-01-31 DOI:10.1002/cam4.70458
Margrethe Bang Henriksen, Florian Van Daalen, Leonard Wee, Torben Frøstrup Hansen, Lars Henrik Jensen, Claus Lohman Brasen, Ole Hilberg, Inigo Bermejo
{"title":"Lung Cancer Detection Using Bayesian Networks: A Retrospective Development and Validation Study on a Danish Population of High-Risk Individuals","authors":"Margrethe Bang Henriksen,&nbsp;Florian Van Daalen,&nbsp;Leonard Wee,&nbsp;Torben Frøstrup Hansen,&nbsp;Lars Henrik Jensen,&nbsp;Claus Lohman Brasen,&nbsp;Ole Hilberg,&nbsp;Inigo Bermejo","doi":"10.1002/cam4.70458","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>Lung cancer (LC) is the top cause of cancer deaths globally, prompting many countries to adopt LC screening programs. While screening typically relies on age and smoking intensity, more efficient risk models exist. We devised a Bayesian network (BN) for LC detection, testing its resilience with varying degrees of missing data and comparing it to a prior machine learning (ML) model.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>We analyzed data from 9940 patients referred for LC assessment in Southern Denmark from 2009 to 2018. Variables included age, sex, smoking, and lab results. Our experiments varied missing data (0%–30%), BN structure (expert-based vs. data-driven), and discretization method (standard vs. data-driven).</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>Across all missing data levels, area under the curve (AUC) remained steady, ranging from 0.737 to 0.757, compared to the ML model's AUC of 0.77. BN structure and discretization method had minimal impact on performance. BNs were well calibrated overall, with a net benefit in decision curve analysis when predicted risk exceeded 5%.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>BN models showed resilience with up to 30% missing values. Moreover, these BNs exhibited similar performance, calibration, and clinical utility compared to the machine learning model developed using the same dataset. Considering their effectiveness in handling missing data, BNs emerge as a relevant method for the development of future lung cancer detection models.</p>\n </section>\n </div>","PeriodicalId":139,"journal":{"name":"Cancer Medicine","volume":"14 3","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11783238/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Medicine","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cam4.70458","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Lung cancer (LC) is the top cause of cancer deaths globally, prompting many countries to adopt LC screening programs. While screening typically relies on age and smoking intensity, more efficient risk models exist. We devised a Bayesian network (BN) for LC detection, testing its resilience with varying degrees of missing data and comparing it to a prior machine learning (ML) model.

Methods

We analyzed data from 9940 patients referred for LC assessment in Southern Denmark from 2009 to 2018. Variables included age, sex, smoking, and lab results. Our experiments varied missing data (0%–30%), BN structure (expert-based vs. data-driven), and discretization method (standard vs. data-driven).

Results

Across all missing data levels, area under the curve (AUC) remained steady, ranging from 0.737 to 0.757, compared to the ML model's AUC of 0.77. BN structure and discretization method had minimal impact on performance. BNs were well calibrated overall, with a net benefit in decision curve analysis when predicted risk exceeded 5%.

Conclusion

BN models showed resilience with up to 30% missing values. Moreover, these BNs exhibited similar performance, calibration, and clinical utility compared to the machine learning model developed using the same dataset. Considering their effectiveness in handling missing data, BNs emerge as a relevant method for the development of future lung cancer detection models.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
肺癌检测使用贝叶斯网络:对丹麦高危人群的回顾性发展和验证研究。
背景:肺癌(LC)是全球癌症死亡的首要原因,促使许多国家采用LC筛查计划。虽然筛查通常依赖于年龄和吸烟强度,但存在更有效的风险模型。我们设计了一个贝叶斯网络(BN)用于LC检测,用不同程度的缺失数据测试其弹性,并将其与先前的机器学习(ML)模型进行比较。方法:我们分析了2009年至2018年丹麦南部9940例LC评估患者的数据。变量包括年龄、性别、吸烟和实验室结果。我们的实验改变了缺失数据(0%-30%)、BN结构(基于专家vs数据驱动)和离散化方法(标准vs数据驱动)。结果:在所有缺失的数据水平上,曲线下面积(AUC)保持稳定,范围从0.737到0.757,而ML模型的AUC为0.77。BN结构和离散化方法对性能影响最小。bn总体上校准得很好,当预测风险超过5%时,在决策曲线分析中具有净收益。结论:BN模型显示弹性,缺失值高达30%。此外,与使用相同数据集开发的机器学习模型相比,这些bp网络表现出相似的性能、校准和临床实用性。考虑到它们在处理缺失数据方面的有效性,BNs成为开发未来肺癌检测模型的相关方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Cancer Medicine
Cancer Medicine ONCOLOGY-
CiteScore
5.50
自引率
2.50%
发文量
907
审稿时长
19 weeks
期刊介绍: Cancer Medicine is a peer-reviewed, open access, interdisciplinary journal providing rapid publication of research from global biomedical researchers across the cancer sciences. The journal will consider submissions from all oncologic specialties, including, but not limited to, the following areas: Clinical Cancer Research Translational research ∙ clinical trials ∙ chemotherapy ∙ radiation therapy ∙ surgical therapy ∙ clinical observations ∙ clinical guidelines ∙ genetic consultation ∙ ethical considerations Cancer Biology: Molecular biology ∙ cellular biology ∙ molecular genetics ∙ genomics ∙ immunology ∙ epigenetics ∙ metabolic studies ∙ proteomics ∙ cytopathology ∙ carcinogenesis ∙ drug discovery and delivery. Cancer Prevention: Behavioral science ∙ psychosocial studies ∙ screening ∙ nutrition ∙ epidemiology and prevention ∙ community outreach. Bioinformatics: Gene expressions profiles ∙ gene regulation networks ∙ genome bioinformatics ∙ pathwayanalysis ∙ prognostic biomarkers. Cancer Medicine publishes original research articles, systematic reviews, meta-analyses, and research methods papers, along with invited editorials and commentaries. Original research papers must report well-conducted research with conclusions supported by the data presented in the paper.
期刊最新文献
Umbrella Reviews Conducted in an Oncology Healthcare Context Focusing on Supportive Care, Systems, and Models of Care: A Review of Umbrella Reviews. Initial Site of Metastasis Influences Prognosis in Pancreatic Ductal Adenocarcinoma. Association Between Nasopharyngeal Carcinoma and Chinese Medicine Constitution: A Meta-Analysis. A Butyrate Metabolism-Related Gene Signature Predicts Prognosis, Immune Landscape, and Immunotherapy Efficacy in Breast Cancer. The Mediating Effect of Socioeconomic Status and Persistent Poverty on Racial and Ethnic Disparities in Pediatric Cancer Mortality in the United States.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1