Machine learning-derived peripheral blood transcriptomic biomarkers for early lung cancer diagnosis: Unveiling tumor-immune interaction mechanisms.

IF 5 3区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY BioFactors Pub Date : 2024-10-16 DOI:10.1002/biof.2129
Xiaohua Li, Xuebing Li, Jiangyue Qin, Lei Lei, Hua Guo, Xi Zheng, Xuefeng Zeng
{"title":"Machine learning-derived peripheral blood transcriptomic biomarkers for early lung cancer diagnosis: Unveiling tumor-immune interaction mechanisms.","authors":"Xiaohua Li, Xuebing Li, Jiangyue Qin, Lei Lei, Hua Guo, Xi Zheng, Xuefeng Zeng","doi":"10.1002/biof.2129","DOIUrl":null,"url":null,"abstract":"<p><p>Lung cancer continues to be the leading cause of cancer-related mortality worldwide. Early detection and a comprehensive understanding of tumor-immune interactions are crucial for improving patient outcomes. This study aimed to develop a novel biomarker panel utilizing peripheral blood transcriptomics and machine learning algorithms for early lung cancer diagnosis, while simultaneously providing insights into tumor-immune crosstalk mechanisms. Leveraging a training cohort (GSE135304), we employed multiple machine learning algorithms to formulate a Lung Cancer Diagnostic Score (LCDS) based on peripheral blood transcriptomic features. The LCDS model's performance was evaluated using the area under the receiver operating characteristic (ROC) curve (AUC) in multiple validation cohorts (GSE42834, GSE157086, and an in-house dataset). Peripheral blood samples were obtained from 20 lung cancer patients and 10 healthy control subjects, representing an in-house cohort recruited at the Sixth People's Hospital of Chengdu. We employed advanced bioinformatics techniques to explore tumor-immune interactions through comprehensive immune infiltration and pathway enrichment analyses. Initial screening identified 844 differentially expressed genes, which were subsequently refined to 87 genes using the Boruta feature selection algorithm. The random forest (RF) algorithm demonstrated the highest accuracy in constructing the LCDS model, yielding a mean AUC of 0.938. Lower LCDS values were significantly associated with elevated immune scores and increased CD4+ and CD8+ T-cell infiltration, indicative of enhanced antitumor-immune responses. Higher LCDS scores correlated with activation of hypoxia, peroxisome proliferator-activated receptor (PPAR), and Toll-like receptor (TLR) signaling pathways, as well as reduced DNA damage repair pathway scores. Our study presents a novel, machine learning-derived peripheral blood transcriptomic biomarker panel with potential applications in early lung cancer diagnosis. The LCDS model not only demonstrates high accuracy in distinguishing lung cancer patients from healthy individuals but also offers valuable insights into tumor-immune interactions and underlying cancer biology. This approach may facilitate early lung cancer detection and contribute to a deeper understanding of the molecular and cellular mechanisms underlying tumor-immune crosstalk. Furthermore, our findings on the relationship between LCDS and immune infiltration patterns may have implications for future research on therapeutic strategies targeting the immune system in lung cancer.</p>","PeriodicalId":8923,"journal":{"name":"BioFactors","volume":" ","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BioFactors","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/biof.2129","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Lung cancer continues to be the leading cause of cancer-related mortality worldwide. Early detection and a comprehensive understanding of tumor-immune interactions are crucial for improving patient outcomes. This study aimed to develop a novel biomarker panel utilizing peripheral blood transcriptomics and machine learning algorithms for early lung cancer diagnosis, while simultaneously providing insights into tumor-immune crosstalk mechanisms. Leveraging a training cohort (GSE135304), we employed multiple machine learning algorithms to formulate a Lung Cancer Diagnostic Score (LCDS) based on peripheral blood transcriptomic features. The LCDS model's performance was evaluated using the area under the receiver operating characteristic (ROC) curve (AUC) in multiple validation cohorts (GSE42834, GSE157086, and an in-house dataset). Peripheral blood samples were obtained from 20 lung cancer patients and 10 healthy control subjects, representing an in-house cohort recruited at the Sixth People's Hospital of Chengdu. We employed advanced bioinformatics techniques to explore tumor-immune interactions through comprehensive immune infiltration and pathway enrichment analyses. Initial screening identified 844 differentially expressed genes, which were subsequently refined to 87 genes using the Boruta feature selection algorithm. The random forest (RF) algorithm demonstrated the highest accuracy in constructing the LCDS model, yielding a mean AUC of 0.938. Lower LCDS values were significantly associated with elevated immune scores and increased CD4+ and CD8+ T-cell infiltration, indicative of enhanced antitumor-immune responses. Higher LCDS scores correlated with activation of hypoxia, peroxisome proliferator-activated receptor (PPAR), and Toll-like receptor (TLR) signaling pathways, as well as reduced DNA damage repair pathway scores. Our study presents a novel, machine learning-derived peripheral blood transcriptomic biomarker panel with potential applications in early lung cancer diagnosis. The LCDS model not only demonstrates high accuracy in distinguishing lung cancer patients from healthy individuals but also offers valuable insights into tumor-immune interactions and underlying cancer biology. This approach may facilitate early lung cancer detection and contribute to a deeper understanding of the molecular and cellular mechanisms underlying tumor-immune crosstalk. Furthermore, our findings on the relationship between LCDS and immune infiltration patterns may have implications for future research on therapeutic strategies targeting the immune system in lung cancer.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于早期肺癌诊断的机器学习衍生外周血转录组生物标记物:揭示肿瘤-免疫相互作用机制
肺癌仍然是全球癌症相关死亡的主要原因。早期检测和全面了解肿瘤与免疫的相互作用对改善患者预后至关重要。本研究旨在利用外周血转录组学和机器学习算法开发一种新型生物标记物面板,用于早期肺癌诊断,同时深入了解肿瘤-免疫串扰机制。利用训练队列(GSE135304),我们采用了多种机器学习算法,根据外周血转录组特征制定了肺癌诊断评分(LCDS)。LCDS 模型的性能通过多个验证队列(GSE42834、GSE157086 和一个内部数据集)中的接收者操作特征曲线(ROC)下面积(AUC)进行评估。外周血样本来自成都市第六人民医院招募的 20 名肺癌患者和 10 名健康对照受试者。我们采用先进的生物信息学技术,通过全面的免疫浸润和通路富集分析来探索肿瘤与免疫之间的相互作用。初步筛选确定了 844 个差异表达基因,随后使用 Boruta 特征选择算法将其细化为 87 个基因。随机森林(RF)算法在构建 LCDS 模型时表现出最高的准确性,平均 AUC 为 0.938。较低的 LCDS 值与免疫评分升高、CD4+ 和 CD8+ T 细胞浸润增加明显相关,表明抗肿瘤免疫反应增强。较高的LCDS评分与缺氧、过氧化物酶体增殖激活受体(PPAR)和Toll样受体(TLR)信号通路的激活以及DNA损伤修复通路评分的降低相关。我们的研究提出了一种新颖的、由机器学习衍生的外周血转录组生物标记物面板,有望应用于早期肺癌诊断。LCDS 模型不仅在区分肺癌患者和健康人方面表现出很高的准确性,而且还为肿瘤-免疫相互作用和潜在的癌症生物学提供了有价值的见解。这种方法可能有助于早期肺癌检测,并有助于加深对肿瘤-免疫串扰的分子和细胞机制的理解。此外,我们关于 LCDS 与免疫浸润模式之间关系的研究结果可能会对未来针对肺癌免疫系统的治疗策略研究产生影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
BioFactors
BioFactors 生物-内分泌学与代谢
CiteScore
11.50
自引率
3.30%
发文量
96
审稿时长
6-12 weeks
期刊介绍: BioFactors, a journal of the International Union of Biochemistry and Molecular Biology, is devoted to the rapid publication of highly significant original research articles and reviews in experimental biology in health and disease. The word “biofactors” refers to the many compounds that regulate biological functions. Biological factors comprise many molecules produced or modified by living organisms, and present in many essential systems like the blood, the nervous or immunological systems. A non-exhaustive list of biological factors includes neurotransmitters, cytokines, chemokines, hormones, coagulation factors, transcription factors, signaling molecules, receptor ligands and many more. In the group of biofactors we can accommodate several classical molecules not synthetized in the body such as vitamins, micronutrients or essential trace elements. In keeping with this unified view of biochemistry, BioFactors publishes research dealing with the identification of new substances and the elucidation of their functions at the biophysical, biochemical, cellular and human level as well as studies revealing novel functions of already known biofactors. The journal encourages the submission of studies that use biochemistry, biophysics, cell and molecular biology and/or cell signaling approaches.
期刊最新文献
Insights into an indolicidin-derived low-toxic anti-microbial peptide's efficacy against bacterial cells while preserving eukaryotic cell viability. Investigating the dysregulation of genes associated with glucose and lipid metabolism in gastric cancer and their influence on immunity and prognosis. Construction of lysosome-related prognostic signature to predict the survival outcomes and selecting suitable drugs for patients with HNSCC. Navigating the immune landscape with plasma cells: A pan-cancer signature for precision immunotherapy. Machine learning models reveal ARHGAP11A's impact on lymph node metastasis and stemness in NSCLC.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1