Machine learning-driven estimation of mutational burden highlights DNAH5 as a prognostic marker in colorectal cancer.

IF 5.7 2区 生物学 Q1 BIOLOGY Biology Direct Pub Date : 2024-11-14 DOI:10.1186/s13062-024-00564-0
Yangyang Fang, Tianmei Fu, Qian Zhang, Ziqing Xiong, Kuai Yu, Aiping Le
{"title":"Machine learning-driven estimation of mutational burden highlights DNAH5 as a prognostic marker in colorectal cancer.","authors":"Yangyang Fang, Tianmei Fu, Qian Zhang, Ziqing Xiong, Kuai Yu, Aiping Le","doi":"10.1186/s13062-024-00564-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Tumor Mutational Burden (TMB) have emerged as pivotal predictive biomarkers in determining prognosis and response to immunotherapy in colorectal cancer (CRC) patients. While Whole Exome Sequencing (WES) stands as the gold standard for TMB assessment, carry substantial costs and demand considerable time commitments. Additionally, the heterogeneity among high-TMB patients remains poorly characterized.</p><p><strong>Methods: </strong>We employed eight advanced machine learning algorithms to develop gene-panel-based models for TMB estimation. To rigorously compare and validate these TMB estimation models, four external cohorts, involving 1,956 patients, were used. Furthermore, we computed the Pearson correlation coefficient between the estimated TMB and tumor neoantigen levels to elucidate their association. CD8<sup>+</sup> tumor-infiltrating lymphocyte (TIL) density was assessed via immunohistochemistry.</p><p><strong>Results: </strong>The TMB estimation model based on the Lasso algorithm, incorporating 20 genes, exhibiting satisfactory performance across multiple independent cohorts (R<sup>2</sup> ≥ 0.859). This 20-gene TMB model proved to be an independent prognostic indicator for the progression-free survival (PFS) of CRC patients (p = 0.001). DNAH5 mutations were associated with a more favorable prognosis in high-TMB CRC patients, and correlated strongly with tumor neoantigen levels and CD8<sup>+</sup> TIL density.</p><p><strong>Conclusions: </strong>The 20-gene model offers a cost-efficient approach to precisely estimating TMB, providing prognosis in patients with CRC. Incorporating DNAH5 within this model further refines the categorization of patients with elevated TMB. Utilizing the 20-gene model facilitates the stratification of patients with CRC, enabling more precise treatment planning.</p>","PeriodicalId":9164,"journal":{"name":"Biology Direct","volume":"19 1","pages":"116"},"PeriodicalIF":5.7000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11566893/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biology Direct","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13062-024-00564-0","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Tumor Mutational Burden (TMB) have emerged as pivotal predictive biomarkers in determining prognosis and response to immunotherapy in colorectal cancer (CRC) patients. While Whole Exome Sequencing (WES) stands as the gold standard for TMB assessment, carry substantial costs and demand considerable time commitments. Additionally, the heterogeneity among high-TMB patients remains poorly characterized.

Methods: We employed eight advanced machine learning algorithms to develop gene-panel-based models for TMB estimation. To rigorously compare and validate these TMB estimation models, four external cohorts, involving 1,956 patients, were used. Furthermore, we computed the Pearson correlation coefficient between the estimated TMB and tumor neoantigen levels to elucidate their association. CD8+ tumor-infiltrating lymphocyte (TIL) density was assessed via immunohistochemistry.

Results: The TMB estimation model based on the Lasso algorithm, incorporating 20 genes, exhibiting satisfactory performance across multiple independent cohorts (R2 ≥ 0.859). This 20-gene TMB model proved to be an independent prognostic indicator for the progression-free survival (PFS) of CRC patients (p = 0.001). DNAH5 mutations were associated with a more favorable prognosis in high-TMB CRC patients, and correlated strongly with tumor neoantigen levels and CD8+ TIL density.

Conclusions: The 20-gene model offers a cost-efficient approach to precisely estimating TMB, providing prognosis in patients with CRC. Incorporating DNAH5 within this model further refines the categorization of patients with elevated TMB. Utilizing the 20-gene model facilitates the stratification of patients with CRC, enabling more precise treatment planning.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
机器学习驱动的突变负荷估算突出了 DNAH5 作为结直肠癌预后标志物的作用。
背景:肿瘤突变负荷(TMB)已成为决定结直肠癌(CRC)患者预后和对免疫疗法反应的关键性预测生物标志物。虽然全外显子组测序(WES)是评估 TMB 的黄金标准,但其成本高昂,需要投入大量时间。此外,高TMB患者的异质性特征仍不明显:我们采用了八种先进的机器学习算法来开发基于基因组的 TMB 估算模型。为了严格比较和验证这些 TMB 估算模型,我们使用了四个外部队列,涉及 1,956 名患者。此外,我们还计算了估计的TMB与肿瘤新抗原水平之间的皮尔逊相关系数,以阐明两者之间的关联。CD8+肿瘤浸润淋巴细胞(TIL)密度通过免疫组化进行评估:基于Lasso算法的TMB估计模型包含20个基因,在多个独立队列中表现出令人满意的性能(R2≥0.859)。该 20 基因 TMB 模型被证明是 CRC 患者无进展生存期(PFS)的独立预后指标(p = 0.001)。DNAH5突变与高TMB CRC患者更有利的预后相关,并与肿瘤新抗原水平和CD8+ TIL密度密切相关:20基因模型为精确估算TMB提供了一种经济有效的方法,可为CRC患者提供预后信息。将 DNAH5 纳入该模型可进一步完善对 TMB 升高患者的分类。利用 20 基因模型有助于对 CRC 患者进行分层,从而制定更精确的治疗计划。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Biology Direct
Biology Direct 生物-生物学
CiteScore
6.40
自引率
10.90%
发文量
32
审稿时长
7 months
期刊介绍: Biology Direct serves the life science research community as an open access, peer-reviewed online journal, providing authors and readers with an alternative to the traditional model of peer review. Biology Direct considers original research articles, hypotheses, comments, discovery notes and reviews in subject areas currently identified as those most conducive to the open review approach, primarily those with a significant non-experimental component.
期刊最新文献
A glutamine metabolish-associated prognostic model to predict prognosis and therapeutic responses of hepatocellular carcinoma. miPEP31 alleviates sepsis development by regulating Chi3l1-dependent macrophage polarization. Machine learning-driven estimation of mutational burden highlights DNAH5 as a prognostic marker in colorectal cancer. NID1 promotes laryngeal cancer stemness via activating WNT pathway. Active enhancers: recent research advances and insights into disease.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1