Bio-primed machine learning to enhance discovery of relevant biomarkers.

IF 6.8 1区 医学 Q1 ONCOLOGY NPJ Precision Oncology Pub Date : 2025-02-06 DOI:10.1038/s41698-025-00825-9
David M Henke, Alexander Renwick, Joseph R Zoeller, Jitendra K Meena, Nicholas J Neill, Elizabeth A Bowling, Kristen L Meerbrey, Thomas F Westbrook, Lukas M Simon
{"title":"Bio-primed machine learning to enhance discovery of relevant biomarkers.","authors":"David M Henke, Alexander Renwick, Joseph R Zoeller, Jitendra K Meena, Nicholas J Neill, Elizabeth A Bowling, Kristen L Meerbrey, Thomas F Westbrook, Lukas M Simon","doi":"10.1038/s41698-025-00825-9","DOIUrl":null,"url":null,"abstract":"<p><p>Precision medicine relies on identifying reliable biomarkers for gene dependencies to tailor individualized therapeutic strategies. The advent of high-throughput technologies presents unprecedented opportunities to explore molecular disease mechanisms but also challenges due to high dimensionality and collinearity among features. Traditional statistical methods often fall short in this context, necessitating novel computational approaches that harness the full potential of big data in bioinformatics. Here, we introduce a novel machine learning approach extending the Least Absolute Shrinkage and Selection Operator (LASSO) regression framework to incorporate biological knowledge, such as protein-protein interaction databases, into the regularization process. This bio-primed approach prioritizes variables that are both statistically significant and biologically relevant. Applying our method to multiple dependency datasets, we identified biomarkers which traditional methods overlooked. Our biologically informed LASSO method effectively identifies relevant biomarkers from high-dimensional collinear data, bridging the gap between statistical rigor and biological insight. This method holds promise for advancing personalized medicine by uncovering novel therapeutic targets and understanding the complex interplay of genetic and molecular factors in disease.</p>","PeriodicalId":19433,"journal":{"name":"NPJ Precision Oncology","volume":"9 1","pages":"39"},"PeriodicalIF":6.8000,"publicationDate":"2025-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11802771/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Precision Oncology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1038/s41698-025-00825-9","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Precision medicine relies on identifying reliable biomarkers for gene dependencies to tailor individualized therapeutic strategies. The advent of high-throughput technologies presents unprecedented opportunities to explore molecular disease mechanisms but also challenges due to high dimensionality and collinearity among features. Traditional statistical methods often fall short in this context, necessitating novel computational approaches that harness the full potential of big data in bioinformatics. Here, we introduce a novel machine learning approach extending the Least Absolute Shrinkage and Selection Operator (LASSO) regression framework to incorporate biological knowledge, such as protein-protein interaction databases, into the regularization process. This bio-primed approach prioritizes variables that are both statistically significant and biologically relevant. Applying our method to multiple dependency datasets, we identified biomarkers which traditional methods overlooked. Our biologically informed LASSO method effectively identifies relevant biomarkers from high-dimensional collinear data, bridging the gap between statistical rigor and biological insight. This method holds promise for advancing personalized medicine by uncovering novel therapeutic targets and understanding the complex interplay of genetic and molecular factors in disease.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
生物启动机器学习,以加强相关生物标志物的发现。
精准医学依赖于确定基因依赖性的可靠生物标志物来定制个性化的治疗策略。高通量技术的出现为探索分子疾病机制提供了前所未有的机会,但由于特征之间的高维性和共线性,也带来了挑战。在这种情况下,传统的统计方法往往不足,需要新的计算方法来利用生物信息学中大数据的全部潜力。在这里,我们引入了一种新的机器学习方法,扩展了最小绝对收缩和选择算子(LASSO)回归框架,将生物知识(如蛋白质-蛋白质相互作用数据库)纳入正则化过程。这种生物启动的方法优先考虑具有统计学意义和生物学相关性的变量。将我们的方法应用于多个依赖数据集,我们确定了传统方法忽略的生物标志物。我们的生物信息LASSO方法有效地从高维共线数据中识别相关生物标志物,弥合了统计严谨性和生物学洞察力之间的差距。这种方法有望通过发现新的治疗靶点和了解疾病中遗传和分子因素的复杂相互作用来推进个性化医疗。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
9.90
自引率
1.30%
发文量
87
审稿时长
18 weeks
期刊介绍: Online-only and open access, npj Precision Oncology is an international, peer-reviewed journal dedicated to showcasing cutting-edge scientific research in all facets of precision oncology, spanning from fundamental science to translational applications and clinical medicine.
期刊最新文献
A first-in-human phase 1 study of the SHP2 inhibitor BBP-398 in patients with advanced solid tumors. Pembrolizumab in advanced malignant peripheral nerve sheath tumors: a single-arm phase 2 trial. Clinical and molecular characteristics of Class II and III BRAF mutations in colorectal cancer. FLT3-SYK inhibitor and Ixazomib combination impact HOXA and oxidative stress control by β-catenin, SQSTM1 and NRF2 in AML. Genomic landscape and clinical impact of BRCA1/2 pathogenic variants in metastatic castration-resistant prostate cancer.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1