Development of a Diagnostic Model for Focal Segmental Glomerulosclerosis: Integrating Machine Learning on Activated Pathways and Clinical Validation.

IF 2 4区 医学 Q2 MEDICINE, GENERAL & INTERNAL International Journal of General Medicine Pub Date : 2025-02-26 eCollection Date: 2025-01-01 DOI:10.2147/IJGM.S498407
Yating Ge, Xueqi Liu, Jinlian Shu, Xiao Jiang, Yonggui Wu
{"title":"Development of a Diagnostic Model for Focal Segmental Glomerulosclerosis: Integrating Machine Learning on Activated Pathways and Clinical Validation.","authors":"Yating Ge, Xueqi Liu, Jinlian Shu, Xiao Jiang, Yonggui Wu","doi":"10.2147/IJGM.S498407","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Focal segmental glomerulosclerosis (FSGS) represents a major global health challenge, with its incidence rising in parallel with advances in diagnostic techniques and the growing prevalence of chronic diseases. This study seeks to enhance the diagnostic accuracy of FSGS by integrating machine learning approaches to identify activated pathways, complemented by robust clinical validation.</p><p><strong>Methods: </strong>We analyzed data from 163 FSGS patients and 42 living donors across multiple GEO cohorts via the ComBat algorithm to address batch effects and ensure the comparability of gene expression profiles. Gene set enrichment analysis (GSEA) identified key signaling pathways involved in FSGS pathogenesis. We then developed a highly accurate diagnostic model by integrating nine machine learning algorithms into 101 combinations, achieving near-perfect AUC values across training, validation, and external cohorts. The model identified six genes as potential biomarkers for FSGS. Additionally, immune cell infiltration patterns, particularly those involving natural killer (NK) cells, were explored, revealing the complex interplay between genetics and the immune response in FSGS patients. Immunohistochemical analysis validated the expression of the key markers CD99 and OAZ2 and confirmed the association between NK cells and FSGS.</p><p><strong>Results: </strong>The glmBoost+Ridge model exhibited exceptional diagnostic accuracy, achieving an AUC of 0.998 using just six genes: BANF1, TUSC2, SMAD3, TGFB1, CD99, and OAZ2. The prediction score was calculated as follows: score = (0.3997×BANF1) + (0.5543×TUSC2) + (0.5279×SMAD3) + (0.4118×TGFB1) + (0.8665×CD99) + (0.5996×OAZ2). Immunohistochemical analysis confirmed significantly elevated expression levels of CD99 and OAZ2 in the glomeruli and tubulointerstitial tissues of FSGS patients compared with those of controls.</p><p><strong>Conclusion: </strong>This study demonstrates a highly accurate machine learning model for FSGS diagnosis. Immunohistochemical validation confirmed elevated expression of CD99 and OAZ2, offering valuable insights into FSGS pathogenesis and potential biomarkers for clinical application.</p>","PeriodicalId":14131,"journal":{"name":"International Journal of General Medicine","volume":"18 ","pages":"1127-1142"},"PeriodicalIF":2.0000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11872063/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of General Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/IJGM.S498407","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Focal segmental glomerulosclerosis (FSGS) represents a major global health challenge, with its incidence rising in parallel with advances in diagnostic techniques and the growing prevalence of chronic diseases. This study seeks to enhance the diagnostic accuracy of FSGS by integrating machine learning approaches to identify activated pathways, complemented by robust clinical validation.

Methods: We analyzed data from 163 FSGS patients and 42 living donors across multiple GEO cohorts via the ComBat algorithm to address batch effects and ensure the comparability of gene expression profiles. Gene set enrichment analysis (GSEA) identified key signaling pathways involved in FSGS pathogenesis. We then developed a highly accurate diagnostic model by integrating nine machine learning algorithms into 101 combinations, achieving near-perfect AUC values across training, validation, and external cohorts. The model identified six genes as potential biomarkers for FSGS. Additionally, immune cell infiltration patterns, particularly those involving natural killer (NK) cells, were explored, revealing the complex interplay between genetics and the immune response in FSGS patients. Immunohistochemical analysis validated the expression of the key markers CD99 and OAZ2 and confirmed the association between NK cells and FSGS.

Results: The glmBoost+Ridge model exhibited exceptional diagnostic accuracy, achieving an AUC of 0.998 using just six genes: BANF1, TUSC2, SMAD3, TGFB1, CD99, and OAZ2. The prediction score was calculated as follows: score = (0.3997×BANF1) + (0.5543×TUSC2) + (0.5279×SMAD3) + (0.4118×TGFB1) + (0.8665×CD99) + (0.5996×OAZ2). Immunohistochemical analysis confirmed significantly elevated expression levels of CD99 and OAZ2 in the glomeruli and tubulointerstitial tissues of FSGS patients compared with those of controls.

Conclusion: This study demonstrates a highly accurate machine learning model for FSGS diagnosis. Immunohistochemical validation confirmed elevated expression of CD99 and OAZ2, offering valuable insights into FSGS pathogenesis and potential biomarkers for clinical application.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
局灶节段性肾小球硬化症诊断模型的建立:将机器学习与激活通路和临床验证相结合。
背景:局灶节段性肾小球硬化(FSGS)是一项重大的全球健康挑战,其发病率随着诊断技术的进步和慢性病患病率的增加而上升。本研究旨在通过整合机器学习方法来识别激活通路,并辅以强大的临床验证,从而提高FSGS的诊断准确性。方法:通过ComBat算法分析来自多个GEO队列的163例FSGS患者和42例活体供体的数据,以解决批次效应并确保基因表达谱的可比性。基因集富集分析(GSEA)确定了FSGS发病机制的关键信号通路。然后,我们通过将9种机器学习算法集成到101种组合中,开发了一个高度准确的诊断模型,在训练、验证和外部队列中获得了近乎完美的AUC值。该模型确定了6个基因作为FSGS的潜在生物标志物。此外,研究人员还探索了免疫细胞浸润模式,特别是涉及自然杀伤(NK)细胞的浸润模式,揭示了FSGS患者遗传与免疫反应之间复杂的相互作用。免疫组化分析证实了关键标志物CD99和OAZ2的表达,证实了NK细胞与FSGS之间的关联。结果:glmBoost+Ridge模型显示出卓越的诊断准确性,仅使用六个基因:BANF1, TUSC2, SMAD3, TGFB1, CD99和OAZ2, AUC就达到了0.998。预测分数的计算公式为:score = (0.3997×BANF1) + (0.5543×TUSC2) + (0.5279×SMAD3) + (0.4118×TGFB1) + (0.8665×CD99) + (0.5996×OAZ2)。免疫组化分析证实,与对照组相比,FSGS患者肾小球和小管间质组织中CD99和OAZ2的表达水平显著升高。结论:本研究为FSGS诊断提供了一个高精度的机器学习模型。免疫组织化学验证证实CD99和OAZ2的表达升高,为FSGS的发病机制和潜在的临床应用生物标志物提供了有价值的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
International Journal of General Medicine
International Journal of General Medicine Medicine-General Medicine
自引率
0.00%
发文量
1113
审稿时长
16 weeks
期刊介绍: The International Journal of General Medicine is an international, peer-reviewed, open access journal that focuses on general and internal medicine, pathogenesis, epidemiology, diagnosis, monitoring and treatment protocols. The journal is characterized by the rapid reporting of reviews, original research and clinical studies across all disease areas. A key focus of the journal is the elucidation of disease processes and management protocols resulting in improved outcomes for the patient. Patient perspectives such as satisfaction, quality of life, health literacy and communication and their role in developing new healthcare programs and optimizing clinical outcomes are major areas of interest for the journal. As of 1st April 2019, the International Journal of General Medicine will no longer consider meta-analyses for publication.
期刊最新文献
Refractive Errors and Ocular Biometric Parameters in Children with Autism Spectrum Disorder in Handan: A Cross-Sectional Study. Echocardiographic Risk Factors for Atrial Fibrillation Recurrence Following Radiofrequency Catheter Ablation: A Retrospective Analysis. Interleukin-Mediated Inflammatory Pathways and Nerve Injury in Lumbar Disc Herniation: A Narrative Review. Prognosis Impact of Betel Nut on Oral Squamous Cell Carcinoma: A Population-Based Study. Difference in Clinical Features and Risk Factors of Ischemic Stroke Between Young and Elderly Adults: A Retrospective Observation from an Island Population.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1