{"title":"Development of a Diagnostic Model for Focal Segmental Glomerulosclerosis: Integrating Machine Learning on Activated Pathways and Clinical Validation.","authors":"Yating Ge, Xueqi Liu, Jinlian Shu, Xiao Jiang, Yonggui Wu","doi":"10.2147/IJGM.S498407","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Focal segmental glomerulosclerosis (FSGS) represents a major global health challenge, with its incidence rising in parallel with advances in diagnostic techniques and the growing prevalence of chronic diseases. This study seeks to enhance the diagnostic accuracy of FSGS by integrating machine learning approaches to identify activated pathways, complemented by robust clinical validation.</p><p><strong>Methods: </strong>We analyzed data from 163 FSGS patients and 42 living donors across multiple GEO cohorts via the ComBat algorithm to address batch effects and ensure the comparability of gene expression profiles. Gene set enrichment analysis (GSEA) identified key signaling pathways involved in FSGS pathogenesis. We then developed a highly accurate diagnostic model by integrating nine machine learning algorithms into 101 combinations, achieving near-perfect AUC values across training, validation, and external cohorts. The model identified six genes as potential biomarkers for FSGS. Additionally, immune cell infiltration patterns, particularly those involving natural killer (NK) cells, were explored, revealing the complex interplay between genetics and the immune response in FSGS patients. Immunohistochemical analysis validated the expression of the key markers CD99 and OAZ2 and confirmed the association between NK cells and FSGS.</p><p><strong>Results: </strong>The glmBoost+Ridge model exhibited exceptional diagnostic accuracy, achieving an AUC of 0.998 using just six genes: BANF1, TUSC2, SMAD3, TGFB1, CD99, and OAZ2. The prediction score was calculated as follows: score = (0.3997×BANF1) + (0.5543×TUSC2) + (0.5279×SMAD3) + (0.4118×TGFB1) + (0.8665×CD99) + (0.5996×OAZ2). Immunohistochemical analysis confirmed significantly elevated expression levels of CD99 and OAZ2 in the glomeruli and tubulointerstitial tissues of FSGS patients compared with those of controls.</p><p><strong>Conclusion: </strong>This study demonstrates a highly accurate machine learning model for FSGS diagnosis. Immunohistochemical validation confirmed elevated expression of CD99 and OAZ2, offering valuable insights into FSGS pathogenesis and potential biomarkers for clinical application.</p>","PeriodicalId":14131,"journal":{"name":"International Journal of General Medicine","volume":"18 ","pages":"1127-1142"},"PeriodicalIF":2.1000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11872063/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of General Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/IJGM.S498407","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Focal segmental glomerulosclerosis (FSGS) represents a major global health challenge, with its incidence rising in parallel with advances in diagnostic techniques and the growing prevalence of chronic diseases. This study seeks to enhance the diagnostic accuracy of FSGS by integrating machine learning approaches to identify activated pathways, complemented by robust clinical validation.
Methods: We analyzed data from 163 FSGS patients and 42 living donors across multiple GEO cohorts via the ComBat algorithm to address batch effects and ensure the comparability of gene expression profiles. Gene set enrichment analysis (GSEA) identified key signaling pathways involved in FSGS pathogenesis. We then developed a highly accurate diagnostic model by integrating nine machine learning algorithms into 101 combinations, achieving near-perfect AUC values across training, validation, and external cohorts. The model identified six genes as potential biomarkers for FSGS. Additionally, immune cell infiltration patterns, particularly those involving natural killer (NK) cells, were explored, revealing the complex interplay between genetics and the immune response in FSGS patients. Immunohistochemical analysis validated the expression of the key markers CD99 and OAZ2 and confirmed the association between NK cells and FSGS.
Results: The glmBoost+Ridge model exhibited exceptional diagnostic accuracy, achieving an AUC of 0.998 using just six genes: BANF1, TUSC2, SMAD3, TGFB1, CD99, and OAZ2. The prediction score was calculated as follows: score = (0.3997×BANF1) + (0.5543×TUSC2) + (0.5279×SMAD3) + (0.4118×TGFB1) + (0.8665×CD99) + (0.5996×OAZ2). Immunohistochemical analysis confirmed significantly elevated expression levels of CD99 and OAZ2 in the glomeruli and tubulointerstitial tissues of FSGS patients compared with those of controls.
Conclusion: This study demonstrates a highly accurate machine learning model for FSGS diagnosis. Immunohistochemical validation confirmed elevated expression of CD99 and OAZ2, offering valuable insights into FSGS pathogenesis and potential biomarkers for clinical application.
期刊介绍:
The International Journal of General Medicine is an international, peer-reviewed, open access journal that focuses on general and internal medicine, pathogenesis, epidemiology, diagnosis, monitoring and treatment protocols. The journal is characterized by the rapid reporting of reviews, original research and clinical studies across all disease areas.
A key focus of the journal is the elucidation of disease processes and management protocols resulting in improved outcomes for the patient. Patient perspectives such as satisfaction, quality of life, health literacy and communication and their role in developing new healthcare programs and optimizing clinical outcomes are major areas of interest for the journal.
As of 1st April 2019, the International Journal of General Medicine will no longer consider meta-analyses for publication.