A diagnostic model for sepsis using an integrated machine learning framework approach and its therapeutic drug discovery.

IF 3 3区 医学 Q2 INFECTIOUS DISEASES BMC Infectious Diseases Pub Date : 2025-02-14 DOI:10.1186/s12879-025-10616-z
Wuping Zhang, Hanping Shi, Jie Peng
{"title":"A diagnostic model for sepsis using an integrated machine learning framework approach and its therapeutic drug discovery.","authors":"Wuping Zhang, Hanping Shi, Jie Peng","doi":"10.1186/s12879-025-10616-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Sepsis remains a life-threatening condition in intensive care units (ICU) with high morbidity and mortality rates. Some biomarkers commonly used in clinic do not have the characteristics of rapid and specific growth and rapid decline after effective treatment. Machine learning has shown great potential in early diagnosis, subtype analysis, accurate treatment and prognosis evaluation of sepsis.</p><p><strong>Methods: </strong>Gene expression matrices from GSE13904 and GSE26440 were combined into a training model after quality control and standardization. Then, the intersection genes were obtained by crossing the screened differentially expressed genes (DEGs) and the module genes with the strongest correlation obtained by WGCNA analysis. 113 combined machine learning algorithms to build a diagnosis model. Then the CIBERSORT algorithm is used to analyze the relationship between the change of core gene expression and immune response in sepsis. Construct nomogram, DCA and CIC to further verify the reliability of the diagnosis model. The potential molecular compounds interacting with key genes were searched from the Traditional Chinese Medicine Active Compound Library (TCMACL).</p><p><strong>Results: </strong>We screened 405 DEGs, including 334 up-regulated and 71 down-regulated genes. The 308 potential genes were obtained by intersection of MEturquoise module genes in WGCNA analysis and DEGs for subsequent machine learning analysis. GO and KEGG enrichment analysis showed that sepsis was mainly related to immune response and bacterial infection. Then 113 combined machine learning algorithms are applied to construct a diagnosis model to screen 22 hub genes. Four four key genes (CD177, GNLY, ANKRD22, and IFIT1) are obtained through further analysis of PPI network constructed by 22 hub genes. Subsequently, the diagnostic model is proved to have good predictive value by nomogram, DCA and CIC. Finally, molecular compounds (Dieckol, Grosvenorine and Tellimagrandin II) were screened out as potential drugs.</p><p><strong>Conclusion: </strong>113 combinated machine learning algorithms screened out four key genes that can distinguish sepsis patients. At the same time, potential therapeutic molecular compounds interacting with key genes genes were screened out by molecular docking.</p>","PeriodicalId":8981,"journal":{"name":"BMC Infectious Diseases","volume":"25 1","pages":"219"},"PeriodicalIF":3.0000,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11827343/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Infectious Diseases","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12879-025-10616-z","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INFECTIOUS DISEASES","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Sepsis remains a life-threatening condition in intensive care units (ICU) with high morbidity and mortality rates. Some biomarkers commonly used in clinic do not have the characteristics of rapid and specific growth and rapid decline after effective treatment. Machine learning has shown great potential in early diagnosis, subtype analysis, accurate treatment and prognosis evaluation of sepsis.

Methods: Gene expression matrices from GSE13904 and GSE26440 were combined into a training model after quality control and standardization. Then, the intersection genes were obtained by crossing the screened differentially expressed genes (DEGs) and the module genes with the strongest correlation obtained by WGCNA analysis. 113 combined machine learning algorithms to build a diagnosis model. Then the CIBERSORT algorithm is used to analyze the relationship between the change of core gene expression and immune response in sepsis. Construct nomogram, DCA and CIC to further verify the reliability of the diagnosis model. The potential molecular compounds interacting with key genes were searched from the Traditional Chinese Medicine Active Compound Library (TCMACL).

Results: We screened 405 DEGs, including 334 up-regulated and 71 down-regulated genes. The 308 potential genes were obtained by intersection of MEturquoise module genes in WGCNA analysis and DEGs for subsequent machine learning analysis. GO and KEGG enrichment analysis showed that sepsis was mainly related to immune response and bacterial infection. Then 113 combined machine learning algorithms are applied to construct a diagnosis model to screen 22 hub genes. Four four key genes (CD177, GNLY, ANKRD22, and IFIT1) are obtained through further analysis of PPI network constructed by 22 hub genes. Subsequently, the diagnostic model is proved to have good predictive value by nomogram, DCA and CIC. Finally, molecular compounds (Dieckol, Grosvenorine and Tellimagrandin II) were screened out as potential drugs.

Conclusion: 113 combinated machine learning algorithms screened out four key genes that can distinguish sepsis patients. At the same time, potential therapeutic molecular compounds interacting with key genes genes were screened out by molecular docking.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用集成机器学习框架方法的败血症诊断模型及其治疗药物发现。
背景:脓毒症仍然是重症监护病房(ICU)中一种危及生命的疾病,具有很高的发病率和死亡率。临床上常用的一些生物标志物在有效治疗后不具有快速特异性生长和快速下降的特点。机器学习在脓毒症的早期诊断、亚型分析、准确治疗和预后评估等方面显示出巨大的潜力。方法:将GSE13904和GSE26440基因表达矩阵经质量控制和标准化后合并为训练模型。然后,将筛选的差异表达基因(deg)与WGCNA分析得到的相关性最强的模块基因进行杂交,得到交叉基因。113结合机器学习算法构建诊断模型。然后利用CIBERSORT算法分析败血症中核心基因表达变化与免疫反应的关系。构建nomogram、DCA和CIC,进一步验证诊断模型的可靠性。从中药活性化合物文库(TCMACL)中寻找可能与关键基因相互作用的分子化合物。结果:共筛选到405个基因,其中上调基因334个,下调基因71个。将WGCNA分析中的MEturquoise模块基因与DEGs交叉得到308个潜在基因,用于后续的机器学习分析。GO和KEGG富集分析显示脓毒症主要与免疫反应和细菌感染有关。然后应用113种组合机器学习算法构建诊断模型,筛选22个轮毂基因。通过对22个枢纽基因构建的PPI网络进行进一步分析,得到4个关键基因(CD177、GNLY、ANKRD22、IFIT1)。随后,通过nomogram、DCA和CIC分析,证明了该诊断模型具有较好的预测价值。最后筛选出分子化合物(Dieckol、Grosvenorine和Tellimagrandin II)作为潜在药物。结论:113种联合机器学习算法筛选出4个能够区分脓毒症患者的关键基因。同时,通过分子对接筛选出与关键基因相互作用的潜在治疗性分子化合物。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
BMC Infectious Diseases
BMC Infectious Diseases 医学-传染病学
CiteScore
6.50
自引率
0.00%
发文量
860
审稿时长
3.3 months
期刊介绍: BMC Infectious Diseases is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of infectious and sexually transmitted diseases in humans, as well as related molecular genetics, pathophysiology, and epidemiology.
期刊最新文献
Viral respiratory infections in the pediatric intensive care unit: a comparative study of clinical outcomes in Syrian refugees and Turkish children. Correction: Retrospective analysis of sexually transmitted infections among people living with HIV and pre-exposure prophylaxis users in Spain. Modeling the COVID-19 epidemic in kurdistan province using the SEIR model and time series analysis of hospitalized patients during the first wave. Incidence of Post-cesarean wound infections, bacterial profiles, resistant patterns, and associated costs among women delivering at public hospitals in Eastern Ethiopia: multi center study. Group-based trajectory modeling of blood pressure reactivity and mortality in septic patients: a retrospective study of the MIMIC-IV database.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1