Machine learning-based identification and validation of immune-related biomarkers for early diagnosis and targeted therapy in diabetic retinopathy

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS ACS Applied Bio Materials Pub Date : 2024-10-18 DOI:10.1016/j.gene.2024.149015
Yulin Tao , Minqi Xiong , Yirui Peng , Lili Yao , Haibo Zhu , Qiong Zhou , Jun Ouyang
{"title":"Machine learning-based identification and validation of immune-related biomarkers for early diagnosis and targeted therapy in diabetic retinopathy","authors":"Yulin Tao ,&nbsp;Minqi Xiong ,&nbsp;Yirui Peng ,&nbsp;Lili Yao ,&nbsp;Haibo Zhu ,&nbsp;Qiong Zhou ,&nbsp;Jun Ouyang","doi":"10.1016/j.gene.2024.149015","DOIUrl":null,"url":null,"abstract":"<div><div>The early diagnosis of diabetic retinopathy (DR) is challenging, highlighting the urgent need to identify new biomarkers. Immune responses play a crucial role in DR, yet there are currently no reports of machine learning (ML) algorithms being utilized for the development of immune-related molecular markers in DR. Based on the datasets GSE102485 and GSE160306, differentially expressed genes (DEGs) were screened using Weighted Gene Co-expression Network Analysis (WGCNA). Five ML algorithms including Bayesian, Learning Vector Quantization (LVQ), Wrapper (Boruta), Random Forest (RF), and Logistic Regression were employed to select immune-related genes associated with DR (DR.Sig). Seven ML algorithms including Naive Bayes (NB), RF, Support Vector Machine (SVM), AdaBoost Classification Trees (AdaBoost), Boosted Logistic Regressions (LogitBoost), K-Nearest Neighbors (KNN), and Cancerclass were utilized to construct a predictive model for DR. The relationship between DR.Sig genes and immune cells was analyzed using single-sample Gene Set Enrichment Analysis (ssGSEA). Additionally, drug sensitivity prediction of DR.Sig genes and molecular docking were performed. Through the utilization of 5 ML algorithms, 6 immune-related biomarkers closely related to the occurrence of DR were identified, including FCGR2B, CSRP1, EDNRA, SDC2, TEK, and CIITA. The DR predictive model constructed based on these 6 DR.Sig genes using the Cancerclass algorithm demonstrated superior predictive performance compared to 4 previously published DR-related biomarkers. In vivo and in vitro experiments also provided strong validation of the expression of the 6 genes in DR. Positive correlations were observed between these genes and 22 types of immune cells. Molecular docking results revealed that CSRP1, EDNRA, and TEK exhibited the highest affinities with the small molecule compounds etoposide, FR-139317, and camptothecin, respectively. The models constructed based on various ML algorithms can effectively predict the occurrence of DR events and hold potential for targeted drug therapies, providing a basis for the early diagnosis and targeted treatment of DR.</div></div>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378111924008965","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0

Abstract

The early diagnosis of diabetic retinopathy (DR) is challenging, highlighting the urgent need to identify new biomarkers. Immune responses play a crucial role in DR, yet there are currently no reports of machine learning (ML) algorithms being utilized for the development of immune-related molecular markers in DR. Based on the datasets GSE102485 and GSE160306, differentially expressed genes (DEGs) were screened using Weighted Gene Co-expression Network Analysis (WGCNA). Five ML algorithms including Bayesian, Learning Vector Quantization (LVQ), Wrapper (Boruta), Random Forest (RF), and Logistic Regression were employed to select immune-related genes associated with DR (DR.Sig). Seven ML algorithms including Naive Bayes (NB), RF, Support Vector Machine (SVM), AdaBoost Classification Trees (AdaBoost), Boosted Logistic Regressions (LogitBoost), K-Nearest Neighbors (KNN), and Cancerclass were utilized to construct a predictive model for DR. The relationship between DR.Sig genes and immune cells was analyzed using single-sample Gene Set Enrichment Analysis (ssGSEA). Additionally, drug sensitivity prediction of DR.Sig genes and molecular docking were performed. Through the utilization of 5 ML algorithms, 6 immune-related biomarkers closely related to the occurrence of DR were identified, including FCGR2B, CSRP1, EDNRA, SDC2, TEK, and CIITA. The DR predictive model constructed based on these 6 DR.Sig genes using the Cancerclass algorithm demonstrated superior predictive performance compared to 4 previously published DR-related biomarkers. In vivo and in vitro experiments also provided strong validation of the expression of the 6 genes in DR. Positive correlations were observed between these genes and 22 types of immune cells. Molecular docking results revealed that CSRP1, EDNRA, and TEK exhibited the highest affinities with the small molecule compounds etoposide, FR-139317, and camptothecin, respectively. The models constructed based on various ML algorithms can effectively predict the occurrence of DR events and hold potential for targeted drug therapies, providing a basis for the early diagnosis and targeted treatment of DR.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于机器学习的糖尿病视网膜病变早期诊断和靶向治疗免疫相关生物标记物的识别与验证。
糖尿病视网膜病变(DR)的早期诊断具有挑战性,因此迫切需要确定新的生物标记物。免疫反应在糖尿病视网膜病变中起着至关重要的作用,但目前还没有关于利用机器学习(ML)算法开发糖尿病视网膜病变免疫相关分子标记物的报道。基于 GSE102485 和 GSE160306 数据集,利用加权基因共表达网络分析(WGCNA)筛选了差异表达基因(DEGs)。采用贝叶斯算法、学习矢量量化算法(LVQ)、Wrapper算法(Boruta)、随机森林算法(RF)和逻辑回归算法等五种 ML 算法筛选出与 DR 相关的免疫相关基因(DR.Sig)。利用 Naive Bayes (NB)、RF、支持向量机 (SVM)、AdaBoost 分类树 (AdaBoost)、Boosted Logistic Regressions (LogitBoost)、K-Nearest Neighbors (KNN) 和 Cancerclass 等七种 ML 算法构建了 DR 预测模型。利用单样本基因组富集分析(ssGSEA)分析了 DR.Sig 基因与免疫细胞之间的关系。此外,还对 DR.Sig 基因进行了药物敏感性预测和分子对接。通过使用 5 种 ML 算法,确定了 6 个与 DR 发生密切相关的免疫相关生物标志物,包括 FCGR2B、CSRP1、EDNRA、SDC2、TEK 和 CIITA。基于这6个DR.Sig基因使用Cancerclass算法构建的DR预测模型与之前公布的4个DR相关生物标记物相比,显示出更优越的预测性能。体内和体外实验也有力地验证了这 6 个基因在 DR 中的表达。在这些基因和 22 种免疫细胞之间观察到了正相关性。分子对接结果显示,CSRP1、EDNRA 和 TEK 分别与小分子化合物依托泊苷、FR-139317 和喜树碱具有最高的亲和力。基于多种 ML 算法构建的模型能有效预测 DR 事件的发生,并具有靶向药物治疗的潜力,为 DR 的早期诊断和靶向治疗提供了依据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
ACS Applied Bio Materials
ACS Applied Bio Materials Chemistry-Chemistry (all)
CiteScore
9.40
自引率
2.10%
发文量
464
期刊最新文献
A Systematic Review of Sleep Disturbance in Idiopathic Intracranial Hypertension. Advancing Patient Education in Idiopathic Intracranial Hypertension: The Promise of Large Language Models. Anti-Myelin-Associated Glycoprotein Neuropathy: Recent Developments. Approach to Managing the Initial Presentation of Multiple Sclerosis: A Worldwide Practice Survey. Association Between LACE+ Index Risk Category and 90-Day Mortality After Stroke.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1