Identification of Potential Key Genes for the Comorbidity of Myasthenia Gravis With Thymoma by Integrated Bioinformatics Analysis and Machine Learning.

IF 2.3 Q3 BIOCHEMICAL RESEARCH METHODS Bioinformatics and Biology Insights Pub Date : 2024-09-26 eCollection Date: 2024-01-01 DOI:10.1177/11779322241281652
Hui Liu, Geyu Liu, Rongjing Guo, Shuang Li, Ting Chang
{"title":"Identification of Potential Key Genes for the Comorbidity of Myasthenia Gravis With Thymoma by Integrated Bioinformatics Analysis and Machine Learning.","authors":"Hui Liu, Geyu Liu, Rongjing Guo, Shuang Li, Ting Chang","doi":"10.1177/11779322241281652","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Thymoma is a key risk factor for myasthenia gravis (MG). The purpose of our study was to investigate the potential key genes responsible for MG patients with thymoma.</p><p><strong>Methods: </strong>We obtained MG and thymoma dataset from GEO database. Differentially expressed genes (DEGs) were determined and functional enrichment analyses were conducted by R packages. Weighted gene co-expression network analysis (WGCNA) was used to screen out the crucial module genes related to thymoma. Candidate genes were obtained by integrating DEGs of MG and module genes. Subsequently, we identified several candidate key genes by machine learning for diagnosing MG patients with thymoma. The nomogram and receiver operating characteristics (ROC) curves were applied to assess the diagnostic value of candidate key genes. Finally, we investigated the infiltration of immunocytes and analyzed the relationship among key genes and immune cells.</p><p><strong>Results: </strong>We obtained 337 DEGs in MG dataset and 2150 DEGs in thymoma dataset. Biological function analyses indicated that DEGs of MG and thymoma were enriched in many common pathways. Black module (containing 207 genes) analyzed by WGCNA was considered as the most correlated with thymoma. Then, 12 candidate genes were identified by intersecting with MG DEGs and thymoma module genes as potential causes of thymoma-associated MG pathogenesis. Furthermore, five candidate key genes (<i>JAM3</i>, <i>MS4A4A</i>, <i>MS4A6A</i>, <i>EGR1</i>, and <i>FOS</i>) were screened out through integrating least absolute shrinkage and selection operator (LASSO) regression and Random forest (RF). The nomogram and ROC curves (area under the curve from 0.833 to 0.929) suggested all five candidate key genes had high diagnostic values. Finally, we found that five key genes and immune cell infiltrations presented varying degrees of correlation.</p><p><strong>Conclusions: </strong>Our study identified five key potential pathogenic genes that predisposed thymoma to the development of MG, which provided potential diagnostic biomarkers and promising therapeutic targets for MG patients with thymoma.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":"18 ","pages":"11779322241281652"},"PeriodicalIF":2.3000,"publicationDate":"2024-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11437577/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics and Biology Insights","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/11779322241281652","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Thymoma is a key risk factor for myasthenia gravis (MG). The purpose of our study was to investigate the potential key genes responsible for MG patients with thymoma.

Methods: We obtained MG and thymoma dataset from GEO database. Differentially expressed genes (DEGs) were determined and functional enrichment analyses were conducted by R packages. Weighted gene co-expression network analysis (WGCNA) was used to screen out the crucial module genes related to thymoma. Candidate genes were obtained by integrating DEGs of MG and module genes. Subsequently, we identified several candidate key genes by machine learning for diagnosing MG patients with thymoma. The nomogram and receiver operating characteristics (ROC) curves were applied to assess the diagnostic value of candidate key genes. Finally, we investigated the infiltration of immunocytes and analyzed the relationship among key genes and immune cells.

Results: We obtained 337 DEGs in MG dataset and 2150 DEGs in thymoma dataset. Biological function analyses indicated that DEGs of MG and thymoma were enriched in many common pathways. Black module (containing 207 genes) analyzed by WGCNA was considered as the most correlated with thymoma. Then, 12 candidate genes were identified by intersecting with MG DEGs and thymoma module genes as potential causes of thymoma-associated MG pathogenesis. Furthermore, five candidate key genes (JAM3, MS4A4A, MS4A6A, EGR1, and FOS) were screened out through integrating least absolute shrinkage and selection operator (LASSO) regression and Random forest (RF). The nomogram and ROC curves (area under the curve from 0.833 to 0.929) suggested all five candidate key genes had high diagnostic values. Finally, we found that five key genes and immune cell infiltrations presented varying degrees of correlation.

Conclusions: Our study identified five key potential pathogenic genes that predisposed thymoma to the development of MG, which provided potential diagnostic biomarkers and promising therapeutic targets for MG patients with thymoma.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过综合生物信息学分析和机器学习鉴定胸腺瘤合并肌无力症的潜在关键基因
背景:胸腺瘤是导致重症肌无力(MG)的一个关键风险因素。我们的研究旨在调查可能导致胸腺瘤患者的关键基因:我们从 GEO 数据库中获得了 MG 和胸腺瘤数据集。方法:我们从 GEO 数据库中获得了 MG 和胸腺瘤数据集,利用 R 软件包确定了差异表达基因(DEGs)并进行了功能富集分析。利用加权基因共表达网络分析(WGCNA)筛选出与胸腺瘤相关的关键模块基因。候选基因是通过整合 MG 和模块基因的 DEGs 获得的。随后,我们通过机器学习确定了几个候选关键基因,用于诊断患有胸腺瘤的 MG 患者。应用提名图和接收者操作特征曲线(ROC)来评估候选关键基因的诊断价值。最后,我们研究了免疫细胞的浸润情况,并分析了关键基因与免疫细胞之间的关系:我们在 MG 数据集中获得了 337 个 DEGs,在胸腺瘤数据集中获得了 2150 个 DEGs。生物功能分析表明,MG 和胸腺瘤的 DEGs 富集在许多共同的通路中。WGCNA分析的黑色模块(包含207个基因)被认为与胸腺瘤的相关性最高。然后,通过与 MG DEGs 和胸腺瘤模块基因的交叉,确定了 12 个候选基因,作为胸腺瘤相关 MG 发病的潜在原因。此外,通过整合最小绝对收缩和选择算子(LASSO)回归和随机森林(RF)筛选出了五个候选关键基因(JAM3、MS4A4A、MS4A6A、EGR1和FOS)。提名图和 ROC 曲线(曲线下面积从 0.833 到 0.929)表明,所有五个候选关键基因都具有很高的诊断价值。最后,我们发现五个关键基因与免疫细胞浸润呈现出不同程度的相关性:我们的研究发现了胸腺瘤易发展为MG的五个关键潜在致病基因,这为胸腺瘤患者提供了潜在的诊断生物标志物和有希望的治疗靶点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Bioinformatics and Biology Insights
Bioinformatics and Biology Insights BIOCHEMICAL RESEARCH METHODS-
CiteScore
6.80
自引率
1.70%
发文量
36
审稿时长
8 weeks
期刊介绍: Bioinformatics and Biology Insights is an open access, peer-reviewed journal that considers articles on bioinformatics methods and their applications which must pertain to biological insights. All papers should be easily amenable to biologists and as such help bridge the gap between theories and applications.
期刊最新文献
Cliotide U1, a Novel Antimicrobial Peptide Isolated From Urtica Dioica Leaves. Resistify: A Novel NLR Classifier That Reveals Helitron-Associated NLR Expansion in Solanaceae. Advancing Regulatory Genomics With Machine Learning. Bioinformatic Annotation of Transposon DNA Processing Genes on the Long-Read Genome Assembly of Caenorhabditis elegans. Comprehensive Analysis of CRISPR-Cas Systems and Their Influence on Antibiotic Resistance in Salmonella enterica Strains.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1