Bioinformatic Insights and XGBoost Identify Shared Genetics in Chronic Obstructive Pulmonary Disease and Type 2 Diabetes

IF 1.9 4区 医学 Q3 RESPIRATORY SYSTEM Clinical Respiratory Journal Pub Date : 2025-03-05 DOI:10.1111/crj.70057
Qianqian Ji, Yaxian Meng, Xiaojie Han, Chao Yi, Xiaoliang Chen, Yiqiang Zhan
{"title":"Bioinformatic Insights and XGBoost Identify Shared Genetics in Chronic Obstructive Pulmonary Disease and Type 2 Diabetes","authors":"Qianqian Ji,&nbsp;Yaxian Meng,&nbsp;Xiaojie Han,&nbsp;Chao Yi,&nbsp;Xiaoliang Chen,&nbsp;Yiqiang Zhan","doi":"10.1111/crj.70057","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>The correlation between chronic obstructive pulmonary disease (COPD) and Type 2 diabetes mellitus (T2DM) has long been recognized, but their shared molecular underpinnings remain elusive. This study aims to uncover common genetic markers and pathways in COPD and T2DM, providing insights into their molecular crosstalk.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>Utilizing the Gene Expression Omnibus (GEO) database, we analyzed gene expression datasets from six COPD and five T2DM studies. A multifaceted bioinformatics approach, encompassing the limma R package, unified matrix analysis, and weighted gene co-expression network analysis (WGCNA), was deployed to identify differentially expressed genes (DEGs) and hub genes. Functional enrichment and protein–protein interaction (PPI) analyses were conducted, followed by cross-species validation in <i>Mus musculus</i> models. Machine learning techniques, including random forest and LASSO regression, were applied for further validation, culminating in the development of a prognostic model using XGBoost.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>Our analysis revealed shared DEGs such as <i>KIF1C</i>, <i>CSTA</i>, <i>GMNN</i>, and <i>PHGDH</i> in both COPD and T2DM. Cross-species comparison identified common genes including <i>PON1</i> and <i>CD14</i>, exhibiting varying expression patterns. The random forest and LASSO regression identified six critical genes, with our XGBoost model demonstrating significant predictive accuracy (AUC = 0.996 for COPD).</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>This study identifies key genetic markers shared between COPD and T2DM, providing new insights into their molecular pathways. Our XGBoost model exhibited high predictive accuracy for COPD, highlighting the potential utility of these markers. These findings offer promising biomarkers for early detection and enhance our understanding of the diseases' interplay. Further validation in larger cohorts is recommended.</p>\n </section>\n </div>","PeriodicalId":55247,"journal":{"name":"Clinical Respiratory Journal","volume":"19 3","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/crj.70057","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Respiratory Journal","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/crj.70057","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RESPIRATORY SYSTEM","Score":null,"Total":0}
引用次数: 0

Abstract

Background

The correlation between chronic obstructive pulmonary disease (COPD) and Type 2 diabetes mellitus (T2DM) has long been recognized, but their shared molecular underpinnings remain elusive. This study aims to uncover common genetic markers and pathways in COPD and T2DM, providing insights into their molecular crosstalk.

Methods

Utilizing the Gene Expression Omnibus (GEO) database, we analyzed gene expression datasets from six COPD and five T2DM studies. A multifaceted bioinformatics approach, encompassing the limma R package, unified matrix analysis, and weighted gene co-expression network analysis (WGCNA), was deployed to identify differentially expressed genes (DEGs) and hub genes. Functional enrichment and protein–protein interaction (PPI) analyses were conducted, followed by cross-species validation in Mus musculus models. Machine learning techniques, including random forest and LASSO regression, were applied for further validation, culminating in the development of a prognostic model using XGBoost.

Results

Our analysis revealed shared DEGs such as KIF1C, CSTA, GMNN, and PHGDH in both COPD and T2DM. Cross-species comparison identified common genes including PON1 and CD14, exhibiting varying expression patterns. The random forest and LASSO regression identified six critical genes, with our XGBoost model demonstrating significant predictive accuracy (AUC = 0.996 for COPD).

Conclusions

This study identifies key genetic markers shared between COPD and T2DM, providing new insights into their molecular pathways. Our XGBoost model exhibited high predictive accuracy for COPD, highlighting the potential utility of these markers. These findings offer promising biomarkers for early detection and enhance our understanding of the diseases' interplay. Further validation in larger cohorts is recommended.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
求助全文
约1分钟内获得全文 去求助
来源期刊
Clinical Respiratory Journal
Clinical Respiratory Journal 医学-呼吸系统
CiteScore
3.70
自引率
0.00%
发文量
104
审稿时长
>12 weeks
期刊介绍: Overview Effective with the 2016 volume, this journal will be published in an online-only format. Aims and Scope The Clinical Respiratory Journal (CRJ) provides a forum for clinical research in all areas of respiratory medicine from clinical lung disease to basic research relevant to the clinic. We publish original research, review articles, case studies, editorials and book reviews in all areas of clinical lung disease including: Asthma Allergy COPD Non-invasive ventilation Sleep related breathing disorders Interstitial lung diseases Lung cancer Clinical genetics Rhinitis Airway and lung infection Epidemiology Pediatrics CRJ provides a fast-track service for selected Phase II and Phase III trial studies. Keywords Clinical Respiratory Journal, respiratory, pulmonary, medicine, clinical, lung disease, Abstracting and Indexing Information Academic Search (EBSCO Publishing) Academic Search Alumni Edition (EBSCO Publishing) Embase (Elsevier) Health & Medical Collection (ProQuest) Health Research Premium Collection (ProQuest) HEED: Health Economic Evaluations Database (Wiley-Blackwell) Hospital Premium Collection (ProQuest) Journal Citation Reports/Science Edition (Clarivate Analytics) MEDLINE/PubMed (NLM) ProQuest Central (ProQuest) Science Citation Index Expanded (Clarivate Analytics) SCOPUS (Elsevier)
期刊最新文献
Bacterial Colonization of Silver-Additive Ventilator Circuit in Patients Receiving Mechanical Ventilation: A Randomized Controlled Trial Bioinformatic Insights and XGBoost Identify Shared Genetics in Chronic Obstructive Pulmonary Disease and Type 2 Diabetes A Nomogram for Predicting Pulmonary Embolism in Silicosis Patients Changes in Retinal Nerve Fiber Layer Thickness in Patients With Chronic Obstructive Pulmonary Disease: A Systematic Review and Meta-Analysis Successful Rescue of Massive Hemoptysis Caused by Vascular-Bronchial Fistula
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1