{"title":"使用WGCNA和机器学习算法识别特发性扩张型心肌病引起的心力衰竭相关生物标志物","authors":"Mengyi Sun, Linping Li","doi":"10.1155/2023/2250772","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The genetic factors and pathogenesis of idiopathic dilated cardiomyopathy-induced heart failure (IDCM-HF) have not been understood thoroughly; there is a lack of specific diagnostic markers and treatment methods for the disease. Hence, we aimed to identify the mechanisms of action at the molecular level and potential molecular markers for this disease.</p><p><strong>Methods: </strong>Gene expression profiles of IDCM-HF and non-heart failure (NF) specimens were acquired from the database of Gene Expression Omnibus (GEO). We then identified the differentially expressed genes (DEGs) and analyzed their functions and related pathways by using \"Metascape\". Weighted gene co-expression network analysis (WGCNA) was utilized to search for key module genes. Candidate genes were identified by intersecting the key module genes identified via WGCNA with DEGs and further screened via the support vector machine-recursive feature elimination (SVM-RFE) method and the least absolute shrinkage and selection operator (LASSO) algorithm. At last, the biomarkers were validated and evaluated the diagnostic efficacy by the area under curve (AUC) value and further confirmed the differential expression in the IDCM-HF and NF groups using an external database.</p><p><strong>Results: </strong>We detected 490 genes exhibiting differential expression between IDCM-HF and NF specimens from the GSE57338 dataset, with most of them being concentrated in the extracellular matrix (ECM) of cells related to biological processes and pathways. After screening, 13 candidate genes were identified. Aquaporin 3 (AQP3) and cytochrome P450 2J2 (CYP2J2) showed high diagnostic efficacy in the GSE57338 and GSE6406 datasets, respectively. In comparison to the NF group, AQP3 was significantly down-regulated in the IDCM-HF group, while CYP2J2 was significantly up-regulated.</p><p><strong>Conclusion: </strong>As far as we know, this is the first study that combines WGCNA and machine learning algorithms to screen for potential biomarkers of IDCM-HF. Our findings suggest that AQP3 and CYP2J2 could be used as novel diagnostic markers and treatment targets of IDCM-HF.</p>","PeriodicalId":13988,"journal":{"name":"International Journal of Genomics","volume":"2023 ","pages":"2250772"},"PeriodicalIF":2.6000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10154102/pdf/","citationCount":"2","resultStr":"{\"title\":\"Identification of Biomarkers Associated with Heart Failure Caused by Idiopathic Dilated Cardiomyopathy Using WGCNA and Machine Learning Algorithms.\",\"authors\":\"Mengyi Sun, Linping Li\",\"doi\":\"10.1155/2023/2250772\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The genetic factors and pathogenesis of idiopathic dilated cardiomyopathy-induced heart failure (IDCM-HF) have not been understood thoroughly; there is a lack of specific diagnostic markers and treatment methods for the disease. Hence, we aimed to identify the mechanisms of action at the molecular level and potential molecular markers for this disease.</p><p><strong>Methods: </strong>Gene expression profiles of IDCM-HF and non-heart failure (NF) specimens were acquired from the database of Gene Expression Omnibus (GEO). We then identified the differentially expressed genes (DEGs) and analyzed their functions and related pathways by using \\\"Metascape\\\". Weighted gene co-expression network analysis (WGCNA) was utilized to search for key module genes. Candidate genes were identified by intersecting the key module genes identified via WGCNA with DEGs and further screened via the support vector machine-recursive feature elimination (SVM-RFE) method and the least absolute shrinkage and selection operator (LASSO) algorithm. At last, the biomarkers were validated and evaluated the diagnostic efficacy by the area under curve (AUC) value and further confirmed the differential expression in the IDCM-HF and NF groups using an external database.</p><p><strong>Results: </strong>We detected 490 genes exhibiting differential expression between IDCM-HF and NF specimens from the GSE57338 dataset, with most of them being concentrated in the extracellular matrix (ECM) of cells related to biological processes and pathways. After screening, 13 candidate genes were identified. Aquaporin 3 (AQP3) and cytochrome P450 2J2 (CYP2J2) showed high diagnostic efficacy in the GSE57338 and GSE6406 datasets, respectively. In comparison to the NF group, AQP3 was significantly down-regulated in the IDCM-HF group, while CYP2J2 was significantly up-regulated.</p><p><strong>Conclusion: </strong>As far as we know, this is the first study that combines WGCNA and machine learning algorithms to screen for potential biomarkers of IDCM-HF. Our findings suggest that AQP3 and CYP2J2 could be used as novel diagnostic markers and treatment targets of IDCM-HF.</p>\",\"PeriodicalId\":13988,\"journal\":{\"name\":\"International Journal of Genomics\",\"volume\":\"2023 \",\"pages\":\"2250772\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10154102/pdf/\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Genomics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1155/2023/2250772\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1155/2023/2250772","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 2
摘要
背景:特发性扩张型心肌病诱发心力衰竭(IDCM-HF)的遗传因素和发病机制尚未完全了解;目前缺乏特异性的诊断标记物和治疗方法。因此,我们的目的是在分子水平上确定作用机制和潜在的分子标记。方法:从Gene expression Omnibus (GEO)数据库中获取IDCM-HF和非心力衰竭(NF)标本的基因表达谱。利用“meta - scape”软件对差异表达基因(differential expression genes, deg)进行鉴定,分析其功能和相关途径。采用加权基因共表达网络分析(Weighted gene co-expression network analysis, WGCNA)搜索关键模块基因。将WGCNA识别出的关键模块基因与deg相交,确定候选基因,并通过支持向量机递归特征消除(SVM-RFE)方法和最小绝对收缩选择算子(LASSO)算法进行筛选。最后,通过曲线下面积(area under curve, AUC)值对生物标志物进行验证和诊断效能评估,并利用外部数据库进一步确认IDCM-HF组和NF组的差异表达。结果:我们从GSE57338数据集中检测到490个基因在IDCM-HF和NF样本中表现出差异表达,其中大多数基因集中在与生物过程和途径相关的细胞外基质(ECM)中。经筛选,共鉴定出13个候选基因。水通道蛋白3 (AQP3)和细胞色素P450 2J2 (CYP2J2)分别在GSE57338和GSE6406数据集中显示出较高的诊断效能。与NF组比较,IDCM-HF组AQP3显著下调,CYP2J2显著上调。结论:据我们所知,这是第一个结合WGCNA和机器学习算法筛选IDCM-HF潜在生物标志物的研究。提示AQP3和CYP2J2可作为IDCM-HF新的诊断标记物和治疗靶点。
Identification of Biomarkers Associated with Heart Failure Caused by Idiopathic Dilated Cardiomyopathy Using WGCNA and Machine Learning Algorithms.
Background: The genetic factors and pathogenesis of idiopathic dilated cardiomyopathy-induced heart failure (IDCM-HF) have not been understood thoroughly; there is a lack of specific diagnostic markers and treatment methods for the disease. Hence, we aimed to identify the mechanisms of action at the molecular level and potential molecular markers for this disease.
Methods: Gene expression profiles of IDCM-HF and non-heart failure (NF) specimens were acquired from the database of Gene Expression Omnibus (GEO). We then identified the differentially expressed genes (DEGs) and analyzed their functions and related pathways by using "Metascape". Weighted gene co-expression network analysis (WGCNA) was utilized to search for key module genes. Candidate genes were identified by intersecting the key module genes identified via WGCNA with DEGs and further screened via the support vector machine-recursive feature elimination (SVM-RFE) method and the least absolute shrinkage and selection operator (LASSO) algorithm. At last, the biomarkers were validated and evaluated the diagnostic efficacy by the area under curve (AUC) value and further confirmed the differential expression in the IDCM-HF and NF groups using an external database.
Results: We detected 490 genes exhibiting differential expression between IDCM-HF and NF specimens from the GSE57338 dataset, with most of them being concentrated in the extracellular matrix (ECM) of cells related to biological processes and pathways. After screening, 13 candidate genes were identified. Aquaporin 3 (AQP3) and cytochrome P450 2J2 (CYP2J2) showed high diagnostic efficacy in the GSE57338 and GSE6406 datasets, respectively. In comparison to the NF group, AQP3 was significantly down-regulated in the IDCM-HF group, while CYP2J2 was significantly up-regulated.
Conclusion: As far as we know, this is the first study that combines WGCNA and machine learning algorithms to screen for potential biomarkers of IDCM-HF. Our findings suggest that AQP3 and CYP2J2 could be used as novel diagnostic markers and treatment targets of IDCM-HF.
期刊介绍:
International Journal of Genomics is a peer-reviewed, Open Access journal that publishes research articles as well as review articles in all areas of genome-scale analysis. Topics covered by the journal include, but are not limited to: bioinformatics, clinical genomics, disease genomics, epigenomics, evolutionary genomics, functional genomics, genome engineering, and synthetic genomics.