Bioinformatics analysis of diagnostic biomarkers for Alzheimer's disease in peripheral blood based on sex differences and support vector machine algorithm.
{"title":"Bioinformatics analysis of diagnostic biomarkers for Alzheimer's disease in peripheral blood based on sex differences and support vector machine algorithm.","authors":"Wencan Ji, Ke An, Canjun Wang, Shaohua Wang","doi":"10.1186/s41065-022-00252-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The prevalence of Alzheimer's disease (AD) varies based on gender. Due to the lack of early stage biomarkers, most of them are diagnosed at the terminal stage. This study aimed to explore sex-specific signaling pathways and identify diagnostic biomarkers of AD.</p><p><strong>Methods: </strong>Microarray dataset for blood was obtained from the Gene Expression Omnibus (GEO) database of GSE63060 to conduct differentially expressed genes (DEGs) analysis by R software limma. Gene Ontology (GO) analysis, Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis and Gene set enrichment analysis (GSEA) were conducted. Immune checkpoint gene expression was compared between females and males. Using CytoHubba, we identified hub genes in a protein-protein interaction network (PPI). Then, we evaluated their distinct effectiveness using unsupervised hierarchical clustering. Support vector machine (SVM) and ten-fold cross-validation were used to further verify these biomarkers. Lastly, we confirmed our findings by using another independent dataset.</p><p><strong>Results: </strong>A total of 37 female-specific DEGs and 27 male-specific DEGs were identified from GSE63060 datasets. Analyses of enrichment showed that female-specific DEGs primarily focused on energy metabolism, while male-specific DEGs mostly involved in immune regulation. Three immune-checkpoint-relevant genes dysregulated in males. In females, however, these eight genes were not differentially expressed. SNRPG, RPS27A, COX7A2, ATP5PO, LSM3, COX7C, PFDN5, HINT1, PSMA6, RPS3A and RPL31 were regarded as hub genes for females, while SNRPG, RPL31, COX7C, RPS27A, RPL35A, RPS3A, RPS20 and PFDN5 were regarded as hub genes for males. Thirteen hub genes mentioned above was significantly lower in both AD and mild cognitive impairment (MCI). The diagnostic model of 15-marker panel (13 hub genes with sex and age) was developed. Both the training dataset and the independent validation dataset have area under the curve (AUC) with a high value (0.919, 95%CI 0.901-0.929 and 0.803, 95%CI 0.789-0.826). Based on GSEA for hub genes, they were associated with some aspects of AD pathogenesis.</p><p><strong>Conclusion: </strong>DEGs in males and females contribute differently to AD pathogenesis. Algorithms combining blood-based biomarkers may improve AD diagnostic accuracy, but large validation studies are needed.</p>","PeriodicalId":12862,"journal":{"name":"Hereditas","volume":" ","pages":"38"},"PeriodicalIF":2.7000,"publicationDate":"2022-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9531459/pdf/","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Hereditas","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s41065-022-00252-x","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Background: The prevalence of Alzheimer's disease (AD) varies based on gender. Due to the lack of early stage biomarkers, most of them are diagnosed at the terminal stage. This study aimed to explore sex-specific signaling pathways and identify diagnostic biomarkers of AD.
Methods: Microarray dataset for blood was obtained from the Gene Expression Omnibus (GEO) database of GSE63060 to conduct differentially expressed genes (DEGs) analysis by R software limma. Gene Ontology (GO) analysis, Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis and Gene set enrichment analysis (GSEA) were conducted. Immune checkpoint gene expression was compared between females and males. Using CytoHubba, we identified hub genes in a protein-protein interaction network (PPI). Then, we evaluated their distinct effectiveness using unsupervised hierarchical clustering. Support vector machine (SVM) and ten-fold cross-validation were used to further verify these biomarkers. Lastly, we confirmed our findings by using another independent dataset.
Results: A total of 37 female-specific DEGs and 27 male-specific DEGs were identified from GSE63060 datasets. Analyses of enrichment showed that female-specific DEGs primarily focused on energy metabolism, while male-specific DEGs mostly involved in immune regulation. Three immune-checkpoint-relevant genes dysregulated in males. In females, however, these eight genes were not differentially expressed. SNRPG, RPS27A, COX7A2, ATP5PO, LSM3, COX7C, PFDN5, HINT1, PSMA6, RPS3A and RPL31 were regarded as hub genes for females, while SNRPG, RPL31, COX7C, RPS27A, RPL35A, RPS3A, RPS20 and PFDN5 were regarded as hub genes for males. Thirteen hub genes mentioned above was significantly lower in both AD and mild cognitive impairment (MCI). The diagnostic model of 15-marker panel (13 hub genes with sex and age) was developed. Both the training dataset and the independent validation dataset have area under the curve (AUC) with a high value (0.919, 95%CI 0.901-0.929 and 0.803, 95%CI 0.789-0.826). Based on GSEA for hub genes, they were associated with some aspects of AD pathogenesis.
Conclusion: DEGs in males and females contribute differently to AD pathogenesis. Algorithms combining blood-based biomarkers may improve AD diagnostic accuracy, but large validation studies are needed.
HereditasBiochemistry, Genetics and Molecular Biology-Genetics
CiteScore
3.80
自引率
3.70%
发文量
0
期刊介绍:
For almost a century, Hereditas has published original cutting-edge research and reviews. As the Official journal of the Mendelian Society of Lund, the journal welcomes research from across all areas of genetics and genomics. Topics of interest include human and medical genetics, animal and plant genetics, microbial genetics, agriculture and bioinformatics.