Comparison of light gradient boosting and logistic regression for interactomic hub genes in Porphyromonas gingivalis and Fusobacterium nucleatum-induced periodontitis with Alzheimer's disease.
Pradeep Kumar Yadalam, Shubhangini Chatterjee, Prabhu Manickam Natarajan, Carlos M Ardila
{"title":"Comparison of light gradient boosting and logistic regression for interactomic hub genes in <i>Porphyromonas gingivalis</i> and <i>Fusobacterium nucleatum</i>-induced periodontitis with Alzheimer's disease.","authors":"Pradeep Kumar Yadalam, Shubhangini Chatterjee, Prabhu Manickam Natarajan, Carlos M Ardila","doi":"10.3389/froh.2025.1463458","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong><i>Porphyromonas gingivalis</i> and Treponema species have been found to invade the central nervous system through virulence factors, causing inflammation and influencing the host immune response. <i>P. gingivalis</i> interacts with astrocytes, microglia, and neurons, leading to neuroinflammation. <i>Aggregatibacter actinomycetemcomitans</i> and <i>Fusobacterium nucleatum</i> may also play a role in the development of Alzheimer's disease. Interactomic hub genes, central to protein-protein interaction networks, are vulnerable to perturbations, leading to diseases such as cancer, neurodegenerative disorders, and cardiovascular diseases. Machine learning can identify differentially expressed hub genes in specific conditions or diseases, providing insights into disease mechanisms and developing new therapeutic approaches. This study compares the performance of light gradient boosting and logistic regression in identifying interactomic hub genes in <i>P. gingivalis</i> and <i>F. nucleatum</i>-induced periodontitis with those in Alzheimer's disease.</p><p><strong>Methods: </strong>Using the GSE222136 dataset, we analyzed differential gene expression in periodontitis and Alzheimer's disease. The GEO2R tool was used to identify differentially expressed genes under different conditions, providing insights into molecular mechanisms. Bioinformatics tools such as Cytoscape and CytoHubba were employed to create gene expression networks to identify hub genes. Logistic regression and light gradient boosting were used to predict interactomic hub genes, with outliers removed and machine learning algorithms applied.</p><p><strong>Results: </strong>The data were cross-validated and divided into training and testing segments. The top hub genes identified were TNFRSF9, LZIC, TNFRSF8, SLC45A1, GPR157, and SLC25A33, which are induced by <i>P. gingivalis</i> and <i>F. nucleatum</i> and are responsible for endothelial dysfunction in brain cells. The accuracy of logistic regression and light gradient boosting was 67% and 60%, respectively.</p><p><strong>Discussion: </strong>The logistic regression model demonstrated superior accuracy and balance compared to the light gradient boosting model, indicating its potential for future improvements in predicting hub genes in periodontal and Alzheimer's diseases.</p>","PeriodicalId":94016,"journal":{"name":"Frontiers in oral health","volume":"6 ","pages":"1463458"},"PeriodicalIF":3.0000,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11914103/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in oral health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/froh.2025.1463458","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Porphyromonas gingivalis and Treponema species have been found to invade the central nervous system through virulence factors, causing inflammation and influencing the host immune response. P. gingivalis interacts with astrocytes, microglia, and neurons, leading to neuroinflammation. Aggregatibacter actinomycetemcomitans and Fusobacterium nucleatum may also play a role in the development of Alzheimer's disease. Interactomic hub genes, central to protein-protein interaction networks, are vulnerable to perturbations, leading to diseases such as cancer, neurodegenerative disorders, and cardiovascular diseases. Machine learning can identify differentially expressed hub genes in specific conditions or diseases, providing insights into disease mechanisms and developing new therapeutic approaches. This study compares the performance of light gradient boosting and logistic regression in identifying interactomic hub genes in P. gingivalis and F. nucleatum-induced periodontitis with those in Alzheimer's disease.
Methods: Using the GSE222136 dataset, we analyzed differential gene expression in periodontitis and Alzheimer's disease. The GEO2R tool was used to identify differentially expressed genes under different conditions, providing insights into molecular mechanisms. Bioinformatics tools such as Cytoscape and CytoHubba were employed to create gene expression networks to identify hub genes. Logistic regression and light gradient boosting were used to predict interactomic hub genes, with outliers removed and machine learning algorithms applied.
Results: The data were cross-validated and divided into training and testing segments. The top hub genes identified were TNFRSF9, LZIC, TNFRSF8, SLC45A1, GPR157, and SLC25A33, which are induced by P. gingivalis and F. nucleatum and are responsible for endothelial dysfunction in brain cells. The accuracy of logistic regression and light gradient boosting was 67% and 60%, respectively.
Discussion: The logistic regression model demonstrated superior accuracy and balance compared to the light gradient boosting model, indicating its potential for future improvements in predicting hub genes in periodontal and Alzheimer's diseases.