Weixiao Bu, Huaxia Mu, Mengyao Gao, Weiqiang Su, Fuyan Shi, Qinghua Wang, Suzhen Wang, Yujia Kong
{"title":"Data-driven Bayesian network learning analysis on the regulatory mechanism between carcinogenic genes and immune cells","authors":"Weixiao Bu, Huaxia Mu, Mengyao Gao, Weiqiang Su, Fuyan Shi, Qinghua Wang, Suzhen Wang, Yujia Kong","doi":"10.1002/ctd2.257","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>In recent years, it has become a research focus to accurately extract key genes influencing the occurrence and development of diseases from massive genomic data and study their regulatory mechanisms. Further exploration of these large databases is beneficial for us to identify key regulatory mechanisms in the occurrence and development of diseases, providing direction and theoretical basis for subsequent experimental design and research.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>By using data from The Cancer Genome Atlas (TCGA) for lung adenocarcinoma (LUAD), the immune cell content of patients was obtained through deconvolution calculations in CIBERSORTx. Combined with Bayesian network inference methods, the impact of key gene expression on the heterocellular network influencing lung cancer was analyzed.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>We found CD36 and ADRA1A genes were identified as two key mRNA genes influencing lung adenocarcinoma. The sensitivity analysis shows that the model performs well on both the testing set (error rate = 4%, AUC = 0.9804), the training set (error rate = 5.637%, AUC = 0.9746) and the verification set (error rate = 28.85%, AUC = 0.8689). The model has excellent predictive capabilities.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>We found that CD36 and ADRA1A genes may influence the development of lung cancer by affecting regulatory T cell and follicular helper T cell populations. Additionally, plasma cells may affect the expression of the CD36 gene. And the mutual information calculated by the model for these two genes is also the highest, indicating their potential as tumour biomarkers. Combining the network model, it can be inferred that CD36 and ADRA1A are likely to influence the occurrence and development of the disease through follicular helper T cells and regulatory T cells.</p>\n </section>\n </div>","PeriodicalId":72605,"journal":{"name":"Clinical and translational discovery","volume":"3 6","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ctd2.257","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical and translational discovery","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ctd2.257","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background
In recent years, it has become a research focus to accurately extract key genes influencing the occurrence and development of diseases from massive genomic data and study their regulatory mechanisms. Further exploration of these large databases is beneficial for us to identify key regulatory mechanisms in the occurrence and development of diseases, providing direction and theoretical basis for subsequent experimental design and research.
Methods
By using data from The Cancer Genome Atlas (TCGA) for lung adenocarcinoma (LUAD), the immune cell content of patients was obtained through deconvolution calculations in CIBERSORTx. Combined with Bayesian network inference methods, the impact of key gene expression on the heterocellular network influencing lung cancer was analyzed.
Results
We found CD36 and ADRA1A genes were identified as two key mRNA genes influencing lung adenocarcinoma. The sensitivity analysis shows that the model performs well on both the testing set (error rate = 4%, AUC = 0.9804), the training set (error rate = 5.637%, AUC = 0.9746) and the verification set (error rate = 28.85%, AUC = 0.8689). The model has excellent predictive capabilities.
Conclusion
We found that CD36 and ADRA1A genes may influence the development of lung cancer by affecting regulatory T cell and follicular helper T cell populations. Additionally, plasma cells may affect the expression of the CD36 gene. And the mutual information calculated by the model for these two genes is also the highest, indicating their potential as tumour biomarkers. Combining the network model, it can be inferred that CD36 and ADRA1A are likely to influence the occurrence and development of the disease through follicular helper T cells and regulatory T cells.