{"title":"整合生物信息学和机器学习方法,分析 HBV 诱导的肝细胞癌的诊断生物标志物。","authors":"Anyin Yang, Jianping Liu, Mengru Li, Hong Zhang, Xulei Zhang, Lianping Wu","doi":"10.1186/s13000-024-01528-8","DOIUrl":null,"url":null,"abstract":"<p><p>Hepatocellular carcinoma (HCC) is a malignant tumor. It is estimated that approximately 50-80% of HCC cases worldwide are caused by hepatitis b virus (HBV) infection, and other pathogenic factors have been shown to promote the development of HCC when coexisting with HBV. Understanding the molecular mechanisms of HBV-induced hepatocellular carcinoma (HBV-HCC) is crucial for the prevention, diagnosis, and treatment of the disease. In this study, we analyzed the molecular mechanisms of HBV-induced HCC by combining bioinformatics and deep learning methods. Firstly, we collected a gene set related to HBV-HCC from the GEO database, performed differential analysis and WGCNA analysis to identify genes with abnormal expression in tumors and high relevance to tumors. We used three deep learning methods, Lasso, random forest, and SVM, to identify key genes RACGAP1, ECT2, and NDC80. By establishing a diagnostic model, we determined the accuracy of key genes in diagnosing HBV-HCC. In the training set, RACGAP1(AUC:0.976), ECT2(AUC:0.969), and NDC80 (AUC: 0.976) showed high accuracy. They also exhibited good accuracy in the validation set: RACGAP1(AUC:0.878), ECT2(AUC:0.731), and NDC80(AUC:0.915). The key genes were found to be highly expressed in liver cancer tissues compared to normal liver tissues, and survival analysis indicated that high expression of key genes was associated with poor prognosis in liver cancer patients. This suggests a close relationship between key genes RACGAP1, ECT2, and NDC80 and the occurrence and progression of HBV-HCC. Molecular docking results showed that the key genes could spontaneously bind to the anti-hepatocellular carcinoma drugs Lenvatinib, Regorafenib, and Sorafenib with strong binding activity. Therefore, ECT2, NDC80, and RACGAP1 may serve as potential biomarkers for the diagnosis of HBV-HCC and as targets for the development of targeted therapeutic drugs.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11295615/pdf/","citationCount":"0","resultStr":"{\"title\":\"Integrating bioinformatics and machine learning methods to analyze diagnostic biomarkers for HBV-induced hepatocellular carcinoma.\",\"authors\":\"Anyin Yang, Jianping Liu, Mengru Li, Hong Zhang, Xulei Zhang, Lianping Wu\",\"doi\":\"10.1186/s13000-024-01528-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Hepatocellular carcinoma (HCC) is a malignant tumor. It is estimated that approximately 50-80% of HCC cases worldwide are caused by hepatitis b virus (HBV) infection, and other pathogenic factors have been shown to promote the development of HCC when coexisting with HBV. Understanding the molecular mechanisms of HBV-induced hepatocellular carcinoma (HBV-HCC) is crucial for the prevention, diagnosis, and treatment of the disease. In this study, we analyzed the molecular mechanisms of HBV-induced HCC by combining bioinformatics and deep learning methods. Firstly, we collected a gene set related to HBV-HCC from the GEO database, performed differential analysis and WGCNA analysis to identify genes with abnormal expression in tumors and high relevance to tumors. We used three deep learning methods, Lasso, random forest, and SVM, to identify key genes RACGAP1, ECT2, and NDC80. By establishing a diagnostic model, we determined the accuracy of key genes in diagnosing HBV-HCC. In the training set, RACGAP1(AUC:0.976), ECT2(AUC:0.969), and NDC80 (AUC: 0.976) showed high accuracy. They also exhibited good accuracy in the validation set: RACGAP1(AUC:0.878), ECT2(AUC:0.731), and NDC80(AUC:0.915). The key genes were found to be highly expressed in liver cancer tissues compared to normal liver tissues, and survival analysis indicated that high expression of key genes was associated with poor prognosis in liver cancer patients. This suggests a close relationship between key genes RACGAP1, ECT2, and NDC80 and the occurrence and progression of HBV-HCC. Molecular docking results showed that the key genes could spontaneously bind to the anti-hepatocellular carcinoma drugs Lenvatinib, Regorafenib, and Sorafenib with strong binding activity. Therefore, ECT2, NDC80, and RACGAP1 may serve as potential biomarkers for the diagnosis of HBV-HCC and as targets for the development of targeted therapeutic drugs.</p>\",\"PeriodicalId\":2,\"journal\":{\"name\":\"ACS Applied Bio Materials\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-08-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11295615/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Applied Bio Materials\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s13000-024-01528-8\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATERIALS SCIENCE, BIOMATERIALS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13000-024-01528-8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
Integrating bioinformatics and machine learning methods to analyze diagnostic biomarkers for HBV-induced hepatocellular carcinoma.
Hepatocellular carcinoma (HCC) is a malignant tumor. It is estimated that approximately 50-80% of HCC cases worldwide are caused by hepatitis b virus (HBV) infection, and other pathogenic factors have been shown to promote the development of HCC when coexisting with HBV. Understanding the molecular mechanisms of HBV-induced hepatocellular carcinoma (HBV-HCC) is crucial for the prevention, diagnosis, and treatment of the disease. In this study, we analyzed the molecular mechanisms of HBV-induced HCC by combining bioinformatics and deep learning methods. Firstly, we collected a gene set related to HBV-HCC from the GEO database, performed differential analysis and WGCNA analysis to identify genes with abnormal expression in tumors and high relevance to tumors. We used three deep learning methods, Lasso, random forest, and SVM, to identify key genes RACGAP1, ECT2, and NDC80. By establishing a diagnostic model, we determined the accuracy of key genes in diagnosing HBV-HCC. In the training set, RACGAP1(AUC:0.976), ECT2(AUC:0.969), and NDC80 (AUC: 0.976) showed high accuracy. They also exhibited good accuracy in the validation set: RACGAP1(AUC:0.878), ECT2(AUC:0.731), and NDC80(AUC:0.915). The key genes were found to be highly expressed in liver cancer tissues compared to normal liver tissues, and survival analysis indicated that high expression of key genes was associated with poor prognosis in liver cancer patients. This suggests a close relationship between key genes RACGAP1, ECT2, and NDC80 and the occurrence and progression of HBV-HCC. Molecular docking results showed that the key genes could spontaneously bind to the anti-hepatocellular carcinoma drugs Lenvatinib, Regorafenib, and Sorafenib with strong binding activity. Therefore, ECT2, NDC80, and RACGAP1 may serve as potential biomarkers for the diagnosis of HBV-HCC and as targets for the development of targeted therapeutic drugs.