Integrating bioinformatics and machine learning methods to analyze diagnostic biomarkers for HBV-induced hepatocellular carcinoma.

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS ACS Applied Bio Materials Pub Date : 2024-08-02 DOI:10.1186/s13000-024-01528-8
Anyin Yang, Jianping Liu, Mengru Li, Hong Zhang, Xulei Zhang, Lianping Wu
{"title":"Integrating bioinformatics and machine learning methods to analyze diagnostic biomarkers for HBV-induced hepatocellular carcinoma.","authors":"Anyin Yang, Jianping Liu, Mengru Li, Hong Zhang, Xulei Zhang, Lianping Wu","doi":"10.1186/s13000-024-01528-8","DOIUrl":null,"url":null,"abstract":"<p><p>Hepatocellular carcinoma (HCC) is a malignant tumor. It is estimated that approximately 50-80% of HCC cases worldwide are caused by hepatitis b virus (HBV) infection, and other pathogenic factors have been shown to promote the development of HCC when coexisting with HBV. Understanding the molecular mechanisms of HBV-induced hepatocellular carcinoma (HBV-HCC) is crucial for the prevention, diagnosis, and treatment of the disease. In this study, we analyzed the molecular mechanisms of HBV-induced HCC by combining bioinformatics and deep learning methods. Firstly, we collected a gene set related to HBV-HCC from the GEO database, performed differential analysis and WGCNA analysis to identify genes with abnormal expression in tumors and high relevance to tumors. We used three deep learning methods, Lasso, random forest, and SVM, to identify key genes RACGAP1, ECT2, and NDC80. By establishing a diagnostic model, we determined the accuracy of key genes in diagnosing HBV-HCC. In the training set, RACGAP1(AUC:0.976), ECT2(AUC:0.969), and NDC80 (AUC: 0.976) showed high accuracy. They also exhibited good accuracy in the validation set: RACGAP1(AUC:0.878), ECT2(AUC:0.731), and NDC80(AUC:0.915). The key genes were found to be highly expressed in liver cancer tissues compared to normal liver tissues, and survival analysis indicated that high expression of key genes was associated with poor prognosis in liver cancer patients. This suggests a close relationship between key genes RACGAP1, ECT2, and NDC80 and the occurrence and progression of HBV-HCC. Molecular docking results showed that the key genes could spontaneously bind to the anti-hepatocellular carcinoma drugs Lenvatinib, Regorafenib, and Sorafenib with strong binding activity. Therefore, ECT2, NDC80, and RACGAP1 may serve as potential biomarkers for the diagnosis of HBV-HCC and as targets for the development of targeted therapeutic drugs.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11295615/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13000-024-01528-8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0

Abstract

Hepatocellular carcinoma (HCC) is a malignant tumor. It is estimated that approximately 50-80% of HCC cases worldwide are caused by hepatitis b virus (HBV) infection, and other pathogenic factors have been shown to promote the development of HCC when coexisting with HBV. Understanding the molecular mechanisms of HBV-induced hepatocellular carcinoma (HBV-HCC) is crucial for the prevention, diagnosis, and treatment of the disease. In this study, we analyzed the molecular mechanisms of HBV-induced HCC by combining bioinformatics and deep learning methods. Firstly, we collected a gene set related to HBV-HCC from the GEO database, performed differential analysis and WGCNA analysis to identify genes with abnormal expression in tumors and high relevance to tumors. We used three deep learning methods, Lasso, random forest, and SVM, to identify key genes RACGAP1, ECT2, and NDC80. By establishing a diagnostic model, we determined the accuracy of key genes in diagnosing HBV-HCC. In the training set, RACGAP1(AUC:0.976), ECT2(AUC:0.969), and NDC80 (AUC: 0.976) showed high accuracy. They also exhibited good accuracy in the validation set: RACGAP1(AUC:0.878), ECT2(AUC:0.731), and NDC80(AUC:0.915). The key genes were found to be highly expressed in liver cancer tissues compared to normal liver tissues, and survival analysis indicated that high expression of key genes was associated with poor prognosis in liver cancer patients. This suggests a close relationship between key genes RACGAP1, ECT2, and NDC80 and the occurrence and progression of HBV-HCC. Molecular docking results showed that the key genes could spontaneously bind to the anti-hepatocellular carcinoma drugs Lenvatinib, Regorafenib, and Sorafenib with strong binding activity. Therefore, ECT2, NDC80, and RACGAP1 may serve as potential biomarkers for the diagnosis of HBV-HCC and as targets for the development of targeted therapeutic drugs.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
整合生物信息学和机器学习方法,分析 HBV 诱导的肝细胞癌的诊断生物标志物。
肝细胞癌(HCC)是一种恶性肿瘤。据估计,全球约有 50%-80% 的 HCC 病例是由乙型肝炎病毒(HBV)感染引起的,其他致病因素与 HBV 共存时也会促进 HCC 的发展。了解 HBV 诱导的肝细胞癌(HBV-HCC)的分子机制对于该疾病的预防、诊断和治疗至关重要。在本研究中,我们结合生物信息学和深度学习方法分析了 HBV 诱导 HCC 的分子机制。首先,我们从 GEO 数据库中收集了与 HBV-HCC 相关的基因集,并进行了差异分析和 WGCNA 分析,以确定在肿瘤中异常表达且与肿瘤高度相关的基因。我们使用 Lasso、随机森林和 SVM 三种深度学习方法识别了关键基因 RACGAP1、ECT2 和 NDC80。通过建立诊断模型,我们确定了关键基因诊断 HBV-HCC 的准确性。在训练集中,RACGAP1(AUC:0.976)、ECT2(AUC:0.969)和 NDC80(AUC:0.976)表现出很高的准确性。它们在验证集中也表现出良好的准确性:RACGAP1(AUC:0.878)、ECT2(AUC:0.731)和 NDC80(AUC:0.915)。研究发现,与正常肝组织相比,关键基因在肝癌组织中高表达,而生存分析表明,关键基因的高表达与肝癌患者的不良预后有关。这表明关键基因 RACGAP1、ECT2 和 NDC80 与 HBV-HCC 的发生和发展有密切关系。分子对接结果表明,这些关键基因能自发地与抗肝癌药物伦伐替尼、瑞戈非尼和索拉非尼结合,并具有很强的结合活性。因此,ECT2、NDC80和RACGAP1可作为诊断HBV-HCC的潜在生物标志物和开发靶向治疗药物的目标。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
ACS Applied Bio Materials
ACS Applied Bio Materials Chemistry-Chemistry (all)
CiteScore
9.40
自引率
2.10%
发文量
464
期刊最新文献
A Systematic Review of Sleep Disturbance in Idiopathic Intracranial Hypertension. Advancing Patient Education in Idiopathic Intracranial Hypertension: The Promise of Large Language Models. Anti-Myelin-Associated Glycoprotein Neuropathy: Recent Developments. Approach to Managing the Initial Presentation of Multiple Sclerosis: A Worldwide Practice Survey. Association Between LACE+ Index Risk Category and 90-Day Mortality After Stroke.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1