MiRNA-gene network embedding for predicting cancer driver genes.

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS ACS Applied Bio Materials Pub Date : 2023-07-17 DOI:10.1093/bfgp/elac059
Wei Peng, Rong Wu, Wei Dai, Yu Ning, Xiaodong Fu, Li Liu, Lijun Liu
{"title":"MiRNA-gene network embedding for predicting cancer driver genes.","authors":"Wei Peng,&nbsp;Rong Wu,&nbsp;Wei Dai,&nbsp;Yu Ning,&nbsp;Xiaodong Fu,&nbsp;Li Liu,&nbsp;Lijun Liu","doi":"10.1093/bfgp/elac059","DOIUrl":null,"url":null,"abstract":"<p><p>The development and progression of cancer arise due to the accumulation of mutations in driver genes. Correctly identifying the driver genes that lead to cancer development can significantly assist the drug design, cancer diagnosis and treatment. Most computer methods detect cancer drivers based on gene-gene networks by assuming that driver genes tend to work together, form protein complexes and enrich pathways. However, they ignore that microribonucleic acid (RNAs; miRNAs) regulate the expressions of their targeted genes and are related to human diseases. In this work, we propose a graph convolution network (GCN) approach called GM-GCN to identify the cancer driver genes based on a gene-miRNA network. First, we constructed a gene-miRNA network, where the nodes are miRNAs and their targeted genes. The edges connecting miRNA and genes indicate the regulatory relationship between miRNAs and genes. We prepared initial attributes for miRNA and genes according to their biological properties and used a GCN model to learn the gene feature representations in the network by aggregating the features of their neighboring miRNA nodes. And then, the learned features were passed through a 1D convolution module for feature dimensionality change. We employed the learned and original gene features to optimize model parameters. Finally, the gene features learned from the network and the initial input gene features were fed into a logistic regression model to predict whether a gene is a driver gene. We applied our model and state-of-the-art methods to predict cancer drivers for pan-cancer and individual cancer types. Experimental results show that our model performs well in terms of the area under the receiver operating characteristic curve and the area under the precision-recall curve compared to state-of-the-art methods that work on gene networks. The GM-GCN is freely available via https://github.com/weiba/GM-GCN.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bfgp/elac059","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 1

Abstract

The development and progression of cancer arise due to the accumulation of mutations in driver genes. Correctly identifying the driver genes that lead to cancer development can significantly assist the drug design, cancer diagnosis and treatment. Most computer methods detect cancer drivers based on gene-gene networks by assuming that driver genes tend to work together, form protein complexes and enrich pathways. However, they ignore that microribonucleic acid (RNAs; miRNAs) regulate the expressions of their targeted genes and are related to human diseases. In this work, we propose a graph convolution network (GCN) approach called GM-GCN to identify the cancer driver genes based on a gene-miRNA network. First, we constructed a gene-miRNA network, where the nodes are miRNAs and their targeted genes. The edges connecting miRNA and genes indicate the regulatory relationship between miRNAs and genes. We prepared initial attributes for miRNA and genes according to their biological properties and used a GCN model to learn the gene feature representations in the network by aggregating the features of their neighboring miRNA nodes. And then, the learned features were passed through a 1D convolution module for feature dimensionality change. We employed the learned and original gene features to optimize model parameters. Finally, the gene features learned from the network and the initial input gene features were fed into a logistic regression model to predict whether a gene is a driver gene. We applied our model and state-of-the-art methods to predict cancer drivers for pan-cancer and individual cancer types. Experimental results show that our model performs well in terms of the area under the receiver operating characteristic curve and the area under the precision-recall curve compared to state-of-the-art methods that work on gene networks. The GM-GCN is freely available via https://github.com/weiba/GM-GCN.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
mirna -基因网络嵌入预测癌症驱动基因。
癌症的发生和发展是由于驱动基因突变的积累引起的。正确识别导致癌症发展的驱动基因对药物设计、癌症诊断和治疗具有重要的辅助作用。大多数计算机方法检测基于基因-基因网络的癌症驱动因素,假设驱动基因倾向于协同工作,形成蛋白质复合物并丰富途径。然而,他们忽略了微核糖核酸(RNAs;mirna)调节其靶基因的表达,并与人类疾病有关。在这项工作中,我们提出了一种称为GM-GCN的图卷积网络(GCN)方法来识别基于基因- mirna网络的癌症驱动基因。首先,我们构建了一个基因- mirna网络,其中节点是mirna及其靶基因。连接miRNA和基因的边缘表示miRNA和基因之间的调控关系。我们根据miRNA和基因的生物学特性为其准备初始属性,并使用GCN模型通过聚合其相邻miRNA节点的特征来学习网络中基因的特征表示。然后,将学习到的特征通过一维卷积模块进行特征维数变化。我们利用学习到的和原始的基因特征来优化模型参数。最后,将从网络中学习到的基因特征和初始输入基因特征输入到逻辑回归模型中,以预测该基因是否为驱动基因。我们应用我们的模型和最先进的方法来预测泛癌症和个体癌症类型的癌症驱动因素。实验结果表明,与最先进的基因网络方法相比,我们的模型在接收者工作特征曲线下的面积和精确召回曲线下的面积方面表现良好。GM-GCN可通过https://github.com/weiba/GM-GCN免费获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
ACS Applied Bio Materials
ACS Applied Bio Materials Chemistry-Chemistry (all)
CiteScore
9.40
自引率
2.10%
发文量
464
期刊最新文献
A Systematic Review of Sleep Disturbance in Idiopathic Intracranial Hypertension. Advancing Patient Education in Idiopathic Intracranial Hypertension: The Promise of Large Language Models. Anti-Myelin-Associated Glycoprotein Neuropathy: Recent Developments. Approach to Managing the Initial Presentation of Multiple Sclerosis: A Worldwide Practice Survey. Association Between LACE+ Index Risk Category and 90-Day Mortality After Stroke.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1