DMFVAE: miRNA-disease associations prediction based on deep matrix factorization method with variational autoencoder

IF 3.4 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Frontiers of Computer Science Pub Date : 2024-07-12 DOI:10.1007/s11704-023-3610-y

Pijing Wei, Qianqian Wang, Zhen Gao, Ruifen Cao, Chunhou Zheng

{"title":"DMFVAE: miRNA-disease associations prediction based on deep matrix factorization method with variational autoencoder","authors":"Pijing Wei, Qianqian Wang, Zhen Gao, Ruifen Cao, Chunhou Zheng","doi":"10.1007/s11704-023-3610-y","DOIUrl":null,"url":null,"abstract":"<p>MicroRNAs (miRNAs) are closely related to numerous complex human diseases, therefore, exploring miRNA-disease associations (MDAs) can help people gain a better understanding of complex disease mechanism. An increasing number of computational methods have been developed to predict MDAs. However, the sparsity of the MDAs may hinder the performance of many methods. In addition, many methods fail to capture the nonlinear relationships of miRNA-disease network and inadequately leverage the features of network and neighbor nodes. In this study, we propose a deep matrix factorization model with variational autoencoder (DMFVAE) to predict potential MDAs. DMFVAE first decomposes the original association matrix and the enhanced association matrix, in which the enhanced association matrix is enhanced by self-adjusting the nearest neighbor method, to obtain sparse vectors and dense vectors, respectively. Then, the variational encoder is employed to obtain the nonlinear latent vectors of miRNA and disease for the sparse vectors, and meanwhile, node2vec is used to obtain the network structure embedding vectors of miRNA and disease for the dense vectors. Finally, sample features are acquired by combining the latent vectors and network structure embedding vectors, and the final prediction is implemented by convolutional neural network with channel attention. To evaluate the performance of DMFVAE, we conduct five-fold cross validation on the HMDD v2.0 and HMDD v3.2 datasets and the results show that DMFVAE performs well. Furthermore, case studies on lung neoplasms, colon neoplasms, and esophageal neoplasms confirm the ability of DMFVAE in identifying potential miRNAs for human diseases.</p>","PeriodicalId":12640,"journal":{"name":"Frontiers of Computer Science","volume":"218 1","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers of Computer Science","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11704-023-3610-y","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

MicroRNAs (miRNAs) are closely related to numerous complex human diseases, therefore, exploring miRNA-disease associations (MDAs) can help people gain a better understanding of complex disease mechanism. An increasing number of computational methods have been developed to predict MDAs. However, the sparsity of the MDAs may hinder the performance of many methods. In addition, many methods fail to capture the nonlinear relationships of miRNA-disease network and inadequately leverage the features of network and neighbor nodes. In this study, we propose a deep matrix factorization model with variational autoencoder (DMFVAE) to predict potential MDAs. DMFVAE first decomposes the original association matrix and the enhanced association matrix, in which the enhanced association matrix is enhanced by self-adjusting the nearest neighbor method, to obtain sparse vectors and dense vectors, respectively. Then, the variational encoder is employed to obtain the nonlinear latent vectors of miRNA and disease for the sparse vectors, and meanwhile, node2vec is used to obtain the network structure embedding vectors of miRNA and disease for the dense vectors. Finally, sample features are acquired by combining the latent vectors and network structure embedding vectors, and the final prediction is implemented by convolutional neural network with channel attention. To evaluate the performance of DMFVAE, we conduct five-fold cross validation on the HMDD v2.0 and HMDD v3.2 datasets and the results show that DMFVAE performs well. Furthermore, case studies on lung neoplasms, colon neoplasms, and esophageal neoplasms confirm the ability of DMFVAE in identifying potential miRNAs for human diseases.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

DMFVAE：基于变异自动编码器的深度矩阵因式分解方法的 miRNA-疾病关联预测

微RNA（miRNA）与人类多种复杂疾病密切相关，因此，探索miRNA与疾病的关联（MDAs）有助于人们更好地了解复杂的疾病机制。目前已开发出越来越多的计算方法来预测 MDAs。然而，MDAs 的稀疏性可能会阻碍许多方法的性能。此外，许多方法未能捕捉到 miRNA-疾病网络的非线性关系，也未能充分利用网络和邻近节点的特征。在这项研究中，我们提出了一种带有变异自动编码器（DMFVAE）的深度矩阵因式分解模型来预测潜在的 MDAs。DMFVAE 首先分解原始关联系数矩阵和增强关联系数矩阵，其中增强关联系数矩阵通过自调整近邻法增强，分别得到稀疏向量和稠密向量。然后，利用变异编码器获取稀疏向量中 miRNA 和疾病的非线性潜向量，同时利用 node2vec 获取密集向量中 miRNA 和疾病的网络结构嵌入向量。最后，结合潜向量和网络结构嵌入向量获得样本特征，并通过具有通道注意的卷积神经网络实现最终预测。为了评估 DMFVAE 的性能，我们在 HMDD v2.0 和 HMDD v3.2 数据集上进行了五倍交叉验证，结果表明 DMFVAE 性能良好。此外，对肺部肿瘤、结肠肿瘤和食管肿瘤的案例研究也证实了 DMFVAE 识别人类疾病潜在 miRNA 的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Frontiers of Computer Science COMPUTER SCIENCE, INFORMATION SYSTEMS-COMPUTER SCIENCE, SOFTWARE ENGINEERING

CiteScore

8.60

自引率

2.40%

发文量

799

审稿时长

6-12 weeks

期刊介绍： Frontiers of Computer Science aims to provide a forum for the publication of peer-reviewed papers to promote rapid communication and exchange between computer scientists. The journal publishes research papers and review articles in a wide range of topics, including: architecture, software, artificial intelligence, theoretical computer science, networks and communication, information systems, multimedia and graphics, information security, interdisciplinary, etc. The journal especially encourages papers from new emerging and multidisciplinary areas, as well as papers reflecting the international trends of research and development and on special topics reporting progress made by Chinese computer scientists.