iGATTLDA: Integrative graph attention and transformer-based model for predicting lncRNA-Disease associations

IF 1.9 4区 生物学 Q4 CELL BIOLOGY IET Systems Biology Pub Date : 2024-09-22 DOI:10.1049/syb2.12098
Biffon Manyura Momanyi, Sebu Aboma Temesgen, Tian-Yu Wang, Hui Gao, Ru Gao, Hua Tang, Li-Xia Tang
{"title":"iGATTLDA: Integrative graph attention and transformer-based model for predicting lncRNA-Disease associations","authors":"Biffon Manyura Momanyi,&nbsp;Sebu Aboma Temesgen,&nbsp;Tian-Yu Wang,&nbsp;Hui Gao,&nbsp;Ru Gao,&nbsp;Hua Tang,&nbsp;Li-Xia Tang","doi":"10.1049/syb2.12098","DOIUrl":null,"url":null,"abstract":"<p>Long non-coding RNAs (lncRNAs) have emerged as significant contributors to the regulation of various biological processes, and their dysregulation has been linked to a variety of human disorders. Accurate prediction of potential correlations between lncRNAs and diseases is crucial for advancing disease diagnostics and treatment procedures. The authors introduced a novel computational method, iGATTLDA, for the prediction of lncRNA-disease associations. The model utilised lncRNA and disease similarity matrices, with known associations represented in an adjacency matrix. A heterogeneous network was constructed, dissecting lncRNAs and diseases as nodes and their associations as edges. The Graph Attention Network (GAT) is employed to process initial features and corresponding adjacency information. GAT identified significant neighbouring nodes in the network, capturing intricate relationships between lncRNAs and diseases, and generating new feature representations. Subsequently, the transformer captures global dependencies and interactions across the entire sequence of features produced by the GAT. Consequently, iGATTLDA successfully captures complex relationships and interactions that conventional approaches may overlook. In evaluating iGATTLDA, it attained an area under the receiver operating characteristic (ROC) curve (AUC) of 0.95 and an area under the precision recall curve (AUPRC) of 0.96 with a two-layer multilayer perceptron (MLP) classifier. These results were notably higher compared to the majority of previously proposed models, further substantiating the model’s efficiency in predicting potential lncRNA-disease associations by incorporating both local and global interactions. The implementation details can be obtained from https://github.com/momanyibiffon/iGATTLDA.</p>","PeriodicalId":50379,"journal":{"name":"IET Systems Biology","volume":"18 5","pages":"172-182"},"PeriodicalIF":1.9000,"publicationDate":"2024-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/syb2.12098","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Systems Biology","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/syb2.12098","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Long non-coding RNAs (lncRNAs) have emerged as significant contributors to the regulation of various biological processes, and their dysregulation has been linked to a variety of human disorders. Accurate prediction of potential correlations between lncRNAs and diseases is crucial for advancing disease diagnostics and treatment procedures. The authors introduced a novel computational method, iGATTLDA, for the prediction of lncRNA-disease associations. The model utilised lncRNA and disease similarity matrices, with known associations represented in an adjacency matrix. A heterogeneous network was constructed, dissecting lncRNAs and diseases as nodes and their associations as edges. The Graph Attention Network (GAT) is employed to process initial features and corresponding adjacency information. GAT identified significant neighbouring nodes in the network, capturing intricate relationships between lncRNAs and diseases, and generating new feature representations. Subsequently, the transformer captures global dependencies and interactions across the entire sequence of features produced by the GAT. Consequently, iGATTLDA successfully captures complex relationships and interactions that conventional approaches may overlook. In evaluating iGATTLDA, it attained an area under the receiver operating characteristic (ROC) curve (AUC) of 0.95 and an area under the precision recall curve (AUPRC) of 0.96 with a two-layer multilayer perceptron (MLP) classifier. These results were notably higher compared to the majority of previously proposed models, further substantiating the model’s efficiency in predicting potential lncRNA-disease associations by incorporating both local and global interactions. The implementation details can be obtained from https://github.com/momanyibiffon/iGATTLDA.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
iGATTLDA:基于图注意和转换器的整合模型,用于预测 lncRNA 与疾病的关联。
长非编码 RNA(lncRNA)已成为调控各种生物过程的重要因素,它们的失调与多种人类疾病有关。准确预测lncRNA与疾病之间的潜在相关性对于推进疾病诊断和治疗程序至关重要。作者介绍了一种新的计算方法--iGATTLDA,用于预测lncRNA与疾病的关联。该模型利用 lncRNA 和疾病的相似性矩阵,并用邻接矩阵表示已知的关联。将 lncRNA 和疾病作为节点,将它们之间的关联作为边,构建了一个异构网络。采用图形注意网络(GAT)处理初始特征和相应的邻接信息。GAT 识别网络中重要的邻接节点,捕捉 lncRNA 与疾病之间错综复杂的关系,并生成新的特征表征。随后,转换器捕捉由 GAT 生成的整个特征序列中的全局依赖关系和相互作用。因此,iGATTLDA 成功捕捉到了传统方法可能忽略的复杂关系和相互作用。在对 iGATTLDA 进行评估时,通过使用双层多层感知器(MLP)分类器,它的接收器操作特征曲线(ROC)下面积(AUC)达到了 0.95,精确召回曲线(AUPRC)下面积(AUC)达到了 0.96。与之前提出的大多数模型相比,这些结果明显更高,进一步证实了该模型通过结合局部和全局相互作用预测潜在lncRNA-疾病关联的效率。具体实现细节请访问 https://github.com/momanyibiffon/iGATTLDA。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IET Systems Biology
IET Systems Biology 生物-数学与计算生物学
CiteScore
4.20
自引率
4.30%
发文量
17
审稿时长
>12 weeks
期刊介绍: IET Systems Biology covers intra- and inter-cellular dynamics, using systems- and signal-oriented approaches. Papers that analyse genomic data in order to identify variables and basic relationships between them are considered if the results provide a basis for mathematical modelling and simulation of cellular dynamics. Manuscripts on molecular and cell biological studies are encouraged if the aim is a systems approach to dynamic interactions within and between cells. The scope includes the following topics: Genomics, transcriptomics, proteomics, metabolomics, cells, tissue and the physiome; molecular and cellular interaction, gene, cell and protein function; networks and pathways; metabolism and cell signalling; dynamics, regulation and control; systems, signals, and information; experimental data analysis; mathematical modelling, simulation and theoretical analysis; biological modelling, simulation, prediction and control; methodologies, databases, tools and algorithms for modelling and simulation; modelling, analysis and control of biological networks; synthetic biology and bioengineering based on systems biology.
期刊最新文献
siRNAEfficacyDB: An experimentally supported small interfering RNA efficacy database. Deep-GB: A novel deep learning model for globular protein prediction using CNN-BiLSTM architecture and enhanced PSSM with trisection strategy. Developing a machine learning model with enhanced performance for predicting COVID-19 from patients presenting to the emergency room with acute respiratory symptoms. Mechanism of action of "cistanche deserticola-Polygala" in treating Alzheimer's disease based on network pharmacology methods and molecular docking analysis. Comprehensive transcriptome and scRNA-seq analyses uncover the expression and underlying mechanism of SYNJ2 in papillary thyroid carcinoma
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1