Injecting structure-aware insights for the learning of RNA sequence representations to identify m6A modification sites.

IF 2.4 3区 生物学 Q2 MULTIDISCIPLINARY SCIENCES PeerJ Pub Date : 2025-02-24 eCollection Date: 2025-01-01 DOI:10.7717/peerj.18878
Yue Yu, Shuang Xiang, Minghao Wu
{"title":"Injecting structure-aware insights for the learning of RNA sequence representations to identify m6A modification sites.","authors":"Yue Yu, Shuang Xiang, Minghao Wu","doi":"10.7717/peerj.18878","DOIUrl":null,"url":null,"abstract":"<p><p>N6-methyladenosine (m6A) represents one of the most prevalent methylation modifications in eukaryotes and it is crucial to accurately identify its modification sites on RNA sequences. Traditional machine learning based approaches to m6A modification site identification primarily focus on RNA sequence data but often incorporate additional biological domain knowledge and rely on manually crafted features. These methods typically overlook the structural insights inherent in RNA sequences. To address this limitation, we propose M6A-SAI, an advanced predictor for RNA m6A modifications. M6A-SAI leverages a transformer-based deep learning framework to integrate structure-aware insights into sequence representation learning, thereby enhancing the precision of m6A modification site identification. The core innovation of M6A-SAI lies in its ability to incorporate structural information through a multi-step process: initially, the model utilizes a Transformer encoder to learn RNA sequence representations. It then constructs a similarity graph based on Manhattan distance to capture sequence correlations. To address the limitations of the smooth similarity graph, M6A-SAI integrates a structure-aware optimization block, which refines the graph by defining anchor sets and generating an awareness graph through PageRank. Following this, M6A-SAI employs a self-correlation fusion graph convolution framework to merge information from both the similarity and awareness graphs, thus producing enriched sequence representations. Finally, a support vector machine is utilized for classifying these representations. Experimental results validate that M6A-SAI substantially improves the recognition of m6A modification sites by incorporating structure-aware insights, demonstrating its efficacy as a robust method for identifying RNA m6A modification sites.</p>","PeriodicalId":19799,"journal":{"name":"PeerJ","volume":"13 ","pages":"e18878"},"PeriodicalIF":2.4000,"publicationDate":"2025-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11867033/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PeerJ","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.7717/peerj.18878","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

N6-methyladenosine (m6A) represents one of the most prevalent methylation modifications in eukaryotes and it is crucial to accurately identify its modification sites on RNA sequences. Traditional machine learning based approaches to m6A modification site identification primarily focus on RNA sequence data but often incorporate additional biological domain knowledge and rely on manually crafted features. These methods typically overlook the structural insights inherent in RNA sequences. To address this limitation, we propose M6A-SAI, an advanced predictor for RNA m6A modifications. M6A-SAI leverages a transformer-based deep learning framework to integrate structure-aware insights into sequence representation learning, thereby enhancing the precision of m6A modification site identification. The core innovation of M6A-SAI lies in its ability to incorporate structural information through a multi-step process: initially, the model utilizes a Transformer encoder to learn RNA sequence representations. It then constructs a similarity graph based on Manhattan distance to capture sequence correlations. To address the limitations of the smooth similarity graph, M6A-SAI integrates a structure-aware optimization block, which refines the graph by defining anchor sets and generating an awareness graph through PageRank. Following this, M6A-SAI employs a self-correlation fusion graph convolution framework to merge information from both the similarity and awareness graphs, thus producing enriched sequence representations. Finally, a support vector machine is utilized for classifying these representations. Experimental results validate that M6A-SAI substantially improves the recognition of m6A modification sites by incorporating structure-aware insights, demonstrating its efficacy as a robust method for identifying RNA m6A modification sites.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
为RNA序列表征的学习注入结构感知的见解,以识别m6A修饰位点。
n6 -甲基腺苷(m6A)是真核生物中最常见的甲基化修饰之一,准确识别其在RNA序列上的修饰位点至关重要。传统的基于机器学习的m6A修饰位点识别方法主要关注RNA序列数据,但通常包含额外的生物领域知识,并依赖于手工制作的特征。这些方法通常忽略了RNA序列固有的结构见解。为了解决这一限制,我们提出了m6A - sai,一种RNA m6A修饰的高级预测因子。m6A - sai利用基于变压器的深度学习框架将结构感知洞察力集成到序列表示学习中,从而提高了m6A修饰位点识别的精度。M6A-SAI的核心创新在于它能够通过一个多步骤的过程来整合结构信息:最初,该模型利用Transformer编码器来学习RNA序列表示。然后构建一个基于曼哈顿距离的相似图来捕获序列相关性。为了解决光滑相似图的局限性,M6A-SAI集成了一个结构感知优化块,该优化块通过定义锚点集和通过PageRank生成感知图来细化图。在此之后,M6A-SAI采用自相关融合图卷积框架来合并相似性图和感知图的信息,从而产生丰富的序列表示。最后,利用支持向量机对这些表示进行分类。实验结果证实,m6A - sai通过结合结构感知的见解,大大提高了对m6A修饰位点的识别,证明了其作为一种识别RNA m6A修饰位点的强大方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
PeerJ
PeerJ MULTIDISCIPLINARY SCIENCES-
CiteScore
4.70
自引率
3.70%
发文量
1665
审稿时长
10 weeks
期刊介绍: PeerJ is an open access peer-reviewed scientific journal covering research in the biological and medical sciences. At PeerJ, authors take out a lifetime publication plan (for as little as $99) which allows them to publish articles in the journal for free, forever. PeerJ has 5 Nobel Prize Winners on the Board; they have won several industry and media awards; and they are widely recognized as being one of the most interesting recent developments in academic publishing.
期刊最新文献
Effective degradation of zearalenone by multiple microbial isolates. Kinesiophobia and alexithymia in knee osteoarthritis: association with radiological severity. Validity and reliability of Insomnia Severity Index among older adults in Indonesia. Predictive value of D4Z4 methylation levels for phenotypic heterogeneity and disease progression in Facioscapulohumeral Muscular Dystrophy with borderline D4Z4 repeat units: a retrospective cohort study. Comparison of primers for the pathogenicity factors vvhA and rpoS in Vibrio vulnificus environmental isolates from the Texas Coastal Bend region of the Gulf of Mexico.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1