基于域知识的原型网络预测植物lncrna编码肽与蛋白质之间的相互作用

Siyuan Zhao, Jun Meng, Yushi Luan
{"title":"基于域知识的原型网络预测植物lncrna编码肽与蛋白质之间的相互作用","authors":"Siyuan Zhao, Jun Meng, Yushi Luan","doi":"10.1109/BIBM55620.2022.9995175","DOIUrl":null,"url":null,"abstract":"Long noncoding RNA(lncRNA) has been reported to encode small peptides which play key roles in life activities through their functions by binding to proteins. It is crucial to predict the interactions between the lncRNA-encoded peptide and protein. However, no computational methods have been designed for predicting this type of interactions directly, owing to the few-shot problem causing poor generalization. Prototypical network (ProtoNet) is a classic learner for few-shot learning. However, how to obtain effective embedding and measure the distance between different prototypes accurately are the most important challenges. Although some improved prototypical networks have been proposed, they ignore the role of domain knowledge which is helpful for constructing models conforming to the domain mechanism In this study, we propose a novel method for interactions prediction between plant lncRNA-encoded peptide and protein using domain knowledge-based ProtoNet (IPLncPP-DKPN). Multiple features that imply domain knowledge are extracted, connected, and converted to avoid sparse and enhance information using a dual-routing parallel feature dimensionality reduction algorithm IProtoNet is an improved ProtoNet using capsule network-based embedding and Mahalanobis distance-based prototype. The converted features are fed into IProtoNet to realize the classification task. The experimental results manifest that IPLncPP-DKPN achieves better performance on the independent test set compared with classic machine learning models. To the best of our knowledge, IPLncPP-DKPN is the first computational method for the interactions prediction between lncRNA-encoded peptide and protein.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting the interactions between plant lncRNA-encoded peptide and protein using domain knowledge-based prototypical network\",\"authors\":\"Siyuan Zhao, Jun Meng, Yushi Luan\",\"doi\":\"10.1109/BIBM55620.2022.9995175\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Long noncoding RNA(lncRNA) has been reported to encode small peptides which play key roles in life activities through their functions by binding to proteins. It is crucial to predict the interactions between the lncRNA-encoded peptide and protein. However, no computational methods have been designed for predicting this type of interactions directly, owing to the few-shot problem causing poor generalization. Prototypical network (ProtoNet) is a classic learner for few-shot learning. However, how to obtain effective embedding and measure the distance between different prototypes accurately are the most important challenges. Although some improved prototypical networks have been proposed, they ignore the role of domain knowledge which is helpful for constructing models conforming to the domain mechanism In this study, we propose a novel method for interactions prediction between plant lncRNA-encoded peptide and protein using domain knowledge-based ProtoNet (IPLncPP-DKPN). Multiple features that imply domain knowledge are extracted, connected, and converted to avoid sparse and enhance information using a dual-routing parallel feature dimensionality reduction algorithm IProtoNet is an improved ProtoNet using capsule network-based embedding and Mahalanobis distance-based prototype. The converted features are fed into IProtoNet to realize the classification task. The experimental results manifest that IPLncPP-DKPN achieves better performance on the independent test set compared with classic machine learning models. To the best of our knowledge, IPLncPP-DKPN is the first computational method for the interactions prediction between lncRNA-encoded peptide and protein.\",\"PeriodicalId\":210337,\"journal\":{\"name\":\"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBM55620.2022.9995175\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM55620.2022.9995175","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

据报道,长链非编码RNA(lncRNA)编码小肽,这些小肽通过与蛋白质结合而发挥作用,在生命活动中起关键作用。预测lncrna编码肽与蛋白之间的相互作用至关重要。然而,目前还没有设计出直接预测这种类型的相互作用的计算方法,因为很少的问题导致较差的泛化。原型网络(Prototypical network, ProtoNet)是一种经典的少次学习算法。然而,如何获得有效的嵌入并准确测量不同原型之间的距离是最重要的挑战。虽然已经提出了一些改进的原型网络,但它们忽略了领域知识的作用,这有助于构建符合领域机制的模型。本研究提出了一种基于领域知识的植物lncrna编码肽与蛋白质相互作用预测方法。利用双路由并行特征降维算法,对隐含领域知识的多个特征进行提取、连接和转换,以避免稀疏和增强信息。IProtoNet是一种基于胶囊网络的嵌入和基于Mahalanobis距离的原型的改进ProtoNet。将转换后的特征输入到IProtoNet中实现分类任务。实验结果表明,与经典机器学习模型相比,iplncp - dkpn在独立测试集上取得了更好的性能。据我们所知,iplncp - dkpn是第一个预测lncrna编码肽与蛋白质相互作用的计算方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Predicting the interactions between plant lncRNA-encoded peptide and protein using domain knowledge-based prototypical network
Long noncoding RNA(lncRNA) has been reported to encode small peptides which play key roles in life activities through their functions by binding to proteins. It is crucial to predict the interactions between the lncRNA-encoded peptide and protein. However, no computational methods have been designed for predicting this type of interactions directly, owing to the few-shot problem causing poor generalization. Prototypical network (ProtoNet) is a classic learner for few-shot learning. However, how to obtain effective embedding and measure the distance between different prototypes accurately are the most important challenges. Although some improved prototypical networks have been proposed, they ignore the role of domain knowledge which is helpful for constructing models conforming to the domain mechanism In this study, we propose a novel method for interactions prediction between plant lncRNA-encoded peptide and protein using domain knowledge-based ProtoNet (IPLncPP-DKPN). Multiple features that imply domain knowledge are extracted, connected, and converted to avoid sparse and enhance information using a dual-routing parallel feature dimensionality reduction algorithm IProtoNet is an improved ProtoNet using capsule network-based embedding and Mahalanobis distance-based prototype. The converted features are fed into IProtoNet to realize the classification task. The experimental results manifest that IPLncPP-DKPN achieves better performance on the independent test set compared with classic machine learning models. To the best of our knowledge, IPLncPP-DKPN is the first computational method for the interactions prediction between lncRNA-encoded peptide and protein.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A framework for associating structural variants with cell-specific transcription factors and histone modifications in defect phenotypes Secure Password Using EEG-based BrainPrint System: Unlock Smartphone Password Using Brain-Computer Interface Technology On functional annotation with gene co-expression networks ST-ChIP: Accurate prediction of spatiotemporal ChIP-seq data with recurrent neural networks Discovering the Knowledge in Unstructured Early Drug Development Data Using NLP and Advanced Analytics
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1