{"title":"Predicting the interactions between plant lncRNA-encoded peptide and protein using domain knowledge-based prototypical network","authors":"Siyuan Zhao, Jun Meng, Yushi Luan","doi":"10.1109/BIBM55620.2022.9995175","DOIUrl":null,"url":null,"abstract":"Long noncoding RNA(lncRNA) has been reported to encode small peptides which play key roles in life activities through their functions by binding to proteins. It is crucial to predict the interactions between the lncRNA-encoded peptide and protein. However, no computational methods have been designed for predicting this type of interactions directly, owing to the few-shot problem causing poor generalization. Prototypical network (ProtoNet) is a classic learner for few-shot learning. However, how to obtain effective embedding and measure the distance between different prototypes accurately are the most important challenges. Although some improved prototypical networks have been proposed, they ignore the role of domain knowledge which is helpful for constructing models conforming to the domain mechanism In this study, we propose a novel method for interactions prediction between plant lncRNA-encoded peptide and protein using domain knowledge-based ProtoNet (IPLncPP-DKPN). Multiple features that imply domain knowledge are extracted, connected, and converted to avoid sparse and enhance information using a dual-routing parallel feature dimensionality reduction algorithm IProtoNet is an improved ProtoNet using capsule network-based embedding and Mahalanobis distance-based prototype. The converted features are fed into IProtoNet to realize the classification task. The experimental results manifest that IPLncPP-DKPN achieves better performance on the independent test set compared with classic machine learning models. To the best of our knowledge, IPLncPP-DKPN is the first computational method for the interactions prediction between lncRNA-encoded peptide and protein.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM55620.2022.9995175","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Long noncoding RNA(lncRNA) has been reported to encode small peptides which play key roles in life activities through their functions by binding to proteins. It is crucial to predict the interactions between the lncRNA-encoded peptide and protein. However, no computational methods have been designed for predicting this type of interactions directly, owing to the few-shot problem causing poor generalization. Prototypical network (ProtoNet) is a classic learner for few-shot learning. However, how to obtain effective embedding and measure the distance between different prototypes accurately are the most important challenges. Although some improved prototypical networks have been proposed, they ignore the role of domain knowledge which is helpful for constructing models conforming to the domain mechanism In this study, we propose a novel method for interactions prediction between plant lncRNA-encoded peptide and protein using domain knowledge-based ProtoNet (IPLncPP-DKPN). Multiple features that imply domain knowledge are extracted, connected, and converted to avoid sparse and enhance information using a dual-routing parallel feature dimensionality reduction algorithm IProtoNet is an improved ProtoNet using capsule network-based embedding and Mahalanobis distance-based prototype. The converted features are fed into IProtoNet to realize the classification task. The experimental results manifest that IPLncPP-DKPN achieves better performance on the independent test set compared with classic machine learning models. To the best of our knowledge, IPLncPP-DKPN is the first computational method for the interactions prediction between lncRNA-encoded peptide and protein.