N-GlycoPred: A hybrid deep learning model for accurate identification of N-glycosylation sites

IF 4.2 3区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Methods Pub Date : 2024-05-10 DOI:10.1016/j.ymeth.2024.05.002
Fengzhu Hu , Jie Gao , Jia Zheng , Cheekeong Kwoh , Cangzhi Jia
{"title":"N-GlycoPred: A hybrid deep learning model for accurate identification of N-glycosylation sites","authors":"Fengzhu Hu ,&nbsp;Jie Gao ,&nbsp;Jia Zheng ,&nbsp;Cheekeong Kwoh ,&nbsp;Cangzhi Jia","doi":"10.1016/j.ymeth.2024.05.002","DOIUrl":null,"url":null,"abstract":"<div><p>Studies have shown that protein glycosylation in cells reflects the real-time dynamics of biological processes, and the occurrence and development of many diseases are closely related to protein glycosylation. Abnormal protein glycosylation can be used as a potential diagnostic and prognostic marker of a disease, as well as a therapeutic target and a new breakthrough point for exploring pathogenesis. To address the issue of significant differences in the prediction results of previous models for different species, we constructed a hybrid deep learning model N-GlycoPred on the basis of dual-layer convolution, a paired attention mechanism and BiLSTM for accurate identification of N-glycosylation sites. By adopting one-hot encoding or the AAindex, we specifically selected the optimum combination of features and deep learning frameworks for human and mouse to refine the models. Based on six independent test datasets, our N-GlycoPred model achieved an average AUC of 0.9553, which is 0.23% higher than MusiteDeep. The comparison results indicate that our model can serve as a powerful tool for N-glycosylation site prescreening for biological researchers.</p></div>","PeriodicalId":390,"journal":{"name":"Methods","volume":"227 ","pages":"Pages 48-57"},"PeriodicalIF":4.2000,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1046202324001129","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Studies have shown that protein glycosylation in cells reflects the real-time dynamics of biological processes, and the occurrence and development of many diseases are closely related to protein glycosylation. Abnormal protein glycosylation can be used as a potential diagnostic and prognostic marker of a disease, as well as a therapeutic target and a new breakthrough point for exploring pathogenesis. To address the issue of significant differences in the prediction results of previous models for different species, we constructed a hybrid deep learning model N-GlycoPred on the basis of dual-layer convolution, a paired attention mechanism and BiLSTM for accurate identification of N-glycosylation sites. By adopting one-hot encoding or the AAindex, we specifically selected the optimum combination of features and deep learning frameworks for human and mouse to refine the models. Based on six independent test datasets, our N-GlycoPred model achieved an average AUC of 0.9553, which is 0.23% higher than MusiteDeep. The comparison results indicate that our model can serve as a powerful tool for N-glycosylation site prescreening for biological researchers.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
N-GlycoPred:用于准确识别 N-糖基化位点的混合深度学习模型。
研究表明,细胞中的蛋白质糖基化反映了生物过程的实时动态,许多疾病的发生和发展都与蛋白质糖基化密切相关。异常的蛋白质糖基化可作为潜在的疾病诊断和预后标志物,也可作为治疗靶点和探索发病机制的新突破点。针对以往模型对不同物种预测结果差异较大的问题,我们在双层卷积、配对注意机制和BiLSTM的基础上构建了混合深度学习模型N-GlycoPred,用于准确识别N-糖基化位点。通过采用单次编码或AAindex,我们有针对性地选择了人类和小鼠的最佳特征组合和深度学习框架,以完善模型。基于六个独立测试数据集,我们的N-GlycoPred模型的平均AUC达到0.9553,比MusiteDeep高0.23%。比较结果表明,我们的模型可以作为生物研究人员预筛选 N-糖基化位点的有力工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Methods
Methods 生物-生化研究方法
CiteScore
9.80
自引率
2.10%
发文量
222
审稿时长
11.3 weeks
期刊介绍: Methods focuses on rapidly developing techniques in the experimental biological and medical sciences. Each topical issue, organized by a guest editor who is an expert in the area covered, consists solely of invited quality articles by specialist authors, many of them reviews. Issues are devoted to specific technical approaches with emphasis on clear detailed descriptions of protocols that allow them to be reproduced easily. The background information provided enables researchers to understand the principles underlying the methods; other helpful sections include comparisons of alternative methods giving the advantages and disadvantages of particular methods, guidance on avoiding potential pitfalls, and suggestions for troubleshooting.
期刊最新文献
Optimizing retinal Imaging: Evaluation of ultrasmall TiO2 Nanoparticle- fluorescein conjugates for improved Fundus fluorescein angiography. Ab-Amy 2.0: Predicting light chain amyloidogenic risk of therapeutic antibodies based on antibody language model. Data preprocessing methods for selective sweep detection using convolutional neural networks. SITP: A single cell bioinformatics analysis flow captures proteasome markers in the development of breast cancer Exploring drug-target interaction prediction on cold-start scenarios via meta-learning-based graph transformer.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1