利用蛋白质语言模型进行稳健的抗菌肽检测。

IF 4.3 3区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Methods Pub Date : 2025-06-01 Epub Date: 2025-03-04 DOI:10.1016/j.ymeth.2025.03.002
Lichao Zhang , Shuwen Xiong , Lei Xu , Junwei Liang , Xuehua Zhao , Honglai Zhang , Xu Tan
{"title":"利用蛋白质语言模型进行稳健的抗菌肽检测。","authors":"Lichao Zhang ,&nbsp;Shuwen Xiong ,&nbsp;Lei Xu ,&nbsp;Junwei Liang ,&nbsp;Xuehua Zhao ,&nbsp;Honglai Zhang ,&nbsp;Xu Tan","doi":"10.1016/j.ymeth.2025.03.002","DOIUrl":null,"url":null,"abstract":"<div><div>Antimicrobial peptides (AMPs) are promising candidates for addressing the global challenge of antibiotic resistance due to their broad-spectrum antimicrobial properties. Traditional AMP identification methods, while effective, are labor-intensive and time-consuming. Recent advancements in deep learning and large language models (LLMs), especially protein language models (PLMs) present a transformative approach for AMP prediction. In this study, we propose PLAPD, a novel framework leveraging a pre-trained ESM2 protein language model for AMP classification. Besides, PLAPD combines local feature extraction via convolutional layers and global feature extraction with a residual Transformer module. We benchmarked PLAPD against state-of-the-art AMP prediction models using a dataset comprising 8,268 peptide sequences, achieving superior performance in Accuracy (0.87), Precision (0.9359), Specificity (0.9456), MCC (0.7486), and AUC (0.9225). The results highlight the potential of PLAPD as a high-throughput and accurate tool for AMP discovery.</div></div>","PeriodicalId":390,"journal":{"name":"Methods","volume":"238 ","pages":"Pages 19-26"},"PeriodicalIF":4.3000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Leveraging protein language models for robust antimicrobial peptide detection\",\"authors\":\"Lichao Zhang ,&nbsp;Shuwen Xiong ,&nbsp;Lei Xu ,&nbsp;Junwei Liang ,&nbsp;Xuehua Zhao ,&nbsp;Honglai Zhang ,&nbsp;Xu Tan\",\"doi\":\"10.1016/j.ymeth.2025.03.002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Antimicrobial peptides (AMPs) are promising candidates for addressing the global challenge of antibiotic resistance due to their broad-spectrum antimicrobial properties. Traditional AMP identification methods, while effective, are labor-intensive and time-consuming. Recent advancements in deep learning and large language models (LLMs), especially protein language models (PLMs) present a transformative approach for AMP prediction. In this study, we propose PLAPD, a novel framework leveraging a pre-trained ESM2 protein language model for AMP classification. Besides, PLAPD combines local feature extraction via convolutional layers and global feature extraction with a residual Transformer module. We benchmarked PLAPD against state-of-the-art AMP prediction models using a dataset comprising 8,268 peptide sequences, achieving superior performance in Accuracy (0.87), Precision (0.9359), Specificity (0.9456), MCC (0.7486), and AUC (0.9225). The results highlight the potential of PLAPD as a high-throughput and accurate tool for AMP discovery.</div></div>\",\"PeriodicalId\":390,\"journal\":{\"name\":\"Methods\",\"volume\":\"238 \",\"pages\":\"Pages 19-26\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Methods\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1046202325000544\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/4 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1046202325000544","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/4 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

摘要

抗菌肽(AMPs)由于其广谱抗菌特性而成为解决抗生素耐药性全球挑战的有希望的候选者。传统的AMP识别方法虽然有效,但费时费力。深度学习和大型语言模型(llm),特别是蛋白质语言模型(plm)的最新进展为AMP预测提供了一种变变性的方法。在这项研究中,我们提出了PLAPD,这是一个利用预训练ESM2蛋白语言模型进行AMP分类的新框架。此外,PLAPD结合了卷积层的局部特征提取和残差Transformer模块的全局特征提取。我们使用包含8,268个肽序列的数据集将PLAPD与最先进的AMP预测模型进行基准测试,在准确度(0.87),精度(0.9359),特异性(0.9456),MCC(0.7486)和AUC(0.9225)方面取得了优异的表现。这些结果突出了PLAPD作为发现AMP的高通量和精确工具的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Leveraging protein language models for robust antimicrobial peptide detection
Antimicrobial peptides (AMPs) are promising candidates for addressing the global challenge of antibiotic resistance due to their broad-spectrum antimicrobial properties. Traditional AMP identification methods, while effective, are labor-intensive and time-consuming. Recent advancements in deep learning and large language models (LLMs), especially protein language models (PLMs) present a transformative approach for AMP prediction. In this study, we propose PLAPD, a novel framework leveraging a pre-trained ESM2 protein language model for AMP classification. Besides, PLAPD combines local feature extraction via convolutional layers and global feature extraction with a residual Transformer module. We benchmarked PLAPD against state-of-the-art AMP prediction models using a dataset comprising 8,268 peptide sequences, achieving superior performance in Accuracy (0.87), Precision (0.9359), Specificity (0.9456), MCC (0.7486), and AUC (0.9225). The results highlight the potential of PLAPD as a high-throughput and accurate tool for AMP discovery.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Methods
Methods 生物-生化研究方法
CiteScore
9.80
自引率
2.10%
发文量
222
审稿时长
11.3 weeks
期刊介绍: Methods focuses on rapidly developing techniques in the experimental biological and medical sciences. Each topical issue, organized by a guest editor who is an expert in the area covered, consists solely of invited quality articles by specialist authors, many of them reviews. Issues are devoted to specific technical approaches with emphasis on clear detailed descriptions of protocols that allow them to be reproduced easily. The background information provided enables researchers to understand the principles underlying the methods; other helpful sections include comparisons of alternative methods giving the advantages and disadvantages of particular methods, guidance on avoiding potential pitfalls, and suggestions for troubleshooting.
期刊最新文献
Detection of allergen-specific IgE in sera from pediatric patients with food allergy using AlphaCL Impact of thermal treatment on genomic DNA integrity and quantification bias in digital PCR TEAMSTER: End-to-end pipeline for generation, processing, and automated quantitative image analysis of 3D tumour spheroids for nanobiology Determining optimal sgRNA coverage and screening duration for pooled CRISPR screens: A quantitative framework Reflectance-based optical sensor for real-time detection and quantification of biofilms in complex matrices
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1