StackAMP: Stacking-Based Ensemble Classifier for Antimicrobial Peptide Identification

IEEE transactions on artificial intelligence Pub Date : 2024-07-02 DOI:10.1109/TAI.2024.3421176

Tasmin Karim;Md. Shazzad Hossain Shaon;Md. Mamun Ali;Kawsar Ahmed;Francis M. Bui;Li Chen

{"title":"StackAMP: Stacking-Based Ensemble Classifier for Antimicrobial Peptide Identification","authors":"Tasmin Karim;Md. Shazzad Hossain Shaon;Md. Mamun Ali;Kawsar Ahmed;Francis M. Bui;Li Chen","doi":"10.1109/TAI.2024.3421176","DOIUrl":null,"url":null,"abstract":"Antimicrobial peptides (AMPs) play a vital role in the immune defence systems of various organisms and have garnered significant attention for their potential applications in biotechnology and medicine. There are several approaches to identifying AMPs including clinical isolation and characterization, functional genomics, microbiology techniques, and others. However, these methods are mostly expensive, time-consuming, and require well-equipped labs. To overcome these challenges, machine learning models are a potential solution due to their robustness and high predictive capability with less time and cost. In this study, we explored the efficacy of stacking-based ensemble machine-learning techniques to identify AMPs with higher accuracy and precision. Five distinct feature extraction methods, namely amino acid composition, dipeptide composition, moran autocorrelation, geary autocorrelation, and pseudoamino acid composition, were employed to represent the sequence characteristics of peptides. To build robust predictive models, different traditional machine learning algorithms were applied. Additionally, we developed a novel stacking classifier, aptly named StackAMP, to harness the collective power of these algorithms. Our results demonstrated the exceptional performance of the proposed StackAMP ensemble method in AMP identification, achieving an accuracy of 99.97%, 99.93% specificity, and 100% sensitivity. This high accuracy underscores the effectiveness of our approach, which has promising outcomes for the rapid and accurate identification of AMPs in various biological contexts. This study not only contributes to the growing body of knowledge in the field of AMP recognition but also offers a practical tool with potential applications in drug discovery, biotechnology, and disease prevention.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5666-5675"},"PeriodicalIF":0.0000,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10582786/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Antimicrobial peptides (AMPs) play a vital role in the immune defence systems of various organisms and have garnered significant attention for their potential applications in biotechnology and medicine. There are several approaches to identifying AMPs including clinical isolation and characterization, functional genomics, microbiology techniques, and others. However, these methods are mostly expensive, time-consuming, and require well-equipped labs. To overcome these challenges, machine learning models are a potential solution due to their robustness and high predictive capability with less time and cost. In this study, we explored the efficacy of stacking-based ensemble machine-learning techniques to identify AMPs with higher accuracy and precision. Five distinct feature extraction methods, namely amino acid composition, dipeptide composition, moran autocorrelation, geary autocorrelation, and pseudoamino acid composition, were employed to represent the sequence characteristics of peptides. To build robust predictive models, different traditional machine learning algorithms were applied. Additionally, we developed a novel stacking classifier, aptly named StackAMP, to harness the collective power of these algorithms. Our results demonstrated the exceptional performance of the proposed StackAMP ensemble method in AMP identification, achieving an accuracy of 99.97%, 99.93% specificity, and 100% sensitivity. This high accuracy underscores the effectiveness of our approach, which has promising outcomes for the rapid and accurate identification of AMPs in various biological contexts. This study not only contributes to the growing body of knowledge in the field of AMP recognition but also offers a practical tool with potential applications in drug discovery, biotechnology, and disease prevention.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

StackAMP：用于抗菌肽鉴定的基于堆叠的集合分类器

抗菌肽（AMPs）在各种生物的免疫防御系统中发挥着重要作用，并因其在生物技术和医学中的潜在应用而备受关注。目前有多种方法可用于鉴定 AMPs，包括临床分离和表征、功能基因组学、微生物学技术等。然而，这些方法大多昂贵、耗时，而且需要设备齐全的实验室。为了克服这些挑战，机器学习模型因其稳健性和高预测能力，以及更少的时间和成本，成为一种潜在的解决方案。在本研究中，我们探索了基于堆叠的集合机器学习技术的有效性，以更高的准确度和精确度识别 AMPs。我们采用了五种不同的特征提取方法，即氨基酸组成、二肽组成、莫伦自相关、吉利自相关和伪氨基酸组成，来表示肽的序列特征。为了建立稳健的预测模型，我们采用了不同的传统机器学习算法。此外，我们还开发了一种新型堆叠分类器，并将其命名为 StackAMP，以利用这些算法的集体力量。我们的研究结果表明，所提出的 StackAMP 组合方法在 AMP 识别方面表现出色，准确率达到 99.97%，特异性达到 99.93%，灵敏度达到 100%。这种高准确度凸显了我们的方法的有效性，有望在各种生物环境中快速准确地识别 AMPs。这项研究不仅丰富了 AMP 识别领域不断增长的知识，还为药物发现、生物技术和疾病预防提供了一种具有潜在应用价值的实用工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助