{"title":"StackAMP: Stacking-Based Ensemble Classifier for Antimicrobial Peptide Identification","authors":"Tasmin Karim;Md. Shazzad Hossain Shaon;Md. Mamun Ali;Kawsar Ahmed;Francis M. Bui;Li Chen","doi":"10.1109/TAI.2024.3421176","DOIUrl":null,"url":null,"abstract":"Antimicrobial peptides (AMPs) play a vital role in the immune defence systems of various organisms and have garnered significant attention for their potential applications in biotechnology and medicine. There are several approaches to identifying AMPs including clinical isolation and characterization, functional genomics, microbiology techniques, and others. However, these methods are mostly expensive, time-consuming, and require well-equipped labs. To overcome these challenges, machine learning models are a potential solution due to their robustness and high predictive capability with less time and cost. In this study, we explored the efficacy of stacking-based ensemble machine-learning techniques to identify AMPs with higher accuracy and precision. Five distinct feature extraction methods, namely amino acid composition, dipeptide composition, moran autocorrelation, geary autocorrelation, and pseudoamino acid composition, were employed to represent the sequence characteristics of peptides. To build robust predictive models, different traditional machine learning algorithms were applied. Additionally, we developed a novel stacking classifier, aptly named StackAMP, to harness the collective power of these algorithms. Our results demonstrated the exceptional performance of the proposed StackAMP ensemble method in AMP identification, achieving an accuracy of 99.97%, 99.93% specificity, and 100% sensitivity. This high accuracy underscores the effectiveness of our approach, which has promising outcomes for the rapid and accurate identification of AMPs in various biological contexts. This study not only contributes to the growing body of knowledge in the field of AMP recognition but also offers a practical tool with potential applications in drug discovery, biotechnology, and disease prevention.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5666-5675"},"PeriodicalIF":0.0000,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10582786/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Antimicrobial peptides (AMPs) play a vital role in the immune defence systems of various organisms and have garnered significant attention for their potential applications in biotechnology and medicine. There are several approaches to identifying AMPs including clinical isolation and characterization, functional genomics, microbiology techniques, and others. However, these methods are mostly expensive, time-consuming, and require well-equipped labs. To overcome these challenges, machine learning models are a potential solution due to their robustness and high predictive capability with less time and cost. In this study, we explored the efficacy of stacking-based ensemble machine-learning techniques to identify AMPs with higher accuracy and precision. Five distinct feature extraction methods, namely amino acid composition, dipeptide composition, moran autocorrelation, geary autocorrelation, and pseudoamino acid composition, were employed to represent the sequence characteristics of peptides. To build robust predictive models, different traditional machine learning algorithms were applied. Additionally, we developed a novel stacking classifier, aptly named StackAMP, to harness the collective power of these algorithms. Our results demonstrated the exceptional performance of the proposed StackAMP ensemble method in AMP identification, achieving an accuracy of 99.97%, 99.93% specificity, and 100% sensitivity. This high accuracy underscores the effectiveness of our approach, which has promising outcomes for the rapid and accurate identification of AMPs in various biological contexts. This study not only contributes to the growing body of knowledge in the field of AMP recognition but also offers a practical tool with potential applications in drug discovery, biotechnology, and disease prevention.