{"title":"Construction of Protein Sequence Databases for Metaproteomics: A Review of the Current Tools and Databases.","authors":"Muzaffer Arıkan, Başak Atabay","doi":"10.1021/acs.jproteome.4c00665","DOIUrl":null,"url":null,"abstract":"<p><p>In metaproteomics studies, constructing a reference protein sequence database that is both comprehensive and not overly large is critical for the peptide identification step. Therefore, the availability of well-curated reference databases and tools for custom database construction is essential to enhance the performance of metaproteomics analyses. In this review, we first provide an overview of metaproteomics by presenting a concise historical background, outlining a typical experimental and bioinformatics workflow, emphasizing the crucial step of constructing a protein sequence database for metaproteomics. We then delve into the current tools available for building such databases, highlighting their individual approaches, utility, and advantages and limitations. Next, we examine existing protein sequence databases, detailing their scope and relevance in metaproteomics research. Then, we provide practical recommendations for constructing protein sequence databases for metaproteomics, along with an overview of the current challenges in this area. We conclude with a discussion of anticipated advancements, emerging trends, and future directions in the construction of protein sequence databases for metaproteomics.</p>","PeriodicalId":3,"journal":{"name":"ACS Applied Electronic Materials","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Electronic Materials","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1021/acs.jproteome.4c00665","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
In metaproteomics studies, constructing a reference protein sequence database that is both comprehensive and not overly large is critical for the peptide identification step. Therefore, the availability of well-curated reference databases and tools for custom database construction is essential to enhance the performance of metaproteomics analyses. In this review, we first provide an overview of metaproteomics by presenting a concise historical background, outlining a typical experimental and bioinformatics workflow, emphasizing the crucial step of constructing a protein sequence database for metaproteomics. We then delve into the current tools available for building such databases, highlighting their individual approaches, utility, and advantages and limitations. Next, we examine existing protein sequence databases, detailing their scope and relevance in metaproteomics research. Then, we provide practical recommendations for constructing protein sequence databases for metaproteomics, along with an overview of the current challenges in this area. We conclude with a discussion of anticipated advancements, emerging trends, and future directions in the construction of protein sequence databases for metaproteomics.