{"title":"FEEDS,食品废料生物肽识别器:从微生物基因组和底物到生物肽功能","authors":"Victor Borin Centurion , Edoardo Bizzotto , Stefano Tonini , Pasquale Filannino , Raffaella Di Cagno , Guido Zampieri , Stefano Campanaro","doi":"10.1016/j.crbiot.2024.100186","DOIUrl":null,"url":null,"abstract":"<div><p>The production of biopeptides from food waste through microbial fermentation faces challenges arising from the diverse proteolytic abilities of microorganisms and substrate variability, impacting both the quality and yield of generated biopeptides. To address these challenges, preliminary in-silico bioinformatics analyses play a crucial role in evaluating suitable substrates and proteases for the fermentation process. However, existing tools lack comprehensive predictive capabilities for relevant proteases, substrate performance assessment, and final biopeptide family characterization. To overcome these limitations, we developed FEEDS (Food wastE biopEptiDe claSsifier), a novel biopeptide prediction and classification tool. FEEDS predicts biopeptide families based on microbial genome protease profiles and substrate composition during proteolysis. The tool also employs a machine learning approach for functional biopeptide classification. Results from testing on 1000 microbial genomes demonstrate the effectiveness of biopeptide classification, particularly in categorizing peptides derived from substrates like <em>Hordeum vulgare</em> and <em>Vitis vinifera</em> seed storage proteins. In addition to biopeptide classification, our study delves into the distinctive protease profiles of bacteria and yeast genomes. Bacterial genomes exhibited 60 to 100 proteases across 40–55 families. Contrastingly, yeast genomes displayed a more evenly distributed pattern with 150 to 160 protease-encoding genes across 60 to 67 families, surpassing bacterial counts. Remarkably, a substantial portion of yeast proteases (∼66 %) was secreted. Moreover, our integration of a machine learning methodology within the FEEDS pipeline proved highly effective, achieving over 80 % accuracy in predicting the function of peptides derived from seed storage proteins. Notably, longer peptide sequences exceeding 20 amino acids consistently displayed a higher probability of correct assignment compared to shorter counterparts.</p></div>","PeriodicalId":52676,"journal":{"name":"Current Research in Biotechnology","volume":null,"pages":null},"PeriodicalIF":3.6000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590262824000121/pdfft?md5=dc14271b17d72c293fd1596de17ef07f&pid=1-s2.0-S2590262824000121-main.pdf","citationCount":"0","resultStr":"{\"title\":\"FEEDS, the Food wastE biopEptiDe claSsifier: From microbial genomes and substrates to biopeptides function\",\"authors\":\"Victor Borin Centurion , Edoardo Bizzotto , Stefano Tonini , Pasquale Filannino , Raffaella Di Cagno , Guido Zampieri , Stefano Campanaro\",\"doi\":\"10.1016/j.crbiot.2024.100186\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The production of biopeptides from food waste through microbial fermentation faces challenges arising from the diverse proteolytic abilities of microorganisms and substrate variability, impacting both the quality and yield of generated biopeptides. To address these challenges, preliminary in-silico bioinformatics analyses play a crucial role in evaluating suitable substrates and proteases for the fermentation process. However, existing tools lack comprehensive predictive capabilities for relevant proteases, substrate performance assessment, and final biopeptide family characterization. To overcome these limitations, we developed FEEDS (Food wastE biopEptiDe claSsifier), a novel biopeptide prediction and classification tool. FEEDS predicts biopeptide families based on microbial genome protease profiles and substrate composition during proteolysis. The tool also employs a machine learning approach for functional biopeptide classification. Results from testing on 1000 microbial genomes demonstrate the effectiveness of biopeptide classification, particularly in categorizing peptides derived from substrates like <em>Hordeum vulgare</em> and <em>Vitis vinifera</em> seed storage proteins. In addition to biopeptide classification, our study delves into the distinctive protease profiles of bacteria and yeast genomes. Bacterial genomes exhibited 60 to 100 proteases across 40–55 families. Contrastingly, yeast genomes displayed a more evenly distributed pattern with 150 to 160 protease-encoding genes across 60 to 67 families, surpassing bacterial counts. Remarkably, a substantial portion of yeast proteases (∼66 %) was secreted. Moreover, our integration of a machine learning methodology within the FEEDS pipeline proved highly effective, achieving over 80 % accuracy in predicting the function of peptides derived from seed storage proteins. Notably, longer peptide sequences exceeding 20 amino acids consistently displayed a higher probability of correct assignment compared to shorter counterparts.</p></div>\",\"PeriodicalId\":52676,\"journal\":{\"name\":\"Current Research in Biotechnology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2590262824000121/pdfft?md5=dc14271b17d72c293fd1596de17ef07f&pid=1-s2.0-S2590262824000121-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Current Research in Biotechnology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2590262824000121\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOTECHNOLOGY & APPLIED MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Research in Biotechnology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590262824000121","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
FEEDS, the Food wastE biopEptiDe claSsifier: From microbial genomes and substrates to biopeptides function
The production of biopeptides from food waste through microbial fermentation faces challenges arising from the diverse proteolytic abilities of microorganisms and substrate variability, impacting both the quality and yield of generated biopeptides. To address these challenges, preliminary in-silico bioinformatics analyses play a crucial role in evaluating suitable substrates and proteases for the fermentation process. However, existing tools lack comprehensive predictive capabilities for relevant proteases, substrate performance assessment, and final biopeptide family characterization. To overcome these limitations, we developed FEEDS (Food wastE biopEptiDe claSsifier), a novel biopeptide prediction and classification tool. FEEDS predicts biopeptide families based on microbial genome protease profiles and substrate composition during proteolysis. The tool also employs a machine learning approach for functional biopeptide classification. Results from testing on 1000 microbial genomes demonstrate the effectiveness of biopeptide classification, particularly in categorizing peptides derived from substrates like Hordeum vulgare and Vitis vinifera seed storage proteins. In addition to biopeptide classification, our study delves into the distinctive protease profiles of bacteria and yeast genomes. Bacterial genomes exhibited 60 to 100 proteases across 40–55 families. Contrastingly, yeast genomes displayed a more evenly distributed pattern with 150 to 160 protease-encoding genes across 60 to 67 families, surpassing bacterial counts. Remarkably, a substantial portion of yeast proteases (∼66 %) was secreted. Moreover, our integration of a machine learning methodology within the FEEDS pipeline proved highly effective, achieving over 80 % accuracy in predicting the function of peptides derived from seed storage proteins. Notably, longer peptide sequences exceeding 20 amino acids consistently displayed a higher probability of correct assignment compared to shorter counterparts.
期刊介绍:
Current Research in Biotechnology (CRBIOT) is a new primary research, gold open access journal from Elsevier. CRBIOT publishes original papers, reviews, and short communications (including viewpoints and perspectives) resulting from research in biotechnology and biotech-associated disciplines.
Current Research in Biotechnology is a peer-reviewed gold open access (OA) journal and upon acceptance all articles are permanently and freely available. It is a companion to the highly regarded review journal Current Opinion in Biotechnology (2018 CiteScore 8.450) and is part of the Current Opinion and Research (CO+RE) suite of journals. All CO+RE journals leverage the Current Opinion legacy-of editorial excellence, high-impact, and global reach-to ensure they are a widely read resource that is integral to scientists' workflow.