{"title":"Guiding generative AI","authors":"Russell Johnson","doi":"10.1038/s41589-025-01854-y","DOIUrl":null,"url":null,"abstract":"<p>Lab-based protein evolution is an approach that improves enzyme activity or development of protein-based binders, but it is very labor intensive. By contrast, computational approaches (potentially) offer rapid results without laborious lab work; however, the designs proposed by computational workflows have often not provided the required improvement in activity. More recently, generative machine learning methods have shown progress in proposing static protein scaffolds, but expanding these approaches to design in enzymatic reactivity or other protein functions has been problematic, and optimizing an initial model typically still requires lab-based directed evolution. Now, Jiang et al. have developed a computational method called EVOLVEpro to help guide experimental directed evolution efforts.</p><p>EVOLVEpro combines a protein language model with an active learning layer. A protein language model is a machine learning algorithm that analyzes and learns patterns from datasets of protein sequences. The active learning layer interprets the protein language model using an iterative process to decipher the relationship between sequence and experimentally determined activities. Based on a random forest design, the active learning layer can simultaneously optimize multiple protein properties during iterative rounds of experimental testing using as few as 10 data points per round.</p>","PeriodicalId":18832,"journal":{"name":"Nature chemical biology","volume":"4 1","pages":""},"PeriodicalIF":12.9000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature chemical biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s41589-025-01854-y","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Lab-based protein evolution is an approach that improves enzyme activity or development of protein-based binders, but it is very labor intensive. By contrast, computational approaches (potentially) offer rapid results without laborious lab work; however, the designs proposed by computational workflows have often not provided the required improvement in activity. More recently, generative machine learning methods have shown progress in proposing static protein scaffolds, but expanding these approaches to design in enzymatic reactivity or other protein functions has been problematic, and optimizing an initial model typically still requires lab-based directed evolution. Now, Jiang et al. have developed a computational method called EVOLVEpro to help guide experimental directed evolution efforts.
EVOLVEpro combines a protein language model with an active learning layer. A protein language model is a machine learning algorithm that analyzes and learns patterns from datasets of protein sequences. The active learning layer interprets the protein language model using an iterative process to decipher the relationship between sequence and experimentally determined activities. Based on a random forest design, the active learning layer can simultaneously optimize multiple protein properties during iterative rounds of experimental testing using as few as 10 data points per round.
期刊介绍:
Nature Chemical Biology stands as an esteemed international monthly journal, offering a prominent platform for the chemical biology community to showcase top-tier original research and commentary. Operating at the crossroads of chemistry, biology, and related disciplines, chemical biology utilizes scientific ideas and approaches to comprehend and manipulate biological systems with molecular precision.
The journal embraces contributions from the growing community of chemical biologists, encompassing insights from chemists applying principles and tools to biological inquiries and biologists striving to comprehend and control molecular-level biological processes. We prioritize studies unveiling significant conceptual or practical advancements in areas where chemistry and biology intersect, emphasizing basic research, especially those reporting novel chemical or biological tools and offering profound molecular-level insights into underlying biological mechanisms.
Nature Chemical Biology also welcomes manuscripts describing applied molecular studies at the chemistry-biology interface due to the broad utility of chemical biology approaches in manipulating or engineering biological systems. Irrespective of scientific focus, we actively seek submissions that creatively blend chemistry and biology, particularly those providing substantial conceptual or methodological breakthroughs with the potential to open innovative research avenues. The journal maintains a robust and impartial review process, emphasizing thorough chemical and biological characterization.