Integrating protein language models and automatic biofoundry for enhanced protein evolution

IF 15.7 1区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Nature Communications Pub Date : 2025-02-11 DOI:10.1038/s41467-025-56751-8
Qiang Zhang, Wanyi Chen, Ming Qin, Yuhao Wang, Zhongji Pu, Keyan Ding, Yuyue Liu, Qunfeng Zhang, Dongfang Li, Xinjia Li, Yu Zhao, Jianhua Yao, Lei Huang, Jianping Wu, Lirong Yang, Huajun Chen, Haoran Yu
{"title":"Integrating protein language models and automatic biofoundry for enhanced protein evolution","authors":"Qiang Zhang, Wanyi Chen, Ming Qin, Yuhao Wang, Zhongji Pu, Keyan Ding, Yuyue Liu, Qunfeng Zhang, Dongfang Li, Xinjia Li, Yu Zhao, Jianhua Yao, Lei Huang, Jianping Wu, Lirong Yang, Huajun Chen, Haoran Yu","doi":"10.1038/s41467-025-56751-8","DOIUrl":null,"url":null,"abstract":"<p>Traditional protein engineering methods, such as directed evolution, while effective, are often slow and labor-intensive. Advances in machine learning and automated biofoundry present new opportunities for optimizing these processes. This study devises a protein language model-enabled automatic evolution platform, a closed-loop system for automated protein engineering within the Design-Build-Test-Learn cycle. The protein language model ESM-2 makes zero-shot prediction of 96 variants to initiate the cycle. The biofoundry constructs and evaluates these variants, and feeds the results back to a multi-layer perceptron to train a fitness predictor, which then makes prediction of second round of 96 variants with improved fitness. With the tRNA synthetase as a model enzyme, four-rounds of evolution carried out within 10 days lead to mutants with enzyme activity improved by up to 2.4-fold. Our system significantly enhances the speed and accuracy of protein evolution, driving faster advancements in protein engineering for industrial applications.</p>","PeriodicalId":19066,"journal":{"name":"Nature Communications","volume":"5 1","pages":""},"PeriodicalIF":15.7000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Communications","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41467-025-56751-8","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Traditional protein engineering methods, such as directed evolution, while effective, are often slow and labor-intensive. Advances in machine learning and automated biofoundry present new opportunities for optimizing these processes. This study devises a protein language model-enabled automatic evolution platform, a closed-loop system for automated protein engineering within the Design-Build-Test-Learn cycle. The protein language model ESM-2 makes zero-shot prediction of 96 variants to initiate the cycle. The biofoundry constructs and evaluates these variants, and feeds the results back to a multi-layer perceptron to train a fitness predictor, which then makes prediction of second round of 96 variants with improved fitness. With the tRNA synthetase as a model enzyme, four-rounds of evolution carried out within 10 days lead to mutants with enzyme activity improved by up to 2.4-fold. Our system significantly enhances the speed and accuracy of protein evolution, driving faster advancements in protein engineering for industrial applications.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
整合蛋白质语言模型和自动生物铸造以增强蛋白质进化
传统的蛋白质工程方法,如定向进化,虽然有效,但往往是缓慢和劳动密集型的。机器学习和自动化生物铸造的进步为优化这些过程提供了新的机会。本研究设计了一个支持蛋白质语言模型的自动进化平台,这是一个在设计-构建-测试-学习周期内自动化蛋白质工程的闭环系统。蛋白质语言模型ESM-2对96个变异进行零概率预测以启动周期。生物铸造厂构建并评估这些变量,并将结果反馈给多层感知器以训练适应度预测器,然后该预测器以改进的适应度对第二轮96个变量进行预测。以tRNA合成酶为模型酶,在10天内进行的4轮进化导致酶活性提高高达2.4倍的突变体。我们的系统显著提高了蛋白质进化的速度和准确性,推动了工业应用中蛋白质工程的更快发展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
文献相关原料
公司名称
产品信息
麦克林
O-tert-butyltyrosine
麦克林
O-methyl-Tyr
麦克林
benzylserine
麦克林
p-phenylphenylalanine
麦克林
p-nitrophenylalanine
麦克林
p-isopropylphenylalanine
麦克林
p-zidophenylalanine
麦克林
p-fluorophenylalanine
麦克林
p-chlorophenylalanine
麦克林
p-cyanophenylalanine
麦克林
O-tert-butyltyrosine
麦克林
O-methyl-Tyr
麦克林
benzylserine
麦克林
p-phenylphenylalanine
麦克林
p-nitrophenylalanine
麦克林
p-isopropylphenylalanine
麦克林
p-azidophenylalanine
麦克林
p-fluorophenylalanine
麦克林
p-chlorophenylalanine
麦克林
p-cyanophenylalanine
阿拉丁
O-tert-butyltyrosine
阿拉丁
O-methyl-Tyr
阿拉丁
benzylserine
阿拉丁
p-phenylphenylalanine
阿拉丁
p-nitrophenylalanine
阿拉丁
p-isopropylphenylalanine
阿拉丁
p-azidophenylalanine
阿拉丁
p-fluorophenylalanine
阿拉丁
p-chlorophenylalanine
阿拉丁
p-cyanophenylalanine
来源期刊
Nature Communications
Nature Communications Biological Science Disciplines-
CiteScore
24.90
自引率
2.40%
发文量
6928
审稿时长
3.7 months
期刊介绍: Nature Communications, an open-access journal, publishes high-quality research spanning all areas of the natural sciences. Papers featured in the journal showcase significant advances relevant to specialists in each respective field. With a 2-year impact factor of 16.6 (2022) and a median time of 8 days from submission to the first editorial decision, Nature Communications is committed to rapid dissemination of research findings. As a multidisciplinary journal, it welcomes contributions from biological, health, physical, chemical, Earth, social, mathematical, applied, and engineering sciences, aiming to highlight important breakthroughs within each domain.
期刊最新文献
Cardiomyocyte-derived GPX4 stabilizes BNIP3 to facilitate mitophagy and mitigate myocardial ischemia/reperfusion injury. Phosphorylation of SF3B1 by CDK11 orchestrates spliceosome activation via SNIP1-dependent RES complex recruitment. Multi-field coupling enhanced plasmonic Moδ+ active site to efficiently hydrolyze ammonia borane. Exploring how deep learning decodes anomalous diffusion via Grad-CAM A controllable human spinal cord model with full dorsoventral patterning.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1