Optimizing enzyme thermostability by combining multiple mutations using protein language model.

IF 4.5 Q1 MICROBIOLOGY mLife Pub Date : 2024-12-26 eCollection Date: 2024-12-01 DOI:10.1002/mlf2.12151
Jiahao Bian, Pan Tan, Ting Nie, Liang Hong, Guang-Yu Yang
{"title":"Optimizing enzyme thermostability by combining multiple mutations using protein language model.","authors":"Jiahao Bian, Pan Tan, Ting Nie, Liang Hong, Guang-Yu Yang","doi":"10.1002/mlf2.12151","DOIUrl":null,"url":null,"abstract":"<p><p>Optimizing enzyme thermostability is essential for advancements in protein science and industrial applications. Currently, (semi-)rational design and random mutagenesis methods can accurately identify single-point mutations that enhance enzyme thermostability. However, complex epistatic interactions often arise when multiple mutation sites are combined, leading to the complete inactivation of combinatorial mutants. As a result, constructing an optimized enzyme often requires repeated rounds of design to incrementally incorporate single mutation sites, which is highly time-consuming. In this study, we developed an AI-aided strategy for enzyme thermostability engineering that efficiently facilitates the recombination of beneficial single-point mutations. We utilized thermostability data from creatinase, including 18 single-point mutants, 22 double-point mutants, 21 triple-point mutants, and 12 quadruple-point mutants. Using these data as inputs, we used a temperature-guided protein language model, Pro-PRIME, to learn epistatic features and design combinatorial mutants. After two rounds of design, we obtained 50 combinatorial mutants with superior thermostability, achieving a success rate of 100%. The best mutant, 13M4, contained 13 mutation sites and maintained nearly full catalytic activity compared to the wild-type. It showed a 10.19°C increase in the melting temperature and an ~655-fold increase in the half-life at 58°C. Additionally, the model successfully captured epistasis in high-order combinatorial mutants, including sign epistasis (K351E) and synergistic epistasis (D17V/I149V). We elucidated the mechanism of long-range epistasis in detail using a dynamics cross-correlation matrix method. Our work provides an efficient framework for designing enzyme thermostability and studying high-order epistatic effects in protein-directed evolution.</p>","PeriodicalId":94145,"journal":{"name":"mLife","volume":"3 4","pages":"492-504"},"PeriodicalIF":4.5000,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11685841/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"mLife","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/mlf2.12151","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Optimizing enzyme thermostability is essential for advancements in protein science and industrial applications. Currently, (semi-)rational design and random mutagenesis methods can accurately identify single-point mutations that enhance enzyme thermostability. However, complex epistatic interactions often arise when multiple mutation sites are combined, leading to the complete inactivation of combinatorial mutants. As a result, constructing an optimized enzyme often requires repeated rounds of design to incrementally incorporate single mutation sites, which is highly time-consuming. In this study, we developed an AI-aided strategy for enzyme thermostability engineering that efficiently facilitates the recombination of beneficial single-point mutations. We utilized thermostability data from creatinase, including 18 single-point mutants, 22 double-point mutants, 21 triple-point mutants, and 12 quadruple-point mutants. Using these data as inputs, we used a temperature-guided protein language model, Pro-PRIME, to learn epistatic features and design combinatorial mutants. After two rounds of design, we obtained 50 combinatorial mutants with superior thermostability, achieving a success rate of 100%. The best mutant, 13M4, contained 13 mutation sites and maintained nearly full catalytic activity compared to the wild-type. It showed a 10.19°C increase in the melting temperature and an ~655-fold increase in the half-life at 58°C. Additionally, the model successfully captured epistasis in high-order combinatorial mutants, including sign epistasis (K351E) and synergistic epistasis (D17V/I149V). We elucidated the mechanism of long-range epistasis in detail using a dynamics cross-correlation matrix method. Our work provides an efficient framework for designing enzyme thermostability and studying high-order epistatic effects in protein-directed evolution.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用蛋白质语言模型结合多种突变优化酶的耐热性。
优化酶的热稳定性对蛋白质科学和工业应用的进步至关重要。目前,(半)理性设计和随机诱变方法可以准确地识别增强酶热稳定性的单点突变。然而,当多个突变位点组合时,往往会出现复杂的上位相互作用,导致组合突变体完全失活。因此,构建一个优化的酶通常需要反复的设计来增加单个突变位点,这是非常耗时的。在这项研究中,我们开发了一种人工智能辅助的酶热稳定性工程策略,有效地促进了有益的单点突变的重组。我们利用了肌酶的热稳定性数据,包括18个单点突变体,22个双点突变体,21个三点突变体和12个四点突变体。使用这些数据作为输入,我们使用温度引导的蛋白质语言模型Pro-PRIME来学习上位性特征并设计组合突变体。经过两轮设计,我们获得了50个具有优异热稳定性的组合突变体,成功率为100%。最好的突变体13M4包含13个突变位点,与野生型相比保持了几乎完全的催化活性。熔点温度提高10.19℃,半衰期提高~655倍。此外,该模型成功捕获了高阶组合突变体的上位性,包括符号上位性(K351E)和协同上位性(D17V/I149V)。我们利用动态相互关联矩阵方法详细阐明了远程上位机制。我们的工作为设计酶的热稳定性和研究蛋白质定向进化中的高阶上位性效应提供了一个有效的框架。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
2.30
自引率
0.00%
发文量
0
期刊最新文献
Optimizing enzyme thermostability by combining multiple mutations using protein language model. Phospholipase PlcH is involved in the secretion of cell wall glycoproteins and contributes to the host immune response of Aspergillus fumigatus. Protein engineering in the deep learning era. NAC4ED: A high-throughput computational platform for the rational design of enzyme activity and substrate selectivity. Bacterial abundance and diversity in 64-74 Ma subseafloor igneous basement from the Louisville Seamount Chain.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1