Variational Autoencoders and Evolutionary Algorithms for Targeted Novel Enzyme Design

2022 IEEE Congress on Evolutionary Computation (CEC) Pub Date : 2022-07-18 DOI:10.1109/CEC55065.2022.9870421

Miguel Martins, M. Rocha, Vítor Pereira

{"title":"Variational Autoencoders and Evolutionary Algorithms for Targeted Novel Enzyme Design","authors":"Miguel Martins, M. Rocha, Vítor Pereira","doi":"10.1109/CEC55065.2022.9870421","DOIUrl":null,"url":null,"abstract":"Recent developments in Generative Deep Learning have fostered new engineering methods for protein design. Although deep generative models trained on protein sequence can learn biologically meaningful representations, the design of proteins with optimised properties remains a challenge. We combined deep learning architectures with evolutionary computation to steer the protein generative process towards specific sets of properties to address this problem. The latent space of a Variational Autoencoder is explored by evolutionary algorithms to find the best candidates. A set of single-objective and multi-objective problems were conceived to evaluate the algorithms' capacity to optimise proteins. The optimisation tasks consider the average proteins' hydrophobicity, their solubility and the probability of being generated by a defined functional Hidden Markov Model profile. The results show that Evolutionary Algorithms can achieve good results while allowing for more variability in the design of the experiment, thus resulting in a much greater set of possibly functional novel proteins.","PeriodicalId":153241,"journal":{"name":"2022 IEEE Congress on Evolutionary Computation (CEC)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Congress on Evolutionary Computation (CEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CEC55065.2022.9870421","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Recent developments in Generative Deep Learning have fostered new engineering methods for protein design. Although deep generative models trained on protein sequence can learn biologically meaningful representations, the design of proteins with optimised properties remains a challenge. We combined deep learning architectures with evolutionary computation to steer the protein generative process towards specific sets of properties to address this problem. The latent space of a Variational Autoencoder is explored by evolutionary algorithms to find the best candidates. A set of single-objective and multi-objective problems were conceived to evaluate the algorithms' capacity to optimise proteins. The optimisation tasks consider the average proteins' hydrophobicity, their solubility and the probability of being generated by a defined functional Hidden Markov Model profile. The results show that Evolutionary Algorithms can achieve good results while allowing for more variability in the design of the experiment, thus resulting in a much greater set of possibly functional novel proteins.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

针对新型酶设计的变分自编码器和进化算法

生成式深度学习的最新发展为蛋白质设计提供了新的工程方法。尽管在蛋白质序列上训练的深度生成模型可以学习有生物学意义的表示，但具有优化特性的蛋白质设计仍然是一个挑战。我们将深度学习架构与进化计算结合起来，引导蛋白质生成过程朝着特定的属性集发展，以解决这个问题。利用进化算法对变分自编码器的潜在空间进行了探索，以寻找最佳候选。设计了一组单目标和多目标问题来评估算法优化蛋白质的能力。优化任务考虑平均蛋白质的疏水性、溶解度和由定义的功能隐马尔可夫模型剖面生成的概率。结果表明，进化算法可以获得良好的结果，同时允许在实验设计中有更多的可变性，从而产生更多可能具有功能的新蛋白质。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2022 IEEE Congress on Evolutionary Computation (CEC)

自引率

0.00%

发文量