Genetic algorithms for protein threading.

J Yadgari, A Amir, R Unger
{"title":"Genetic algorithms for protein threading.","authors":"J Yadgari,&nbsp;A Amir,&nbsp;R Unger","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>Despite many years of efforts, a direct prediction of protein structure from sequence is still not possible. As a result, in the last few years researchers have started to address the \"inverse folding problem\": Identifying and aligning a sequence to the fold with which it is most compatible, a process known as \"threading\". In two meetings in which protein folding predictions were objectively evaluated, it became clear that threading as a concept promises a real breakthrough, but that much improvement is still needed in the technique itself. Threading is a NP-hard problem, and thus no general polynomial solution can be expected. Still a practical approach with demonstrated ability to find optimal solutions in many cases, and acceptable solutions in other cases, is needed. We applied the technique of Genetic Algorithms in order to significantly improve the ability of threading algorithms to find the optimal alignment of a sequence to a structure, i.e. the alignment with the minimum free energy. A major progress reported here is the design of a representation of the threading alignment as a string of fixed length. With this representation validation of alignments and genetic operators are effectively implemented. Appropriate data structure and parameters have been selected. It is shown that Genetic Algorithm threading is effective and is able to find the optimal alignment in a few test cases. Furthermore, the described algorithm is shown to perform well even without pre-definition of core elements. Existing threading methods are dependent on such constraints to make their calculations feasible. But the concept of core elements is inherently arbitrary and should be avoided if possible. While a rigorous proof is hard to submit yet an, we present indications that indeed Genetic Algorithm threading is capable of finding consistently good solutions of full alignments in search spaces of size up to 10(70).</p>","PeriodicalId":79420,"journal":{"name":"Proceedings. International Conference on Intelligent Systems for Molecular Biology","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. International Conference on Intelligent Systems for Molecular Biology","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Despite many years of efforts, a direct prediction of protein structure from sequence is still not possible. As a result, in the last few years researchers have started to address the "inverse folding problem": Identifying and aligning a sequence to the fold with which it is most compatible, a process known as "threading". In two meetings in which protein folding predictions were objectively evaluated, it became clear that threading as a concept promises a real breakthrough, but that much improvement is still needed in the technique itself. Threading is a NP-hard problem, and thus no general polynomial solution can be expected. Still a practical approach with demonstrated ability to find optimal solutions in many cases, and acceptable solutions in other cases, is needed. We applied the technique of Genetic Algorithms in order to significantly improve the ability of threading algorithms to find the optimal alignment of a sequence to a structure, i.e. the alignment with the minimum free energy. A major progress reported here is the design of a representation of the threading alignment as a string of fixed length. With this representation validation of alignments and genetic operators are effectively implemented. Appropriate data structure and parameters have been selected. It is shown that Genetic Algorithm threading is effective and is able to find the optimal alignment in a few test cases. Furthermore, the described algorithm is shown to perform well even without pre-definition of core elements. Existing threading methods are dependent on such constraints to make their calculations feasible. But the concept of core elements is inherently arbitrary and should be avoided if possible. While a rigorous proof is hard to submit yet an, we present indications that indeed Genetic Algorithm threading is capable of finding consistently good solutions of full alignments in search spaces of size up to 10(70).

分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
蛋白质穿线的遗传算法。
尽管经过多年的努力,从序列中直接预测蛋白质结构仍然是不可能的。因此,在过去的几年里,研究人员已经开始解决“反折叠问题”:识别一个序列并将其与最兼容的折叠对齐,这一过程被称为“穿线”。在对蛋白质折叠预测进行客观评估的两次会议上,很明显,线程作为一个概念有望取得真正的突破,但技术本身仍需要大量改进。线程是一个np困难的问题,因此不能期望有一般的多项式解。仍然需要一种实际的方法,证明在许多情况下能够找到最优解,在其他情况下能够找到可接受的解。为了显著提高线程算法寻找序列与结构的最优对齐(即自由能最小的对齐)的能力,我们应用了遗传算法技术。这里报告的一个主要进展是将线程对齐的表示设计为固定长度的字符串。通过这种表示,可以有效地实现对比对和遗传算子的验证。选择了合适的数据结构和参数。结果表明,遗传算法线程化是有效的,能够在少数测试用例中找到最优对齐。此外,即使没有预先定义核心元素,所描述的算法也能很好地执行。现有的线程方法依赖于这样的约束,以使它们的计算可行。但核心元素的概念本质上是武断的,应该尽可能避免。虽然严格的证明还很难提交,但我们提出的迹象表明,遗传算法线程确实能够在大小为10的搜索空间中找到一致的完整对齐的良好解决方案(70)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Reducing Mass Degeneracy in SAR by MS by Stable Isotopic Labeling Intelligent aids for parallel experiment planning and macromolecular crystallization. A practical algorithm for optimal inference of haplotypes from diploid populations. Analysis of yeast's ORF upstream regions by parallel processing, microarrays, and computational methods. Finding regulatory elements using joint likelihoods for sequence and expression profile data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1