Genetic algorithms with permutation coding for multiple sequence alignment.

Mohamed Tahar Ben Othman, Gamil Abdel-Azim
{"title":"Genetic algorithms with permutation coding for multiple sequence alignment.","authors":"Mohamed Tahar Ben Othman,&nbsp;Gamil Abdel-Azim","doi":"10.2174/1872215611307020004","DOIUrl":null,"url":null,"abstract":"<p><p>Multiple sequence alignment (MSA) is one of the topics of bio informatics that has seriously been researched. It is known as NP-complete problem. It is also considered as one of the most important and daunting tasks in computational biology. Concerning this a wide number of heuristic algorithms have been proposed to find optimal alignment. Among these heuristic algorithms are genetic algorithms (GA). The GA has mainly two major weaknesses: it is time consuming and can cause local minima. One of the significant aspects in the GA process in MSA is to maximize the similarities between sequences by adding and shuffling the gaps of Solution Coding (SC). Several ways for SC have been introduced. One of them is the Permutation Coding (PC). We propose a hybrid algorithm based on genetic algorithms (GAs) with a PC and 2-opt algorithm. The PC helps to code the MSA solution which maximizes the gain of resources, reliability and diversity of GA. The use of the PC opens the area by applying all functions over permutations for MSA. Thus, we suggest an algorithm to calculate the scoring function for multiple alignments based on PC, which is used as fitness function. The time complexity of the GA is reduced by using this algorithm. Our GA is implemented with different selections strategies and different crossovers. The probability of crossover and mutation is set as one strategy. Relevant patents have been probed in the topic.</p>","PeriodicalId":74646,"journal":{"name":"Recent patents on DNA & gene sequences","volume":"7 2","pages":"105-14"},"PeriodicalIF":0.0000,"publicationDate":"2013-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Recent patents on DNA & gene sequences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/1872215611307020004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Multiple sequence alignment (MSA) is one of the topics of bio informatics that has seriously been researched. It is known as NP-complete problem. It is also considered as one of the most important and daunting tasks in computational biology. Concerning this a wide number of heuristic algorithms have been proposed to find optimal alignment. Among these heuristic algorithms are genetic algorithms (GA). The GA has mainly two major weaknesses: it is time consuming and can cause local minima. One of the significant aspects in the GA process in MSA is to maximize the similarities between sequences by adding and shuffling the gaps of Solution Coding (SC). Several ways for SC have been introduced. One of them is the Permutation Coding (PC). We propose a hybrid algorithm based on genetic algorithms (GAs) with a PC and 2-opt algorithm. The PC helps to code the MSA solution which maximizes the gain of resources, reliability and diversity of GA. The use of the PC opens the area by applying all functions over permutations for MSA. Thus, we suggest an algorithm to calculate the scoring function for multiple alignments based on PC, which is used as fitness function. The time complexity of the GA is reduced by using this algorithm. Our GA is implemented with different selections strategies and different crossovers. The probability of crossover and mutation is set as one strategy. Relevant patents have been probed in the topic.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
多序列比对的排列编码遗传算法。
多序列比对(MSA)是生物信息学研究的热点之一。它被称为np完全问题。它也被认为是计算生物学中最重要和最艰巨的任务之一。关于这一点,已经提出了大量的启发式算法来寻找最优对齐。在这些启发式算法中有遗传算法(GA)。遗传算法主要有两个缺点:耗时长,可能导致局部最小值。MSA遗传过程的一个重要方面是通过增加和洗牌解编码(SC)的间隙来最大化序列之间的相似性。介绍了SC的几种方法。其中之一是排列编码(PC)。提出了一种基于遗传算法(GAs)与PC和2-opt算法的混合算法。PC机帮助编码MSA解决方案,从而最大限度地提高遗传算法的资源增益、可靠性和多样性。PC的使用打开了该区域通过应用所有功能的排列为MSA。因此,我们提出了一种基于PC计算多组队得分函数的算法,并将其作为适应度函数。该算法降低了遗传算法的时间复杂度。我们的遗传算法通过不同的选择策略和不同的交叉来实现。将交叉和变异的概率设为一种策略。本课题对相关专利进行了探讨。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Patented aptamers for C-reactive protein detection: a review about their use in clinical diagnostics. Patent landscape for biological hydrogen production. Biotechnological advances in amaranths species and their future outlook in crop improvement--a review. Recent advances in gene therapy of endometriosis. Energy crops for biofuel feedstocks: facts and recent patents on genetic manipulation to improve biofuel crops.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1