{"title":"一种基于多目标遗传算法的结构基序提取方法","authors":"Mehmet Kaya, Melikali Güç","doi":"10.1109/CBMS.2008.99","DOIUrl":null,"url":null,"abstract":"The functional motifs composed of several sequential blocks are difficult to find. Current mining methods might individually find each motif block but fail to connect them with large irregular gaps. In this paper we propose a novel method for the efficient extraction of structured motifs from DNA sequences using multi-objective genetic algorithm. The main advantage of our approach is that a large number of nondominated motifs can be obtained by a single run with respect to conflicting objectives: similarity and support maximization and gap minimization. To the best of our knowledge, this is the first effort in this direction. The proposed method can be applied to any data set with a sequential character. Furthermore, it allows any choice of similarity measures for finding motifs. By analyzing the obtained optimal motifs, the decision maker can understand the tradeoff between the objectives. We compare our method with the two well-known structured motif extraction methods, EXMOTIF and RISOTTO. Experimental results on synthetics data set demonstrate that the proposed method exhibits good performance over the other methods in terms of runtime.","PeriodicalId":377855,"journal":{"name":"2008 21st IEEE International Symposium on Computer-Based Medical Systems","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"A Novel Approach to Extract Structured Motifs by Multi-Objective Genetic Algorithm\",\"authors\":\"Mehmet Kaya, Melikali Güç\",\"doi\":\"10.1109/CBMS.2008.99\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The functional motifs composed of several sequential blocks are difficult to find. Current mining methods might individually find each motif block but fail to connect them with large irregular gaps. In this paper we propose a novel method for the efficient extraction of structured motifs from DNA sequences using multi-objective genetic algorithm. The main advantage of our approach is that a large number of nondominated motifs can be obtained by a single run with respect to conflicting objectives: similarity and support maximization and gap minimization. To the best of our knowledge, this is the first effort in this direction. The proposed method can be applied to any data set with a sequential character. Furthermore, it allows any choice of similarity measures for finding motifs. By analyzing the obtained optimal motifs, the decision maker can understand the tradeoff between the objectives. We compare our method with the two well-known structured motif extraction methods, EXMOTIF and RISOTTO. Experimental results on synthetics data set demonstrate that the proposed method exhibits good performance over the other methods in terms of runtime.\",\"PeriodicalId\":377855,\"journal\":{\"name\":\"2008 21st IEEE International Symposium on Computer-Based Medical Systems\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 21st IEEE International Symposium on Computer-Based Medical Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CBMS.2008.99\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 21st IEEE International Symposium on Computer-Based Medical Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMS.2008.99","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Novel Approach to Extract Structured Motifs by Multi-Objective Genetic Algorithm
The functional motifs composed of several sequential blocks are difficult to find. Current mining methods might individually find each motif block but fail to connect them with large irregular gaps. In this paper we propose a novel method for the efficient extraction of structured motifs from DNA sequences using multi-objective genetic algorithm. The main advantage of our approach is that a large number of nondominated motifs can be obtained by a single run with respect to conflicting objectives: similarity and support maximization and gap minimization. To the best of our knowledge, this is the first effort in this direction. The proposed method can be applied to any data set with a sequential character. Furthermore, it allows any choice of similarity measures for finding motifs. By analyzing the obtained optimal motifs, the decision maker can understand the tradeoff between the objectives. We compare our method with the two well-known structured motif extraction methods, EXMOTIF and RISOTTO. Experimental results on synthetics data set demonstrate that the proposed method exhibits good performance over the other methods in terms of runtime.