{"title":"The Maximal Frequent Pattern mining of DNA sequence","authors":"S. Bai, Sixue Bai","doi":"10.1109/GRC.2009.5255169","DOIUrl":null,"url":null,"abstract":"The DNA sequence data is one of the basic and important data among biological data. The DNA sequence pattern mining has got wide attention and rapid development. Traditional algorithms for the sequential pattern mining may generate lots of redundant patterns when dealing with the DNA sequence. The Maximal Frequent Pattern is preferable to express the function and structure of the DNA sequence. Base on the characteristics of the DNA sequence, the author develops the Joined Maximal Pattern Segments algorithm—JMPS, for the maximal frequent patterns mining of the DNA sequence. First, the maximal frequent pattern segments base on adjacent generated. Then, longer Maximal Frequent Pattern can be obtained by combining the above segments, at the same time deleting the Non-maximal patterns. The algorithm can deal with the DNA sequence data efficiently.","PeriodicalId":388774,"journal":{"name":"2009 IEEE International Conference on Granular Computing","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE International Conference on Granular Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GRC.2009.5255169","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14
Abstract
The DNA sequence data is one of the basic and important data among biological data. The DNA sequence pattern mining has got wide attention and rapid development. Traditional algorithms for the sequential pattern mining may generate lots of redundant patterns when dealing with the DNA sequence. The Maximal Frequent Pattern is preferable to express the function and structure of the DNA sequence. Base on the characteristics of the DNA sequence, the author develops the Joined Maximal Pattern Segments algorithm—JMPS, for the maximal frequent patterns mining of the DNA sequence. First, the maximal frequent pattern segments base on adjacent generated. Then, longer Maximal Frequent Pattern can be obtained by combining the above segments, at the same time deleting the Non-maximal patterns. The algorithm can deal with the DNA sequence data efficiently.