{"title":"Fast Priority: A strategy to reduce processing time in a motif discovery tool like Priority","authors":"C. A. Sierra, P. Reyes-Herrera","doi":"10.1109/COLCOMCON.2014.6860428","DOIUrl":null,"url":null,"abstract":"Motif recognition is key to identify common sequences and specific elements recognized by different proteins such as Transcription Factors or RNA Binding Proteins. One interesting approach to computationally identify motifs consists on using additional sources of information besides the input sequences to guide the motif search. However, depending on the information and input data size the motif recovery process can take several hours. One of the most used algorithms for motif recovery is Priority [1]. This algorithm uses a Gibbs sampling strategy and includes additional information, in informative priors, to guide the motif search. We propose to reduce motif recovery processing time by using threads to execute independent processes in parallel. We apply the strategy to Priority and the results show the strategy has a positive effect on time and it does not affect the quality of the resultant motif. Reducing processing time in motif recovery algorithms is important considering that massive parallel sequencing is frequently used today. By using massive sequencing the number of sequences derived by experimental techniques is only getting bigger.","PeriodicalId":346697,"journal":{"name":"2014 IEEE Colombian Conference on Communications and Computing (COLCOM)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE Colombian Conference on Communications and Computing (COLCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COLCOMCON.2014.6860428","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Motif recognition is key to identify common sequences and specific elements recognized by different proteins such as Transcription Factors or RNA Binding Proteins. One interesting approach to computationally identify motifs consists on using additional sources of information besides the input sequences to guide the motif search. However, depending on the information and input data size the motif recovery process can take several hours. One of the most used algorithms for motif recovery is Priority [1]. This algorithm uses a Gibbs sampling strategy and includes additional information, in informative priors, to guide the motif search. We propose to reduce motif recovery processing time by using threads to execute independent processes in parallel. We apply the strategy to Priority and the results show the strategy has a positive effect on time and it does not affect the quality of the resultant motif. Reducing processing time in motif recovery algorithms is important considering that massive parallel sequencing is frequently used today. By using massive sequencing the number of sequences derived by experimental techniques is only getting bigger.