Pushpa Susant Mahapatro , Jatinderkumar R. Saini , Shraddha Vaidya
{"title":"Genome Motif Discovery in Zika Virus: Computational Techniques and Validation Using Greedy Method","authors":"Pushpa Susant Mahapatro , Jatinderkumar R. Saini , Shraddha Vaidya","doi":"10.1016/j.procs.2024.12.028","DOIUrl":null,"url":null,"abstract":"<div><div>Identifying patterns in the genome sequences is an essential yet tricky task in Bioinformatics. It provides information about gene activity and gene functionality. In Deoxyribose Nucleic Acid (DNA) sequence analysis, computational approaches like Greedy motif search can be applied for motif identification. It finds recurring patterns called motifs by iteratively selecting the most promising sequence of a specified length from each DNA string. The selected string maximizes the scoring function and hence is selected. First, a set of initial motifs is selected for each set in the input string. Then, a subsequence that best aligns with the selected string is selected for the next iteration. The score is calculated and needs to be minimized. The validation of the obtained motif is also performed. This study focuses on applying the algorithm to identify patterns in the genome sequence of the Zika virus. In finding conserved patterns in the Zika virus genome sequence, the Greedy motif search is known for its efficiency and precision. The Greedy motif search results are compared with Gibbs sampler method of motif identification. This study adds knowledge of the viral genome and suggests new treatment development methods by confining these patterns.</div></div>","PeriodicalId":20465,"journal":{"name":"Procedia Computer Science","volume":"252 ","pages":"Pages 260-269"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Procedia Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1877050924034604","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Identifying patterns in the genome sequences is an essential yet tricky task in Bioinformatics. It provides information about gene activity and gene functionality. In Deoxyribose Nucleic Acid (DNA) sequence analysis, computational approaches like Greedy motif search can be applied for motif identification. It finds recurring patterns called motifs by iteratively selecting the most promising sequence of a specified length from each DNA string. The selected string maximizes the scoring function and hence is selected. First, a set of initial motifs is selected for each set in the input string. Then, a subsequence that best aligns with the selected string is selected for the next iteration. The score is calculated and needs to be minimized. The validation of the obtained motif is also performed. This study focuses on applying the algorithm to identify patterns in the genome sequence of the Zika virus. In finding conserved patterns in the Zika virus genome sequence, the Greedy motif search is known for its efficiency and precision. The Greedy motif search results are compared with Gibbs sampler method of motif identification. This study adds knowledge of the viral genome and suggests new treatment development methods by confining these patterns.