{"title":"Finding identical sequence repeats in multiple protein sequences: An algorithm","authors":"Vikas Kumar Maurya, Madhumathi Sanjeevi, Chandrasekar Narayanan Rahul, Ajitha Mohan, Dhanalakshmi Ramachandran, Rashmi Siddalingappa, Roshan Rauniyar, Sekar Kanagaraj","doi":"10.1007/s12038-023-00410-x","DOIUrl":null,"url":null,"abstract":"<p>In recent years, several experimental evidences suggest that amino acid repeats are closely linked to many disease conditions, as they have a significant role in evolution of disordered regions of the polypeptide segments. Even though many algorithms and databases were developed for such analysis, each algorithm has some caveats, like limitation on the number of amino acids within the repeat patterns and number of query protein sequences. To this end, in the present work, a new method called the internal sequence repeats across multiple protein sequences (ISRMPS) is proposed for the first time to identify identical repeats across multiple protein sequences. It also identifies distantly located repeat patterns in various protein sequences. Our method can be applied to study evolutionary relationships, epitope mapping, CRISPR-Cas sequencing methods, and other comparative analytical assessments of protein sequences.</p>","PeriodicalId":15171,"journal":{"name":"Journal of Biosciences","volume":"6 1","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biosciences","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s12038-023-00410-x","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, several experimental evidences suggest that amino acid repeats are closely linked to many disease conditions, as they have a significant role in evolution of disordered regions of the polypeptide segments. Even though many algorithms and databases were developed for such analysis, each algorithm has some caveats, like limitation on the number of amino acids within the repeat patterns and number of query protein sequences. To this end, in the present work, a new method called the internal sequence repeats across multiple protein sequences (ISRMPS) is proposed for the first time to identify identical repeats across multiple protein sequences. It also identifies distantly located repeat patterns in various protein sequences. Our method can be applied to study evolutionary relationships, epitope mapping, CRISPR-Cas sequencing methods, and other comparative analytical assessments of protein sequences.
期刊介绍:
The Journal of Biosciences is a quarterly journal published by the Indian Academy of Sciences, Bangalore. It covers all areas of Biology and is the premier journal in the country within its scope. It is indexed in Current Contents and other standard Biological and Medical databases. The Journal of Biosciences began in 1934 as the Proceedings of the Indian Academy of Sciences (Section B). This continued until 1978 when it was split into three parts : Proceedings-Animal Sciences, Proceedings-Plant Sciences and Proceedings-Experimental Biology. Proceedings-Experimental Biology was renamed Journal of Biosciences in 1979; and in 1991, Proceedings-Animal Sciences and Proceedings-Plant Sciences merged with it.