{"title":"DNA序列的几何特征","authors":"Hongjie Xu","doi":"10.2174/0118722121271190230928072933","DOIUrl":null,"url":null,"abstract":"Background:: The primary goal of molecular phylogenetics is to characterize the similarity/ dissimilarity of DNA sequences. Existing sequence comparison methods with some patented are mostly alignment-based and remain computationally arduous. background: The primary goal of molecular phylogenetics is to characterize similarity/dissimilarity of DNA sequences. Existing sequence comparison methods are mostly alignment-based and remain computationally arduous. Objective:: In this study, we propose a novel alignment-free approach based on a previous DNA curve representation without degeneracy. Method:: The method combines two important geometric elements that describe the global and local features of the curve, respectively. It allows us to use a 24-dimensional vector called a characterization vector to numerically characterize a DNA sequence. We then measure the dissimilarity/ similarity of various DNA sequences by the Euclidean distances between their characterization vectors. Results:: we compare our approach with other existing algorithms on 4 data sets including COVID-19, and find that our apporach can produce consistent results and is faster than the alignment-based methods. Conclusion:: The method stated in this study, can assist in analyzing biological molecular sequences efficiently and will be helpful to molecular biologists. other: none","PeriodicalId":40022,"journal":{"name":"Recent Patents on Engineering","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Geometric Feature of DNA Sequences\",\"authors\":\"Hongjie Xu\",\"doi\":\"10.2174/0118722121271190230928072933\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background:: The primary goal of molecular phylogenetics is to characterize the similarity/ dissimilarity of DNA sequences. Existing sequence comparison methods with some patented are mostly alignment-based and remain computationally arduous. background: The primary goal of molecular phylogenetics is to characterize similarity/dissimilarity of DNA sequences. Existing sequence comparison methods are mostly alignment-based and remain computationally arduous. Objective:: In this study, we propose a novel alignment-free approach based on a previous DNA curve representation without degeneracy. Method:: The method combines two important geometric elements that describe the global and local features of the curve, respectively. It allows us to use a 24-dimensional vector called a characterization vector to numerically characterize a DNA sequence. We then measure the dissimilarity/ similarity of various DNA sequences by the Euclidean distances between their characterization vectors. Results:: we compare our approach with other existing algorithms on 4 data sets including COVID-19, and find that our apporach can produce consistent results and is faster than the alignment-based methods. Conclusion:: The method stated in this study, can assist in analyzing biological molecular sequences efficiently and will be helpful to molecular biologists. other: none\",\"PeriodicalId\":40022,\"journal\":{\"name\":\"Recent Patents on Engineering\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Recent Patents on Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2174/0118722121271190230928072933\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Engineering\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Recent Patents on Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/0118722121271190230928072933","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}
Background:: The primary goal of molecular phylogenetics is to characterize the similarity/ dissimilarity of DNA sequences. Existing sequence comparison methods with some patented are mostly alignment-based and remain computationally arduous. background: The primary goal of molecular phylogenetics is to characterize similarity/dissimilarity of DNA sequences. Existing sequence comparison methods are mostly alignment-based and remain computationally arduous. Objective:: In this study, we propose a novel alignment-free approach based on a previous DNA curve representation without degeneracy. Method:: The method combines two important geometric elements that describe the global and local features of the curve, respectively. It allows us to use a 24-dimensional vector called a characterization vector to numerically characterize a DNA sequence. We then measure the dissimilarity/ similarity of various DNA sequences by the Euclidean distances between their characterization vectors. Results:: we compare our approach with other existing algorithms on 4 data sets including COVID-19, and find that our apporach can produce consistent results and is faster than the alignment-based methods. Conclusion:: The method stated in this study, can assist in analyzing biological molecular sequences efficiently and will be helpful to molecular biologists. other: none
期刊介绍:
Recent Patents on Engineering publishes review articles by experts on recent patents in the major fields of engineering. A selection of important and recent patents on engineering is also included in the journal. The journal is essential reading for all researchers involved in engineering sciences.