一种新的序列不相似性测量方法及其在系统发育中的应用

Xiao-hui Niu, Nana Li, Feng Shi, Xue-yan Li
{"title":"一种新的序列不相似性测量方法及其在系统发育中的应用","authors":"Xiao-hui Niu, Nana Li, Feng Shi, Xue-yan Li","doi":"10.1109/ICNC.2008.299","DOIUrl":null,"url":null,"abstract":"We present a new computational approach to measure the distance between two biological sequences. A biological sequence quantifies as a Markov Chain with 20 states. Stochastic state transition matrix is computed as the quantitative index of the biological sequence. The Kullback-Leibler discrimination information is used as a diversity indicator to measure the dissimilarity of each pair of the rows in the two state transition matrix. Distance between the two sequences is defined as the average value with the weight of the occurrence possibility of each amino acid. We illustrate its application in reconstructing a phylogeny of the Eutherian orders using concatenated H-stranded amino acid sequences. This phylogeny is consistent with the commonly accepted one for the Eutherians.","PeriodicalId":6404,"journal":{"name":"2008 Fourth International Conference on Natural Computation","volume":"37 1","pages":"231-234"},"PeriodicalIF":0.0000,"publicationDate":"2008-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Novel Measurement of Sequence Dissimilarity and Its Application to Phylogeny\",\"authors\":\"Xiao-hui Niu, Nana Li, Feng Shi, Xue-yan Li\",\"doi\":\"10.1109/ICNC.2008.299\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present a new computational approach to measure the distance between two biological sequences. A biological sequence quantifies as a Markov Chain with 20 states. Stochastic state transition matrix is computed as the quantitative index of the biological sequence. The Kullback-Leibler discrimination information is used as a diversity indicator to measure the dissimilarity of each pair of the rows in the two state transition matrix. Distance between the two sequences is defined as the average value with the weight of the occurrence possibility of each amino acid. We illustrate its application in reconstructing a phylogeny of the Eutherian orders using concatenated H-stranded amino acid sequences. This phylogeny is consistent with the commonly accepted one for the Eutherians.\",\"PeriodicalId\":6404,\"journal\":{\"name\":\"2008 Fourth International Conference on Natural Computation\",\"volume\":\"37 1\",\"pages\":\"231-234\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-10-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 Fourth International Conference on Natural Computation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICNC.2008.299\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 Fourth International Conference on Natural Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNC.2008.299","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

我们提出了一种新的计算方法来测量两个生物序列之间的距离。一个生物序列可以量化为一个有20个状态的马尔可夫链。计算随机状态转移矩阵作为生物序列的定量指标。利用Kullback-Leibler判别信息作为多样性指标,衡量两状态转移矩阵中每对行之间的不相似性。两个序列之间的距离定义为每个氨基酸出现可能性的加权平均值。我们说明了它的应用在重建真兽目系统发育使用连接的h链氨基酸序列。这种系统发育与人们普遍接受的真瑟利亚人的系统发育是一致的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A Novel Measurement of Sequence Dissimilarity and Its Application to Phylogeny
We present a new computational approach to measure the distance between two biological sequences. A biological sequence quantifies as a Markov Chain with 20 states. Stochastic state transition matrix is computed as the quantitative index of the biological sequence. The Kullback-Leibler discrimination information is used as a diversity indicator to measure the dissimilarity of each pair of the rows in the two state transition matrix. Distance between the two sequences is defined as the average value with the weight of the occurrence possibility of each amino acid. We illustrate its application in reconstructing a phylogeny of the Eutherian orders using concatenated H-stranded amino acid sequences. This phylogeny is consistent with the commonly accepted one for the Eutherians.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Two-Level Content-Based Endoscope Image Retrieval A New PSO Scheduling Simulation Algorithm Based on an Intelligent Compensation Particle Position Rounding off Genetic Algorithm with an Application to Complex Portfolio Selection Some Operations of L-Fuzzy Approximate Spaces On Residuated Lattices Image Edge Detection Based on Improved Local Fractal Dimension
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1