连续易位距离问题的精确和近似算法

IF 1 4区 计算机科学 Q3 COMPUTER SCIENCE, THEORY & METHODS Theoretical Computer Science Pub Date : 2025-02-12 Epub Date: 2024-11-28 DOI:10.1016/j.tcs.2024.115003
Maria Constantin , Alexandru Popa
{"title":"连续易位距离问题的精确和近似算法","authors":"Maria Constantin ,&nbsp;Alexandru Popa","doi":"10.1016/j.tcs.2024.115003","DOIUrl":null,"url":null,"abstract":"<div><div>Biological computation is the field that studies the computations performed by the biological systems and includes the development of algorithms or other computational techniques inspired by nature. The genome rearrangements that occur during genome evolution are an important example of a natural computation process which inspired multiple problems that can be solved using combinatorial models. A popular problem inspired by genome evolution is computing the genetic distance between two organisms by identifying the minimum number of genome rearrangements needed to obtain one genome from the other. Given two chromosomes, represented as strings over the DNA alphabet {A, C, G, T}, the translocation operation is defined as a prefix interchange between these chromosomes. Thus, two new chromosomes are obtained after a translocation. When the chromosomes are interchanging equal length prefixes, the translocation operation is called uniform. A translocation sequence is a series of translocation operations applied to an initial genome, represented as a set of strings (initial set), resulting in a new genome, also represented as a set of strings (target set). Given a translocation sequence, if the strings exchanging prefixes are part of the initial set or have additional copies created by preceding translocations than those utilised, then the translocation sequence is referred to as contiguous. The translocation distance between two given genomes is defined as the minimum number of translocation operations necessary to obtain one genome from the other. We introduce a new polynomial time exact algorithm to compute the uniform contiguous translocation distance for a target genome of size two. Then, we present a polynomial time 2-approximation algorithm for the non-uniform contiguous translocation distance considering a target genome of size one.</div></div>","PeriodicalId":49438,"journal":{"name":"Theoretical Computer Science","volume":"1026 ","pages":"Article 115003"},"PeriodicalIF":1.0000,"publicationDate":"2025-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exact and approximation algorithms for the contiguous translocation distance problem\",\"authors\":\"Maria Constantin ,&nbsp;Alexandru Popa\",\"doi\":\"10.1016/j.tcs.2024.115003\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Biological computation is the field that studies the computations performed by the biological systems and includes the development of algorithms or other computational techniques inspired by nature. The genome rearrangements that occur during genome evolution are an important example of a natural computation process which inspired multiple problems that can be solved using combinatorial models. A popular problem inspired by genome evolution is computing the genetic distance between two organisms by identifying the minimum number of genome rearrangements needed to obtain one genome from the other. Given two chromosomes, represented as strings over the DNA alphabet {A, C, G, T}, the translocation operation is defined as a prefix interchange between these chromosomes. Thus, two new chromosomes are obtained after a translocation. When the chromosomes are interchanging equal length prefixes, the translocation operation is called uniform. A translocation sequence is a series of translocation operations applied to an initial genome, represented as a set of strings (initial set), resulting in a new genome, also represented as a set of strings (target set). Given a translocation sequence, if the strings exchanging prefixes are part of the initial set or have additional copies created by preceding translocations than those utilised, then the translocation sequence is referred to as contiguous. The translocation distance between two given genomes is defined as the minimum number of translocation operations necessary to obtain one genome from the other. We introduce a new polynomial time exact algorithm to compute the uniform contiguous translocation distance for a target genome of size two. Then, we present a polynomial time 2-approximation algorithm for the non-uniform contiguous translocation distance considering a target genome of size one.</div></div>\",\"PeriodicalId\":49438,\"journal\":{\"name\":\"Theoretical Computer Science\",\"volume\":\"1026 \",\"pages\":\"Article 115003\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2025-02-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Theoretical Computer Science\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0304397524006200\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/11/28 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theoretical Computer Science","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0304397524006200","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/28 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0

摘要

生物计算是研究由生物系统执行的计算的领域,包括受自然启发的算法或其他计算技术的发展。在基因组进化过程中发生的基因组重排是自然计算过程的一个重要例子,它激发了可以使用组合模型解决的多个问题。一个受基因组进化启发的流行问题是,通过确定从一个基因组获得另一个基因组所需的最小基因组重排数来计算两个生物体之间的遗传距离。给定两条染色体,以DNA字母表{A, C, G, T}上的字符串表示,易位操作被定义为这些染色体之间的前缀交换。因此,易位后获得两条新染色体。当染色体互换等长前缀时,这种易位操作称为均匀易位。易位序列是应用于初始基因组的一系列易位操作,表示为一组字符串(initial set),从而产生一个新的基因组,也表示为一组字符串(target set)。给定一个易位序列,如果交换前缀的字符串是初始集合的一部分,或者由先前的易位创建的副本比所使用的那些副本多,则该易位序列被称为连续的。两个给定基因组之间的易位距离定义为从另一个基因组获得一个基因组所需的最小易位操作数。我们引入了一种新的多项式时间精确算法来计算大小为2的目标基因组的均匀连续易位距离。然后,我们提出了考虑大小为1的目标基因组的非均匀连续易位距离的多项式时间2逼近算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Exact and approximation algorithms for the contiguous translocation distance problem
Biological computation is the field that studies the computations performed by the biological systems and includes the development of algorithms or other computational techniques inspired by nature. The genome rearrangements that occur during genome evolution are an important example of a natural computation process which inspired multiple problems that can be solved using combinatorial models. A popular problem inspired by genome evolution is computing the genetic distance between two organisms by identifying the minimum number of genome rearrangements needed to obtain one genome from the other. Given two chromosomes, represented as strings over the DNA alphabet {A, C, G, T}, the translocation operation is defined as a prefix interchange between these chromosomes. Thus, two new chromosomes are obtained after a translocation. When the chromosomes are interchanging equal length prefixes, the translocation operation is called uniform. A translocation sequence is a series of translocation operations applied to an initial genome, represented as a set of strings (initial set), resulting in a new genome, also represented as a set of strings (target set). Given a translocation sequence, if the strings exchanging prefixes are part of the initial set or have additional copies created by preceding translocations than those utilised, then the translocation sequence is referred to as contiguous. The translocation distance between two given genomes is defined as the minimum number of translocation operations necessary to obtain one genome from the other. We introduce a new polynomial time exact algorithm to compute the uniform contiguous translocation distance for a target genome of size two. Then, we present a polynomial time 2-approximation algorithm for the non-uniform contiguous translocation distance considering a target genome of size one.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Theoretical Computer Science
Theoretical Computer Science 工程技术-计算机:理论方法
CiteScore
2.60
自引率
18.20%
发文量
471
审稿时长
12.6 months
期刊介绍: Theoretical Computer Science is mathematical and abstract in spirit, but it derives its motivation from practical and everyday computation. Its aim is to understand the nature of computation and, as a consequence of this understanding, provide more efficient methodologies. All papers introducing or studying mathematical, logic and formal concepts and methods are welcome, provided that their motivation is clearly drawn from the field of computing.
期刊最新文献
Byzantine-tolerant privacy-preserving atomic register Correctness and fairness of committee-based blockchains: A case study of Tendermint’s repeated consensus On languages of transduced-input finite automata with translucent letters Self-stabilizing graph exploration by a single agent Grammar index by induced suffix sorting
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1