{"title":"Exact and approximation algorithms for the contiguous translocation distance problem","authors":"Maria Constantin , Alexandru Popa","doi":"10.1016/j.tcs.2024.115003","DOIUrl":null,"url":null,"abstract":"<div><div>Biological computation is the field that studies the computations performed by the biological systems and includes the development of algorithms or other computational techniques inspired by nature. The genome rearrangements that occur during genome evolution are an important example of a natural computation process which inspired multiple problems that can be solved using combinatorial models. A popular problem inspired by genome evolution is computing the genetic distance between two organisms by identifying the minimum number of genome rearrangements needed to obtain one genome from the other. Given two chromosomes, represented as strings over the DNA alphabet {A, C, G, T}, the translocation operation is defined as a prefix interchange between these chromosomes. Thus, two new chromosomes are obtained after a translocation. When the chromosomes are interchanging equal length prefixes, the translocation operation is called uniform. A translocation sequence is a series of translocation operations applied to an initial genome, represented as a set of strings (initial set), resulting in a new genome, also represented as a set of strings (target set). Given a translocation sequence, if the strings exchanging prefixes are part of the initial set or have additional copies created by preceding translocations than those utilised, then the translocation sequence is referred to as contiguous. The translocation distance between two given genomes is defined as the minimum number of translocation operations necessary to obtain one genome from the other. We introduce a new polynomial time exact algorithm to compute the uniform contiguous translocation distance for a target genome of size two. Then, we present a polynomial time 2-approximation algorithm for the non-uniform contiguous translocation distance considering a target genome of size one.</div></div>","PeriodicalId":49438,"journal":{"name":"Theoretical Computer Science","volume":"1026 ","pages":"Article 115003"},"PeriodicalIF":0.9000,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theoretical Computer Science","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0304397524006200","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Biological computation is the field that studies the computations performed by the biological systems and includes the development of algorithms or other computational techniques inspired by nature. The genome rearrangements that occur during genome evolution are an important example of a natural computation process which inspired multiple problems that can be solved using combinatorial models. A popular problem inspired by genome evolution is computing the genetic distance between two organisms by identifying the minimum number of genome rearrangements needed to obtain one genome from the other. Given two chromosomes, represented as strings over the DNA alphabet {A, C, G, T}, the translocation operation is defined as a prefix interchange between these chromosomes. Thus, two new chromosomes are obtained after a translocation. When the chromosomes are interchanging equal length prefixes, the translocation operation is called uniform. A translocation sequence is a series of translocation operations applied to an initial genome, represented as a set of strings (initial set), resulting in a new genome, also represented as a set of strings (target set). Given a translocation sequence, if the strings exchanging prefixes are part of the initial set or have additional copies created by preceding translocations than those utilised, then the translocation sequence is referred to as contiguous. The translocation distance between two given genomes is defined as the minimum number of translocation operations necessary to obtain one genome from the other. We introduce a new polynomial time exact algorithm to compute the uniform contiguous translocation distance for a target genome of size two. Then, we present a polynomial time 2-approximation algorithm for the non-uniform contiguous translocation distance considering a target genome of size one.
期刊介绍:
Theoretical Computer Science is mathematical and abstract in spirit, but it derives its motivation from practical and everyday computation. Its aim is to understand the nature of computation and, as a consequence of this understanding, provide more efficient methodologies. All papers introducing or studying mathematical, logic and formal concepts and methods are welcome, provided that their motivation is clearly drawn from the field of computing.