Katharina T. Huber , Leo van Iersel , Vincent Moulton , Guillaume E. Scholz
{"title":"这个网络是以森林为基础的吗?","authors":"Katharina T. Huber , Leo van Iersel , Vincent Moulton , Guillaume E. Scholz","doi":"10.1016/j.ipl.2024.106500","DOIUrl":null,"url":null,"abstract":"<div><p>In evolutionary biology, networks are becoming increasingly used to represent evolutionary histories for species that have undergone non-treelike or reticulate evolution. Such networks are essentially directed acyclic graphs with a leaf set that corresponds to a collection of species, and in which non-leaf vertices with indegree 1 correspond to speciation events and vertices with indegree greater than 1 correspond to reticulate events such as gene transfer. Recently forest-based networks have been introduced, which are essentially (multi-rooted) networks that can be formed by adding some arcs to a collection of phylogenetic trees (or phylogenetic forest), where each arc is added in such a way that its ends always lie in two different trees in the forest. In this paper, we consider the complexity of deciding whether a given network is proper forest-based, that is, whether it can be formed by adding arcs to some underlying phylogenetic forest which contains the same number of trees as there are roots in the network. More specifically, we show that it is NP-complete to decide whether a tree-child network with <em>m</em> roots is proper forest-based, for each <span><math><mi>m</mi><mo>≥</mo><mn>2</mn></math></span>. Moreover, for binary networks the problem remains NP-complete when <span><math><mi>m</mi><mo>≥</mo><mn>3</mn></math></span> but becomes polynomial-time solvable for <span><math><mi>m</mi><mo>=</mo><mn>2</mn></math></span>. We also give a fixed parameter tractable (FPT) algorithm, with parameters the maximum outdegree of a vertex, the number of roots, and the number of indegree 2 vertices, for deciding if a semi-binary network is proper forest-based. A key element in proving our results is a new characterization for when a network with <em>m</em> roots is proper forest-based in terms of certain <em>m</em>-colorings.</p></div>","PeriodicalId":56290,"journal":{"name":"Information Processing Letters","volume":"187 ","pages":"Article 106500"},"PeriodicalIF":0.7000,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0020019024000309/pdfft?md5=6bb24ac246fe9301af0f20ad64b94c09&pid=1-s2.0-S0020019024000309-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Is this network proper forest-based?\",\"authors\":\"Katharina T. Huber , Leo van Iersel , Vincent Moulton , Guillaume E. Scholz\",\"doi\":\"10.1016/j.ipl.2024.106500\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In evolutionary biology, networks are becoming increasingly used to represent evolutionary histories for species that have undergone non-treelike or reticulate evolution. Such networks are essentially directed acyclic graphs with a leaf set that corresponds to a collection of species, and in which non-leaf vertices with indegree 1 correspond to speciation events and vertices with indegree greater than 1 correspond to reticulate events such as gene transfer. Recently forest-based networks have been introduced, which are essentially (multi-rooted) networks that can be formed by adding some arcs to a collection of phylogenetic trees (or phylogenetic forest), where each arc is added in such a way that its ends always lie in two different trees in the forest. In this paper, we consider the complexity of deciding whether a given network is proper forest-based, that is, whether it can be formed by adding arcs to some underlying phylogenetic forest which contains the same number of trees as there are roots in the network. More specifically, we show that it is NP-complete to decide whether a tree-child network with <em>m</em> roots is proper forest-based, for each <span><math><mi>m</mi><mo>≥</mo><mn>2</mn></math></span>. Moreover, for binary networks the problem remains NP-complete when <span><math><mi>m</mi><mo>≥</mo><mn>3</mn></math></span> but becomes polynomial-time solvable for <span><math><mi>m</mi><mo>=</mo><mn>2</mn></math></span>. We also give a fixed parameter tractable (FPT) algorithm, with parameters the maximum outdegree of a vertex, the number of roots, and the number of indegree 2 vertices, for deciding if a semi-binary network is proper forest-based. A key element in proving our results is a new characterization for when a network with <em>m</em> roots is proper forest-based in terms of certain <em>m</em>-colorings.</p></div>\",\"PeriodicalId\":56290,\"journal\":{\"name\":\"Information Processing Letters\",\"volume\":\"187 \",\"pages\":\"Article 106500\"},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2024-05-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S0020019024000309/pdfft?md5=6bb24ac246fe9301af0f20ad64b94c09&pid=1-s2.0-S0020019024000309-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Processing Letters\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0020019024000309\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020019024000309","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
摘要
在进化生物学中,人们越来越多地使用网络来表示物种的进化历史,这些物种经历了非簇状或网状进化。这类网络本质上是有向无环图,其叶集对应于物种集合,其中indegree为1的非叶顶点对应于物种分化事件,indegree大于1的顶点对应于网状事件,如基因转移。最近有人提出了基于森林的网络,这种网络本质上是(多根)网络,可以通过在系统发育树集合(或系统发育森林)中添加一些弧来形成,其中添加每个弧的方式是使其末端总是位于森林中的两棵不同的树上。在本文中,我们考虑了判定给定网络是否基于适当的森林的复杂性,即判定该网络是否可以通过向某个底层系统发育森林添加弧来形成,该森林包含的树的数量与网络中根的数量相同。更具体地说,我们证明了在每个 m≥2 的情况下,判定一个有 m 根的树子网络是否基于适当的森林是 NP-完全的。此外,对于二元网络,当 m≥3 时,问题仍然是 NP-完全的,但当 m=2 时,问题变得多项式时间可解。我们还给出了一种固定参数可控(FPT)算法,其参数为顶点的最大外度、根数和指数为 2 的顶点数,用于判断半二元网络是否是适当的森林型网络。证明我们结果的一个关键要素是,根据特定的 m-着色,对具有 m 个根的网络何时是适当的森林网络进行了新的描述。
In evolutionary biology, networks are becoming increasingly used to represent evolutionary histories for species that have undergone non-treelike or reticulate evolution. Such networks are essentially directed acyclic graphs with a leaf set that corresponds to a collection of species, and in which non-leaf vertices with indegree 1 correspond to speciation events and vertices with indegree greater than 1 correspond to reticulate events such as gene transfer. Recently forest-based networks have been introduced, which are essentially (multi-rooted) networks that can be formed by adding some arcs to a collection of phylogenetic trees (or phylogenetic forest), where each arc is added in such a way that its ends always lie in two different trees in the forest. In this paper, we consider the complexity of deciding whether a given network is proper forest-based, that is, whether it can be formed by adding arcs to some underlying phylogenetic forest which contains the same number of trees as there are roots in the network. More specifically, we show that it is NP-complete to decide whether a tree-child network with m roots is proper forest-based, for each . Moreover, for binary networks the problem remains NP-complete when but becomes polynomial-time solvable for . We also give a fixed parameter tractable (FPT) algorithm, with parameters the maximum outdegree of a vertex, the number of roots, and the number of indegree 2 vertices, for deciding if a semi-binary network is proper forest-based. A key element in proving our results is a new characterization for when a network with m roots is proper forest-based in terms of certain m-colorings.
期刊介绍:
Information Processing Letters invites submission of original research articles that focus on fundamental aspects of information processing and computing. This naturally includes work in the broadly understood field of theoretical computer science; although papers in all areas of scientific inquiry will be given consideration, provided that they describe research contributions credibly motivated by applications to computing and involve rigorous methodology. High quality experimental papers that address topics of sufficiently broad interest may also be considered.
Since its inception in 1971, Information Processing Letters has served as a forum for timely dissemination of short, concise and focused research contributions. Continuing with this tradition, and to expedite the reviewing process, manuscripts are generally limited in length to nine pages when they appear in print.