M. Frohn, N. Holtgrefe, L. van Iersel, M. Jones, S. Kelk
{"title":"Invariants for level-1 phylogenetic networks under the random walk 4-state Markov model","authors":"M. Frohn, N. Holtgrefe, L. van Iersel, M. Jones, S. Kelk","doi":"arxiv-2407.11720","DOIUrl":null,"url":null,"abstract":"Phylogenetic networks can represent evolutionary events that cannot be\ndescribed by phylogenetic trees, such as hybridization, introgression, and\nlateral gene transfer. Studying phylogenetic networks under a statistical model\nof DNA sequence evolution can aid the inference of phylogenetic networks. Most\nnotably Markov models like the Jukes-Cantor or Kimura-3 model can been employed\nto infer a phylogenetic network using phylogenetic invariants. In this article\nwe determine all quadratic invariants for sunlet networks under the random walk\n4-state Markov model, which includes the aforementioned models. Taking toric\nfiber products of trees and sunlet networks, we obtain a new class of\ninvariants for level-1 phylogenetic networks under the same model. Furthermore,\nwe apply our results to the identifiability problem of a network parameter. In\nparticular, we prove that our new class of invariants of the studied model is\nnot sufficient to derive identifiability of quarnets (4-leaf networks).\nMoreover, we provide an efficient method that is faster and more reliable than\nthe state-of-the-art in finding a significant number of invariants for many\nlevel-1 phylogenetic networks.","PeriodicalId":501044,"journal":{"name":"arXiv - QuanBio - Populations and Evolution","volume":"30 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Populations and Evolution","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.11720","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Phylogenetic networks can represent evolutionary events that cannot be
described by phylogenetic trees, such as hybridization, introgression, and
lateral gene transfer. Studying phylogenetic networks under a statistical model
of DNA sequence evolution can aid the inference of phylogenetic networks. Most
notably Markov models like the Jukes-Cantor or Kimura-3 model can been employed
to infer a phylogenetic network using phylogenetic invariants. In this article
we determine all quadratic invariants for sunlet networks under the random walk
4-state Markov model, which includes the aforementioned models. Taking toric
fiber products of trees and sunlet networks, we obtain a new class of
invariants for level-1 phylogenetic networks under the same model. Furthermore,
we apply our results to the identifiability problem of a network parameter. In
particular, we prove that our new class of invariants of the studied model is
not sufficient to derive identifiability of quarnets (4-leaf networks).
Moreover, we provide an efficient method that is faster and more reliable than
the state-of-the-art in finding a significant number of invariants for many
level-1 phylogenetic networks.
系统发育网络可以代表系统发育树无法描述的进化事件,如杂交、引种和侧向基因转移。在 DNA 序列进化统计模型下研究系统发育网络有助于系统发育网络的推断。最值得注意的是,像朱克斯-康托(Jukes-Cantor)或木村-3(Kimura-3)模型这样的马尔可夫模型可以用来利用系统发育不变式推断系统发育网络。在本文中,我们确定了随机行走4态马尔可夫模型(包括上述模型)下小太阳网络的所有二次不变式。通过树和 Sunlet 网络的环状纤维乘积,我们得到了同一模型下一级系统发育网络的一类新不变式。此外,我们还将结果应用于网络参数的可识别性问题。特别是,我们证明了所研究模型的新一类不变式不足以推导出四叶网络(quarnets)的可识别性。此外,我们还提供了一种高效的方法,它比最先进的方法更快、更可靠地找到了许多一级系统发育网络的大量不变式。