{"title":"The structured coalescent in the context of gene copy number variation","authors":"Moritz Otto, Thomas Wiehe","doi":"10.1016/j.tpb.2023.08.001","DOIUrl":null,"url":null,"abstract":"<div><p>The <em>Structured Coalescent</em> was introduced to describe the coalescent process in spatially subdivided populations with migration. Here, we re-interpret migration routes of individuals in the original model as “migration routes” of single genes in tandemly arranged gene arrays. A gene copy may change its position within the array via unequal recombination. Hence, in a coalescent framework, two copies sampled from two chromosomes may coalesce only if they are at exactly homologous positions. Otherwise, one or multiple recombination events have to occur before they can coalesce, thereby increasing mean coalescence time and expected genetic diversity among the copies in a gene array.</p><p>We explicitly calculate the transition probabilities on these routes backward in time. We simulate the structured coalescent with migration and coalescence rates informed by the unequal recombination process of gene copies. With this novel interpretation of population structure models we determine coalescence times and expected genetic diversity in samples of orthologous and paralogous copies from a gene family. As a case study, we discuss the site frequency spectrum of a small gene family in the two scenarios of high and of no gene copy number variation among individuals. These examples underline the significance of our model, since standard test-statistics may lead to misinterpretations when analyzing sequence data of multi-copy genes due to their different expected genetic diversity.</p></div>","PeriodicalId":49437,"journal":{"name":"Theoretical Population Biology","volume":"154 ","pages":"Pages 67-78"},"PeriodicalIF":1.2000,"publicationDate":"2023-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theoretical Population Biology","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0040580923000564","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The Structured Coalescent was introduced to describe the coalescent process in spatially subdivided populations with migration. Here, we re-interpret migration routes of individuals in the original model as “migration routes” of single genes in tandemly arranged gene arrays. A gene copy may change its position within the array via unequal recombination. Hence, in a coalescent framework, two copies sampled from two chromosomes may coalesce only if they are at exactly homologous positions. Otherwise, one or multiple recombination events have to occur before they can coalesce, thereby increasing mean coalescence time and expected genetic diversity among the copies in a gene array.
We explicitly calculate the transition probabilities on these routes backward in time. We simulate the structured coalescent with migration and coalescence rates informed by the unequal recombination process of gene copies. With this novel interpretation of population structure models we determine coalescence times and expected genetic diversity in samples of orthologous and paralogous copies from a gene family. As a case study, we discuss the site frequency spectrum of a small gene family in the two scenarios of high and of no gene copy number variation among individuals. These examples underline the significance of our model, since standard test-statistics may lead to misinterpretations when analyzing sequence data of multi-copy genes due to their different expected genetic diversity.
期刊介绍:
An interdisciplinary journal, Theoretical Population Biology presents articles on theoretical aspects of the biology of populations, particularly in the areas of demography, ecology, epidemiology, evolution, and genetics. Emphasis is on the development of mathematical theory and models that enhance the understanding of biological phenomena.
Articles highlight the motivation and significance of the work for advancing progress in biology, relying on a substantial mathematical effort to obtain biological insight. The journal also presents empirical results and computational and statistical methods directly impinging on theoretical problems in population biology.