首页 > 最新文献

Genome Biology and Evolution最新文献

英文 中文
Species Tree Branch Length Estimation despite Incomplete Lineage Sorting, Duplication, and Loss. 不完全谱系分类、重复和丢失情况下的物种树枝长度估计。
IF 2.8 2区 生物学 Q2 EVOLUTIONARY BIOLOGY Pub Date : 2025-10-29 DOI: 10.1093/gbe/evaf200
Yasamin Tabatabaee, Chao Zhang, Shayesteh Arasti, Siavash Mirarab

Phylogenetic branch lengths are essential for many analyses, such as estimating divergence times, analyzing rate changes, and studying adaptation. However, true gene tree heterogeneity due to incomplete lineage sorting, gene duplication and loss, and horizontal gene transfer can complicate the estimation of species tree branch lengths. While several tools exist for estimating the topology of a species tree addressing various causes of gene tree discordance, much less attention has been paid to branch length estimation on multi-locus datasets. For single-copy gene trees, some methods are available that summarize gene tree branch lengths onto a species tree, including coalescent-based methods that account for heterogeneity due to incomplete lineage sorting. However, no such branch length estimation method exists for multi-copy gene family trees that have evolved with gene duplication and loss. To address this gap, we introduce the CASTLES-Pro algorithm for estimating species tree branch lengths while accounting for both gene duplication and loss and incomplete lineage sorting. CASTLES-Pro improves on the existing coalescent-based branch length estimation method CASTLES by increasing its accuracy for single-copy gene trees and extending it to handle multi-copy ones. Our simulation studies show that CASTLES-Pro is generally more accurate than alternatives, eliminating the systematic bias toward overestimating terminal branch lengths often observed when using concatenation. Moreover, while not theoretically designed for horizontal gene transfer, we show that CASTLES-Pro is relatively robust to random horizontal gene transfer, though its accuracy can degrade at the highest levels of horizontal gene transfer.

系统发育分支长度在许多分析中都是必不可少的,例如估计分化时间、分析速率变化和研究适应性。然而,由于谱系分类不完整、基因复制和丢失以及基因水平转移等原因,真正的基因树异质性会使物种树枝长度的估计复杂化。虽然有几种工具用于估计物种树的拓扑结构,以解决基因树不一致的各种原因,但对多位点数据集的分支长度估计的关注较少。对于单拷贝基因树,一些方法可以将基因树分支长度总结到物种树上,包括基于聚结的方法,该方法可以解释由于谱系分类不完整而导致的异质性。然而,对于基因重复和丢失的多拷贝基因家族树,还没有这样的分支长度估计方法。为了解决这一差距,我们引入了CASTLES-Pro算法来估计物种树枝长度,同时考虑到基因复制和丢失以及不完整的谱系分类。CASTLES- pro对现有的基于聚结的分支长度估计方法CASTLES进行了改进,提高了其对单拷贝基因树的准确性,并将其扩展到多拷贝基因树。我们的模拟研究表明,CASTLES-Pro通常比其他选择更准确,消除了在使用串联时经常观察到的对终端分支长度高估的系统性偏见。此外,虽然理论上不是为水平基因转移设计的,但我们表明CASTLES-Pro对随机水平基因转移相对稳健,尽管其准确性在水平基因转移的最高水平时会下降。
{"title":"Species Tree Branch Length Estimation despite Incomplete Lineage Sorting, Duplication, and Loss.","authors":"Yasamin Tabatabaee, Chao Zhang, Shayesteh Arasti, Siavash Mirarab","doi":"10.1093/gbe/evaf200","DOIUrl":"10.1093/gbe/evaf200","url":null,"abstract":"<p><p>Phylogenetic branch lengths are essential for many analyses, such as estimating divergence times, analyzing rate changes, and studying adaptation. However, true gene tree heterogeneity due to incomplete lineage sorting, gene duplication and loss, and horizontal gene transfer can complicate the estimation of species tree branch lengths. While several tools exist for estimating the topology of a species tree addressing various causes of gene tree discordance, much less attention has been paid to branch length estimation on multi-locus datasets. For single-copy gene trees, some methods are available that summarize gene tree branch lengths onto a species tree, including coalescent-based methods that account for heterogeneity due to incomplete lineage sorting. However, no such branch length estimation method exists for multi-copy gene family trees that have evolved with gene duplication and loss. To address this gap, we introduce the CASTLES-Pro algorithm for estimating species tree branch lengths while accounting for both gene duplication and loss and incomplete lineage sorting. CASTLES-Pro improves on the existing coalescent-based branch length estimation method CASTLES by increasing its accuracy for single-copy gene trees and extending it to handle multi-copy ones. Our simulation studies show that CASTLES-Pro is generally more accurate than alternatives, eliminating the systematic bias toward overestimating terminal branch lengths often observed when using concatenation. Moreover, while not theoretically designed for horizontal gene transfer, we show that CASTLES-Pro is relatively robust to random horizontal gene transfer, though its accuracy can degrade at the highest levels of horizontal gene transfer.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":"17 11","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12648238/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145603891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Testing the Mother's Curse Hypothesis in Human Mitochondrial Genome Evolution. 在人类线粒体基因组进化中测试母亲诅咒假说。
IF 2.8 2区 生物学 Q2 EVOLUTIONARY BIOLOGY Pub Date : 2025-10-29 DOI: 10.1093/gbe/evaf207
Ruiqi Yuan, Jianzhi Zhang

In species where mitochondrial DNA (mtDNA) is maternally inherited such as vertebrates, mtDNA mutations harming males only are not subject to purifying selection and thus can spread in a population, especially when these mutations benefit females. Therefore, the mother's curse hypothesis (MCH) posits a greater mtDNA mutation load in males than in females. MCH is potentially important for human health, disease, and evolution, but a systematic test that considers the vast human mtDNA variation is lacking. Analyzing the genotypic and phenotypic data of approximately 0.5 million British participants in the UK Biobank, we estimate the reproductive fitness of mtDNA variants in each sex. Contradicting MCH, a positive intersexual correlation in the number of offspring exists across mitochondrial haplogroups. While a significant variation in the number of opposite-sex sexual partners-a proxy for reproductive fitness in premodern societies-is present among mitochondrial haplogroups, no significant intersexual correlation in this quantity is detected. The frequencies of a few mtDNA variants differ significantly between males and females, suggesting that these variants differentially affect the survival in the two sexes, but the number of such variants with lower male frequencies is not significantly different from that with lower female frequencies. Analysis of disease associations also finds no enrichment of male disease-associated mtDNA variants despite the discovery of multiple sex-biased disease associations. Together, these findings provide no genomic support to MCH in humans and suggest no difference in mtDNA mutation load between the two sexes that is detectable in the UK Biobank.

在线粒体DNA (mtDNA)是母系遗传的物种中,如脊椎动物,mtDNA突变只伤害雄性,不受净化选择的影响,因此可以在种群中传播,特别是当这些突变有利于雌性时。因此,母亲诅咒假说(MCH)假设男性的mtDNA突变负荷大于女性。MCH对人类健康、疾病和进化具有潜在的重要意义,但缺乏考虑人类mtDNA巨大变异的系统测试。通过分析英国生物银行中约50万英国参与者的基因型和表型数据,我们估计了每个性别mtDNA变异的生殖适合度。与MCH相反,在线粒体单倍群中存在后代数量的正性间相关性。在线粒体单倍群中,异性伴侣的数量(前现代社会中生殖适应性的代表)存在显著差异,但在这一数量上没有发现显著的两性相关性。少数mtDNA变异的频率在男性和女性之间存在显著差异,表明这些变异对两性生存的影响存在差异,但男性频率较低的mtDNA变异的数量与女性频率较低的mtDNA变异的数量没有显著差异。疾病关联分析还发现,尽管发现了多种性别偏倚的疾病关联,但男性疾病相关的mtDNA变异并未富集。总之,这些发现没有提供人类MCH的基因组支持,并且表明在英国生物银行中检测到的两性之间的mtDNA突变负荷没有差异。
{"title":"Testing the Mother's Curse Hypothesis in Human Mitochondrial Genome Evolution.","authors":"Ruiqi Yuan, Jianzhi Zhang","doi":"10.1093/gbe/evaf207","DOIUrl":"10.1093/gbe/evaf207","url":null,"abstract":"<p><p>In species where mitochondrial DNA (mtDNA) is maternally inherited such as vertebrates, mtDNA mutations harming males only are not subject to purifying selection and thus can spread in a population, especially when these mutations benefit females. Therefore, the mother's curse hypothesis (MCH) posits a greater mtDNA mutation load in males than in females. MCH is potentially important for human health, disease, and evolution, but a systematic test that considers the vast human mtDNA variation is lacking. Analyzing the genotypic and phenotypic data of approximately 0.5 million British participants in the UK Biobank, we estimate the reproductive fitness of mtDNA variants in each sex. Contradicting MCH, a positive intersexual correlation in the number of offspring exists across mitochondrial haplogroups. While a significant variation in the number of opposite-sex sexual partners-a proxy for reproductive fitness in premodern societies-is present among mitochondrial haplogroups, no significant intersexual correlation in this quantity is detected. The frequencies of a few mtDNA variants differ significantly between males and females, suggesting that these variants differentially affect the survival in the two sexes, but the number of such variants with lower male frequencies is not significantly different from that with lower female frequencies. Analysis of disease associations also finds no enrichment of male disease-associated mtDNA variants despite the discovery of multiple sex-biased disease associations. Together, these findings provide no genomic support to MCH in humans and suggest no difference in mtDNA mutation load between the two sexes that is detectable in the UK Biobank.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12629233/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145476974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SmithRNAs: A Common Feature among Metazoa. SmithRNAs:后生动物的共同特征。
IF 2.8 2区 生物学 Q2 EVOLUTIONARY BIOLOGY Pub Date : 2025-10-29 DOI: 10.1093/gbe/evaf208
Giovanni Marturano, Diego Carli, Claudio Cucini, Elena Cardaioli, Antonio Carapelli, Federico Plazzi, Francesco Frati, Marco Passamonti, Francesco Nardi

SmithRNAs are a novel class of small noncoding RNAs that are encoded in the mitochondrial genome and regulate the expression of nuclear transcripts. They have been recently described in the Manila clam Ruditapes philippinarum and their biological function has been confirmed in vivo. It is currently unclear whether smithRNAs are a unique feature of this species, possibly related to the peculiar mechanism of sex determination observed in bivalves, or a common feature of Metazoa. Aiming at a broader survey on the presence and biological features of smithRNAs across Metazoa, 14 species were selected to represent major lineages, and for each species small RNAseq data, as well as the transcriptome, mitochondrial and nuclear genomes, were collected from the literature or sequenced/assembled de novo. Data were analyzed using the SmithHunter pipeline, a recently published tool specifically designed to identify smithRNAs and their targets. Candidate smithRNAs were identified in all species studied, supporting the notion that smithRNAs are a common feature across Metazoa. SmithRNAs are generally encoded within other genes, on the same strand, and with a preference for mitochondrial rRNAs and tRNAs. Based on their strandedness and preferential position at the 5'-end of the encompassing gene, a transcription mechanism is proposed where smithRNAs are cleaved off from gene-specific transcripts after the maturation of the two primary mitochondrial transcripts. A substantial variability was identified concerning the possible nuclear targets of smithRNAs, with a preference for regulation/response terms, mitochondrial functions, and sex/germline associated terms.

SmithRNAs是一类新型的小非编码rna,编码于线粒体基因组并调节核转录物的表达。它们最近在马尼拉蛤Ruditapes philippinarum中被描述,其生物学功能已在体内得到证实。目前尚不清楚smithrna是否是该物种的独特特征,可能与在双壳类动物中观察到的特殊性别决定机制有关,或者是后生动物的共同特征。为了对后生动物中smithRNAs的存在和生物学特征进行更广泛的调查,我们选择了14个物种作为主要谱系,并从文献中收集每个物种的小RNAseq数据,以及转录组、线粒体和核基因组,或重新测序/组装。数据分析使用SmithHunter管道,这是一个最近发布的专门用于识别smithrna及其靶标的工具。候选smithRNAs在所有被研究的物种中都被鉴定出来,这支持了smithRNAs是后生动物共同特征的观点。SmithRNAs通常在其他基因中编码,在同一条链上,并且偏爱线粒体rnas和tRNAs。基于它们的链结性和在包含基因5'端的优先位置,提出了一种转录机制,即在两个主要线粒体转录物成熟后,smithrna从基因特异性转录物中分离出来。在smithrna可能的核靶标上发现了大量的可变性,其中包括调控/响应术语、线粒体功能和性别/生殖系相关术语的偏好。
{"title":"SmithRNAs: A Common Feature among Metazoa.","authors":"Giovanni Marturano, Diego Carli, Claudio Cucini, Elena Cardaioli, Antonio Carapelli, Federico Plazzi, Francesco Frati, Marco Passamonti, Francesco Nardi","doi":"10.1093/gbe/evaf208","DOIUrl":"10.1093/gbe/evaf208","url":null,"abstract":"<p><p>SmithRNAs are a novel class of small noncoding RNAs that are encoded in the mitochondrial genome and regulate the expression of nuclear transcripts. They have been recently described in the Manila clam Ruditapes philippinarum and their biological function has been confirmed in vivo. It is currently unclear whether smithRNAs are a unique feature of this species, possibly related to the peculiar mechanism of sex determination observed in bivalves, or a common feature of Metazoa. Aiming at a broader survey on the presence and biological features of smithRNAs across Metazoa, 14 species were selected to represent major lineages, and for each species small RNAseq data, as well as the transcriptome, mitochondrial and nuclear genomes, were collected from the literature or sequenced/assembled de novo. Data were analyzed using the SmithHunter pipeline, a recently published tool specifically designed to identify smithRNAs and their targets. Candidate smithRNAs were identified in all species studied, supporting the notion that smithRNAs are a common feature across Metazoa. SmithRNAs are generally encoded within other genes, on the same strand, and with a preference for mitochondrial rRNAs and tRNAs. Based on their strandedness and preferential position at the 5'-end of the encompassing gene, a transcription mechanism is proposed where smithRNAs are cleaved off from gene-specific transcripts after the maturation of the two primary mitochondrial transcripts. A substantial variability was identified concerning the possible nuclear targets of smithRNAs, with a preference for regulation/response terms, mitochondrial functions, and sex/germline associated terms.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12628786/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145476893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chromosome-scale Genomes Show Rapid Diversification and Ancient Gene Flow Among Bear Species. 染色体尺度的基因组显示了熊类物种之间的快速多样化和古老的基因流动。
IF 2.8 2区 生物学 Q2 EVOLUTIONARY BIOLOGY Pub Date : 2025-10-29 DOI: 10.1093/gbe/evaf188
T Brock Wooldridge, Merly Escalona, Blair W Perry, Alexis N Enstrom, Dalya Salih, William E Seligmann, Samuel Sacco, Katherine L Moon, Ruta Sahasrabudhe, Noravit Chumchim, Oanh Nguyen, Joanna L Kelley, Ross D E MacPhee, Beth Shapiro

Reconstructions of evolutionary history can be restricted by a lack of high-quality reference genomes. To date, only four of the eight species of bears (family Ursidae) have chromosome-level genome assemblies. Here, we present assemblies for three additional species-the sun, sloth, and Andean bears-and use a whole-genome alignment of all bear species and other carnivores to reconstruct the evolution of Ursidae. Multiple divergence dating approaches suggest that the six Ursine bears likely diversified in the last 5 Ma, but that divergence times within Ursinae are significantly impacted by gene tree heterogeneity. Consistent with this, we observe that nearly 50% of gene trees conflict with our highly supported species tree, a pattern driven by a significant early hybridization event within Ursinae. We also find that the karyotype of Ursinae is largely similar to the ancestral karyotype of all bears twenty million years prior. In contrast to this conservation of structure, dozens of chromosomal fissions and fusions associated with LINE/L1 retrotransposons dramatically restructured the genomes of the giant panda and Andean bear. Finally, we leverage these genomes to identify species-specific evidence for positive selection on genes associated with color, diet, and metabolism. One of these genes, TCPN2, has a role in pigmentation and shows a series of amino acid mutations in the polar bear over the last 0.5 Ma. Collectively, these new genomic resources enable improved reconstruction of the complex evolutionary history of bears and clarify how this enigmatic group diversified.

由于缺乏高质量的参考基因组,进化历史的重建可能受到限制。迄今为止,熊(熊科)的八种中只有四种具有染色体水平的基因组组装。在这里,我们展示了另外三个物种——太阳熊、树懒和眼镜熊的集合,并使用所有熊类和其他食肉动物的全基因组比对来重建熊科的进化。基于融合模式的分化年代测定表明,熊类的分化速度比之前报道的要快,所有熊类的起源时间为~ 19 Ma,而熊科的6个物种的起源时间为~ 3.3 Ma。令人惊讶的是,我们观察到近50%的基因树拓扑结构与我们高度支持的物种树相冲突,这一模式是由熊科中一个重要的早期杂交事件驱动的。我们还发现熊科的祖先核型在大约1500万年的时间里与所有熊类的祖先核型在很大程度上保持保守。与这种稳定性相反,与LINE/L1反转录转座子相关的数十个染色体分裂和融合戏剧性地重组了大熊猫和眼镜熊的基因组。最后,我们利用这些基因组来确定与颜色、饮食和新陈代谢相关的基因正选择的物种特异性证据。其中一个基因TCPN2在色素沉着中起作用,在过去的0.5 Ma中,北极熊出现了一系列氨基酸突变。总的来说,这些新的基因组资源能够更好地重建熊的复杂进化历史,并阐明这个神秘的群体是如何多样化的。
{"title":"Chromosome-scale Genomes Show Rapid Diversification and Ancient Gene Flow Among Bear Species.","authors":"T Brock Wooldridge, Merly Escalona, Blair W Perry, Alexis N Enstrom, Dalya Salih, William E Seligmann, Samuel Sacco, Katherine L Moon, Ruta Sahasrabudhe, Noravit Chumchim, Oanh Nguyen, Joanna L Kelley, Ross D E MacPhee, Beth Shapiro","doi":"10.1093/gbe/evaf188","DOIUrl":"10.1093/gbe/evaf188","url":null,"abstract":"<p><p>Reconstructions of evolutionary history can be restricted by a lack of high-quality reference genomes. To date, only four of the eight species of bears (family Ursidae) have chromosome-level genome assemblies. Here, we present assemblies for three additional species-the sun, sloth, and Andean bears-and use a whole-genome alignment of all bear species and other carnivores to reconstruct the evolution of Ursidae. Multiple divergence dating approaches suggest that the six Ursine bears likely diversified in the last 5 Ma, but that divergence times within Ursinae are significantly impacted by gene tree heterogeneity. Consistent with this, we observe that nearly 50% of gene trees conflict with our highly supported species tree, a pattern driven by a significant early hybridization event within Ursinae. We also find that the karyotype of Ursinae is largely similar to the ancestral karyotype of all bears twenty million years prior. In contrast to this conservation of structure, dozens of chromosomal fissions and fusions associated with LINE/L1 retrotransposons dramatically restructured the genomes of the giant panda and Andean bear. Finally, we leverage these genomes to identify species-specific evidence for positive selection on genes associated with color, diet, and metabolism. One of these genes, TCPN2, has a role in pigmentation and shows a series of amino acid mutations in the polar bear over the last 0.5 Ma. Collectively, these new genomic resources enable improved reconstruction of the complex evolutionary history of bears and clarify how this enigmatic group diversified.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12631120/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145232508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Before the East African radiation: sex chromosome systems in basal haplotilapiine cichlids. 东非辐射之前:基型单罗非鱼的性染色体系统。
IF 2.8 2区 生物学 Q2 EVOLUTIONARY BIOLOGY Pub Date : 2025-10-09 DOI: 10.1093/gbe/evaf191
Kristen A Behrens, Zexuan Zhao, Michael R Kidd, Alfred Maluwa, Avner Cnaani, Stephan Koblmüller, Thomas D Kocher

Cichlid fishes have undergone an extraordinary diversification in East Africa. They also have a high rate of sex chromosome turnover. This clade provides an opportunity to study the rates and patterns of sex chromosome turnover, and the interactions of sex chromosome turnover with adaptation and speciation. Here we investigate the evolution sex chromosomes in the tribes Tilapiini, Coptodonini, Heterotilapiini, Gobiocichlini, Pelmatolapiini and Oreochromini. We assembled chromosome-scale genomes of male and female Pelmatotilapia mariae. We then mapped pooled sequencing reads for males and females of P. mariae and 12 additional species on several genome assemblies to identify sex chromosomes. Tilapia sparrmanii and Oreochromis aureus share a ZW system on LG3 that overlaps the ZW system identified in P. mariae. Heterotilapia buettikoferi, T. brevimanus and Coptodon bakossiorum share an XY system mapping to another region of LG3. Coptodon zilli, Sarotherodon galilaeus, S. melanotheron and O. niloticus share an XY system on LG1. Finally, O. mossambicus and O. shiranus share an XY system on LG14 and we find evidence of an XY system on LG20 in Danakilia sp. 'shukoray'. The phylogenetic distribution of these sex determination systems suggests a long period of polymorphism for the systems on LG1 and LG3 and a generally lower rate of sex chromosome turnover in these lineages compared to the lacustrine lineages of the East African radiation. Our data is not consistent with the recent suggestion of figla and banf2 as candidate genes for the LG1XY and LG3ZW systems. We suggest a possible role for ubiquitination in the XY systems on LG3.

慈鲷在东非经历了非同寻常的多样化。它们的性染色体周转率也很高。这个分支提供了一个机会来研究性染色体更替的速率和模式,以及性染色体更替与适应和物种形成的相互作用。本文研究了罗非鱼(Tilapiini)、Coptodonini、Heterotilapiini、Gobiocichlini、Pelmatolapiini和Oreochromini部落的性染色体进化。我们组装了雄性和雌性马里罗非鱼的染色体尺度基因组。然后,我们对mariae和其他12个物种的雄性和雌性基因组序列进行了汇总测序,以确定性染色体。sparrmanii罗非鱼和Oreochromis aureus在LG3上共享一个ZW系统,该系统与P. mariae中发现的ZW系统重叠。buettikoferi异罗非鱼,brevimanus和Coptodon bakossiorum共享一个XY系统映射到LG3的另一个区域。Coptodon zilli, Sarotherodon galilaeus, S. melanotheron和O. niloticus在LG1上共享一个XY系统。最后,mosambicus和O. shiranus在LG14上共享一个XY系统,并在Danakilia sp. l20上发现了XY系统的证据。“shukoray”。这些性别决定系统的系统发育分布表明,LG1和LG3上的系统存在长时间的多态性,与东非辐射的湖泊谱系相比,这些谱系的性染色体周转率普遍较低。我们的数据与最近关于figla和banf2是LG1XY和LG3ZW系统的候选基因的建议不一致。我们认为泛素化可能在LG3上的XY系统中起作用。
{"title":"Before the East African radiation: sex chromosome systems in basal haplotilapiine cichlids.","authors":"Kristen A Behrens, Zexuan Zhao, Michael R Kidd, Alfred Maluwa, Avner Cnaani, Stephan Koblmüller, Thomas D Kocher","doi":"10.1093/gbe/evaf191","DOIUrl":"https://doi.org/10.1093/gbe/evaf191","url":null,"abstract":"<p><p>Cichlid fishes have undergone an extraordinary diversification in East Africa. They also have a high rate of sex chromosome turnover. This clade provides an opportunity to study the rates and patterns of sex chromosome turnover, and the interactions of sex chromosome turnover with adaptation and speciation. Here we investigate the evolution sex chromosomes in the tribes Tilapiini, Coptodonini, Heterotilapiini, Gobiocichlini, Pelmatolapiini and Oreochromini. We assembled chromosome-scale genomes of male and female Pelmatotilapia mariae. We then mapped pooled sequencing reads for males and females of P. mariae and 12 additional species on several genome assemblies to identify sex chromosomes. Tilapia sparrmanii and Oreochromis aureus share a ZW system on LG3 that overlaps the ZW system identified in P. mariae. Heterotilapia buettikoferi, T. brevimanus and Coptodon bakossiorum share an XY system mapping to another region of LG3. Coptodon zilli, Sarotherodon galilaeus, S. melanotheron and O. niloticus share an XY system on LG1. Finally, O. mossambicus and O. shiranus share an XY system on LG14 and we find evidence of an XY system on LG20 in Danakilia sp. 'shukoray'. The phylogenetic distribution of these sex determination systems suggests a long period of polymorphism for the systems on LG1 and LG3 and a generally lower rate of sex chromosome turnover in these lineages compared to the lacustrine lineages of the East African radiation. Our data is not consistent with the recent suggestion of figla and banf2 as candidate genes for the LG1XY and LG3ZW systems. We suggest a possible role for ubiquitination in the XY systems on LG3.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145250890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recurrent Evolutionary Innovations in Rodent and Primate Schlafen Genes. 啮齿动物和灵长类动物施拉芬基因的循环进化创新。
IF 2.8 2区 生物学 Q2 EVOLUTIONARY BIOLOGY Pub Date : 2025-09-30 DOI: 10.1093/gbe/evaf172
Joris Mordier, Marine Fraisse, Michel Cohen-Tannoudji, Antoine Molaro

SCHLAFEN proteins are a large family of RNase-related enzymes carrying essential immune and developmental functions. Despite these important roles, Schlafen genes display varying degrees of evolutionary conservation in mammals. While this appears to influence their molecular activities, a detailed understanding of these evolutionary innovations is still lacking. Here, we used in-depth phylogenomic approaches to characterize the evolutionary trajectories and selective forces shaping mammalian Schlafen genes. We traced lineage-specific Schlafen amplifications and found that recent duplicates evolved under distinct selective forces, supporting repeated subfunctionalization cycles. Codon-level natural selection analyses in primates and rodents identified recurrent positive selection over Schlafen protein domains engaged in viral interactions. Combining known crystal structures and predicted protein structures, we discovered a novel class of rapidly evolving residues enriched at the contact interface of SCHLAFEN protein dimers. Our results suggest that inter-SCHLAFEN compatibilities are under strong selective pressures and are likely to impact their molecular functions. We posit that cycles of genetic conflicts with pathogens and between paralogs drove Schlafens' recurrent evolutionary innovations in mammals.

SCHLAFEN蛋白是rnase相关的一个大家族,具有重要的免疫和发育功能。尽管有这些重要的作用,施拉芬基因在哺乳动物中表现出不同程度的进化保守性。虽然这似乎影响了它们的分子活动,但对这些进化创新的详细了解仍然缺乏。在这里,我们使用深入的系统基因组方法来表征进化轨迹和选择力量塑造哺乳动物Schlafen基因。我们追踪了谱系特异性Schlafen扩增,发现最近的重复在不同的选择力下进化,支持重复的亚功能化周期。在灵长类动物和啮齿类动物的密码子水平的自然选择分析中,发现了参与病毒相互作用的Schlafen蛋白域的反复正选择。结合已知的晶体和预测的蛋白质结构,我们发现了一类新的快速进化的残基富集在SCHLAFEN蛋白二聚体的接触界面上。我们的研究结果表明,SCHLAFEN之间的相容性受到强大的选择压力,并可能影响它们的分子功能。我们假设,与病原体的遗传冲突循环以及同类之间的遗传冲突驱动了施拉芬斯在哺乳动物中反复出现的进化创新。
{"title":"Recurrent Evolutionary Innovations in Rodent and Primate Schlafen Genes.","authors":"Joris Mordier, Marine Fraisse, Michel Cohen-Tannoudji, Antoine Molaro","doi":"10.1093/gbe/evaf172","DOIUrl":"10.1093/gbe/evaf172","url":null,"abstract":"<p><p>SCHLAFEN proteins are a large family of RNase-related enzymes carrying essential immune and developmental functions. Despite these important roles, Schlafen genes display varying degrees of evolutionary conservation in mammals. While this appears to influence their molecular activities, a detailed understanding of these evolutionary innovations is still lacking. Here, we used in-depth phylogenomic approaches to characterize the evolutionary trajectories and selective forces shaping mammalian Schlafen genes. We traced lineage-specific Schlafen amplifications and found that recent duplicates evolved under distinct selective forces, supporting repeated subfunctionalization cycles. Codon-level natural selection analyses in primates and rodents identified recurrent positive selection over Schlafen protein domains engaged in viral interactions. Combining known crystal structures and predicted protein structures, we discovered a novel class of rapidly evolving residues enriched at the contact interface of SCHLAFEN protein dimers. Our results suggest that inter-SCHLAFEN compatibilities are under strong selective pressures and are likely to impact their molecular functions. We posit that cycles of genetic conflicts with pathogens and between paralogs drove Schlafens' recurrent evolutionary innovations in mammals.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12492265/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145091368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
De Novo Genome Sequence Assembly of the RNAi-Tractable Paramecium bursaria 186b: An Endosymbiotic Model System. rnai易感滑囊草履虫186b内共生模型系统的从头基因组序列组装。
IF 2.8 2区 生物学 Q2 EVOLUTIONARY BIOLOGY Pub Date : 2025-09-30 DOI: 10.1093/gbe/evaf183
Guy Leonard, Benjamin H Jenkins, Fiona R Savory, Estelle S Kilias, Finlay Maguire, David S Milner, Thomas A Richards

How two species engage in stable endosymbiosis is a biological quandary. The study of facultative endosymbiotic interactions has emerged as a useful approach to understand how endosymbiotic functions can arise. The ciliate protist Paramecium bursaria hosts green algae of the order Chlorellales in a facultative photo-endosymbiosis. We have recently reported RNAi as a tool for understanding gene function in P. bursaria 186b (CCAP strain 1660/18). To complement this work, here we report a near complete host genome and transcriptome sequence dataset, using both Illumina and PacBio sequencing methods, in order to aid genome analysis and to enable the design of RNAi experiments. Our analyses demonstrate P. bursaria 186b, like other ciliates such as diverse species of Paramecia, possess numerous tiny introns. These data patterns, combined with the alternative genetic code common to ciliates, make gene identification and annotation challenging; as such, we identify gene models using Iso-Seq methodologies. These data will aid the investigation of genome evolution in the Paramecia and provide additional source data for the exploration of endosymbiotic functions.

两个物种如何进行稳定的内共生是一个生物学难题。兼性内共生相互作用的研究已经成为了解内共生功能如何产生的有用方法。在兼性光内共生中,纤毛原生生物法氏草草虫寄生小球藻目绿藻。我们最近报道了RNAi作为了解滑囊草履虫186b (CCAP菌株1660/18)基因功能的工具。为了补充这项工作,在这里,我们报告了一个近乎完整的宿主基因组和转录组序列数据集,使用Illumina和PacBio测序方法,以帮助基因组分析和设计RNAi实验。我们的分析表明,草履虫bursaria 186b与其他纤毛虫(如各种草履虫)一样,具有许多微小的内含子。这些数据,结合其他遗传密码共同的纤毛虫,使基因鉴定和注释具有挑战性;因此,我们使用Iso-Seq方法识别基因模型。这些数据将有助于草履虫基因组进化的研究,并为探索内共生功能提供额外的来源数据。
{"title":"De Novo Genome Sequence Assembly of the RNAi-Tractable Paramecium bursaria 186b: An Endosymbiotic Model System.","authors":"Guy Leonard, Benjamin H Jenkins, Fiona R Savory, Estelle S Kilias, Finlay Maguire, David S Milner, Thomas A Richards","doi":"10.1093/gbe/evaf183","DOIUrl":"10.1093/gbe/evaf183","url":null,"abstract":"<p><p>How two species engage in stable endosymbiosis is a biological quandary. The study of facultative endosymbiotic interactions has emerged as a useful approach to understand how endosymbiotic functions can arise. The ciliate protist Paramecium bursaria hosts green algae of the order Chlorellales in a facultative photo-endosymbiosis. We have recently reported RNAi as a tool for understanding gene function in P. bursaria 186b (CCAP strain 1660/18). To complement this work, here we report a near complete host genome and transcriptome sequence dataset, using both Illumina and PacBio sequencing methods, in order to aid genome analysis and to enable the design of RNAi experiments. Our analyses demonstrate P. bursaria 186b, like other ciliates such as diverse species of Paramecia, possess numerous tiny introns. These data patterns, combined with the alternative genetic code common to ciliates, make gene identification and annotation challenging; as such, we identify gene models using Iso-Seq methodologies. These data will aid the investigation of genome evolution in the Paramecia and provide additional source data for the exploration of endosymbiotic functions.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12569599/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145148740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Laboratory Evolution Reveals Transcriptional Mechanisms Underlying Thermal Adaptation of Escherichia coli. 实验室进化揭示了大肠杆菌热适应的转录机制。
IF 2.8 2区 生物学 Q2 EVOLUTIONARY BIOLOGY Pub Date : 2025-09-30 DOI: 10.1093/gbe/evaf171
Kevin Rychel, Ke Chen, Edward A Catoiu, Elina Olson, Troy E Sandberg, Ye Gao, Sibei Xu, Ying Hefner, Richard Szubin, Arjun Patel, Adam M Feist, Bernhard O Palsson

Adaptive laboratory evolution is able to generate microbial strains, which exhibit extreme phenotypes, revealing fundamental biological adaptation mechanisms. Here, we use adaptive laboratory evolution to evolve Escherichia coli strains that grow at temperatures as high as 45.3 °C, a temperature lethal to wild-type cells. The strains adopted a hypermutator phenotype and employed multiple systems-level adaptations that made global analysis of the DNA mutations difficult. Given the challenge at the genomic level, we were motivated to uncover high-temperature tolerance adaptation mechanisms at the transcriptomic level. We employed independently modulated gene set (iModulon) analysis to reveal five transcriptional mechanisms underlying growth at high temperatures. These mechanisms were connected to acquired mutations, changes in transcriptome composition, sensory inputs, phenotypes, and protein structures. They are as follows: (i) downregulation of general stress responses while upregulating the specific heat stress responses, (ii) upregulation of flagellar basal bodies without upregulating motility and upregulation fimbriae, (iii) shift toward anaerobic metabolism, (iv) shift in regulation of iron uptake away from siderophore production, and (v) upregulation of yjfIJKL, a novel heat tolerance operon whose structures we predicted with AlphaFold. iModulons associated with these five mechanisms explain nearly half of all variance in the gene expression in the adapted strains. These thermotolerance strategies reveal that optimal coordination of known stress responses and metabolism can be achieved with a small number of regulatory mutations and may suggest a new role for large protein export systems. Adaptive laboratory evolution with transcriptomic characterization is a productive approach for elucidating and interpreting adaptation to otherwise lethal stresses.

适应性实验室进化能够产生具有极端表型的微生物菌株,揭示了基本的生物适应机制。在这里,我们使用适应性实验室进化来进化大肠杆菌菌株,这些菌株在高达45.3°C的温度下生长,这是对野生型细胞致命的温度。这些菌株采用了超突变表型,并采用了多个系统级适应,这使得DNA突变的全局分析变得困难。鉴于基因组水平上的挑战,我们有动力在转录组水平上揭示高温耐受性适应机制。我们利用独立调节基因集(iModulon)分析揭示了高温下生长的五种转录机制。这些机制与获得性突变、转录组组成、感觉输入、表型和蛋白质结构的变化有关。它们分别是:(i)一般应激反应下调而特异热应激反应上调,(ii)鞭毛基体上调而不上调运动性和上调毛,(iii)向厌氧代谢转变,(iv)铁载体产生的铁摄取调节转变,以及(v) yjfIJKL上调,这是一种新的耐热操纵子,我们用AlphaFold预测了其结构。与这五种机制相关的调节子解释了适应菌株中近一半的基因表达差异。这些耐热策略表明,已知的应激反应和代谢的最佳协调可以通过少量的调控突变来实现,并可能为大型蛋白质输出系统提供新的作用。具有转录组学特征的适应性实验室进化是阐明和解释对其他致死胁迫的适应的有效方法。
{"title":"Laboratory Evolution Reveals Transcriptional Mechanisms Underlying Thermal Adaptation of Escherichia coli.","authors":"Kevin Rychel, Ke Chen, Edward A Catoiu, Elina Olson, Troy E Sandberg, Ye Gao, Sibei Xu, Ying Hefner, Richard Szubin, Arjun Patel, Adam M Feist, Bernhard O Palsson","doi":"10.1093/gbe/evaf171","DOIUrl":"10.1093/gbe/evaf171","url":null,"abstract":"<p><p>Adaptive laboratory evolution is able to generate microbial strains, which exhibit extreme phenotypes, revealing fundamental biological adaptation mechanisms. Here, we use adaptive laboratory evolution to evolve Escherichia coli strains that grow at temperatures as high as 45.3 °C, a temperature lethal to wild-type cells. The strains adopted a hypermutator phenotype and employed multiple systems-level adaptations that made global analysis of the DNA mutations difficult. Given the challenge at the genomic level, we were motivated to uncover high-temperature tolerance adaptation mechanisms at the transcriptomic level. We employed independently modulated gene set (iModulon) analysis to reveal five transcriptional mechanisms underlying growth at high temperatures. These mechanisms were connected to acquired mutations, changes in transcriptome composition, sensory inputs, phenotypes, and protein structures. They are as follows: (i) downregulation of general stress responses while upregulating the specific heat stress responses, (ii) upregulation of flagellar basal bodies without upregulating motility and upregulation fimbriae, (iii) shift toward anaerobic metabolism, (iv) shift in regulation of iron uptake away from siderophore production, and (v) upregulation of yjfIJKL, a novel heat tolerance operon whose structures we predicted with AlphaFold. iModulons associated with these five mechanisms explain nearly half of all variance in the gene expression in the adapted strains. These thermotolerance strategies reveal that optimal coordination of known stress responses and metabolism can be achieved with a small number of regulatory mutations and may suggest a new role for large protein export systems. Adaptive laboratory evolution with transcriptomic characterization is a productive approach for elucidating and interpreting adaptation to otherwise lethal stresses.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":"17 10","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12492005/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145212504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inference of Cytochrome P450 Evolutionary History Using Structural and Physicochemical Metrics. 利用结构和理化指标推断细胞色素P450的进化史。
IF 2.8 2区 生物学 Q2 EVOLUTIONARY BIOLOGY Pub Date : 2025-09-30 DOI: 10.1093/gbe/evaf178
Jamie D Dixson, Abhijay Azad, Pamela A Padilla, Rajeev K Azad

Cytochrome P450s are a superfamily of heme-binding monooxygenases involved with the detoxification of intrinsic and extrinsic toxins. They are near ubiquitous within biological domains and are found in all domains. Members of families within the superfamily are defined based on amino acid identity thresholds, with thresholds as low as 40% in some families. Relationships among Cytochrome P450 families have proven elusive due to sub-Twilight Zone interfamily identities (<30%) that result in poor multiple sequence alignment quality and thus low levels of support for downstream phylogenetic reconstructions. Despite the low identities, Cytochrome P450 structures are remarkably well conserved both within and among families. In such cases, structural phylogenetics has the potential to unveil elusive relationships because the selectively favored physicochemical properties giving rise to the structure and function of the proteins persist despite sequence-level divergence. Recently, in two separate publications, we demonstrated that by utilizing physicochemical vectors, dynamic time warping, and hierarchical clustering (PCDTW), large swaths of protein domain families and betacoronavirus receptor-binding domain clades were congruent with validated functional/structural relationships. These were important findings because anomalous sequence alignment-based maximum likelihood phylogenetic findings, which were not congruent with the known functional relationships, were resolved. That also validated the use of physicochemical vectors in making inferences about structural/functional homology. Additionally, it illuminated that the same methods might be applied to other protein families with relationships that are difficult to resolve from sequence data alone. Herein, we used Molecular Weight and Hydrophobicity Physicochemical Dynamic Time Warping (MWHP PCDTW) along with structural and sequence alignment-based phylogenetic methodologies to analyze all of the Cytochrome P450s found both in the high-fidelity Structural Classificaction of Proteins (SCOP) database and the reviewed sequences with both experimentally resolved and de novo predicted structures in the Protein Data Bank and the AlphaFold (AF) Protein Structure Database, respectively. We compared the resulting phylogenetic topologies and found that in some cases, structure-based methods may be less able to resolve random/convergent similarity than physicochemical and sequence-based methodologies. This finding agrees with previous findings that demonstrate the usefulness of physicochemical properties in resolving both random structural similarity and potentially convergent relationships.

细胞色素P450's是血红素结合单加氧酶的超家族,参与内源性和外源性毒素的解毒。它们在生物领域中几乎无处不在,在所有领域都有发现。超家族内的家族成员是根据氨基酸识别阈值来定义的,某些家族的阈值低至40%。细胞色素P450家族之间的关系被证明是难以捉摸的,因为家族间的身份处于亚模糊区(
{"title":"Inference of Cytochrome P450 Evolutionary History Using Structural and Physicochemical Metrics.","authors":"Jamie D Dixson, Abhijay Azad, Pamela A Padilla, Rajeev K Azad","doi":"10.1093/gbe/evaf178","DOIUrl":"10.1093/gbe/evaf178","url":null,"abstract":"<p><p>Cytochrome P450s are a superfamily of heme-binding monooxygenases involved with the detoxification of intrinsic and extrinsic toxins. They are near ubiquitous within biological domains and are found in all domains. Members of families within the superfamily are defined based on amino acid identity thresholds, with thresholds as low as 40% in some families. Relationships among Cytochrome P450 families have proven elusive due to sub-Twilight Zone interfamily identities (<30%) that result in poor multiple sequence alignment quality and thus low levels of support for downstream phylogenetic reconstructions. Despite the low identities, Cytochrome P450 structures are remarkably well conserved both within and among families. In such cases, structural phylogenetics has the potential to unveil elusive relationships because the selectively favored physicochemical properties giving rise to the structure and function of the proteins persist despite sequence-level divergence. Recently, in two separate publications, we demonstrated that by utilizing physicochemical vectors, dynamic time warping, and hierarchical clustering (PCDTW), large swaths of protein domain families and betacoronavirus receptor-binding domain clades were congruent with validated functional/structural relationships. These were important findings because anomalous sequence alignment-based maximum likelihood phylogenetic findings, which were not congruent with the known functional relationships, were resolved. That also validated the use of physicochemical vectors in making inferences about structural/functional homology. Additionally, it illuminated that the same methods might be applied to other protein families with relationships that are difficult to resolve from sequence data alone. Herein, we used Molecular Weight and Hydrophobicity Physicochemical Dynamic Time Warping (MWHP PCDTW) along with structural and sequence alignment-based phylogenetic methodologies to analyze all of the Cytochrome P450s found both in the high-fidelity Structural Classificaction of Proteins (SCOP) database and the reviewed sequences with both experimentally resolved and de novo predicted structures in the Protein Data Bank and the AlphaFold (AF) Protein Structure Database, respectively. We compared the resulting phylogenetic topologies and found that in some cases, structure-based methods may be less able to resolve random/convergent similarity than physicochemical and sequence-based methodologies. This finding agrees with previous findings that demonstrate the usefulness of physicochemical properties in resolving both random structural similarity and potentially convergent relationships.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12502919/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145091130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrative Genotyping and Analysis of Canine Structural Variation Using Long-read and Short-read Data. 利用长读和短读数据进行犬结构变异的综合基因分型和分析。
IF 2.8 2区 生物学 Q2 EVOLUTIONARY BIOLOGY Pub Date : 2025-09-30 DOI: 10.1093/gbe/evaf173
Peter Z Schall, Jeffrey M Kidd

Structural variation makes an important contribution to canine evolution and phenotypic differences. Although recent advances in long-read sequencing have enabled the generation of multiple canine genome assemblies, most prior analyses of structural variation have relied on short-read sequencing. To offer a more complete assessment of structural variation in canines, we performed an integrative analysis of structural variants present in 12 canine samples with available long-read and short-read sequencing data along with genome assemblies. Use of long-reads permits the discovery of heterozygous variation that is absent in existing haploid assembly representations while offering a marked increase in the ability to identify insertion variants relative to short-read approaches. Examination of the size spectrum of structural variants shows that dimorphic LINE-1 and SINE variants account for over 45% of all deletions and identified 1,410 LINE-1s with intact open reading frames that show presence-absence dimorphism. Using a graph-based approach, we genotype newly discovered structural variants in an existing collection of 1,879 resequenced dogs and wolves, generating a variant catalog containing a 56.5% increase in the number of deletions and 705% increase in the number of insertions previously found in the analyzed samples. Examination of allele frequencies across admixture components present across breed clades identified 283 structural variants evolving with a signature of selection.

结构变异在犬类进化和表型差异中起着重要作用。尽管最近在长读测序方面的进展已经能够产生多个犬基因组组装,但大多数先前的结构变异分析都依赖于短读测序。为了对犬科动物的结构变异进行更全面的评估,我们对12个犬科动物样本的结构变异进行了综合分析,包括长读和短读测序数据以及基因组组装。使用长读取允许发现杂合变异,这在现有的单倍体组装表示中是不存在的,同时提供了相对于短读取方法识别插入变异的能力的显着增加。对结构变异的大小谱的检查表明,二态的LINE-1和SINE变异占所有缺失的45%以上,并鉴定出1,410个具有完整开放阅读框的LINE-1,显示存在-缺失二态性。使用基于图的方法,我们在现有的1879只狗和狼的重测序集合中对新发现的结构变异进行了基因分型,生成了一个变异目录,其中缺失数量增加了56.5%,插入数量增加了705%。对跨品种分支的混合成分的等位基因频率进行检查,发现283个结构变异具有选择的特征。
{"title":"Integrative Genotyping and Analysis of Canine Structural Variation Using Long-read and Short-read Data.","authors":"Peter Z Schall, Jeffrey M Kidd","doi":"10.1093/gbe/evaf173","DOIUrl":"10.1093/gbe/evaf173","url":null,"abstract":"<p><p>Structural variation makes an important contribution to canine evolution and phenotypic differences. Although recent advances in long-read sequencing have enabled the generation of multiple canine genome assemblies, most prior analyses of structural variation have relied on short-read sequencing. To offer a more complete assessment of structural variation in canines, we performed an integrative analysis of structural variants present in 12 canine samples with available long-read and short-read sequencing data along with genome assemblies. Use of long-reads permits the discovery of heterozygous variation that is absent in existing haploid assembly representations while offering a marked increase in the ability to identify insertion variants relative to short-read approaches. Examination of the size spectrum of structural variants shows that dimorphic LINE-1 and SINE variants account for over 45% of all deletions and identified 1,410 LINE-1s with intact open reading frames that show presence-absence dimorphism. Using a graph-based approach, we genotype newly discovered structural variants in an existing collection of 1,879 resequenced dogs and wolves, generating a variant catalog containing a 56.5% increase in the number of deletions and 705% increase in the number of insertions previously found in the analyzed samples. Examination of allele frequencies across admixture components present across breed clades identified 283 structural variants evolving with a signature of selection.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":" ","pages":""},"PeriodicalIF":2.8,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12481690/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145091397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Genome Biology and Evolution
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1