首页 > 最新文献

Molecular biology and evolution最新文献

英文 中文
Asymmetrical evolution of promoter methylation of mammalian genes after duplication.
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-12-17 DOI: 10.1093/molbev/msae259
Mercedes de la Fuente, Isabel Mendizabal, Mira Han, Soojin V Yi, David Alvarez-Ponce

Even though gene duplication is a key source of new genes and evolutionary innovation, it is unclear how duplicates survive the period immediately following gene duplication, in which both copies are functionally redundant. In the absence of epigenetic silencing, the abundance of the gene product would double after gene duplication, which would often have deleterious effects. However, recent duplicates exhibit low expression levels, which could be at least partially explained by high levels of promoter methylation. What evolutionary paths lead to duplicate hypermethylation, and does it affect both duplicates, or only one? Here, we compare levels of promoter methylation in 10 human and 16 mouse tissues, between singletons and duplicates and among human-mouse orthologs of different kinds (one-to-one, one-to-many, many-to-one, and many-to-many). Our results indicate that: (1) on average, duplicates are more methylated than singletons in mouse, but less methylated than singletons in human, (2) recently duplicated genes tend to exhibit high levels of promoter methylation, (3) genes that undergo duplication tend to be highly methylated before duplication, (4) after gene duplication, one of the copies (the daughter copy, i.e. the one that relocates to a new genomic context) tends to undergo an additional increase in promoter methylation, whereas the other (the parental copy, which remains in the original genomic location) tends to retain pre-duplication methylation levels, and (5) daughter copies tend to be lowly expressed. These observations support a model in which daughter copies are repressed via promoter hypermethylation and can thus survive the filter of purifying selection until both copies diverge functionally.

{"title":"Asymmetrical evolution of promoter methylation of mammalian genes after duplication.","authors":"Mercedes de la Fuente, Isabel Mendizabal, Mira Han, Soojin V Yi, David Alvarez-Ponce","doi":"10.1093/molbev/msae259","DOIUrl":"10.1093/molbev/msae259","url":null,"abstract":"<p><p>Even though gene duplication is a key source of new genes and evolutionary innovation, it is unclear how duplicates survive the period immediately following gene duplication, in which both copies are functionally redundant. In the absence of epigenetic silencing, the abundance of the gene product would double after gene duplication, which would often have deleterious effects. However, recent duplicates exhibit low expression levels, which could be at least partially explained by high levels of promoter methylation. What evolutionary paths lead to duplicate hypermethylation, and does it affect both duplicates, or only one? Here, we compare levels of promoter methylation in 10 human and 16 mouse tissues, between singletons and duplicates and among human-mouse orthologs of different kinds (one-to-one, one-to-many, many-to-one, and many-to-many). Our results indicate that: (1) on average, duplicates are more methylated than singletons in mouse, but less methylated than singletons in human, (2) recently duplicated genes tend to exhibit high levels of promoter methylation, (3) genes that undergo duplication tend to be highly methylated before duplication, (4) after gene duplication, one of the copies (the daughter copy, i.e. the one that relocates to a new genomic context) tends to undergo an additional increase in promoter methylation, whereas the other (the parental copy, which remains in the original genomic location) tends to retain pre-duplication methylation levels, and (5) daughter copies tend to be lowly expressed. These observations support a model in which daughter copies are repressed via promoter hypermethylation and can thus survive the filter of purifying selection until both copies diverge functionally.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11683416/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142837553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evolutionary novelty of Apolipoprotein D facilitates metabolic plasticity in lepidopteran wings.
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-12-12 DOI: 10.1093/molbev/msae252
Shunze Jia, Rongqiao Li, Yinghui Li, Yuxin Huang, Minmin Liu, Yanyan Zhou, Yanting Liang, Zhihua Hao, Yusong Xu, Huabing Wang

Understanding metabolic plasticity of animal evolution is a fundamental challenge in evolutionary biology. Owing to the diversification of insect wing morphology and dynamic energy requirements, the molecular adaptation mechanisms underlying the metabolic pathways in wing evolution remain largely unknown. This study reveals the pivotal role of the duplicated Apolipoprotein D (ApoD) gene in lipid and energy homeostasis in the lepidopteran wing. ApoD underwent significant expansion in insects, with gene duplication and consistent retention observed in Lepidoptera. Notably, duplicated ApoD2 was highly expressed in lepidopteran wings and encoded a unique C-terminal tail, conferring distinct ligand-binding properties. Using Bombyx mori as a model organism, we integrated evolutionary analysis, multiomics, and in vivo functional experiments to elucidate the way duplicated ApoD2 mediates lipid trafficking and homeostasis via the AMP-activated protein kinase pathway in wings. Moreover, we revealed the specific expression and functional divergence of duplicated ApoD as a key mechanism regulating lipid homeostasis in the lepidopteran wing. These findings highlight an evolutionary scenario in which neofunctionalization conferred a novel role of ApoD in shaping adaptive lipid metabolic regulatory networks during wing phenotypic evolution. Overall, we provide in vivo evidence for the functional differentiation of duplicate genes in shaping adaptive metabolic regulatory networks during phenotypic evolution.

{"title":"Evolutionary novelty of Apolipoprotein D facilitates metabolic plasticity in lepidopteran wings.","authors":"Shunze Jia, Rongqiao Li, Yinghui Li, Yuxin Huang, Minmin Liu, Yanyan Zhou, Yanting Liang, Zhihua Hao, Yusong Xu, Huabing Wang","doi":"10.1093/molbev/msae252","DOIUrl":"10.1093/molbev/msae252","url":null,"abstract":"<p><p>Understanding metabolic plasticity of animal evolution is a fundamental challenge in evolutionary biology. Owing to the diversification of insect wing morphology and dynamic energy requirements, the molecular adaptation mechanisms underlying the metabolic pathways in wing evolution remain largely unknown. This study reveals the pivotal role of the duplicated Apolipoprotein D (ApoD) gene in lipid and energy homeostasis in the lepidopteran wing. ApoD underwent significant expansion in insects, with gene duplication and consistent retention observed in Lepidoptera. Notably, duplicated ApoD2 was highly expressed in lepidopteran wings and encoded a unique C-terminal tail, conferring distinct ligand-binding properties. Using Bombyx mori as a model organism, we integrated evolutionary analysis, multiomics, and in vivo functional experiments to elucidate the way duplicated ApoD2 mediates lipid trafficking and homeostasis via the AMP-activated protein kinase pathway in wings. Moreover, we revealed the specific expression and functional divergence of duplicated ApoD as a key mechanism regulating lipid homeostasis in the lepidopteran wing. These findings highlight an evolutionary scenario in which neofunctionalization conferred a novel role of ApoD in shaping adaptive lipid metabolic regulatory networks during wing phenotypic evolution. Overall, we provide in vivo evidence for the functional differentiation of duplicate genes in shaping adaptive metabolic regulatory networks during phenotypic evolution.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11683417/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142813580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evolutionary Dynamics of Proinflammatory Caspases in Primates and Rodents. 灵长类和啮齿类动物中促炎 Caspases 的进化动态。
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-12-06 DOI: 10.1093/molbev/msae220
Mische Holland, Rachel Rutkowski, Tera C Levin

Caspase-1 and related proteases are key players in inflammation and innate immunity. Here, we characterize the evolutionary history of caspase-1 and its close relatives across 19 primates and 21 rodents, focusing on differences that may cause discrepancies between humans and animal studies. While caspase-1 has been retained in all these taxa, other members of the caspase-1 subfamily (caspase-4, caspase-5, caspase-11, and caspase-12 and CARD16, 17, and 18) each have unique evolutionary trajectories. Caspase-4 is found across simian primates, whereas we identified multiple pseudogenization and gene loss events in caspase-5, caspase-11, and the CARDs. Because caspase-4 and caspase-11 are both key players in the noncanonical inflammasome pathway, we expected that these proteins would be likely to evolve rapidly. Instead, we found that these two proteins are largely conserved, whereas caspase-4's close paralog, caspase-5, showed significant indications of positive selection, as did primate caspase-1. Caspase-12 is a nonfunctional pseudogene in humans. We find this extends across most primates, although many rodents and some primates retain an intact, and likely functional, caspase-12. In mouse laboratory lines, we found that 50% of common strains carry nonsynonymous variants that may impact the functions of caspase-11 and caspase-12 and therefore recommend specific strains to be used (and avoided). Finally, unlike rodents, primate caspases have undergone repeated rounds of gene conversion, duplication, and loss leading to a highly dynamic proinflammatory caspase repertoire. Thus, we uncovered many differences in the evolution of primate and rodent proinflammatory caspases and discuss the potential implications of this history for caspase gene functions.

Caspase-1 和相关蛋白酶是炎症和先天性免疫中的关键角色。在这里,我们描述了 caspase-1 及其近亲在 19 种灵长类动物和 21 种啮齿类动物中的进化史,重点关注可能导致人类与动物研究之间差异的不同之处。在所有这些类群中都保留了 caspase-1,而 caspase-1 亚家族的其他成员(caspase-4、-5、-11、-12 和 CARD16、17 和 18)则各自有着独特的进化轨迹。Caspase-4在整个类灵长类动物中都有发现,而我们在caspase-5、caspase-11和CARDs中发现了多种假基因和基因缺失事件。由于 Caspase-4 和 Caspase-11 都是非典型炎症小体通路中的关键角色,我们预计这两种蛋白可能会迅速进化。相反,我们发现这两种蛋白在很大程度上是保守的,而 Caspase-4 的近似同源物 Caspase-5 和灵长类的 Caspase-1 则表现出明显的正选择迹象。Caspase-12在人类中是一种无功能的假基因。我们发现,尽管许多啮齿类动物和一些灵长类动物保留了完整的、可能具有功能的 Caspase-12,但大多数灵长类动物都存在这种情况。在小鼠实验室品系中,我们发现 50%的常见品系携带可能影响 caspase-11 和 -12 功能的非同义变异,因此建议使用(和避免)特定的品系。最后,与啮齿类动物不同,灵长类动物的 caspase 经历了反复的基因转换、复制和缺失,从而形成了高度动态的促炎性 caspase 基因库。因此,我们发现了灵长类和啮齿类动物促炎性 caspase 进化过程中的许多差异,并讨论了这一历史对 caspase 基因功能的潜在影响。
{"title":"Evolutionary Dynamics of Proinflammatory Caspases in Primates and Rodents.","authors":"Mische Holland, Rachel Rutkowski, Tera C Levin","doi":"10.1093/molbev/msae220","DOIUrl":"10.1093/molbev/msae220","url":null,"abstract":"<p><p>Caspase-1 and related proteases are key players in inflammation and innate immunity. Here, we characterize the evolutionary history of caspase-1 and its close relatives across 19 primates and 21 rodents, focusing on differences that may cause discrepancies between humans and animal studies. While caspase-1 has been retained in all these taxa, other members of the caspase-1 subfamily (caspase-4, caspase-5, caspase-11, and caspase-12 and CARD16, 17, and 18) each have unique evolutionary trajectories. Caspase-4 is found across simian primates, whereas we identified multiple pseudogenization and gene loss events in caspase-5, caspase-11, and the CARDs. Because caspase-4 and caspase-11 are both key players in the noncanonical inflammasome pathway, we expected that these proteins would be likely to evolve rapidly. Instead, we found that these two proteins are largely conserved, whereas caspase-4's close paralog, caspase-5, showed significant indications of positive selection, as did primate caspase-1. Caspase-12 is a nonfunctional pseudogene in humans. We find this extends across most primates, although many rodents and some primates retain an intact, and likely functional, caspase-12. In mouse laboratory lines, we found that 50% of common strains carry nonsynonymous variants that may impact the functions of caspase-11 and caspase-12 and therefore recommend specific strains to be used (and avoided). Finally, unlike rodents, primate caspases have undergone repeated rounds of gene conversion, duplication, and loss leading to a highly dynamic proinflammatory caspase repertoire. Thus, we uncovered many differences in the evolution of primate and rodent proinflammatory caspases and discuss the potential implications of this history for caspase gene functions.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11630849/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142470121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiple-Wave Admixture and Adaptive Evolution of the Pamirian Wakhi People. 帕米尔瓦希人的多波混血和适应性进化。
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-12-06 DOI: 10.1093/molbev/msae237
Wanxing Xu, Jiaojiao Liu, Xiaoxi Zhang, Jia Wen, Qidi Feng, Yang Gao, Yuwen Pan, Yan Lu, Asifullah Khan, Shuhua Xu

While whole-genome sequencing has been applied extensively to investigate the genetic diversity of global populations, ethnic minority groups in Pakistan are generally underrepresented. In particular, little is known about the genetic origin and highland adaptation of the Pamirian Wakhi people. According to Chinese historical records, the geographical location and language usage of Wakhi may be closely related to Xinjiang Tajiks. In this study, based on high-coverage (∼30×) whole-genome sequencing of eight Wakhi and 25 Xinjiang Tajik individuals, we performed data analyses together with worldwide populations to gain insights into their genetic composition, demography, and adaptive evolution to the highland environment. The Wakhi derived more than 85% of their ancestry from West Eurasian populations (European ∼44.5%, South Asian ∼42.2%) and 10% from East Eurasian populations (Siberian ∼6.0%, East Asian ∼4.3%). Modeling the admixture history of the Wakhi indicated that the early West-East admixture occurred ∼3,875 to 2,250 years ago and that the recent admixture occurred ∼750 to 375 years ago. We identified selection signatures across EGLN3, in particular, a distinctive evolutionary signature was observed, and a certain underlying selected haplotype showed higher frequency (87.5%) in the Wakhi than in nearby Xinjiang Tajiks and other highlanders. Interestingly, we found high-frequency archaic sequences in the Wakhi genome, which overlapped with several genes related to cellular signaling transduction, including MAGI2, previously associated with high-altitude adaptation. Our analysis indicates that the Wakhi are distinct from the Xinjiang Tajiks and Tajikistan Tajiks and sheds light on the Wakhi's ancestral origin and genetic basis of high-altitude adaptation.

尽管全基因组测序已被广泛应用于研究全球人口的遗传多样性,但巴基斯坦少数民族群体的代表性普遍不足。特别是,人们对帕米尔瓦其人的基因起源和高原适应性知之甚少。根据中国历史记载,瓦其人的地理位置和语言使用可能与新疆塔吉克人(XJT)密切相关。在本研究中,我们基于对8个瓦希族个体和25个新疆塔吉克族个体的高覆盖率(∼30×)全基因组测序,与世界范围内的人群一起进行了数据分析,以深入了解他们的遗传组成、人口结构以及对高原环境的适应性进化。瓦希人85%以上的祖先来自欧亚西部人群(欧洲人∼44.5%,南亚人∼42.2%),10%来自欧亚东部人群(西伯利亚人∼6.0%,东亚人∼4.3%)。对瓦希族的混血历史建模表明,早期的西东混血发生在大约 3,875-2,250 年前,近期的混血发生在 750-375 年前。我们发现了整个 EGLN3 的选择特征,尤其是观察到了一个独特的进化特征,与邻近的 XJT 和其他高原人相比,某个潜在的选择单倍型在瓦希人中的频率更高(87.5%)。有趣的是,我们在瓦希人的基因组中发现了高频率的古老序列,这些序列与几个与细胞信号传导有关的基因重叠,包括以前与高海拔适应有关的 MAGI2。我们的分析表明,瓦希人有别于新疆塔吉克人和塔吉克斯坦塔吉克人,并揭示了瓦希人的祖先起源和高海拔适应的遗传基础。
{"title":"Multiple-Wave Admixture and Adaptive Evolution of the Pamirian Wakhi People.","authors":"Wanxing Xu, Jiaojiao Liu, Xiaoxi Zhang, Jia Wen, Qidi Feng, Yang Gao, Yuwen Pan, Yan Lu, Asifullah Khan, Shuhua Xu","doi":"10.1093/molbev/msae237","DOIUrl":"10.1093/molbev/msae237","url":null,"abstract":"<p><p>While whole-genome sequencing has been applied extensively to investigate the genetic diversity of global populations, ethnic minority groups in Pakistan are generally underrepresented. In particular, little is known about the genetic origin and highland adaptation of the Pamirian Wakhi people. According to Chinese historical records, the geographical location and language usage of Wakhi may be closely related to Xinjiang Tajiks. In this study, based on high-coverage (∼30×) whole-genome sequencing of eight Wakhi and 25 Xinjiang Tajik individuals, we performed data analyses together with worldwide populations to gain insights into their genetic composition, demography, and adaptive evolution to the highland environment. The Wakhi derived more than 85% of their ancestry from West Eurasian populations (European ∼44.5%, South Asian ∼42.2%) and 10% from East Eurasian populations (Siberian ∼6.0%, East Asian ∼4.3%). Modeling the admixture history of the Wakhi indicated that the early West-East admixture occurred ∼3,875 to 2,250 years ago and that the recent admixture occurred ∼750 to 375 years ago. We identified selection signatures across EGLN3, in particular, a distinctive evolutionary signature was observed, and a certain underlying selected haplotype showed higher frequency (87.5%) in the Wakhi than in nearby Xinjiang Tajiks and other highlanders. Interestingly, we found high-frequency archaic sequences in the Wakhi genome, which overlapped with several genes related to cellular signaling transduction, including MAGI2, previously associated with high-altitude adaptation. Our analysis indicates that the Wakhi are distinct from the Xinjiang Tajiks and Tajikistan Tajiks and sheds light on the Wakhi's ancestral origin and genetic basis of high-altitude adaptation.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11631187/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142591253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative Genomics Provides Insights into Adaptive Evolution and Demographics of Bats. 比较基因组学为蝙蝠的适应性进化和人口统计提供了洞察力。
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-12-06 DOI: 10.1093/molbev/msae208
Gaoming Liu, Qi Pan, Pingfen Zhu, Xinyu Guo, Zhan Zhang, Zihao Li, Yaolei Zhang, Xiaoxiao Zhang, Jiahao Wang, Weiqiang Liu, Chunyan Hu, Yang Yu, Xiao Wang, Weixiao Chen, Meng Li, Wenhua Yu, Xin Liu, Inge Seim, Guangyi Fan, Xuming Zhou

Bats possess a range of distinctive characteristics, including flight, echolocation, impressive longevity, and the ability to harbor various zoonotic pathogens. Additionally, they account for the second-highest species diversity among mammalian orders, yet their phylogenetic relationships and demographic history remain underexplored. Here, we generated de novo assembled genomes for 17 bat species and 2 of their mammalian relatives (the Amur hedgehog and Chinese mole shrew), with 12 genomes reaching chromosome-level assembly. Comparative genomics and ChIP-seq assays identified newly gained genomic regions in bats potentially linked to the regulation of gene activity and expression. Notably, some antiviral infection-related gene under positive selection exhibited the activity of suppressing cancer, evidencing the linkage between virus tolerance and cancer resistance in bats. By integrating published bat genome assemblies, phylogenetic reconstruction established the proximity of noctilionoid bats to vesper bats. Interestingly, we found 2 distinct patterns of ancient population dynamics in bats and population changes since the last glacial maximum does not reflect species phylogenetic relationships. These findings enriched our understanding of adaptive mechanisms and demographic history of bats.

蝙蝠具有一系列与众不同的特征,包括飞行、回声定位、惊人的寿命以及携带各种人畜共患病病原体的能力。此外,蝙蝠的物种多样性在哺乳动物中位居第二,但它们的系统发育关系和人口历史仍未得到充分探索。在这里,我们生成了 17 种蝙蝠及其两种哺乳动物近亲(阿穆尔刺猬和中国鼹鼩)的全新组装基因组,其中 12 个基因组达到了染色体组水平的组装。比较基因组学和 ChIP-seq 分析确定了蝙蝠新获得的基因组区域,这些区域可能与基因活性和表达调控有关。值得注意的是,一些抗病毒感染相关基因在正向选择下表现出抑制癌症的活性,证明了蝙蝠的病毒耐受性和癌症抵抗力之间的联系。通过整合已发表的蝙蝠基因组汇编,系统发育重建确定了夜叉蝙蝠与紫斑蝙蝠的亲缘关系。有趣的是,我们发现了蝙蝠古老种群动态的两种截然不同的模式,而且自上一个冰期最大值以来的种群变化并不反映物种的系统发育关系。这些发现丰富了我们对蝙蝠适应机制和人口历史的理解。
{"title":"Comparative Genomics Provides Insights into Adaptive Evolution and Demographics of Bats.","authors":"Gaoming Liu, Qi Pan, Pingfen Zhu, Xinyu Guo, Zhan Zhang, Zihao Li, Yaolei Zhang, Xiaoxiao Zhang, Jiahao Wang, Weiqiang Liu, Chunyan Hu, Yang Yu, Xiao Wang, Weixiao Chen, Meng Li, Wenhua Yu, Xin Liu, Inge Seim, Guangyi Fan, Xuming Zhou","doi":"10.1093/molbev/msae208","DOIUrl":"10.1093/molbev/msae208","url":null,"abstract":"<p><p>Bats possess a range of distinctive characteristics, including flight, echolocation, impressive longevity, and the ability to harbor various zoonotic pathogens. Additionally, they account for the second-highest species diversity among mammalian orders, yet their phylogenetic relationships and demographic history remain underexplored. Here, we generated de novo assembled genomes for 17 bat species and 2 of their mammalian relatives (the Amur hedgehog and Chinese mole shrew), with 12 genomes reaching chromosome-level assembly. Comparative genomics and ChIP-seq assays identified newly gained genomic regions in bats potentially linked to the regulation of gene activity and expression. Notably, some antiviral infection-related gene under positive selection exhibited the activity of suppressing cancer, evidencing the linkage between virus tolerance and cancer resistance in bats. By integrating published bat genome assemblies, phylogenetic reconstruction established the proximity of noctilionoid bats to vesper bats. Interestingly, we found 2 distinct patterns of ancient population dynamics in bats and population changes since the last glacial maximum does not reflect species phylogenetic relationships. These findings enriched our understanding of adaptive mechanisms and demographic history of bats.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11646704/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142623955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Graph-based Goat Pangenome Reveals Structural Variations Involved in Domestication and Adaptation.
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-12-06 DOI: 10.1093/molbev/msae251
Peipei Bian, Jiaxin Li, Shishuo Zhou, Xingquan Wang, Mian Gong, Xi Guo, Yudong Cai, Qimeng Yang, Jiaqi Fu, Rongrong Li, Shuhong Huang, Funong Luo, Ali Mujtaba Shah, Johannes A Lenstra, Joram M Mwacharo, Ran Li, Gang Ren, Xiaolong Wang, Cong Li, Wenxin Zheng, Yu Jiang, Xihong Wang

Pangenomes can facilitate a deeper understanding of genome complexity. Using de novo phased long-read assemblies of eight representative goat breeds, we constructed a graph-based pangenome of goats (Capra hircus) and discovered 113-Mb autosomal novel sequences. Combining this multi-assembly pangenome with low-coverage PacBio HiFi sequences, we constructed a long-read structural variations (SVs) database containing 59,325 SV deletions, 84,910 SV insertions, and 24,954 other complex SV alleles. This resource allowed reliable graph-based genotyping from short reads of 79 wild and 1,148 worldwide domestic goats. Selection signal analysis of SV captured a novel immune-related domestication locus containing the galectin-9 gene and extra copies of the ruminant-specific galectin-9-like genes (LGALS9L), which have high tissue specificity. A segmental duplication in domestic goats generates three additional LGALS9L copies. Ancient goat genome sequences show a gradual increase in frequency of this duplication from the Neolithic to the present. Two other newly detected SVs also have higher selection signals than adjacent SNPs, a truncated-LINE1 deletion in EDAR2 associated with cashmere production and a VNTR-related insertion in PAPSS2 linked to high-altitude adaptation. In summary, the multi-assembly goat pangenome and long-read SV database facilitates detecting complex variations that are important in evolution and selection.

{"title":"A Graph-based Goat Pangenome Reveals Structural Variations Involved in Domestication and Adaptation.","authors":"Peipei Bian, Jiaxin Li, Shishuo Zhou, Xingquan Wang, Mian Gong, Xi Guo, Yudong Cai, Qimeng Yang, Jiaqi Fu, Rongrong Li, Shuhong Huang, Funong Luo, Ali Mujtaba Shah, Johannes A Lenstra, Joram M Mwacharo, Ran Li, Gang Ren, Xiaolong Wang, Cong Li, Wenxin Zheng, Yu Jiang, Xihong Wang","doi":"10.1093/molbev/msae251","DOIUrl":"10.1093/molbev/msae251","url":null,"abstract":"<p><p>Pangenomes can facilitate a deeper understanding of genome complexity. Using de novo phased long-read assemblies of eight representative goat breeds, we constructed a graph-based pangenome of goats (Capra hircus) and discovered 113-Mb autosomal novel sequences. Combining this multi-assembly pangenome with low-coverage PacBio HiFi sequences, we constructed a long-read structural variations (SVs) database containing 59,325 SV deletions, 84,910 SV insertions, and 24,954 other complex SV alleles. This resource allowed reliable graph-based genotyping from short reads of 79 wild and 1,148 worldwide domestic goats. Selection signal analysis of SV captured a novel immune-related domestication locus containing the galectin-9 gene and extra copies of the ruminant-specific galectin-9-like genes (LGALS9L), which have high tissue specificity. A segmental duplication in domestic goats generates three additional LGALS9L copies. Ancient goat genome sequences show a gradual increase in frequency of this duplication from the Neolithic to the present. Two other newly detected SVs also have higher selection signals than adjacent SNPs, a truncated-LINE1 deletion in EDAR2 associated with cashmere production and a VNTR-related insertion in PAPSS2 linked to high-altitude adaptation. In summary, the multi-assembly goat pangenome and long-read SV database facilitates detecting complex variations that are important in evolution and selection.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11669723/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142813464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Elevated Rates of Molecular Evolution Genome-wide in Mutualist Legumes and Rhizobia. 互生豆科植物和根瘤菌全基因组的分子进化速度加快。
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-12-06 DOI: 10.1093/molbev/msae245
Tia L Harrison, John R Stinchcombe, Megan E Frederickson

Rates of molecular evolution vary greatly among even closely related species. Although theory predicts that antagonistic interactions between species increase rates of molecular evolution, predictions for how mutualism affects evolutionary rates are mixed. We compared rates of molecular evolution between (i) mutualistic and non-mutualistic legumes, (ii) an independent set of symbiotic rhizobia and their non-symbiotic close relatives, and (iii) symbiotic and non-symbiotic clades within Ensifer, a diverse genus of bacteria with various lifestyles. We assembled transcriptomes de novo for 12 legume species and calculated dN/dS ratios at orthologous genes in all species to determine if genes in mutualistic plants evolve faster or slower than in their non-mutualistic relatives. We also calculated dN/dS ratios in genes known to be important for symbiosis. We found that mutualists have higher rates of molecular evolution genome-wide compared to non-mutualistic legumes, but this pattern did not hold in symbiosis genes. We next calculated dN/dS ratios in 14 bacteria species across the proteobacteria phylogeny that differ in whether they associate mutualistically with plants, using published data. In most pairs, symbiotic rhizobia show higher dN/dS values compared to their non-symbiotic relatives. Within a bacterial genus with many well-characterized mutualist species (Ensifer), we calculated dN/dS ratios in symbiotic and non-symbiotic clades and found that symbiotic lineages have higher rates of molecular evolution genome-wide, but not at genes on the symbiotic plasmid pSymB. Our results suggest that although mutualism between legumes and rhizobia is associated with elevated rates of molecular evolution genome-wide, symbiosis genes may be evolutionarily stagnant.

即使是亲缘关系很近的物种,其分子进化速度也有很大差异。虽然理论预测物种间的对抗性相互作用会提高分子进化的速度,但对互利性如何影响进化速度的预测却不尽相同。我们比较了以下物种之间的分子进化速度:1)互生和非互生豆科植物;2)一组独立的共生根瘤菌和它们的非共生近亲;3)Ensifer(具有各种生活方式的多种细菌属)中的共生和非共生支系。我们为 12 个豆科植物物种重新组装了转录组,并计算了所有物种中同源基因的 dN/dS 比值,以确定互生植物中的基因比非互生近缘植物中的基因进化得更快或更慢。我们还计算了已知对共生很重要的基因的 dN/dS 比率。我们发现,与非互生豆科植物相比,互生植物的全基因组分子进化速度较快,但这种模式在共生基因中并不成立。接下来,我们利用已发表的数据计算了蛋白细菌系统发育过程中 14 种细菌的 dN/dS 比率,这些细菌在是否与植物共生方面存在差异。在大多数菌对中,共生根瘤菌的 dN/dS 值高于非共生根瘤菌。在一个有许多特征明确的互生物种的细菌属(Ensifer)中,我们计算了共生和非共生支系的 dN/dS 比值,发现共生支系在整个基因组中的分子进化速度较快,但在共生质粒 pSymB 上的基因中则不然。我们的研究结果表明,虽然豆科植物与根瘤菌之间的互生关系与整个基因组的分子进化率升高有关,但共生基因可能在进化上停滞不前。
{"title":"Elevated Rates of Molecular Evolution Genome-wide in Mutualist Legumes and Rhizobia.","authors":"Tia L Harrison, John R Stinchcombe, Megan E Frederickson","doi":"10.1093/molbev/msae245","DOIUrl":"10.1093/molbev/msae245","url":null,"abstract":"<p><p>Rates of molecular evolution vary greatly among even closely related species. Although theory predicts that antagonistic interactions between species increase rates of molecular evolution, predictions for how mutualism affects evolutionary rates are mixed. We compared rates of molecular evolution between (i) mutualistic and non-mutualistic legumes, (ii) an independent set of symbiotic rhizobia and their non-symbiotic close relatives, and (iii) symbiotic and non-symbiotic clades within Ensifer, a diverse genus of bacteria with various lifestyles. We assembled transcriptomes de novo for 12 legume species and calculated dN/dS ratios at orthologous genes in all species to determine if genes in mutualistic plants evolve faster or slower than in their non-mutualistic relatives. We also calculated dN/dS ratios in genes known to be important for symbiosis. We found that mutualists have higher rates of molecular evolution genome-wide compared to non-mutualistic legumes, but this pattern did not hold in symbiosis genes. We next calculated dN/dS ratios in 14 bacteria species across the proteobacteria phylogeny that differ in whether they associate mutualistically with plants, using published data. In most pairs, symbiotic rhizobia show higher dN/dS values compared to their non-symbiotic relatives. Within a bacterial genus with many well-characterized mutualist species (Ensifer), we calculated dN/dS ratios in symbiotic and non-symbiotic clades and found that symbiotic lineages have higher rates of molecular evolution genome-wide, but not at genes on the symbiotic plasmid pSymB. Our results suggest that although mutualism between legumes and rhizobia is associated with elevated rates of molecular evolution genome-wide, symbiosis genes may be evolutionarily stagnant.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11632370/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142739862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Saltational Episodes of Reticulate Evolution in the Drosophila saltans Species Group.
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-12-06 DOI: 10.1093/molbev/msae250
Carolina Prediger, Erina A Ferreira, Samara Videira Zorzato, Aurélie Hua-Van, Lisa Klasson, Wolfgang J Miller, Amir Yassin, Lilian Madi-Ravazzi

Phylogenomics reveals reticulate evolution to be widespread across taxa, but whether reticulation is due to low statistical power or it is a true evolutionary pattern remains a field of study. Here, we investigate the phylogeny and quantify reticulation in the Drosophila saltans species group, a Neotropical clade of the subgenus Sophophora comprising 23 species whose relationships have long been problematic. Phylogenetic analyses revealed conflicting topologies between the X chromosome, autosomes and the mitochondria. We extended the ABBA-BABA test of asymmetry in phylogenetic discordance to cases where no "true" species tree could be inferred, and applied our new test (called 2A2B) to whole genome data and to individual loci. We used four strategies, two based on our new assemblies using either conserved genes or ≥50 kb-long syntenic blocks with conserved collinearity across Neotropical Sophophora, and two consisted of windows from pseudo-reference genomes aligned to either an ingroup or outgroup species. Evidence for reticulation varied among the strategies, being lowest in the synteny-based approach, where it did not exceed ∼7% of the blocks in the most conflicting species quartets. High incidences of reticulation were restricted to three nodes on the tree that coincided with major paleogeographical events in South America. Our results identify possible technical biases in quantifying reticulate evolution and indicate that episodic rapid radiations have played a major role in the evolution of a largely understudied Neotropical clade.

{"title":"Saltational Episodes of Reticulate Evolution in the Drosophila saltans Species Group.","authors":"Carolina Prediger, Erina A Ferreira, Samara Videira Zorzato, Aurélie Hua-Van, Lisa Klasson, Wolfgang J Miller, Amir Yassin, Lilian Madi-Ravazzi","doi":"10.1093/molbev/msae250","DOIUrl":"10.1093/molbev/msae250","url":null,"abstract":"<p><p>Phylogenomics reveals reticulate evolution to be widespread across taxa, but whether reticulation is due to low statistical power or it is a true evolutionary pattern remains a field of study. Here, we investigate the phylogeny and quantify reticulation in the Drosophila saltans species group, a Neotropical clade of the subgenus Sophophora comprising 23 species whose relationships have long been problematic. Phylogenetic analyses revealed conflicting topologies between the X chromosome, autosomes and the mitochondria. We extended the ABBA-BABA test of asymmetry in phylogenetic discordance to cases where no \"true\" species tree could be inferred, and applied our new test (called 2A2B) to whole genome data and to individual loci. We used four strategies, two based on our new assemblies using either conserved genes or ≥50 kb-long syntenic blocks with conserved collinearity across Neotropical Sophophora, and two consisted of windows from pseudo-reference genomes aligned to either an ingroup or outgroup species. Evidence for reticulation varied among the strategies, being lowest in the synteny-based approach, where it did not exceed ∼7% of the blocks in the most conflicting species quartets. High incidences of reticulation were restricted to three nodes on the tree that coincided with major paleogeographical events in South America. Our results identify possible technical biases in quantifying reticulate evolution and indicate that episodic rapid radiations have played a major role in the evolution of a largely understudied Neotropical clade.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11662284/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142813519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Characterization of Ancient Methanococcales Malate Dehydrogenases Reveals That Strong Thermal Stability Prevents Unfolding Under Intense γ-Irradiation. 对古老的 Methanococcales 苹果酸脱氢酶进行表征后发现,其强大的热稳定性可防止其在强γ-辐照下解折。
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-12-06 DOI: 10.1093/molbev/msae231
Dominique Madern, Frédéric Halgand, Chantal Houée-Levin, Anne-Béatrice Dufour, Sandrine Coquille, Salomé Ansanay-Alex, Sophie Sacquin-Mora, Céline Brochier-Armanet

Malate dehydrogenases (MalDHs) (EC.1.1.1.37), which are involved in the conversion of oxaloacetate to pyruvate in the tricarboxylic acid cycle, are a relevant model for the study of enzyme evolution and adaptation. Likewise, a recent study showed that Methanococcales, a major lineage of Archaea, is a good model to study the molecular processes of proteome thermoadaptation in prokaryotes. Here, we use ancestral sequence reconstruction and paleoenzymology to characterize both ancient and extant MalDHs. We observe a good correlation between inferred optimal growth temperatures and experimental optimal temperatures for activity (A-Topt). In particular, we show that the MalDH present in the ancestor of Methanococcales was hyperthermostable and had an A-Topt of 80 °C, consistent with a hyperthermophilic lifestyle. This ancestor gave rise to two lineages with different thermal constraints: one remained hyperthermophilic, while the other underwent several independent adaptations to colder environments. Surprisingly, the enzymes of the first lineage have retained a thermoresistant behavior (i.e. strong thermostability and high A-Topt), whereas the ancestor of the second lineage shows a strong thermostability, but a reduced A-Topt. Using mutants, we mimic the adaptation trajectory toward mesophily and show that it is possible to significantly reduce the A-Topt without altering the thermostability of the enzyme by introducing a few mutations. Finally, we reveal an unexpected link between thermostability and the ability to resist γ-irradiation-induced unfolding.

苹果酸脱氢酶(MalDH)(EC.1.1.1.37)参与三羧酸循环中草酰乙酸到丙酮酸的转化,是研究酶进化和适应的相关模型。同样,最近的一项研究表明,古细菌的一个主要品系--甲烷球菌(Methanococcales)是研究原核生物蛋白质组热适应分子过程的良好模型。在这里,我们利用祖先序列重建和古酶学来描述古代和现生 MalDHs 的特征。我们观察到推断的最适生长温度(OGTs)与实验的最适活性温度(A-Topt)之间存在良好的相关性。特别是,我们发现存在于 Methanococcales 的祖先中的 MalDH 具有超恒温性,其 A-Topt 为 80°C,与嗜热生活方式一致。这个祖先产生了两个具有不同热限制的品系,其中一个品系仍然是嗜热的,而另一个品系则经历了数次对寒冷环境的独立适应。令人惊讶的是,第一系的酶保留了耐热性(即较强的耐热性和较高的A-Topt),而第二系的祖先则表现出较强的耐热性,但A-Topt却降低了。我们利用突变体模拟了嗜中性的适应轨迹,并证明通过引入少量突变,可以在不改变酶的恒温性的情况下显著降低 A-Topt。最后,我们揭示了耐热性与抗γ-辐照诱导的解折能力之间意想不到的联系。
{"title":"The Characterization of Ancient Methanococcales Malate Dehydrogenases Reveals That Strong Thermal Stability Prevents Unfolding Under Intense γ-Irradiation.","authors":"Dominique Madern, Frédéric Halgand, Chantal Houée-Levin, Anne-Béatrice Dufour, Sandrine Coquille, Salomé Ansanay-Alex, Sophie Sacquin-Mora, Céline Brochier-Armanet","doi":"10.1093/molbev/msae231","DOIUrl":"10.1093/molbev/msae231","url":null,"abstract":"<p><p>Malate dehydrogenases (MalDHs) (EC.1.1.1.37), which are involved in the conversion of oxaloacetate to pyruvate in the tricarboxylic acid cycle, are a relevant model for the study of enzyme evolution and adaptation. Likewise, a recent study showed that Methanococcales, a major lineage of Archaea, is a good model to study the molecular processes of proteome thermoadaptation in prokaryotes. Here, we use ancestral sequence reconstruction and paleoenzymology to characterize both ancient and extant MalDHs. We observe a good correlation between inferred optimal growth temperatures and experimental optimal temperatures for activity (A-Topt). In particular, we show that the MalDH present in the ancestor of Methanococcales was hyperthermostable and had an A-Topt of 80 °C, consistent with a hyperthermophilic lifestyle. This ancestor gave rise to two lineages with different thermal constraints: one remained hyperthermophilic, while the other underwent several independent adaptations to colder environments. Surprisingly, the enzymes of the first lineage have retained a thermoresistant behavior (i.e. strong thermostability and high A-Topt), whereas the ancestor of the second lineage shows a strong thermostability, but a reduced A-Topt. Using mutants, we mimic the adaptation trajectory toward mesophily and show that it is possible to significantly reduce the A-Topt without altering the thermostability of the enzyme by introducing a few mutations. Finally, we reveal an unexpected link between thermostability and the ability to resist γ-irradiation-induced unfolding.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11631191/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142568125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Digital Image Processing to Detect Adaptive Evolution. 检测适应性进化的数字图像处理
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-12-06 DOI: 10.1093/molbev/msae242
Md Ruhul Amin, Mahmudul Hasan, Michael DeGiorgio

In recent years, advances in image processing and machine learning have fueled a paradigm shift in detecting genomic regions under natural selection. Early machine learning techniques employed population-genetic summary statistics as features, which focus on specific genomic patterns expected by adaptive and neutral processes. Though such engineered features are important when training data are limited, the ease at which simulated data can now be generated has led to the recent development of approaches that take in image representations of haplotype alignments and automatically extract important features using convolutional neural networks. Digital image processing methods termed α-molecules are a class of techniques for multiscale representation of objects that can extract a diverse set of features from images. One such α-molecule method, termed wavelet decomposition, lends greater control over high-frequency components of images. Another α-molecule method, termed curvelet decomposition, is an extension of the wavelet concept that considers events occurring along curves within images. We show that application of these α-molecule techniques to extract features from image representations of haplotype alignments yield high true positive rate and accuracy to detect hard and soft selective sweep signatures from genomic data with both linear and nonlinear machine learning classifiers. Moreover, we find that such models are easy to visualize and interpret, with performance rivaling those of contemporary deep learning approaches for detecting sweeps.

近年来,图像处理和机器学习的进步推动了检测自然选择下基因组区域的模式转变。早期的机器学习技术采用群体遗传汇总统计作为特征,重点关注适应性和中性过程所预期的特定基因组模式。虽然在训练数据有限的情况下,这种工程特征非常重要,但由于现在可以轻松生成模拟数据,因此最近开发出了一些方法,这些方法采用单倍型排列的图像表示,并利用卷积神经网络自动提取重要特征。被称为α分子的数字图像处理方法是一类用于多尺度表示物体的技术,可以从图像中提取各种特征。其中一种α分子方法被称为小波分解法,它能更好地控制图像的高频成分。另一种α-分子方法被称为小曲线分解法,它是小波概念的延伸,考虑了图像中沿曲线发生的事件。我们的研究表明,应用这些α-分子技术从单体型配对的图像表征中提取特征,可获得较高的真阳性率和准确率,从而利用线性和非线性机器学习分类器从基因组数据中检测出硬和软选择性扫描特征。此外,我们发现这种模型易于可视化和解释,其性能可与当代深度学习方法检测横扫的性能相媲美。
{"title":"Digital Image Processing to Detect Adaptive Evolution.","authors":"Md Ruhul Amin, Mahmudul Hasan, Michael DeGiorgio","doi":"10.1093/molbev/msae242","DOIUrl":"10.1093/molbev/msae242","url":null,"abstract":"<p><p>In recent years, advances in image processing and machine learning have fueled a paradigm shift in detecting genomic regions under natural selection. Early machine learning techniques employed population-genetic summary statistics as features, which focus on specific genomic patterns expected by adaptive and neutral processes. Though such engineered features are important when training data are limited, the ease at which simulated data can now be generated has led to the recent development of approaches that take in image representations of haplotype alignments and automatically extract important features using convolutional neural networks. Digital image processing methods termed α-molecules are a class of techniques for multiscale representation of objects that can extract a diverse set of features from images. One such α-molecule method, termed wavelet decomposition, lends greater control over high-frequency components of images. Another α-molecule method, termed curvelet decomposition, is an extension of the wavelet concept that considers events occurring along curves within images. We show that application of these α-molecule techniques to extract features from image representations of haplotype alignments yield high true positive rate and accuracy to detect hard and soft selective sweep signatures from genomic data with both linear and nonlinear machine learning classifiers. Moreover, we find that such models are easy to visualize and interpret, with performance rivaling those of contemporary deep learning approaches for detecting sweeps.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11631197/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142681669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Molecular biology and evolution
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1