Fast and Accurate Estimation of Selection Coefficients and Allele Histories from Ancient and Modern DNA.

IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Molecular biology and evolution Pub Date : 2024-08-02 DOI:10.1093/molbev/msae156
Andrew H Vaughn, Rasmus Nielsen
{"title":"Fast and Accurate Estimation of Selection Coefficients and Allele Histories from Ancient and Modern DNA.","authors":"Andrew H Vaughn, Rasmus Nielsen","doi":"10.1093/molbev/msae156","DOIUrl":null,"url":null,"abstract":"<p><p>We here present CLUES2, a full-likelihood method to infer natural selection from sequence data that is an extension of the method CLUES. We make several substantial improvements to the CLUES method that greatly increases both its applicability and its speed. We add the ability to use ancestral recombination graphs on ancient data as emissions to the underlying hidden Markov model, which enables CLUES2 to use both temporal and linkage information to make estimates of selection coefficients. We also fully implement the ability to estimate distinct selection coefficients in different epochs, which allows for the analysis of changes in selective pressures through time, as well as selection with dominance. In addition, we greatly increase the computational efficiency of CLUES2 over CLUES using several approximations to the forward-backward algorithms and develop a new way to reconstruct historic allele frequencies by integrating over the uncertainty in the estimation of the selection coefficients. We illustrate the accuracy of CLUES2 through extensive simulations and validate the importance sampling framework for integrating over the uncertainty in the inference of gene trees. We also show that CLUES2 is well-calibrated by showing that under the null hypothesis, the distribution of log-likelihood ratios follows a χ2 distribution with the appropriate degrees of freedom. We run CLUES2 on a set of recently published ancient human data from Western Eurasia and test for evidence of changing selection coefficients through time. We find significant evidence of changing selective pressures in several genes correlated with the introduction of agriculture to Europe and the ensuing dietary and demographic shifts of that time. In particular, our analysis supports previous hypotheses of strong selection on lactase persistence during periods of ancient famines and attenuated selection in more modern periods.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0000,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11321360/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular biology and evolution","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/molbev/msae156","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

We here present CLUES2, a full-likelihood method to infer natural selection from sequence data that is an extension of the method CLUES. We make several substantial improvements to the CLUES method that greatly increases both its applicability and its speed. We add the ability to use ancestral recombination graphs on ancient data as emissions to the underlying hidden Markov model, which enables CLUES2 to use both temporal and linkage information to make estimates of selection coefficients. We also fully implement the ability to estimate distinct selection coefficients in different epochs, which allows for the analysis of changes in selective pressures through time, as well as selection with dominance. In addition, we greatly increase the computational efficiency of CLUES2 over CLUES using several approximations to the forward-backward algorithms and develop a new way to reconstruct historic allele frequencies by integrating over the uncertainty in the estimation of the selection coefficients. We illustrate the accuracy of CLUES2 through extensive simulations and validate the importance sampling framework for integrating over the uncertainty in the inference of gene trees. We also show that CLUES2 is well-calibrated by showing that under the null hypothesis, the distribution of log-likelihood ratios follows a χ2 distribution with the appropriate degrees of freedom. We run CLUES2 on a set of recently published ancient human data from Western Eurasia and test for evidence of changing selection coefficients through time. We find significant evidence of changing selective pressures in several genes correlated with the introduction of agriculture to Europe and the ensuing dietary and demographic shifts of that time. In particular, our analysis supports previous hypotheses of strong selection on lactase persistence during periods of ancient famines and attenuated selection in more modern periods.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
从古代和现代 DNA 中快速准确地估算选择系数和等位基因历史。
我们在此介绍 CLUES2,这是一种从序列数据推断自然选择的全似然方法,是 CLUES 方法的扩展。我们对 CLUES 方法进行了多项重大改进,大大提高了其适用性和速度。我们增加了在古代数据上使用 ARG 作为底层 HMM 排放的能力,这使得 CLUES2 能够使用时间信息和联系信息来估计选择系数。我们还完全实现了在不同时代估算不同选择系数的功能,从而可以分析选择压力随时间的变化以及优势选择。此外,与 CLUES 相比,我们使用了几种近似的前向后向算法,大大提高了 CLUES2 的计算效率,并开发了一种新方法,通过对选择系数估计中的不确定性进行积分,重建历史等位基因频率。我们通过大量模拟说明了 CLUES2 的准确性,并验证了在推断基因树时对不确定性进行整合的重要性采样框架。我们还通过证明在零假设下,对数似然比的分布遵循具有适当自由度的秩方分布,证明 CLUES2 经过了良好校准。我们在一组最近公布的欧亚大陆西部古人类数据上运行了 CLUES2,并检验了选择系数随时间变化的证据。我们发现了一些基因的选择压力发生变化的重要证据,这些变化与农业传入欧洲以及当时随之而来的饮食和人口变化有关。特别是,我们的分析支持了之前的假设,即在古代饥荒时期,乳糖酶的持久性受到了强烈的选择,而在更现代的时期,选择则有所减弱。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Molecular biology and evolution
Molecular biology and evolution 生物-进化生物学
CiteScore
19.70
自引率
3.70%
发文量
257
审稿时长
1 months
期刊介绍: Molecular Biology and Evolution Journal Overview: Publishes research at the interface of molecular (including genomics) and evolutionary biology Considers manuscripts containing patterns, processes, and predictions at all levels of organization: population, taxonomic, functional, and phenotypic Interested in fundamental discoveries, new and improved methods, resources, technologies, and theories advancing evolutionary research Publishes balanced reviews of recent developments in genome evolution and forward-looking perspectives suggesting future directions in molecular evolution applications.
期刊最新文献
Remarkable evolutionary rate variations among lineages and among genome compartments in malaria parasites of mammals. Digital image processing to detect adaptive evolution. Accurate Inference of the Polyploid Continuum using Forward-time Simulations. Comparative genomics provides insights into adaptive evolution and demographics of bats. Multiple-wave admixture and adaptive evolution of the Pamirian Wakhi people.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1