Modelling the demographic history of human North African genomes points to a recent soft split divergence between populations.

IF 10.1 1区 生物学 Q1 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Genome Biology Pub Date : 2024-07-30 DOI:10.1186/s13059-024-03341-4
Jose M Serradell, Jose M Lorenzo-Salazar, Carlos Flores, Oscar Lao, David Comas
{"title":"Modelling the demographic history of human North African genomes points to a recent soft split divergence between populations.","authors":"Jose M Serradell, Jose M Lorenzo-Salazar, Carlos Flores, Oscar Lao, David Comas","doi":"10.1186/s13059-024-03341-4","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>North African human populations present a complex demographic scenario due to the presence of an autochthonous genetic component and population substructure, plus extensive gene flow from the Middle East, Europe, and sub-Saharan Africa.</p><p><strong>Results: </strong>We conducted a comprehensive analysis of 364 genomes to construct detailed demographic models for the North African region, encompassing its two primary ethnic groups, the Arab and Amazigh populations. This was achieved through an Approximate Bayesian Computation with Deep Learning (ABC-DL) framework and a novel algorithm called Genetic Programming for Population Genetics (GP4PG). This innovative approach enabled us to effectively model intricate demographic scenarios, utilizing a subset of 16 whole genomes at > 30X coverage. The demographic model suggested by GP4PG exhibited a closer alignment with the observed data compared to the ABC-DL model. Both point to a back-to-Africa origin of North African individuals and a close relationship with Eurasian populations. Results support different origins for Amazigh and Arab populations, with Amazigh populations originating back in Epipaleolithic times, while GP4PG supports Arabization as the main source of Middle Eastern ancestry. The GP4PG model includes population substructure in surrounding populations (sub-Saharan Africa and Middle East) with continuous decaying gene flow after population split. Contrary to ABC-DL, the best GP4PG model does not require pulses of admixture from surrounding populations into North Africa pointing to soft splits as drivers of divergence in North Africa.</p><p><strong>Conclusions: </strong>We have built a demographic model on North Africa that points to a back-to-Africa expansion and a differential origin between Arab and Amazigh populations.</p>","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":10.1000,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11290046/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genome Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13059-024-03341-4","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: North African human populations present a complex demographic scenario due to the presence of an autochthonous genetic component and population substructure, plus extensive gene flow from the Middle East, Europe, and sub-Saharan Africa.

Results: We conducted a comprehensive analysis of 364 genomes to construct detailed demographic models for the North African region, encompassing its two primary ethnic groups, the Arab and Amazigh populations. This was achieved through an Approximate Bayesian Computation with Deep Learning (ABC-DL) framework and a novel algorithm called Genetic Programming for Population Genetics (GP4PG). This innovative approach enabled us to effectively model intricate demographic scenarios, utilizing a subset of 16 whole genomes at > 30X coverage. The demographic model suggested by GP4PG exhibited a closer alignment with the observed data compared to the ABC-DL model. Both point to a back-to-Africa origin of North African individuals and a close relationship with Eurasian populations. Results support different origins for Amazigh and Arab populations, with Amazigh populations originating back in Epipaleolithic times, while GP4PG supports Arabization as the main source of Middle Eastern ancestry. The GP4PG model includes population substructure in surrounding populations (sub-Saharan Africa and Middle East) with continuous decaying gene flow after population split. Contrary to ABC-DL, the best GP4PG model does not require pulses of admixture from surrounding populations into North Africa pointing to soft splits as drivers of divergence in North Africa.

Conclusions: We have built a demographic model on North Africa that points to a back-to-Africa expansion and a differential origin between Arab and Amazigh populations.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
对北非人类基因组的人口历史进行建模表明,最近出现了种群之间的软分裂分化。
背景:北非地区的人口构成非常复杂,这是因为这里存在自生遗传成分和人口亚结构,另外还有来自中东、欧洲和撒哈拉以南非洲的大量基因流:我们对 364 个基因组进行了全面分析,为北非地区构建了详细的人口模型,其中包括两个主要民族--阿拉伯人和阿马齐格人。这是通过深度学习近似贝叶斯计算(ABC-DL)框架和一种名为人口遗传学遗传编程(GP4PG)的新型算法实现的。这种创新方法使我们能够利用覆盖率大于 30 倍的 16 个全基因组子集,有效地建立复杂的人口统计模型。与 ABC-DL 模型相比,GP4PG 提出的人口模型与观察到的数据更为吻合。两者都表明北非人起源于非洲,与欧亚人群关系密切。结果支持阿马齐格人和阿拉伯人的不同起源,阿马齐格人起源于旧石器时代,而 GP4PG 支持阿拉伯化是中东祖先的主要来源。GP4PG 模型包括周边种群(撒哈拉以南非洲和中东)的种群亚结构,种群分裂后基因流持续衰减。与 ABC-DL 模型相反,最佳的 GP4PG 模型并不需要周围人群向北非的掺杂脉冲,这表明软分裂是北非分化的驱动因素:我们建立了一个北非人口模型,该模型指出了阿拉伯人和阿马齐格人回溯到非洲的扩张和不同的起源。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Genome Biology
Genome Biology Biochemistry, Genetics and Molecular Biology-Genetics
CiteScore
21.00
自引率
3.30%
发文量
241
审稿时长
2 months
期刊介绍: Genome Biology stands as a premier platform for exceptional research across all domains of biology and biomedicine, explored through a genomic and post-genomic lens. With an impressive impact factor of 12.3 (2022),* the journal secures its position as the 3rd-ranked research journal in the Genetics and Heredity category and the 2nd-ranked research journal in the Biotechnology and Applied Microbiology category by Thomson Reuters. Notably, Genome Biology holds the distinction of being the highest-ranked open-access journal in this category. Our dedicated team of highly trained in-house Editors collaborates closely with our esteemed Editorial Board of international experts, ensuring the journal remains on the forefront of scientific advances and community standards. Regular engagement with researchers at conferences and institute visits underscores our commitment to staying abreast of the latest developments in the field.
期刊最新文献
Atlas of telomeric repeat diversity in Arabidopsis thaliana ESCHR: a hyperparameter-randomized ensemble approach for robust clustering across diverse datasets Splam: a deep-learning-based splice site predictor that improves spliced alignments Dimension reduction, cell clustering, and cell–cell communication inference for single-cell transcriptomics with DcjComm A comprehensive map of the aging blood methylome in humans
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1