Steganinae果蝇(双翅目:果蝇科)的系统基因组研究:强大的基因树异质性和单系性证据。

IF 3.4 Q1 Agricultural and Biological Sciences BMC Evolutionary Biology Pub Date : 2020-11-02 DOI:10.1186/s12862-020-01703-7
Guilherme Rezende Dias, Eduardo Guimarães Dupim, Thyago Vanderlinde, Beatriz Mello, Antonio Bernardo Carvalho
{"title":"Steganinae果蝇(双翅目:果蝇科)的系统基因组研究:强大的基因树异质性和单系性证据。","authors":"Guilherme Rezende Dias,&nbsp;Eduardo Guimarães Dupim,&nbsp;Thyago Vanderlinde,&nbsp;Beatriz Mello,&nbsp;Antonio Bernardo Carvalho","doi":"10.1186/s12862-020-01703-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The Drosophilidae family is traditionally divided into two subfamilies: Drosophilinae and Steganinae. This division is based on morphological characters, and the two subfamilies have been treated as monophyletic in most of the literature, but some molecular phylogenies have suggested Steganinae to be paraphyletic. To test the paraphyletic-Steganinae hypothesis, here, we used genomic sequences of eight Drosophilidae (three Steganinae and five Drosophilinae) and two Ephydridae (outgroup) species and inferred the phylogeny for the group based on a dataset of 1,028 orthologous genes present in all species (> 1,000,000 bp). This dataset includes three genera that broke the monophyly of the subfamilies in previous works. To investigate possible biases introduced by small sample sizes and automatic gene annotation, we used the same methods to infer species trees from a set of 10 manually annotated genes that are commonly used in phylogenetics.</p><p><strong>Results: </strong>Most of the 1,028 gene trees depicted Steganinae as paraphyletic with distinct topologies, but the most common topology depicted it as monophyletic (43.7% of the gene trees). Despite the high levels of gene tree heterogeneity observed, species tree inference in ASTRAL, in PhyloNet, and with the concatenation approach strongly supported the monophyly of both subfamilies for the 1,028-gene dataset. However, when using the concatenation approach to infer a species tree from the smaller set of 10 genes, we recovered Steganinae as a paraphyletic group. The pattern of gene tree heterogeneity was asymmetrical and thus could not be explained solely by incomplete lineage sorting (ILS).</p><p><strong>Conclusions: </strong>Steganinae was clearly a monophyletic group in the dataset that we analyzed. In addition to ILS, gene tree discordance was possibly the result of introgression, suggesting complex branching processes during the early evolution of Drosophilidae with short speciation intervals and gene flow. Our study highlights the importance of genomic data in elucidating contentious phylogenetic relationships and suggests that phylogenetic inference for drosophilids based on small molecular datasets should be performed cautiously. Finally, we suggest an approach for the correction and cleaning of BUSCO-derived genomic datasets that will be useful to other researchers planning to use this tool for phylogenomic studies.</p>","PeriodicalId":9111,"journal":{"name":"BMC Evolutionary Biology","volume":null,"pages":null},"PeriodicalIF":3.4000,"publicationDate":"2020-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s12862-020-01703-7","citationCount":"6","resultStr":"{\"title\":\"A phylogenomic study of Steganinae fruit flies (Diptera: Drosophilidae): strong gene tree heterogeneity and evidence for monophyly.\",\"authors\":\"Guilherme Rezende Dias,&nbsp;Eduardo Guimarães Dupim,&nbsp;Thyago Vanderlinde,&nbsp;Beatriz Mello,&nbsp;Antonio Bernardo Carvalho\",\"doi\":\"10.1186/s12862-020-01703-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The Drosophilidae family is traditionally divided into two subfamilies: Drosophilinae and Steganinae. This division is based on morphological characters, and the two subfamilies have been treated as monophyletic in most of the literature, but some molecular phylogenies have suggested Steganinae to be paraphyletic. To test the paraphyletic-Steganinae hypothesis, here, we used genomic sequences of eight Drosophilidae (three Steganinae and five Drosophilinae) and two Ephydridae (outgroup) species and inferred the phylogeny for the group based on a dataset of 1,028 orthologous genes present in all species (> 1,000,000 bp). This dataset includes three genera that broke the monophyly of the subfamilies in previous works. To investigate possible biases introduced by small sample sizes and automatic gene annotation, we used the same methods to infer species trees from a set of 10 manually annotated genes that are commonly used in phylogenetics.</p><p><strong>Results: </strong>Most of the 1,028 gene trees depicted Steganinae as paraphyletic with distinct topologies, but the most common topology depicted it as monophyletic (43.7% of the gene trees). Despite the high levels of gene tree heterogeneity observed, species tree inference in ASTRAL, in PhyloNet, and with the concatenation approach strongly supported the monophyly of both subfamilies for the 1,028-gene dataset. However, when using the concatenation approach to infer a species tree from the smaller set of 10 genes, we recovered Steganinae as a paraphyletic group. The pattern of gene tree heterogeneity was asymmetrical and thus could not be explained solely by incomplete lineage sorting (ILS).</p><p><strong>Conclusions: </strong>Steganinae was clearly a monophyletic group in the dataset that we analyzed. In addition to ILS, gene tree discordance was possibly the result of introgression, suggesting complex branching processes during the early evolution of Drosophilidae with short speciation intervals and gene flow. Our study highlights the importance of genomic data in elucidating contentious phylogenetic relationships and suggests that phylogenetic inference for drosophilids based on small molecular datasets should be performed cautiously. Finally, we suggest an approach for the correction and cleaning of BUSCO-derived genomic datasets that will be useful to other researchers planning to use this tool for phylogenomic studies.</p>\",\"PeriodicalId\":9111,\"journal\":{\"name\":\"BMC Evolutionary Biology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2020-11-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1186/s12862-020-01703-7\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Evolutionary Biology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/s12862-020-01703-7\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Agricultural and Biological Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Evolutionary Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s12862-020-01703-7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Agricultural and Biological Sciences","Score":null,"Total":0}
引用次数: 6

摘要

背景:果蝇科传统上分为两个亚科:果蝇科和隐蝇科。这种划分是基于形态特征,在大多数文献中,这两个亚科被认为是单系的,但一些分子系统发育表明Steganinae是副系的。为了验证这一假说,我们使用了8个果蝇科(3个剑蝇科和5个果蝇科)和2个剑蝇科(外群)物种的基因组序列,并基于所有物种(> 1,000,000 bp)中存在的1,028个同源基因数据集推断了该群体的系统发育。该数据集包括三个属,打破了以往工作中亚科的单一性。为了研究小样本量和自动基因注释可能带来的偏差,我们使用相同的方法从一组10个系统发育中常用的人工注释基因中推断物种树。结果:在1028个基因树中,大多数将Steganinae描述为具有不同拓扑结构的副葡萄系,但最常见的拓扑结构将其描述为单系(43.7%)。尽管观察到高水平的基因树异质性,但ASTRAL、PhyloNet和连接方法中的物种树推断强烈支持1,028个基因数据集中两个亚科的单系性。然而,当使用串联方法从较小的10个基因集推断物种树时,我们恢复了Steganinae作为一个副类群。基因树异质性的模式是不对称的,因此不能完全用不完全谱系分类(ILS)来解释。结论:在我们分析的数据集中,Steganinae显然是一个单系类群。除ILS外,基因树不一致可能是基因渗入的结果,表明果蝇在早期进化过程中存在复杂的分支过程,物种形成间隔短,基因流动频繁。我们的研究强调了基因组数据在阐明有争议的系统发育关系中的重要性,并表明基于小分子数据集的果蝇系统发育推断应谨慎进行。最后,我们提出了一种校正和清理busco衍生基因组数据集的方法,这将对其他计划使用该工具进行系统基因组学研究的研究人员有用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

摘要图片

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A phylogenomic study of Steganinae fruit flies (Diptera: Drosophilidae): strong gene tree heterogeneity and evidence for monophyly.

Background: The Drosophilidae family is traditionally divided into two subfamilies: Drosophilinae and Steganinae. This division is based on morphological characters, and the two subfamilies have been treated as monophyletic in most of the literature, but some molecular phylogenies have suggested Steganinae to be paraphyletic. To test the paraphyletic-Steganinae hypothesis, here, we used genomic sequences of eight Drosophilidae (three Steganinae and five Drosophilinae) and two Ephydridae (outgroup) species and inferred the phylogeny for the group based on a dataset of 1,028 orthologous genes present in all species (> 1,000,000 bp). This dataset includes three genera that broke the monophyly of the subfamilies in previous works. To investigate possible biases introduced by small sample sizes and automatic gene annotation, we used the same methods to infer species trees from a set of 10 manually annotated genes that are commonly used in phylogenetics.

Results: Most of the 1,028 gene trees depicted Steganinae as paraphyletic with distinct topologies, but the most common topology depicted it as monophyletic (43.7% of the gene trees). Despite the high levels of gene tree heterogeneity observed, species tree inference in ASTRAL, in PhyloNet, and with the concatenation approach strongly supported the monophyly of both subfamilies for the 1,028-gene dataset. However, when using the concatenation approach to infer a species tree from the smaller set of 10 genes, we recovered Steganinae as a paraphyletic group. The pattern of gene tree heterogeneity was asymmetrical and thus could not be explained solely by incomplete lineage sorting (ILS).

Conclusions: Steganinae was clearly a monophyletic group in the dataset that we analyzed. In addition to ILS, gene tree discordance was possibly the result of introgression, suggesting complex branching processes during the early evolution of Drosophilidae with short speciation intervals and gene flow. Our study highlights the importance of genomic data in elucidating contentious phylogenetic relationships and suggests that phylogenetic inference for drosophilids based on small molecular datasets should be performed cautiously. Finally, we suggest an approach for the correction and cleaning of BUSCO-derived genomic datasets that will be useful to other researchers planning to use this tool for phylogenomic studies.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Evolutionary Biology
BMC Evolutionary Biology 生物-进化生物学
CiteScore
5.80
自引率
0.00%
发文量
0
审稿时长
6 months
期刊介绍: BMC Evolutionary Biology is an open access, peer-reviewed journal that considers articles on all aspects of molecular and non-molecular evolution of all organisms, as well as phylogenetics and palaeontology.
期刊最新文献
Correlation between bioluminescent blinks and swimming behavior in the splitfin flashlight fish Anomalops katoptron Home range of three turtle species in Central Yucatan. A comparative study Variation and plasticity in life-history traits and fitness of wild Arabidopsis thaliana populations are not related to their genotypic and ecological diversity Does local soil factor drive functional leaf trait variation? A test on Neilingding Island, South China Biodiversity and distribution patterns of blooming jellyfish in the Bohai Sea revealed by eDNA metabarcoding
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1