Benefits of alignment quality-control processing steps and an Angiosperms353 phylogenomics pipeline applied to the Celastrales

IF 3.9 2区 生物学 Q1 EVOLUTIONARY BIOLOGY Cladistics Pub Date : 2022-05-15 DOI:10.1111/cla.12507
Mark P. Simmons, Olivier Maurin, Paul Bailey, Grace E. Brewer, Shyamali Roy, Julio A. Lombardi, Félix Forest, William J. Baker
{"title":"Benefits of alignment quality-control processing steps and an Angiosperms353 phylogenomics pipeline applied to the Celastrales","authors":"Mark P. Simmons,&nbsp;Olivier Maurin,&nbsp;Paul Bailey,&nbsp;Grace E. Brewer,&nbsp;Shyamali Roy,&nbsp;Julio A. Lombardi,&nbsp;Félix Forest,&nbsp;William J. Baker","doi":"10.1111/cla.12507","DOIUrl":null,"url":null,"abstract":"<p>We examined the impact of successive alignment quality-control steps on downstream phylogenomic analyses. We applied a recently published phylogenomics pipeline that was developed for the Angiosperms353 target-sequence-capture probe set to the flowering plant order Celastrales. Our final dataset consists of 158 species, including at least one exemplar from all 109 currently recognized Celastrales genera. We performed nine quality-control steps and compared the inferred resolution, branch support, and topological congruence of the inferred gene and species trees with those generated after each of the first six steps. We describe and justify each of our quality-control steps, including manual masking, in detail so that they may be readily applied to other lineages. We found that highly supported clades could generally be relied upon even if stringent orthology and alignment quality-control measures had not been applied. But separate instances were identified, for both concatenation and coalescence, wherein a clade was highly supported before manual masking but then subsequently contradicted. These results are generally reassuring for broad-scale analyses that use phylogenomics pipelines, but also indicate that we cannot rely exclusively on these analyses to conclude how challenging phylogenetic problems are best resolved.</p>","PeriodicalId":50688,"journal":{"name":"Cladistics","volume":"38 5","pages":"595-611"},"PeriodicalIF":3.9000,"publicationDate":"2022-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cladistics","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/cla.12507","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EVOLUTIONARY BIOLOGY","Score":null,"Total":0}
引用次数: 1

Abstract

We examined the impact of successive alignment quality-control steps on downstream phylogenomic analyses. We applied a recently published phylogenomics pipeline that was developed for the Angiosperms353 target-sequence-capture probe set to the flowering plant order Celastrales. Our final dataset consists of 158 species, including at least one exemplar from all 109 currently recognized Celastrales genera. We performed nine quality-control steps and compared the inferred resolution, branch support, and topological congruence of the inferred gene and species trees with those generated after each of the first six steps. We describe and justify each of our quality-control steps, including manual masking, in detail so that they may be readily applied to other lineages. We found that highly supported clades could generally be relied upon even if stringent orthology and alignment quality-control measures had not been applied. But separate instances were identified, for both concatenation and coalescence, wherein a clade was highly supported before manual masking but then subsequently contradicted. These results are generally reassuring for broad-scale analyses that use phylogenomics pipelines, but also indicate that we cannot rely exclusively on these analyses to conclude how challenging phylogenetic problems are best resolved.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
校准质量控制处理步骤和应用于Celastrales的Angiosperms353系统基因组学管道的好处
我们研究了连续的序列质量控制步骤对下游系统基因组分析的影响。我们应用了最近发表的系统基因组学管道,该管道是为开花植物目Celastrales的Angiosperms353目标序列捕获探针开发的。我们最终的数据集包括158个物种,包括目前已知的所有109个Celastrales属中的至少一个样本。我们执行了9个质量控制步骤,并将推断出的基因和物种树的分辨率、分支支持度和拓扑一致性与前6个步骤后产生的结果进行了比较。我们详细地描述并证明了我们的每一个质量控制步骤,包括手动掩蔽,以便它们可以很容易地应用于其他谱系。我们发现,即使没有应用严格的正形学和对准质量控制措施,高度支持的枝通常也可以依赖。但是,在手工掩蔽之前,一个分支被高度支持,但随后又被反驳,从而确定了连接和合并的单独实例。这些结果对于使用系统基因组学管道的大规模分析通常是令人放心的,但也表明我们不能完全依赖这些分析来得出如何最好地解决具有挑战性的系统发育问题的结论。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Cladistics
Cladistics 生物-进化生物学
CiteScore
8.60
自引率
5.60%
发文量
34
期刊介绍: Cladistics publishes high quality research papers on systematics, encouraging debate on all aspects of the field, from philosophy, theory and methodology to empirical studies and applications in biogeography, coevolution, conservation biology, ontogeny, genomics and paleontology. Cladistics is read by scientists working in the research fields of evolution, systematics and integrative biology and enjoys a consistently high position in the ISI® rankings for evolutionary biology.
期刊最新文献
Issue Information Incomplete barriers to heterospecific mating among Somatochlora species (Odonata: Corduliidae) as revealed in multi-gene phylogenies Rethinking spatial history: envisioning a mechanistic historical biogeography Robust phylogenomics settles controversies of classification and reveals evolution of male embolic complex of the Laufeia clade (Araneae, Salticidae, Euophryini) Issue Information
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1