Auto-phylo v2 和 aut-phylo-pipeliner:构建先进、灵活和可重复使用的管道,用于系统发育推断、变异性水平估计和正选氨基酸位点识别。

IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Journal of Integrative Bioinformatics Pub Date : 2024-03-27 eCollection Date: 2024-06-01 DOI:10.1515/jib-2023-0046
Hugo López-Fernández, Miguel Pinto, Cristina P Vieira, Pedro Duque, Miguel Reboiro-Jato, Jorge Vieira
{"title":"Auto-phylo v2 和 aut-phylo-pipeliner:构建先进、灵活和可重复使用的管道,用于系统发育推断、变异性水平估计和正选氨基酸位点识别。","authors":"Hugo López-Fernández, Miguel Pinto, Cristina P Vieira, Pedro Duque, Miguel Reboiro-Jato, Jorge Vieira","doi":"10.1515/jib-2023-0046","DOIUrl":null,"url":null,"abstract":"<p><p>The vast amount of genome sequence data that is available, and that is predicted to drastically increase in the near future, can only be efficiently dealt with by building automated pipelines. Indeed, the Earth Biogenome Project will produce high-quality reference genome sequences for all 1.8 million named living eukaryote species, providing unprecedented insight into the evolution of genes and gene families, and thus on biological issues. Here, new modules for gene annotation, further BLAST search algorithms, further multiple sequence alignment methods, the adding of reference sequences, further tree rooting methods, the estimation of rates of synonymous and nonsynonymous substitutions, and the identification of positively selected amino acid sites, have been added to auto-phylo (version 2), a recently developed software to address biological problems using phylogenetic inferences. Additionally, we present auto-phylo-pipeliner, a graphical user interface application that further facilitates the creation and running of auto-phylo pipelines. Inferences on <i>S-RNase</i> specificity, are critical for both cross-based breeding and for the establishment of pollination requirements. Therefore, as a test case, we develop an auto-phylo pipeline to identify amino acid sites under positive selection, that are, in principle, those determining <i>S-RNase</i> specificity, starting from both non-annotated <i>Prunus</i> genomes and sequences available in public databases.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.5000,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11378518/pdf/","citationCount":"0","resultStr":"{\"title\":\"Auto-phylo v2 and auto-phylo-pipeliner: building advanced, flexible, and reusable pipelines for phylogenetic inferences, estimation of variability levels and identification of positively selected amino acid sites.\",\"authors\":\"Hugo López-Fernández, Miguel Pinto, Cristina P Vieira, Pedro Duque, Miguel Reboiro-Jato, Jorge Vieira\",\"doi\":\"10.1515/jib-2023-0046\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The vast amount of genome sequence data that is available, and that is predicted to drastically increase in the near future, can only be efficiently dealt with by building automated pipelines. Indeed, the Earth Biogenome Project will produce high-quality reference genome sequences for all 1.8 million named living eukaryote species, providing unprecedented insight into the evolution of genes and gene families, and thus on biological issues. Here, new modules for gene annotation, further BLAST search algorithms, further multiple sequence alignment methods, the adding of reference sequences, further tree rooting methods, the estimation of rates of synonymous and nonsynonymous substitutions, and the identification of positively selected amino acid sites, have been added to auto-phylo (version 2), a recently developed software to address biological problems using phylogenetic inferences. Additionally, we present auto-phylo-pipeliner, a graphical user interface application that further facilitates the creation and running of auto-phylo pipelines. Inferences on <i>S-RNase</i> specificity, are critical for both cross-based breeding and for the establishment of pollination requirements. Therefore, as a test case, we develop an auto-phylo pipeline to identify amino acid sites under positive selection, that are, in principle, those determining <i>S-RNase</i> specificity, starting from both non-annotated <i>Prunus</i> genomes and sequences available in public databases.</p>\",\"PeriodicalId\":53625,\"journal\":{\"name\":\"Journal of Integrative Bioinformatics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2024-03-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11378518/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Integrative Bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1515/jib-2023-0046\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/6/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q3\",\"JCRName\":\"MATHEMATICAL & COMPUTATIONAL BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Integrative Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/jib-2023-0046","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/6/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

现有的基因组序列数据量巨大,而且预计在不久的将来还会急剧增加,只有建立自动化管道才能有效处理这些数据。事实上,地球生物基因组计划(Earth Biogenome Project)将为所有 180 万个已命名的真核生物物种提供高质量的参考基因组序列,为基因和基因家族的进化,进而为生物问题提供前所未有的洞察力。auto-phylo(第 2 版)是最近开发的一款利用系统发育推论解决生物学问题的软件,在这里,我们为它添加了新的模块,包括基因注释、进一步的 BLAST 搜索算法、进一步的多序列比对方法、参考序列的添加、进一步的树根方法、同义和非同义替换率的估计以及正选氨基酸位点的鉴定。此外,我们还介绍了auto-phylo-pipeliner,这是一个图形用户界面应用程序,可进一步方便auto-phylo管道的创建和运行。S-RNase特异性推断对于杂交育种和确定授粉要求都至关重要。因此,作为一个测试案例,我们从未注明的梅花基因组和公共数据库中的序列入手,开发了一个自动植物基因组分析管道,以确定正选择的氨基酸位点,这些位点原则上就是决定 S-RNase 特异性的氨基酸位点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Auto-phylo v2 and auto-phylo-pipeliner: building advanced, flexible, and reusable pipelines for phylogenetic inferences, estimation of variability levels and identification of positively selected amino acid sites.

The vast amount of genome sequence data that is available, and that is predicted to drastically increase in the near future, can only be efficiently dealt with by building automated pipelines. Indeed, the Earth Biogenome Project will produce high-quality reference genome sequences for all 1.8 million named living eukaryote species, providing unprecedented insight into the evolution of genes and gene families, and thus on biological issues. Here, new modules for gene annotation, further BLAST search algorithms, further multiple sequence alignment methods, the adding of reference sequences, further tree rooting methods, the estimation of rates of synonymous and nonsynonymous substitutions, and the identification of positively selected amino acid sites, have been added to auto-phylo (version 2), a recently developed software to address biological problems using phylogenetic inferences. Additionally, we present auto-phylo-pipeliner, a graphical user interface application that further facilitates the creation and running of auto-phylo pipelines. Inferences on S-RNase specificity, are critical for both cross-based breeding and for the establishment of pollination requirements. Therefore, as a test case, we develop an auto-phylo pipeline to identify amino acid sites under positive selection, that are, in principle, those determining S-RNase specificity, starting from both non-annotated Prunus genomes and sequences available in public databases.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Integrative Bioinformatics
Journal of Integrative Bioinformatics Medicine-Medicine (all)
CiteScore
3.10
自引率
5.30%
发文量
27
审稿时长
12 weeks
期刊最新文献
A roadmap for a middleware as a federation service for integrative data retrieval of agricultural data. International symposium on integrative bioinformatics 2024 - editorial. The potential of Mitragyna speciosa leaves as a natural source of antioxidants for disease prevention. MCMVDRP: a multi-channel multi-view deep learning framework for cancer drug response prediction. Leonhard Med, a trusted research environment for processing sensitive research data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1