基因组测序和注释的计算环境:生命研究人员项目应用的工作流程

Matheus Pedron Cassol, Alexandre Rafael Lenz, Rudinei Zacaria, Scheila de Avila e Silva
{"title":"基因组测序和注释的计算环境:生命研究人员项目应用的工作流程","authors":"Matheus Pedron Cassol, Alexandre Rafael Lenz, Rudinei Zacaria, Scheila de Avila e Silva","doi":"10.36995/j.recyt.2022.38.008","DOIUrl":null,"url":null,"abstract":"The current paper seeks to approach, using a workflow, basics subjects of the bioinformatic field and also useful informations to consider during the development of in silico researches. Installation and general usage of multiple softwares related to different sections of the genome annotation process were also presented. At last, an model organism, Staphylococcus aureus, was sequenced in two different softwares, SPAdes and IDBA-UD, seeking further comparison and evaluation of the process as a whole. The quality evaluation of the assemble was established by tests on QUAST, BUSCO and Augustus, supported by BLASTP. Results: QUAST evaluation returned genome coverage values above 98% in both test cases, pointing towards a trustworthy assemble for this organism. Via SPAdes were needed less computational resources, but, using IDBA-UD the sequences found were more contiguous. Results deriving from BUSCO showed only one expected gene difference. Some proteins and genes predicted by Augustus led to hits, sequences already studied in that organism, using the BLASTP program.","PeriodicalId":21243,"journal":{"name":"Revista de Ciencia y Tecnología","volume":"40 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Computational Environment for Genomic Sequencing and Annotation: a workflow for application in projects by life researchers\",\"authors\":\"Matheus Pedron Cassol, Alexandre Rafael Lenz, Rudinei Zacaria, Scheila de Avila e Silva\",\"doi\":\"10.36995/j.recyt.2022.38.008\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The current paper seeks to approach, using a workflow, basics subjects of the bioinformatic field and also useful informations to consider during the development of in silico researches. Installation and general usage of multiple softwares related to different sections of the genome annotation process were also presented. At last, an model organism, Staphylococcus aureus, was sequenced in two different softwares, SPAdes and IDBA-UD, seeking further comparison and evaluation of the process as a whole. The quality evaluation of the assemble was established by tests on QUAST, BUSCO and Augustus, supported by BLASTP. Results: QUAST evaluation returned genome coverage values above 98% in both test cases, pointing towards a trustworthy assemble for this organism. Via SPAdes were needed less computational resources, but, using IDBA-UD the sequences found were more contiguous. Results deriving from BUSCO showed only one expected gene difference. Some proteins and genes predicted by Augustus led to hits, sequences already studied in that organism, using the BLASTP program.\",\"PeriodicalId\":21243,\"journal\":{\"name\":\"Revista de Ciencia y Tecnología\",\"volume\":\"40 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Revista de Ciencia y Tecnología\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.36995/j.recyt.2022.38.008\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Revista de Ciencia y Tecnología","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36995/j.recyt.2022.38.008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本论文试图利用工作流程,探讨生物信息学领域的基本主题,以及在计算机研究发展过程中考虑的有用信息。还介绍了与基因组注释过程的不同部分相关的多个软件的安装和一般用法。最后,利用SPAdes和IDBA-UD两种不同的软件对模式生物金黄色葡萄球菌(Staphylococcus aureus)进行测序,进一步对整个过程进行比较和评价。在BLASTP的支持下,通过QUAST, BUSCO和Augustus测试建立了组件的质量评估。结果:QUAST评估在两个测试案例中返回的基因组覆盖率值均超过98%,指向该生物体的可靠组合。通过SPAdes所需的计算资源较少,但使用IDBA-UD找到的序列更连续。来自BUSCO的结果显示只有一个预期的基因差异。奥古斯都预测的一些蛋白质和基因导致了撞击,这些序列已经在该生物体中使用BLASTP程序进行了研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Computational Environment for Genomic Sequencing and Annotation: a workflow for application in projects by life researchers
The current paper seeks to approach, using a workflow, basics subjects of the bioinformatic field and also useful informations to consider during the development of in silico researches. Installation and general usage of multiple softwares related to different sections of the genome annotation process were also presented. At last, an model organism, Staphylococcus aureus, was sequenced in two different softwares, SPAdes and IDBA-UD, seeking further comparison and evaluation of the process as a whole. The quality evaluation of the assemble was established by tests on QUAST, BUSCO and Augustus, supported by BLASTP. Results: QUAST evaluation returned genome coverage values above 98% in both test cases, pointing towards a trustworthy assemble for this organism. Via SPAdes were needed less computational resources, but, using IDBA-UD the sequences found were more contiguous. Results deriving from BUSCO showed only one expected gene difference. Some proteins and genes predicted by Augustus led to hits, sequences already studied in that organism, using the BLASTP program.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Pb, Cd, and As in soils and yerba mate branches from different cultivation areas Evaluation of cardiovascular risk in obese children and adolescents attended at a Level III hospital in Posadas, Misiones, Argentina Resilience profile of students of the National University of Misiones (UNaM) First report of pseudogenes of SABATH enzymes from xanthinesmetabolic pathway in Ilex paraguariensis Biological activity of Lemongrass (Cymbopogon citratus) and its applications in industry
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1