ARGNet:使用深度神经网络从序列中对抗生素耐药基因进行稳健识别和分类。

IF 13.8 1区 生物学 Q1 MICROBIOLOGY Microbiome Pub Date : 2024-05-09 DOI:10.1186/s40168-024-01805-0
Yao Pei, Marcus Ho-Hin Shum, Yunshi Liao, Vivian W Leung, Yu-Nong Gong, David K Smith, Xiaole Yin, Yi Guan, Ruibang Luo, Tong Zhang, Tommy Tsan-Yuk Lam
{"title":"ARGNet:使用深度神经网络从序列中对抗生素耐药基因进行稳健识别和分类。","authors":"Yao Pei, Marcus Ho-Hin Shum, Yunshi Liao, Vivian W Leung, Yu-Nong Gong, David K Smith, Xiaole Yin, Yi Guan, Ruibang Luo, Tong Zhang, Tommy Tsan-Yuk Lam","doi":"10.1186/s40168-024-01805-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Emergence of antibiotic resistance in bacteria is an important threat to global health. Antibiotic resistance genes (ARGs) are some of the key components to define bacterial resistance and their spread in different environments. Identification of ARGs, particularly from high-throughput sequencing data of the specimens, is the state-of-the-art method for comprehensively monitoring their spread and evolution. Current computational methods to identify ARGs mainly rely on alignment-based sequence similarities with known ARGs. Such approaches are limited by choice of reference databases and may potentially miss novel ARGs. The similarity thresholds are usually simple and could not accommodate variations across different gene families and regions. It is also difficult to scale up when sequence data are increasing.</p><p><strong>Results: </strong>In this study, we developed ARGNet, a deep neural network that incorporates an unsupervised learning autoencoder model to identify ARGs and a multiclass classification convolutional neural network to classify ARGs that do not depend on sequence alignment. This approach enables a more efficient discovery of both known and novel ARGs. ARGNet accepts both amino acid and nucleotide sequences of variable lengths, from partial (30-50 aa; 100-150 nt) sequences to full-length protein or genes, allowing its application in both target sequencing and metagenomic sequencing. Our performance evaluation showed that ARGNet outperformed other deep learning models including DeepARG and HMD-ARG in most of the application scenarios especially quasi-negative test and the analysis of prediction consistency with phylogenetic tree. ARGNet has a reduced inference runtime by up to 57% relative to DeepARG.</p><p><strong>Conclusions: </strong>ARGNet is flexible, efficient, and accurate at predicting a broad range of ARGs from the sequencing data. ARGNet is freely available at https://github.com/id-bioinfo/ARGNet , with an online service provided at https://ARGNet.hku.hk . Video Abstract.</p>","PeriodicalId":18447,"journal":{"name":"Microbiome","volume":null,"pages":null},"PeriodicalIF":13.8000,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11080312/pdf/","citationCount":"0","resultStr":"{\"title\":\"ARGNet: using deep neural networks for robust identification and classification of antibiotic resistance genes from sequences.\",\"authors\":\"Yao Pei, Marcus Ho-Hin Shum, Yunshi Liao, Vivian W Leung, Yu-Nong Gong, David K Smith, Xiaole Yin, Yi Guan, Ruibang Luo, Tong Zhang, Tommy Tsan-Yuk Lam\",\"doi\":\"10.1186/s40168-024-01805-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Emergence of antibiotic resistance in bacteria is an important threat to global health. Antibiotic resistance genes (ARGs) are some of the key components to define bacterial resistance and their spread in different environments. Identification of ARGs, particularly from high-throughput sequencing data of the specimens, is the state-of-the-art method for comprehensively monitoring their spread and evolution. Current computational methods to identify ARGs mainly rely on alignment-based sequence similarities with known ARGs. Such approaches are limited by choice of reference databases and may potentially miss novel ARGs. The similarity thresholds are usually simple and could not accommodate variations across different gene families and regions. It is also difficult to scale up when sequence data are increasing.</p><p><strong>Results: </strong>In this study, we developed ARGNet, a deep neural network that incorporates an unsupervised learning autoencoder model to identify ARGs and a multiclass classification convolutional neural network to classify ARGs that do not depend on sequence alignment. This approach enables a more efficient discovery of both known and novel ARGs. ARGNet accepts both amino acid and nucleotide sequences of variable lengths, from partial (30-50 aa; 100-150 nt) sequences to full-length protein or genes, allowing its application in both target sequencing and metagenomic sequencing. Our performance evaluation showed that ARGNet outperformed other deep learning models including DeepARG and HMD-ARG in most of the application scenarios especially quasi-negative test and the analysis of prediction consistency with phylogenetic tree. ARGNet has a reduced inference runtime by up to 57% relative to DeepARG.</p><p><strong>Conclusions: </strong>ARGNet is flexible, efficient, and accurate at predicting a broad range of ARGs from the sequencing data. ARGNet is freely available at https://github.com/id-bioinfo/ARGNet , with an online service provided at https://ARGNet.hku.hk . Video Abstract.</p>\",\"PeriodicalId\":18447,\"journal\":{\"name\":\"Microbiome\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":13.8000,\"publicationDate\":\"2024-05-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11080312/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Microbiome\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s40168-024-01805-0\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbiome","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s40168-024-01805-0","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:细菌中出现的抗生素耐药性是对全球健康的重要威胁。抗生素耐药性基因(ARGs)是确定细菌耐药性及其在不同环境中传播的一些关键要素。鉴定 ARGs,特别是从标本的高通量测序数据中鉴定 ARGs,是全面监测其传播和进化的最先进方法。目前识别 ARGs 的计算方法主要依赖于与已知 ARGs 的基于比对的序列相似性。这种方法受到参考数据库选择的限制,可能会遗漏新的 ARGs。相似性阈值通常比较简单,不能适应不同基因家族和区域的变化。当序列数据不断增加时,这种方法也很难扩展:在这项研究中,我们开发了一种深度神经网络 ARGNet,它结合了一个无监督学习的自动编码器模型来识别 ARGs,并结合了一个多类分类卷积神经网络来对不依赖序列比对的 ARGs 进行分类。这种方法能更有效地发现已知和新的 ARGs。ARGNet 可接受不同长度的氨基酸和核苷酸序列,从部分序列(30-50 aa; 100-150 nt)到全长蛋白质或基因,因此可应用于目标测序和元基因组测序。性能评估结果表明,ARGNet 在大多数应用场景中都优于其他深度学习模型,包括 DeepARG 和 HMD-ARG,特别是在准负性测试和分析预测与系统发生树的一致性方面。与 DeepARG 相比,ARGNet 的推理运行时间最多缩短了 57%:ARGNet在从测序数据中预测各种ARG方面灵活、高效、准确。ARGNet 可在 https://github.com/id-bioinfo/ARGNet 上免费获取,并可在 https://ARGNet.hku.hk 上提供在线服务。视频摘要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
ARGNet: using deep neural networks for robust identification and classification of antibiotic resistance genes from sequences.

Background: Emergence of antibiotic resistance in bacteria is an important threat to global health. Antibiotic resistance genes (ARGs) are some of the key components to define bacterial resistance and their spread in different environments. Identification of ARGs, particularly from high-throughput sequencing data of the specimens, is the state-of-the-art method for comprehensively monitoring their spread and evolution. Current computational methods to identify ARGs mainly rely on alignment-based sequence similarities with known ARGs. Such approaches are limited by choice of reference databases and may potentially miss novel ARGs. The similarity thresholds are usually simple and could not accommodate variations across different gene families and regions. It is also difficult to scale up when sequence data are increasing.

Results: In this study, we developed ARGNet, a deep neural network that incorporates an unsupervised learning autoencoder model to identify ARGs and a multiclass classification convolutional neural network to classify ARGs that do not depend on sequence alignment. This approach enables a more efficient discovery of both known and novel ARGs. ARGNet accepts both amino acid and nucleotide sequences of variable lengths, from partial (30-50 aa; 100-150 nt) sequences to full-length protein or genes, allowing its application in both target sequencing and metagenomic sequencing. Our performance evaluation showed that ARGNet outperformed other deep learning models including DeepARG and HMD-ARG in most of the application scenarios especially quasi-negative test and the analysis of prediction consistency with phylogenetic tree. ARGNet has a reduced inference runtime by up to 57% relative to DeepARG.

Conclusions: ARGNet is flexible, efficient, and accurate at predicting a broad range of ARGs from the sequencing data. ARGNet is freely available at https://github.com/id-bioinfo/ARGNet , with an online service provided at https://ARGNet.hku.hk . Video Abstract.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Microbiome
Microbiome MICROBIOLOGY-
CiteScore
21.90
自引率
2.60%
发文量
198
审稿时长
4 weeks
期刊介绍: Microbiome is a journal that focuses on studies of microbiomes in humans, animals, plants, and the environment. It covers both natural and manipulated microbiomes, such as those in agriculture. The journal is interested in research that uses meta-omics approaches or novel bioinformatics tools and emphasizes the community/host interaction and structure-function relationship within the microbiome. Studies that go beyond descriptive omics surveys and include experimental or theoretical approaches will be considered for publication. The journal also encourages research that establishes cause and effect relationships and supports proposed microbiome functions. However, studies of individual microbial isolates/species without exploring their impact on the host or the complex microbiome structures and functions will not be considered for publication. Microbiome is indexed in BIOSIS, Current Contents, DOAJ, Embase, MEDLINE, PubMed, PubMed Central, and Science Citations Index Expanded.
期刊最新文献
Microbial colonisation rewires the composition and content of poplar root exudates, root and shoot metabolomes Succession of rumen microbiota and metabolites across different reproductive periods in different sheep breeds and their impact on the growth and development of offspring lambs Predicting how varying moisture conditions impact the microbiome of dust collected from the International Space Station Microbial mechanisms for higher hydrogen production in anaerobic digestion at constant temperature versus gradient heating Enrichment of novel entomopathogenic Pseudomonas species enhances willow resistance to leaf beetles
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1