Enhancing novel isoform discovery: leveraging nanopore long-read sequencing and machine learning approaches.

IF 2.5 3区生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Briefings in Functional Genomics Pub Date : 2024-12-06 DOI:10.1093/bfgp/elae031

Kristina Santucci, Yuning Cheng, Si-Mei Xu, Michael Janitz

{"title":"Enhancing novel isoform discovery: leveraging nanopore long-read sequencing and machine learning approaches.","authors":"Kristina Santucci, Yuning Cheng, Si-Mei Xu, Michael Janitz","doi":"10.1093/bfgp/elae031","DOIUrl":null,"url":null,"abstract":"<p><p>Long-read sequencing technologies can capture entire RNA transcripts in a single sequencing read, reducing the ambiguity in constructing and quantifying transcript models in comparison to more common and earlier methods, such as short-read sequencing. Recent improvements in the accuracy of long-read sequencing technologies have expanded the scope for novel splice isoform detection and have also enabled a far more accurate reconstruction of complex splicing patterns and transcriptomes. Additionally, the incorporation and advancements of machine learning and deep learning algorithms in bioinformatic software have significantly improved the reliability of long-read sequencing transcriptomic studies. However, there is a lack of consensus on what bioinformatic tools and pipelines produce the most precise and consistent results. Thus, this review aims to discuss and compare the performance of available methods for novel isoform discovery with long-read sequencing technologies, with 25 tools being presented. Furthermore, this review intends to demonstrate the need for developing standard analytical pipelines, tools, and transcript model conventions for novel isoform discovery and transcriptomic studies.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"683-694"},"PeriodicalIF":2.5000,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Briefings in Functional Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bfgp/elae031","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Long-read sequencing technologies can capture entire RNA transcripts in a single sequencing read, reducing the ambiguity in constructing and quantifying transcript models in comparison to more common and earlier methods, such as short-read sequencing. Recent improvements in the accuracy of long-read sequencing technologies have expanded the scope for novel splice isoform detection and have also enabled a far more accurate reconstruction of complex splicing patterns and transcriptomes. Additionally, the incorporation and advancements of machine learning and deep learning algorithms in bioinformatic software have significantly improved the reliability of long-read sequencing transcriptomic studies. However, there is a lack of consensus on what bioinformatic tools and pipelines produce the most precise and consistent results. Thus, this review aims to discuss and compare the performance of available methods for novel isoform discovery with long-read sequencing technologies, with 25 tools being presented. Furthermore, this review intends to demonstrate the need for developing standard analytical pipelines, tools, and transcript model conventions for novel isoform discovery and transcriptomic studies.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

加强新型同工酶的发现：利用纳米孔长读数测序和机器学习方法。

与短线程测序等更常见和更早的方法相比，长线程测序技术可以在单个测序读数中捕获整个 RNA 转录本，从而减少了构建和量化转录本模型的模糊性。最近，长读程测序技术的准确性有所提高，扩大了新型剪接异构体的检测范围，也能更准确地重建复杂的剪接模式和转录组。此外，机器学习和深度学习算法在生物信息学软件中的应用和进步也大大提高了长片段测序转录组研究的可靠性。然而，对于什么样的生物信息学工具和管道能产生最精确、最一致的结果还缺乏共识。因此，本综述旨在讨论和比较利用长线程测序技术发现新型同工酶的现有方法的性能，共介绍了 25 种工具。此外，本综述还旨在说明有必要为新型同工酶发现和转录组研究开发标准的分析管道、工具和转录本模型惯例。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Briefings in Functional Genomics BIOTECHNOLOGY & APPLIED MICROBIOLOGY-GENETICS & HEREDITY

CiteScore

6.30

自引率

2.50%

发文量

审稿时长

6-12 weeks

期刊介绍： Briefings in Functional Genomics publishes high quality peer reviewed articles that focus on the use, development or exploitation of genomic approaches, and their application to all areas of biological research. As well as exploring thematic areas where these techniques and protocols are being used, articles review the impact that these approaches have had, or are likely to have, on their field. Subjects covered by the Journal include but are not restricted to: the identification and functional characterisation of coding and non-coding features in genomes, microarray technologies, gene expression profiling, next generation sequencing, pharmacogenomics, phenomics, SNP technologies, transgenic systems, mutation screens and genotyping. Articles range in scope and depth from the introductory level to specific details of protocols and analyses, encompassing bacterial, fungal, plant, animal and human data. The editorial board welcome the submission of review articles for publication. Essential criteria for the publication of papers is that they do not contain primary data, and that they are high quality, clearly written review articles which provide a balanced, highly informative and up to date perspective to researchers in the field of functional genomics.