An Island-Based Approach for Differential Expression Analysis.

Abdallah M Eteleeb, Robert M Flight, Benjamin J Harrison, Jeffrey C Petruska, Eric C Rouchka
{"title":"An Island-Based Approach for Differential Expression Analysis.","authors":"Abdallah M Eteleeb,&nbsp;Robert M Flight,&nbsp;Benjamin J Harrison,&nbsp;Jeffrey C Petruska,&nbsp;Eric C Rouchka","doi":"10.1145/2506583.2506589","DOIUrl":null,"url":null,"abstract":"<p><p>High-throughput mRNA sequencing (also known as RNA-Seq) promises to be the technique of choice for studying transcriptome profiles. This technique provides the ability to develop precise methodologies for transcript and gene expression quantification, novel transcript and exon discovery, and splice variant detection. One of the limitations of current RNA-Seq methods is the dependency on annotated biological features (e.g. exons, transcripts, genes) to detect expression differences across samples. This forces the identification of expression levels and the detection of significant changes to known genomic regions. Any significant changes that occur in unannotated regions will not be captured. To overcome this limitation, we developed a novel segmentation approach, Island-Based (IB), for analyzing differential expression in RNA-Seq and targeted sequencing (exome capture) data without specific knowledge of an isoform. The IB segmentation determines individual islands of expression based on windowed read counts that can be compared across experimental conditions to determine differential island expression. In order to detect differentially expressed genes, the significance of islands (<i>p</i>-values) are combined using <i>Fisher's</i> method. We tested and evaluated the performance of our approach by comparing it to the existing differentially expressed gene (DEG) methods: CuffDiff, DESeq, and edgeR using two benchmark MAQC RNA-Seq datasets. The IB algorithm outperforms all three methods in both datasets as illustrated by an increased auROC.</p>","PeriodicalId":90404,"journal":{"name":"2013 ACM Conference on Bioinformatics, Computational Biology and Biomedical Informatics : ACM - BCB 2013 : Washington, D.C., U.S.A., September 22 - 25, 2013. ACM Conference on Bioinformatics, Computational Biology and Biomedical Informa...","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2013-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/2506583.2506589","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 ACM Conference on Bioinformatics, Computational Biology and Biomedical Informatics : ACM - BCB 2013 : Washington, D.C., U.S.A., September 22 - 25, 2013. ACM Conference on Bioinformatics, Computational Biology and Biomedical Informa...","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2506583.2506589","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

High-throughput mRNA sequencing (also known as RNA-Seq) promises to be the technique of choice for studying transcriptome profiles. This technique provides the ability to develop precise methodologies for transcript and gene expression quantification, novel transcript and exon discovery, and splice variant detection. One of the limitations of current RNA-Seq methods is the dependency on annotated biological features (e.g. exons, transcripts, genes) to detect expression differences across samples. This forces the identification of expression levels and the detection of significant changes to known genomic regions. Any significant changes that occur in unannotated regions will not be captured. To overcome this limitation, we developed a novel segmentation approach, Island-Based (IB), for analyzing differential expression in RNA-Seq and targeted sequencing (exome capture) data without specific knowledge of an isoform. The IB segmentation determines individual islands of expression based on windowed read counts that can be compared across experimental conditions to determine differential island expression. In order to detect differentially expressed genes, the significance of islands (p-values) are combined using Fisher's method. We tested and evaluated the performance of our approach by comparing it to the existing differentially expressed gene (DEG) methods: CuffDiff, DESeq, and edgeR using two benchmark MAQC RNA-Seq datasets. The IB algorithm outperforms all three methods in both datasets as illustrated by an increased auROC.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于岛的差异表达分析方法。
高通量mRNA测序(也称为RNA-Seq)有望成为研究转录组谱的首选技术。这项技术为转录物和基因表达定量、新转录物和外显子发现以及剪接变异检测提供了精确的方法。当前RNA-Seq方法的局限性之一是依赖于注释的生物学特征(例如外显子,转录本,基因)来检测样品之间的表达差异。这迫使鉴定表达水平和检测显著变化的已知基因组区域。在未注释的区域中发生的任何重大更改都不会被捕获。为了克服这一限制,我们开发了一种新的分割方法,基于岛的(IB),用于分析RNA-Seq和靶向测序(外显子组捕获)数据中的差异表达,而不需要特定的同种异构体知识。IB分割基于窗口读取计数确定单个表达岛,可以在不同的实验条件下进行比较,以确定差异岛表达。为了检测差异表达的基因,使用Fisher的方法将岛的显著性(p值)结合起来。我们使用两个基准MAQC RNA-Seq数据集,将我们的方法与现有的差异表达基因(DEG)方法(CuffDiff、DESeq和edgeR)进行比较,测试和评估了我们的方法的性能。在两个数据集中,IB算法都优于所有三种方法,如增加的auROC所示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
An Island-Based Approach for Differential Expression Analysis. Classification of Alzheimer Diagnosis from ADNI Plasma Biomarker Data. Three-Dimensional Spot Detection in Ratiometric Fluorescence Imaging For Measurement of Subcellular Organelles.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1