Boosting edgeR (Robust) by dealing with missing observations and gene-specific outliers in RNA-Seq profiles and its application to explore biomarker genes for diagnosis and therapies of ovarian cancer

IF 3.4 2区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Genomics Pub Date : 2024-03-26 DOI:10.1016/j.ygeno.2024.110834
Bandhan Sarker , Md. Matiur Rahaman , Muhammad Habibulla Alamin , Md. Ariful Islam , Md. Nurul Haque Mollah
{"title":"Boosting edgeR (Robust) by dealing with missing observations and gene-specific outliers in RNA-Seq profiles and its application to explore biomarker genes for diagnosis and therapies of ovarian cancer","authors":"Bandhan Sarker ,&nbsp;Md. Matiur Rahaman ,&nbsp;Muhammad Habibulla Alamin ,&nbsp;Md. Ariful Islam ,&nbsp;Md. Nurul Haque Mollah","doi":"10.1016/j.ygeno.2024.110834","DOIUrl":null,"url":null,"abstract":"<div><p>The edgeR (Robust) is a popular approach for identifying differentially expressed genes (DEGs) from RNA-Seq profiles. However, it shows weak performance against gene-specific outliers and is unable to handle missing observations. To address these issues, we proposed a pre-processing approach of RNA-Seq count data by combining the iLOO-based outlier detection and random forest-based missing imputation approach for boosting the performance of edgeR (Robust). Both simulation and real RNA-Seq count data analysis results showed that the proposed edgeR (Robust) outperformed than the conventional edgeR (Robust). To investigate the effectiveness of identified DEGs for diagnosis, and therapies of ovarian cancer (OC), we selected top-ranked 12 DEGs (<em>IL6, XCL1, CXCL8, C1QC, C1QB, SNAI2, TYROBP, COL1A2, SNAP25, NTS, CXCL2,</em> and <em>AGT</em>) and suggested hub-DEGs guided top-ranked 10 candidate drug-molecules for the treatment against OC. Hence, our proposed procedure might be an effective computational tool for exploring potential DEGs from RNA-Seq profiles for diagnosis and therapies of any disease.</p></div>","PeriodicalId":12521,"journal":{"name":"Genomics","volume":null,"pages":null},"PeriodicalIF":3.4000,"publicationDate":"2024-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0888754324000557/pdfft?md5=5947ffca20222991f38fad62d0e4ad43&pid=1-s2.0-S0888754324000557-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genomics","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0888754324000557","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

The edgeR (Robust) is a popular approach for identifying differentially expressed genes (DEGs) from RNA-Seq profiles. However, it shows weak performance against gene-specific outliers and is unable to handle missing observations. To address these issues, we proposed a pre-processing approach of RNA-Seq count data by combining the iLOO-based outlier detection and random forest-based missing imputation approach for boosting the performance of edgeR (Robust). Both simulation and real RNA-Seq count data analysis results showed that the proposed edgeR (Robust) outperformed than the conventional edgeR (Robust). To investigate the effectiveness of identified DEGs for diagnosis, and therapies of ovarian cancer (OC), we selected top-ranked 12 DEGs (IL6, XCL1, CXCL8, C1QC, C1QB, SNAI2, TYROBP, COL1A2, SNAP25, NTS, CXCL2, and AGT) and suggested hub-DEGs guided top-ranked 10 candidate drug-molecules for the treatment against OC. Hence, our proposed procedure might be an effective computational tool for exploring potential DEGs from RNA-Seq profiles for diagnosis and therapies of any disease.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过处理 RNA-Seq 图谱中的缺失观测值和基因特异性异常值来增强 edgeR(鲁棒性),并将其应用于探索卵巢癌诊断和治疗的生物标记基因。
edgeR(Robust)是从 RNA-Seq 图谱中识别差异表达基因(DEG)的常用方法。然而,它对特定基因异常值的处理能力较弱,而且无法处理缺失观测数据。为了解决这些问题,我们提出了一种 RNA-Seq 计数数据预处理方法,将基于 iLOO 的离群点检测和基于随机森林的缺失归因方法结合起来,以提高 edgeR(Robust)的性能。模拟和真实的 RNA-Seq 计数数据分析结果表明,提出的 edgeR (Robust) 优于传统的 edgeR (Robust)。为了研究已识别的 DEGs 对卵巢癌(OC)诊断和治疗的有效性,我们选择了排名前 12 位的 DEGs(IL6、XCL1、CXCL8、C1QC、C1QB、SNAI2、TYROBP、COL1A2、SNAP25、NTS、CXCL2 和 AGT),并建议枢纽 DEGs 引导排名前 10 位的候选药物分子用于治疗 OC。因此,我们提出的程序可能是一种有效的计算工具,可从 RNA-Seq 图谱中探索潜在的 DEGs,用于任何疾病的诊断和治疗。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Genomics
Genomics 生物-生物工程与应用微生物
CiteScore
9.60
自引率
2.30%
发文量
260
审稿时长
60 days
期刊介绍: Genomics is a forum for describing the development of genome-scale technologies and their application to all areas of biological investigation. As a journal that has evolved with the field that carries its name, Genomics focuses on the development and application of cutting-edge methods, addressing fundamental questions with potential interest to a wide audience. Our aim is to publish the highest quality research and to provide authors with rapid, fair and accurate review and publication of manuscripts falling within our scope.
期刊最新文献
Key role of CYP17A1 in Leydig cell function and testicular development in Qianbei Ma goats. Multiomics reveals blood differential metabolites and differential genes in the early onset of ketosis in dairy cows STRIP2 is regulated by the transcription factor Sp1 and promotes lung adenocarcinoma progression via activating the PI3K/AKT/mTOR/MYC signaling pathway Brain lncRNA-mRNA co-expression regulatory networks and alcohol use disorder Whole-genome sequence of Sclerotium delphinii, a pathogenic fungus of Dendrobium officinale southern blight
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1