MLML2R: an R package for maximum likelihood estimation of DNA methylation and hydroxymethylation proportions.

IF 0.9 4区 数学 Q3 Mathematics Statistical Applications in Genetics and Molecular Biology Pub Date : 2019-01-17 DOI:10.1515/sagmb-2018-0031
Samara F Kiihl, Maria Jose Martinez-Garrido, Arce Domingo-Relloso, Jose Bermudez, Maria Tellez-Plaza
{"title":"MLML2R: an R package for maximum likelihood estimation of DNA methylation and hydroxymethylation proportions.","authors":"Samara F Kiihl,&nbsp;Maria Jose Martinez-Garrido,&nbsp;Arce Domingo-Relloso,&nbsp;Jose Bermudez,&nbsp;Maria Tellez-Plaza","doi":"10.1515/sagmb-2018-0031","DOIUrl":null,"url":null,"abstract":"<p><p>Accurately measuring epigenetic marks such as 5-methylcytosine (5-mC) and 5-hydroxymethylcytosine (5-hmC) at the single-nucleotide level, requires combining data from DNA processing methods including traditional (BS), oxidative (oxBS) or Tet-Assisted (TAB) bisulfite conversion. We introduce the R package MLML2R, which provides maximum likelihood estimates (MLE) of 5-mC and 5-hmC proportions. While all other available R packages provide 5-mC and 5-hmC MLEs only for the oxBS+BS combination, MLML2R also provides MLE for TAB combinations. For combinations of any two of the methods, we derived the pool-adjacent-violators algorithm (PAVA) exact constrained MLE in analytical form. For the three methods combination, we implemented both the iterative method by Qu et al. [Qu, J., M. Zhou, Q. Song, E. E. Hong and A. D. Smith (2013): \"Mlml: consistent simultaneous estimates of dna methylation and hydroxymethylation,\" Bioinformatics, 29, 2645-2646.], and also a novel non iterative approximation using Lagrange multipliers. The newly proposed non iterative solutions greatly decrease computational time, common bottlenecks when processing high-throughput data. The MLML2R package is flexible as it takes as input both, preprocessed intensities from Infinium Methylation arrays and counts from Next Generation Sequencing technologies. The MLML2R package is freely available at https://CRAN.R-project.org/package=MLML2R.</p>","PeriodicalId":49477,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":null,"pages":null},"PeriodicalIF":0.9000,"publicationDate":"2019-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/sagmb-2018-0031","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistical Applications in Genetics and Molecular Biology","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1515/sagmb-2018-0031","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 10

Abstract

Accurately measuring epigenetic marks such as 5-methylcytosine (5-mC) and 5-hydroxymethylcytosine (5-hmC) at the single-nucleotide level, requires combining data from DNA processing methods including traditional (BS), oxidative (oxBS) or Tet-Assisted (TAB) bisulfite conversion. We introduce the R package MLML2R, which provides maximum likelihood estimates (MLE) of 5-mC and 5-hmC proportions. While all other available R packages provide 5-mC and 5-hmC MLEs only for the oxBS+BS combination, MLML2R also provides MLE for TAB combinations. For combinations of any two of the methods, we derived the pool-adjacent-violators algorithm (PAVA) exact constrained MLE in analytical form. For the three methods combination, we implemented both the iterative method by Qu et al. [Qu, J., M. Zhou, Q. Song, E. E. Hong and A. D. Smith (2013): "Mlml: consistent simultaneous estimates of dna methylation and hydroxymethylation," Bioinformatics, 29, 2645-2646.], and also a novel non iterative approximation using Lagrange multipliers. The newly proposed non iterative solutions greatly decrease computational time, common bottlenecks when processing high-throughput data. The MLML2R package is flexible as it takes as input both, preprocessed intensities from Infinium Methylation arrays and counts from Next Generation Sequencing technologies. The MLML2R package is freely available at https://CRAN.R-project.org/package=MLML2R.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
MLML2R:一个R包DNA甲基化和羟甲基化比例的最大似然估计。
在单核苷酸水平上精确测量5-甲基胞嘧啶(5-mC)和5-羟甲基胞嘧啶(5-hmC)等表观遗传标记,需要结合DNA处理方法的数据,包括传统(BS),氧化(oxBS)或et辅助(TAB)亚硫酸氢盐转化。我们介绍了R包MLML2R,它提供了5-mC和5-hmC比例的最大似然估计(MLE)。虽然所有其他可用的R包仅为oxBS+BS组合提供5-mC和5-hmC MLE,但MLML2R还为TAB组合提供了MLE。对于任意两种方法的组合,我们以解析形式导出了池邻接违反者算法(PAVA)的精确约束MLE。对于这三种方法的组合,我们实现了Qu等人的迭代方法[Qu, J, M. Zhou, Q. Song, E. E. Hong and A. D. Smith(2013):“Mlml: dna甲基化和羟甲基化的一致同时估计”,生物信息学,29,2645-2646。],以及使用拉格朗日乘法器的一种新颖的非迭代近似。新提出的非迭代解决方案大大减少了处理高吞吐量数据时常见的计算时间瓶颈。MLML2R封装是灵活的,因为它需要输入,来自Infinium甲基化阵列的预处理强度和来自下一代测序技术的计数。MLML2R包可在https://CRAN.R-project.org/package=MLML2R免费获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
1.20
自引率
11.10%
发文量
8
审稿时长
6-12 weeks
期刊介绍: Statistical Applications in Genetics and Molecular Biology seeks to publish significant research on the application of statistical ideas to problems arising from computational biology. The focus of the papers should be on the relevant statistical issues but should contain a succinct description of the relevant biological problem being considered. The range of topics is wide and will include topics such as linkage mapping, association studies, gene finding and sequence alignment, protein structure prediction, design and analysis of microarray data, molecular evolution and phylogenetic trees, DNA topology, and data base search strategies. Both original research and review articles will be warmly received.
期刊最新文献
Empirically adjusted fixed-effects meta-analysis methods in genomic studies. A CNN-CBAM-BIGRU model for protein function prediction. A heavy-tailed model for analyzing miRNA-seq raw read counts. Flexible model-based non-negative matrix factorization with application to mutational signatures. Choice of baseline hazards in joint modeling of longitudinal and time-to-event cancer survival data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1