{"title":"DSLE2 random-effects meta-analysis model for high-throughput methylation data.","authors":"Nan Wang, Yang Zhou, Fengping Zhu, Shuilin Jin","doi":"10.1186/s12864-025-11316-3","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>With the rapid development of high-throughput sequencing technology, high-throughput sequencing data has grown on a massive scale, leading to the emergence of multiple public databases, such as EBI and GEO. Conducting secondary mining of high-throughput sequencing data in these databases can yield more valuable insights. Meta-analysis can quantitatively combine high-throughput sequencing data from the the same topic. It increases the sample size for data analysis, enhances statistical power, and results in more consistent and reliable conclusions.</p><p><strong>Results: </strong>This study proposes a new between-study variance estimator <math><msub><mi>E</mi> <mi>m</mi></msub> </math> . We prove that <math><msub><mi>E</mi> <mi>m</mi></msub> </math> is non-negative and <math> <mrow><msub><mi>E</mi> <mi>m</mi></msub> <mfenced> <msubsup><mover><mi>τ</mi> <mo>^</mo></mover> <mrow><mi>m</mi></mrow> <mn>2</mn></msubsup> </mfenced> </mrow> </math> increases with the increase of <math> <msubsup><mover><mi>τ</mi> <mo>^</mo></mover> <mrow><mi>m</mi></mrow> <mn>2</mn></msubsup> </math> , satisfying the general conditions of the between-study variance estimator. We get the DSLE2 (two-step estimation starting with the DSL estimate and the <math><msub><mi>E</mi> <mi>m</mi></msub> </math> in the second step) random-effects meta-analysis model based on the between-study variance estimator Em. The accuracy and a series of evaluation metrics of the DSLE2 model are better than those of the other 6 meta-analysis models. DSLE2 model is applied to lung cancer and Parkinson's methylation data. Significantly differentially methylated sites identified by DSLE2 model and the genes with significantly differentially methylated sites are closely related to two diseases, indicating the effectiveness of DSLE2 random-effects model.</p><p><strong>Conclusions: </strong>This paper propose the DSLE2 random-effects meta-analysis model based on new between-study variance estimator Em. The DSLE2 model performs well for methylation data.</p>","PeriodicalId":9030,"journal":{"name":"BMC Genomics","volume":"26 1","pages":"219"},"PeriodicalIF":3.7000,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11884006/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12864-025-11316-3","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: With the rapid development of high-throughput sequencing technology, high-throughput sequencing data has grown on a massive scale, leading to the emergence of multiple public databases, such as EBI and GEO. Conducting secondary mining of high-throughput sequencing data in these databases can yield more valuable insights. Meta-analysis can quantitatively combine high-throughput sequencing data from the the same topic. It increases the sample size for data analysis, enhances statistical power, and results in more consistent and reliable conclusions.
Results: This study proposes a new between-study variance estimator . We prove that is non-negative and increases with the increase of , satisfying the general conditions of the between-study variance estimator. We get the DSLE2 (two-step estimation starting with the DSL estimate and the in the second step) random-effects meta-analysis model based on the between-study variance estimator Em. The accuracy and a series of evaluation metrics of the DSLE2 model are better than those of the other 6 meta-analysis models. DSLE2 model is applied to lung cancer and Parkinson's methylation data. Significantly differentially methylated sites identified by DSLE2 model and the genes with significantly differentially methylated sites are closely related to two diseases, indicating the effectiveness of DSLE2 random-effects model.
Conclusions: This paper propose the DSLE2 random-effects meta-analysis model based on new between-study variance estimator Em. The DSLE2 model performs well for methylation data.
期刊介绍:
BMC Genomics is an open access, peer-reviewed journal that considers articles on all aspects of genome-scale analysis, functional genomics, and proteomics.
BMC Genomics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.