A selective review of statistical methods using calibration information from similar studies

IF 1.3 Q3 STATISTICS & PROBABILITY Statistical Theory and Related Fields Pub Date : 2022-02-17 DOI:10.1080/24754269.2022.2037201

J. Qin, Yukun Liu, Pengfei Li

{"title":"A selective review of statistical methods using calibration information from similar studies","authors":"J. Qin, Yukun Liu, Pengfei Li","doi":"10.1080/24754269.2022.2037201","DOIUrl":null,"url":null,"abstract":"In the era of big data, divide-and-conquer, parallel, and distributed inference methods have become increasingly popular. How to effectively use the calibration information from each machine in parallel computation has become a challenging task for statisticians and computer scientists. Many newly developed methods have roots in traditional statistical approaches that make use of calibration information. In this paper, we first review some classical statistical methods for using calibration information, including simple meta-analysis methods, parametric likelihood, empirical likelihood, and the generalized method of moments. We further investigate how these methods incorporate summarized or auxiliary information from previous studies, related studies, or populations. We find that the methods based on summarized data usually have little or nearly no efficiency loss compared with the corresponding methods based on all-individual data. Finally, we review some recently developed big data analysis methods including communication-efficient distributed approaches, renewal estimation, and incremental inference as examples of the latest developments in methods using calibration information.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"6 1","pages":"175 - 190"},"PeriodicalIF":1.3000,"publicationDate":"2022-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistical Theory and Related Fields","FirstCategoryId":"96","ListUrlMain":"https://doi.org/10.1080/24754269.2022.2037201","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}

引用次数: 3

Abstract

In the era of big data, divide-and-conquer, parallel, and distributed inference methods have become increasingly popular. How to effectively use the calibration information from each machine in parallel computation has become a challenging task for statisticians and computer scientists. Many newly developed methods have roots in traditional statistical approaches that make use of calibration information. In this paper, we first review some classical statistical methods for using calibration information, including simple meta-analysis methods, parametric likelihood, empirical likelihood, and the generalized method of moments. We further investigate how these methods incorporate summarized or auxiliary information from previous studies, related studies, or populations. We find that the methods based on summarized data usually have little or nearly no efficiency loss compared with the corresponding methods based on all-individual data. Finally, we review some recently developed big data analysis methods including communication-efficient distributed approaches, renewal estimation, and incremental inference as examples of the latest developments in methods using calibration information.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

使用类似研究的校准信息的统计方法的选择性回顾

在大数据时代，分而治之、并行和分布式推理方法越来越流行。如何在并行计算中有效地使用来自每台机器的校准信息已成为统计学家和计算机科学家的一项具有挑战性的任务。许多新开发的方法都源于利用校准信息的传统统计方法。在本文中，我们首先回顾了一些使用校准信息的经典统计方法，包括简单的荟萃分析方法、参数似然、经验似然和广义矩方法。我们进一步研究了这些方法如何结合先前研究、相关研究或人群的总结或辅助信息。我们发现，与基于所有单个数据的相应方法相比，基于汇总数据的方法通常很少或几乎没有效率损失。最后，我们回顾了一些最近开发的大数据分析方法，包括高效通信的分布式方法、更新估计和增量推理，作为使用校准信息的方法的最新发展的例子。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊