首页 > 最新文献

Statistics Surveys最新文献

英文 中文
Variable selection methods for model-based clustering 基于模型聚类的变量选择方法
IF 3.3 Q1 STATISTICS & PROBABILITY Pub Date : 2017-07-02 DOI: 10.1214/18-SS119
Michael Fop, T. B. Murphy
Model-based clustering is a popular approach for clustering multivariate data which has seen applications in numerous fields. Nowadays, high-dimensional data are more and more common and the model-based clustering approach has adapted to deal with the increasing dimensionality. In particular, the development of variable selection techniques has received a lot of attention and research effort in recent years. Even for small size problems, variable selection has been advocated to facilitate the interpretation of the clustering results. This review provides a summary of the methods developed for variable selection in model-based clustering. Existing R packages implementing the different methods are indicated and illustrated in application to two data analysis examples.
基于模型的聚类是一种流行的多变量数据聚类方法,在许多领域都有应用。现如今,高维数据越来越普遍,基于模型的聚类方法已经适应了高维数据的处理。特别是近年来,变量选择技术的发展受到了广泛的关注和研究。即使是小规模的问题,变量选择也一直被提倡,以方便对聚类结果的解释。本文综述了基于模型的聚类中变量选择的方法。现有的R包实现了不同的方法,并在两个数据分析实例中进行了说明和应用。
{"title":"Variable selection methods for model-based clustering","authors":"Michael Fop, T. B. Murphy","doi":"10.1214/18-SS119","DOIUrl":"https://doi.org/10.1214/18-SS119","url":null,"abstract":"Model-based clustering is a popular approach for clustering multivariate data which has seen applications in numerous fields. Nowadays, high-dimensional data are more and more common and the model-based clustering approach has adapted to deal with the increasing dimensionality. In particular, the development of variable selection techniques has received a lot of attention and research effort in recent years. Even for small size problems, variable selection has been advocated to facilitate the interpretation of the clustering results. This review provides a summary of the methods developed for variable selection in model-based clustering. Existing R packages implementing the different methods are indicated and illustrated in application to two data analysis examples.","PeriodicalId":46627,"journal":{"name":"Statistics Surveys","volume":"210 1","pages":"18-65"},"PeriodicalIF":3.3,"publicationDate":"2017-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76116167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 73
A survey of bootstrap methods in finite population sampling 有限总体抽样中的自举方法综述
IF 3.3 Q1 STATISTICS & PROBABILITY Pub Date : 2016-01-01 DOI: 10.1214/16-SS113
Z. Mashreghi, D. Haziza, C. Léger
{"title":"A survey of bootstrap methods in finite population sampling","authors":"Z. Mashreghi, D. Haziza, C. Léger","doi":"10.1214/16-SS113","DOIUrl":"https://doi.org/10.1214/16-SS113","url":null,"abstract":"","PeriodicalId":46627,"journal":{"name":"Statistics Surveys","volume":"12 1","pages":"1-52"},"PeriodicalIF":3.3,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73260692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 44
Measuring multivariate association and beyond. 测量多变量关联及其他。
IF 3.3 Q1 STATISTICS & PROBABILITY Pub Date : 2016-01-01 Epub Date: 2016-11-17 DOI: 10.1214/16-SS116
Julie Josse, Susan Holmes

Simple correlation coefficients between two variables have been generalized to measure association between two matrices in many ways. Coefficients such as the RV coefficient, the distance covariance (dCov) coefficient and kernel based coefficients are being used by different research communities. Scientists use these coefficients to test whether two random vectors are linked. Once it has been ascertained that there is such association through testing, then a next step, often ignored, is to explore and uncover the association's underlying patterns. This article provides a survey of various measures of dependence between random vectors and tests of independence and emphasizes the connections and differences between the various approaches. After providing definitions of the coefficients and associated tests, we present the recent improvements that enhance their statistical properties and ease of interpretation. We summarize multi-table approaches and provide scenarii where the indices can provide useful summaries of heterogeneous multi-block data. We illustrate these different strategies on several examples of real data and suggest directions for future research.

两个变量之间的简单相关系数在许多方面被推广到度量两个矩阵之间的关联。RV系数、距离协方差(distance covariance, dCov)系数和基于核的系数等系数被不同的研究界广泛使用。科学家们用这些系数来测试两个随机向量是否相连。一旦通过测试确定了存在这样的关联,那么下一步(通常被忽略)就是探索和揭示关联的潜在模式。本文概述了随机向量之间的各种依赖性度量和独立性检验,并强调了各种方法之间的联系和区别。在提供了系数的定义和相关的测试之后,我们提出了最近的改进,增强了它们的统计特性和易于解释。我们总结了多表方法,并提供了索引可以为异构多块数据提供有用摘要的场景。我们用几个真实数据的例子说明了这些不同的策略,并提出了未来研究的方向。
{"title":"Measuring multivariate association and beyond.","authors":"Julie Josse,&nbsp;Susan Holmes","doi":"10.1214/16-SS116","DOIUrl":"https://doi.org/10.1214/16-SS116","url":null,"abstract":"<p><p>Simple correlation coefficients between two variables have been generalized to measure association between two matrices in many ways. Coefficients such as the RV coefficient, the distance covariance (dCov) coefficient and kernel based coefficients are being used by different research communities. Scientists use these coefficients to test whether two random vectors are linked. Once it has been ascertained that there is such association through testing, then a next step, often ignored, is to explore and uncover the association's underlying patterns. This article provides a survey of various measures of dependence between random vectors and tests of independence and emphasizes the connections and differences between the various approaches. After providing definitions of the coefficients and associated tests, we present the recent improvements that enhance their statistical properties and ease of interpretation. We summarize multi-table approaches and provide scenarii where the indices can provide useful summaries of heterogeneous multi-block data. We illustrate these different strategies on several examples of real data and suggest directions for future research.</p>","PeriodicalId":46627,"journal":{"name":"Statistics Surveys","volume":"10 ","pages":"132-167"},"PeriodicalIF":3.3,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1214/16-SS116","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35553938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 71
Some models and methods for the analysis of observational data 一些观测资料分析的模式和方法
IF 3.3 Q1 STATISTICS & PROBABILITY Pub Date : 2015-01-01 DOI: 10.1214/15-SS110
J. A. Ferreira
{"title":"Some models and methods for the analysis of observational data","authors":"J. A. Ferreira","doi":"10.1214/15-SS110","DOIUrl":"https://doi.org/10.1214/15-SS110","url":null,"abstract":"","PeriodicalId":46627,"journal":{"name":"Statistics Surveys","volume":"214 1","pages":"106-208"},"PeriodicalIF":3.3,"publicationDate":"2015-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75584060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Semi-Parametric Estimation for Conditional Independence Multivariate Finite Mixture Models 条件无关多元有限混合模型的半参数估计
IF 3.3 Q1 STATISTICS & PROBABILITY Pub Date : 2015-01-01 DOI: 10.1214/15-SS108
D. Chauveau, D. Hunter, M. Levine
The conditional independence assumption for nonparametric multivariate finite mixture models, a weaker form of the well-known conditional independence assumption for random effects models for longitudinal data, is the subject of an increasing number of theoretical and algorithmic developments in the statistical literature. After presenting a survey of this literature, including an in-depth discussion of the all-important identifiability results, this article describes and extends an algorithm for estimation of the parameters in these models. The algorithm works for any number of components in three or more dimensions. It possesses a descent property and can be easily adapted to situations where the data are grouped in blocks of conditionally independent variables. We discuss how to adapt this algorithm to various location-scale models that link component densities, and we even adapt it to a particular class of univariate mixture problems in which the components are assumed symmetric. We give a bandwidth selection procedure for our algorithm. Finally, we demonstrate the effectiveness of our algorithm using a simulation study and two psychometric datasets.
非参数多元有限混合模型的条件独立假设,是众所周知的纵向数据随机效应模型条件独立假设的较弱形式,是统计文献中越来越多的理论和算法发展的主题。在对这些文献进行综述之后,包括对所有重要的可辨识性结果的深入讨论,本文描述并扩展了用于估计这些模型中参数的算法。该算法适用于三维或三维以上的任意数量的组件。它具有下降属性,可以很容易地适应数据分组在条件独立变量块中的情况。我们讨论了如何使该算法适用于连接组件密度的各种位置尺度模型,我们甚至将其适用于假设组件对称的单变量混合问题的特定类别。给出了算法的带宽选择过程。最后,我们使用模拟研究和两个心理测量数据集证明了算法的有效性。
{"title":"Semi-Parametric Estimation for Conditional Independence Multivariate Finite Mixture Models","authors":"D. Chauveau, D. Hunter, M. Levine","doi":"10.1214/15-SS108","DOIUrl":"https://doi.org/10.1214/15-SS108","url":null,"abstract":"The conditional independence assumption for nonparametric multivariate finite mixture models, a weaker form of the well-known conditional independence assumption for random effects models for longitudinal data, is the subject of an increasing number of theoretical and algorithmic developments in the statistical literature. After presenting a survey of this literature, including an in-depth discussion of the all-important identifiability results, this article describes and extends an algorithm for estimation of the parameters in these models. The algorithm works for any number of components in three or more dimensions. It possesses a descent property and can be easily adapted to situations where the data are grouped in blocks of conditionally independent variables. We discuss how to adapt this algorithm to various location-scale models that link component densities, and we even adapt it to a particular class of univariate mixture problems in which the components are assumed symmetric. We give a bandwidth selection procedure for our algorithm. Finally, we demonstrate the effectiveness of our algorithm using a simulation study and two psychometric datasets.","PeriodicalId":46627,"journal":{"name":"Statistics Surveys","volume":"1 1","pages":"1-31"},"PeriodicalIF":3.3,"publicationDate":"2015-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82153457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
A comparison of spatial predictors when datasets could be very large 当数据集可能非常大时,空间预测因子的比较
IF 3.3 Q1 STATISTICS & PROBABILITY Pub Date : 2014-10-28 DOI: 10.1214/16-SS115
J. Bradley, N. Cressie, Tao Shi
In this article, we review and compare a number of methods of spatial prediction. To demonstrate the breadth of available choices, we consider both traditional and more-recently-introduced spatial predictors. Specifically, in our exposition we review: traditional stationary kriging, smoothing splines, negative-exponential distance-weighting, Fixed Rank Kriging, modified predictive processes, a stochastic partial differential equation approach, and lattice kriging. This comparison is meant to provide a service to practitioners wishing to decide between spatial predictors. Hence, we provide technical material for the unfamiliar, which includes the definition and motivation for each (deterministic and stochastic) spatial predictor. We use a benchmark dataset of $mathrm{CO}_{2}$ data from NASA's AIRS instrument to address computational efficiencies that include CPU time and memory usage. Furthermore, the predictive performance of each spatial predictor is assessed empirically using a hold-out subset of the AIRS data.
在本文中,我们回顾和比较了一些空间预测的方法。为了展示可用选择的广度,我们考虑了传统的和最近引入的空间预测因子。具体来说,在我们的阐述中,我们回顾了:传统的平稳克里格,平滑样条,负指数距离加权,固定秩克里格,修正预测过程,随机偏微分方程方法和晶格克里格。这种比较旨在为希望在空间预测器之间做出决定的从业者提供服务。因此,我们为不熟悉的人提供技术材料,其中包括每个(确定性和随机)空间预测器的定义和动机。我们使用来自NASA AIRS仪器的$ mathm {CO}_{2}$数据的基准数据集来解决包括CPU时间和内存使用在内的计算效率问题。此外,使用AIRS数据的保留子集对每个空间预测器的预测性能进行了经验评估。
{"title":"A comparison of spatial predictors when datasets could be very large","authors":"J. Bradley, N. Cressie, Tao Shi","doi":"10.1214/16-SS115","DOIUrl":"https://doi.org/10.1214/16-SS115","url":null,"abstract":"In this article, we review and compare a number of methods of spatial prediction. To demonstrate the breadth of available choices, we consider both traditional and more-recently-introduced spatial predictors. Specifically, in our exposition we review: traditional stationary kriging, smoothing splines, negative-exponential distance-weighting, Fixed Rank Kriging, modified predictive processes, a stochastic partial differential equation approach, and lattice kriging. This comparison is meant to provide a service to practitioners wishing to decide between spatial predictors. Hence, we provide technical material for the unfamiliar, which includes the definition and motivation for each (deterministic and stochastic) spatial predictor. We use a benchmark dataset of $mathrm{CO}_{2}$ data from NASA's AIRS instrument to address computational efficiencies that include CPU time and memory usage. Furthermore, the predictive performance of each spatial predictor is assessed empirically using a hold-out subset of the AIRS data.","PeriodicalId":46627,"journal":{"name":"Statistics Surveys","volume":"9 1","pages":"100-131"},"PeriodicalIF":3.3,"publicationDate":"2014-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81924992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 55
Log-Concavity and Strong Log-Concavity: a review. 对数凹性与强对数凹性综述。
IF 3.3 Q1 STATISTICS & PROBABILITY Pub Date : 2014-01-01 Epub Date: 2014-12-09 DOI: 10.1214/14-SS107
Adrien Saumard, Jon A Wellner

We review and formulate results concerning log-concavity and strong-log-concavity in both discrete and continuous settings. We show how preservation of log-concavity and strongly log-concavity on ℝ under convolution follows from a fundamental monotonicity result of Efron (1969). We provide a new proof of Efron's theorem using the recent asymmetric Brascamp-Lieb inequality due to Otto and Menz (2013). Along the way we review connections between log-concavity and other areas of mathematics and statistics, including concentration of measure, log-Sobolev inequalities, convex geometry, MCMC algorithms, Laplace approximations, and machine learning.

我们回顾并表述了离散和连续两种情况下关于对数凹性和强对数凹性的结果。我们展示了如何从Efron(1969)的一个基本单调性结果出发,在卷积条件下保持对数凹性和强对数凹性。我们使用Otto和Menz(2013)最近提出的不对称Brascamp-Lieb不等式提供了Efron定理的新证明。在此过程中,我们回顾了对数凹性与其他数学和统计学领域之间的联系,包括测度的集中、对数-索博列夫不等式、凸几何、MCMC算法、拉普拉斯近似和机器学习。
{"title":"Log-Concavity and Strong Log-Concavity: a review.","authors":"Adrien Saumard,&nbsp;Jon A Wellner","doi":"10.1214/14-SS107","DOIUrl":"https://doi.org/10.1214/14-SS107","url":null,"abstract":"<p><p>We review and formulate results concerning log-concavity and strong-log-concavity in both discrete and continuous settings. We show how preservation of log-concavity and strongly log-concavity on ℝ under convolution follows from a fundamental monotonicity result of Efron (1969). We provide a new proof of Efron's theorem using the recent asymmetric Brascamp-Lieb inequality due to Otto and Menz (2013). Along the way we review connections between log-concavity and other areas of mathematics and statistics, including concentration of measure, log-Sobolev inequalities, convex geometry, MCMC algorithms, Laplace approximations, and machine learning.</p>","PeriodicalId":46627,"journal":{"name":"Statistics Surveys","volume":"8 ","pages":"45-114"},"PeriodicalIF":3.3,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1214/14-SS107","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"34446771","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 237
Adaptive clinical trial designs for phase I cancer studies 一期癌症研究的适应性临床试验设计
IF 3.3 Q1 STATISTICS & PROBABILITY Pub Date : 2014-01-01 DOI: 10.1214/14-SS106
O. Sverdlov, W. Wong, Y. Ryeznik
Adaptive clinical trials are becoming increasingly popular research designs for clinical investigation. Adaptive designs are particularly useful in phase I cancer studies where clinical data are scant and the goals are to assess the drug dose-toxicity profile and to determine the maximum tolerated dose while minimizing the number of study patients treated at suboptimal dose levels. In the current work we give an overview of adaptive design methods for phase I cancer trials. We find that modern statistical literature is replete with novel adaptive designs that have clearly defined objectives and established statistical properties, and are shown to outperform conventional dose finding methods such as the 3+3 design, both in terms of statistical efficiency and in terms of minimizing the number of patients treated at highly toxic or nonefficacious doses. We discuss statistical, logistical, and regulatory aspects of these designs and present some links to non-commercial statistical software for implementing these methods in practice. MSC 2010 subject classifications: Primary 62L05, 62L10, 62L12; secondary 62L20.
适应性临床试验已成为越来越流行的临床研究设计。适应性设计在临床数据不足的I期癌症研究中特别有用,目的是评估药物剂量-毒性概况,确定最大耐受剂量,同时尽量减少在次优剂量水平下治疗的研究患者人数。在目前的工作中,我们概述了一期癌症试验的适应性设计方法。我们发现,现代统计文献中充满了新颖的适应性设计,这些设计具有明确定义的目标和已建立的统计特性,并且在统计效率和最小化高毒性或无效剂量治疗的患者数量方面都优于传统的剂量查找方法,如3+3设计。我们讨论了这些设计的统计、后勤和管理方面,并提供了一些在实践中实现这些方法的非商业统计软件的链接。MSC 2010学科分类:初级62L05, 62L10, 62L12;二次62活用。
{"title":"Adaptive clinical trial designs for phase I cancer studies","authors":"O. Sverdlov, W. Wong, Y. Ryeznik","doi":"10.1214/14-SS106","DOIUrl":"https://doi.org/10.1214/14-SS106","url":null,"abstract":"Adaptive clinical trials are becoming increasingly popular research designs for clinical investigation. Adaptive designs are particularly useful in phase I cancer studies where clinical data are scant and the goals are to assess the drug dose-toxicity profile and to determine the maximum tolerated dose while minimizing the number of study patients treated at suboptimal dose levels. In the current work we give an overview of adaptive design methods for phase I cancer trials. We find that modern statistical literature is replete with novel adaptive designs that have clearly defined objectives and established statistical properties, and are shown to outperform conventional dose finding methods such as the 3+3 design, both in terms of statistical efficiency and in terms of minimizing the number of patients treated at highly toxic or nonefficacious doses. We discuss statistical, logistical, and regulatory aspects of these designs and present some links to non-commercial statistical software for implementing these methods in practice. MSC 2010 subject classifications: Primary 62L05, 62L10, 62L12; secondary 62L20.","PeriodicalId":46627,"journal":{"name":"Statistics Surveys","volume":"154 1","pages":"2-44"},"PeriodicalIF":3.3,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77126913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
$M$-functionals of multivariate scatter $M$-多元散射的泛函
IF 3.3 Q1 STATISTICS & PROBABILITY Pub Date : 2013-12-19 DOI: 10.1214/15-SS109
L. Duembgen, Markus Pauly, T. Schweizer
This survey provides a self-contained account of M-estimation of multivariate scatter. In particular, we present new proofs for existence of the underlying M-functionals and discuss their weak continuity and differentiability. This is done in a rather general framework with matrix-valued random variables. By doing so we reveal a connection between Tyler's (1987) M-functional of scatter and the estimation of proportional covariance matrices. Moreover, this general framework allows us to treat a new class of scatter estimators, based on symmetrizations of arbitrary order. Finally these results are applied to M-estimation of multivariate location and scatter via multivariate t-distributions.
这项调查提供了一个独立的多变量散点的m估计的帐户。特别地,我们给出了新的m泛函存在性的证明,并讨论了它们的弱连续性和可微性。这是在具有矩阵值随机变量的相当一般的框架中完成的。通过这样做,我们揭示了泰勒(1987)散点的m泛函和比例协方差矩阵的估计之间的联系。此外,这个一般框架允许我们处理一类新的基于任意阶对称的散射估计。最后,将这些结果应用于多变量位置的m估计,并通过多变量t分布进行分散。
{"title":"$M$-functionals of multivariate scatter","authors":"L. Duembgen, Markus Pauly, T. Schweizer","doi":"10.1214/15-SS109","DOIUrl":"https://doi.org/10.1214/15-SS109","url":null,"abstract":"This survey provides a self-contained account of M-estimation of multivariate scatter. In particular, we present new proofs for existence of the underlying M-functionals and discuss their weak continuity and differentiability. This is done in a rather general framework with matrix-valued random variables. By doing so we reveal a connection between Tyler's (1987) M-functional of scatter and the estimation of proportional covariance matrices. Moreover, this general framework allows us to treat a new class of scatter estimators, based on symmetrizations of arbitrary order. Finally these results are applied to M-estimation of multivariate location and scatter via multivariate t-distributions.","PeriodicalId":46627,"journal":{"name":"Statistics Surveys","volume":"49 1","pages":"32-105"},"PeriodicalIF":3.3,"publicationDate":"2013-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76269753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
The implementation of cross-sectional weights in household panel surveys 横截面权值在住户小组调查中的应用
IF 3.3 Q1 STATISTICS & PROBABILITY Pub Date : 2013-01-01 DOI: 10.1214/13-SS104
Matthias Schonlau, M. Kroh, N. Watson
While household panel surveys are longitudinal in nature crosssectional sampling weights are also of interest. The computation of crosssectional weights is challenging because household compositions change over time. Sampling probabilities of household entrants after wave 1 are generally not known and assigning them zero weight is not satisfying. Two common approaches to cross-sectional weighting address this issue: (1) “shared weights” and (2) modeling or estimating unobserved sampling probabilities based on person-level characteristics. We survey how several well-known national household panels address cross-sectional weights for different groups of respondents (including immigrants and births) and in different situations (including household mergers and splits). When a new person moves into a household, both “shared weights” and “modeling” lead to reduced individual weights of pre-existing household members, but differences due to the approach arise elsewhere. The implementation of “shared weights” is problematic when the panel contains households without a household member already present in wave 1. Panels also differ in the treatment of immigrants, household merges, and sometimes on how weights are assigned to children born to wave 1 panel members.
虽然住户小组调查本质上是纵向的,但横截面抽样权重也令人感兴趣。横截面权重的计算具有挑战性,因为家庭组成随时间而变化。第一波之后住户进入的抽样概率通常是未知的,给它们赋零权重是不令人满意的。横截面加权的两种常见方法解决了这个问题:(1)“共享权重”和(2)基于个人水平特征建模或估计未观察到的抽样概率。我们调查了几个知名的全国家庭小组如何处理不同受访者群体(包括移民和出生)和不同情况(包括家庭合并和分裂)的横截面权重。当一个新人搬进一个家庭时,“共享权重”和“建模”都会导致现有家庭成员的个体权重降低,但由于这种方法而产生的差异会在其他地方出现。当面板中包含没有家庭成员的家庭时,“共享权重”的实现是有问题的。小组在对待移民、家庭合并、有时如何分配第一波小组成员所生孩子的权重等方面也存在差异。
{"title":"The implementation of cross-sectional weights in household panel surveys","authors":"Matthias Schonlau, M. Kroh, N. Watson","doi":"10.1214/13-SS104","DOIUrl":"https://doi.org/10.1214/13-SS104","url":null,"abstract":"While household panel surveys are longitudinal in nature crosssectional sampling weights are also of interest. The computation of crosssectional weights is challenging because household compositions change over time. Sampling probabilities of household entrants after wave 1 are generally not known and assigning them zero weight is not satisfying. Two common approaches to cross-sectional weighting address this issue: (1) “shared weights” and (2) modeling or estimating unobserved sampling probabilities based on person-level characteristics. We survey how several well-known national household panels address cross-sectional weights for different groups of respondents (including immigrants and births) and in different situations (including household mergers and splits). When a new person moves into a household, both “shared weights” and “modeling” lead to reduced individual weights of pre-existing household members, but differences due to the approach arise elsewhere. The implementation of “shared weights” is problematic when the panel contains households without a household member already present in wave 1. Panels also differ in the treatment of immigrants, household merges, and sometimes on how weights are assigned to children born to wave 1 panel members.","PeriodicalId":46627,"journal":{"name":"Statistics Surveys","volume":" 10","pages":"37-57"},"PeriodicalIF":3.3,"publicationDate":"2013-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1214/13-SS104","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72385387","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
Statistics Surveys
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1