A comparison of likelihood-based methods for size-biased sampling

IF 0.8 4区数学 Q3 STATISTICS & PROBABILITY Journal of Statistical Planning and Inference Pub Date : 2023-10-13 DOI:10.1016/j.jspi.2023.106115

Victoria L. Leaver , Robert G. Clark , Pavel N. Krivitsky , Carole L. Birrell

{"title":"A comparison of likelihood-based methods for size-biased sampling","authors":"Victoria L. Leaver , Robert G. Clark , Pavel N. Krivitsky , Carole L. Birrell","doi":"10.1016/j.jspi.2023.106115","DOIUrl":null,"url":null,"abstract":"<div>Three likelihood approaches to estimation under informative sampling are compared using a special case for which analytic expressions are possible to derive. An independent and identically distributed population of values of a variable of interest is drawn from a gamma distribution, with the shape parameter and the population size both assumed to be known. The sampling method is selection with probability proportional to a power of the variable with replacement, so that duplicate sample units are possible. Estimators of the unknown parameter, variance estimators and asymptotic variances of the estimators are derived for maximum likelihood, sample likelihood and pseudo-likelihood estimation. Theoretical derivations and simulation results show that the efficiency of the sample likelihood approaches that of full maximum likelihood estimation when the sample size <math><mi>n</mi></math> tends to infinity and the sampling fraction <math><mi>f</mi></math> tends to zero. However, when <math><mi>n</mi></math> tends to infinity and <math><mi>f</mi></math> is not negligible, the maximum likelihood estimator is more efficient than the other methods because it takes the possibility of duplicate sample units into account. Pseudo-likelihood can perform much more poorly than the other methods in some cases. For the special case when the superpopulation is exponential and the selection is probability proportional to size, the anticipated variance of the pseudo-likelihood estimate is infinite.</div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"230 ","pages":"Article 106115"},"PeriodicalIF":0.8000,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0378375823000848/pdfft?md5=34807a0d3caadad51aaee1e2b82b751e&pid=1-s2.0-S0378375823000848-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Statistical Planning and Inference","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378375823000848","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}

引用次数: 0

Abstract

Three likelihood approaches to estimation under informative sampling are compared using a special case for which analytic expressions are possible to derive. An independent and identically distributed population of values of a variable of interest is drawn from a gamma distribution, with the shape parameter and the population size both assumed to be known. The sampling method is selection with probability proportional to a power of the variable with replacement, so that duplicate sample units are possible. Estimators of the unknown parameter, variance estimators and asymptotic variances of the estimators are derived for maximum likelihood, sample likelihood and pseudo-likelihood estimation. Theoretical derivations and simulation results show that the efficiency of the sample likelihood approaches that of full maximum likelihood estimation when the sample size $n$ tends to infinity and the sampling fraction $f$ tends to zero. However, when $n$ tends to infinity and $f$ is not negligible, the maximum likelihood estimator is more efficient than the other methods because it takes the possibility of duplicate sample units into account. Pseudo-likelihood can perform much more poorly than the other methods in some cases. For the special case when the superpopulation is exponential and the selection is probability proportional to size, the anticipated variance of the pseudo-likelihood estimate is infinite.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于可能性的大小偏差抽样方法的比较

在信息抽样下的三种似然估计方法比较了一种可能推导出解析表达式的特殊情况。从伽马分布中绘制出感兴趣的变量值的独立和相同分布的总体，假设形状参数和总体大小都是已知的。抽样方法是选择与替换变量的幂成比例的概率，使重复的样本单位成为可能。给出了最大似然、样本似然和伪似然估计的未知参数估计量、方差估计量和渐近方差。理论推导和仿真结果表明，当样本容量n趋于无穷，采样分数f趋于零时，样本似然估计的效率接近完全极大似然估计的效率。然而，当n趋于无穷大且f不可忽略时，最大似然估计器比其他方法更有效，因为它考虑了重复样本单元的可能性。在某些情况下，伪似然方法的性能可能比其他方法差得多。对于超总体呈指数型且选择与大小成概率比例的特殊情况，拟似然估计的预期方差是无穷大的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Statistical Planning and Inference 数学-统计学与概率论

CiteScore

2.10

自引率

11.10%

发文量

审稿时长

3-6 weeks

期刊介绍： The Journal of Statistical Planning and Inference offers itself as a multifaceted and all-inclusive bridge between classical aspects of statistics and probability, and the emerging interdisciplinary aspects that have a potential of revolutionizing the subject. While we maintain our traditional strength in statistical inference, design, classical probability, and large sample methods, we also have a far more inclusive and broadened scope to keep up with the new problems that confront us as statisticians, mathematicians, and scientists. We publish high quality articles in all branches of statistics, probability, discrete mathematics, machine learning, and bioinformatics. We also especially welcome well written and up to date review articles on fundamental themes of statistics, probability, machine learning, and general biostatistics. Thoughtful letters to the editors, interesting problems in need of a solution, and short notes carrying an element of elegance or beauty are equally welcome.