Victoria L. Leaver , Robert G. Clark , Pavel N. Krivitsky , Carole L. Birrell
{"title":"A comparison of likelihood-based methods for size-biased sampling","authors":"Victoria L. Leaver , Robert G. Clark , Pavel N. Krivitsky , Carole L. Birrell","doi":"10.1016/j.jspi.2023.106115","DOIUrl":null,"url":null,"abstract":"<div><p>Three likelihood approaches to estimation under informative sampling are compared using a special case for which analytic expressions are possible to derive. An independent and identically distributed population of values of a variable of interest is drawn from a gamma distribution, with the shape parameter and the population size both assumed to be known. The sampling method is selection with probability proportional to a power of the variable with replacement, so that duplicate sample units are possible. Estimators of the unknown parameter, variance estimators and asymptotic variances of the estimators are derived for maximum likelihood, sample likelihood and pseudo-likelihood estimation. Theoretical derivations and simulation results show that the efficiency of the sample likelihood approaches that of full maximum likelihood estimation when the sample size <span><math><mi>n</mi></math></span> tends to infinity and the sampling fraction <span><math><mi>f</mi></math></span> tends to zero. However, when <span><math><mi>n</mi></math></span> tends to infinity and <span><math><mi>f</mi></math></span> is not negligible, the maximum likelihood estimator is more efficient than the other methods because it takes the possibility of duplicate sample units into account. Pseudo-likelihood can perform much more poorly than the other methods in some cases. For the special case when the superpopulation is exponential and the selection is probability proportional to size, the anticipated variance of the pseudo-likelihood estimate is infinite.</p></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"230 ","pages":"Article 106115"},"PeriodicalIF":0.8000,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0378375823000848/pdfft?md5=34807a0d3caadad51aaee1e2b82b751e&pid=1-s2.0-S0378375823000848-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Statistical Planning and Inference","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378375823000848","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
Three likelihood approaches to estimation under informative sampling are compared using a special case for which analytic expressions are possible to derive. An independent and identically distributed population of values of a variable of interest is drawn from a gamma distribution, with the shape parameter and the population size both assumed to be known. The sampling method is selection with probability proportional to a power of the variable with replacement, so that duplicate sample units are possible. Estimators of the unknown parameter, variance estimators and asymptotic variances of the estimators are derived for maximum likelihood, sample likelihood and pseudo-likelihood estimation. Theoretical derivations and simulation results show that the efficiency of the sample likelihood approaches that of full maximum likelihood estimation when the sample size tends to infinity and the sampling fraction tends to zero. However, when tends to infinity and is not negligible, the maximum likelihood estimator is more efficient than the other methods because it takes the possibility of duplicate sample units into account. Pseudo-likelihood can perform much more poorly than the other methods in some cases. For the special case when the superpopulation is exponential and the selection is probability proportional to size, the anticipated variance of the pseudo-likelihood estimate is infinite.
期刊介绍:
The Journal of Statistical Planning and Inference offers itself as a multifaceted and all-inclusive bridge between classical aspects of statistics and probability, and the emerging interdisciplinary aspects that have a potential of revolutionizing the subject. While we maintain our traditional strength in statistical inference, design, classical probability, and large sample methods, we also have a far more inclusive and broadened scope to keep up with the new problems that confront us as statisticians, mathematicians, and scientists.
We publish high quality articles in all branches of statistics, probability, discrete mathematics, machine learning, and bioinformatics. We also especially welcome well written and up to date review articles on fundamental themes of statistics, probability, machine learning, and general biostatistics. Thoughtful letters to the editors, interesting problems in need of a solution, and short notes carrying an element of elegance or beauty are equally welcome.