Assessment of Required Sample Sizes for Estimating Proportions

Asian Journal of Probability and Statistics Pub Date : 2024-06-15 DOI:10.9734/ajpas/2024/v26i7629

S. Garren, Brooke A. Cleathero

{"title":"Assessment of Required Sample Sizes for Estimating Proportions","authors":"S. Garren, Brooke A. Cleathero","doi":"10.9734/ajpas/2024/v26i7629","DOIUrl":null,"url":null,"abstract":"When estimating a population proportion p within margin of error m, a preliminary sample of size n is taken to produce a preliminary sample proportion y/n, which is then used to determine the required sample size (y/n)(1-y/n)(z/m)2, where z is the critical value for a given level of confidence. The population is assumed to be infinite, so these Bernoulli(p) observations are mutually independent. Upon taking a new sample based on the required sample size, the coverage probabilities on p are determined exactly for various values of m, n, p, and z, using a commonly-used formula for a confidence interval on p. The coverage probabilities tend to be somewhat smaller than their nominal values, and tend to be a lot smaller when np or n(1 - p) is small, which would result in anti-conservative confidence intervals. As a more minor conclusion, since the given margin of error m is not relative to the population proportion p, then the required sample size is larger for values of p nearest to 0.5. The mean and standard deviation of the required sample size are also computed exactly to provide prospective, regarding just how large or how small these required sample sizes need to be.","PeriodicalId":8532,"journal":{"name":"Asian Journal of Probability and Statistics","volume":"8 24","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asian Journal of Probability and Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.9734/ajpas/2024/v26i7629","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

When estimating a population proportion p within margin of error m, a preliminary sample of size n is taken to produce a preliminary sample proportion y/n, which is then used to determine the required sample size (y/n)(1-y/n)(z/m)2, where z is the critical value for a given level of confidence. The population is assumed to be infinite, so these Bernoulli(p) observations are mutually independent. Upon taking a new sample based on the required sample size, the coverage probabilities on p are determined exactly for various values of m, n, p, and z, using a commonly-used formula for a confidence interval on p. The coverage probabilities tend to be somewhat smaller than their nominal values, and tend to be a lot smaller when np or n(1 - p) is small, which would result in anti-conservative confidence intervals. As a more minor conclusion, since the given margin of error m is not relative to the population proportion p, then the required sample size is larger for values of p nearest to 0.5. The mean and standard deviation of the required sample size are also computed exactly to provide prospective, regarding just how large or how small these required sample sizes need to be.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

评估估计比例所需的样本量

在误差范围 m 内估计人口比例 p 时，需要抽取规模为 n 的初步样本，得出初步样本比例 y/n，然后用它来确定所需的样本规模 (y/n)(1-y/n)(z/m)2，其中 z 是给定置信度的临界值。假设总体是无限的，因此这些伯努利（p）观测结果是相互独立的。根据所需的样本量重新抽取样本后，使用常用的 p 置信区间公式，可以精确地确定 m、n、p 和 z 的不同值时 p 的覆盖概率。一个更次要的结论是，由于给定的误差范围 m 不是相对于人口比例 p 而定的，因此当 p 值接近 0.5 时，所需的样本量会更大。我们还精确计算了所需样本量的平均值和标准偏差，以提供关于所需样本量需要多大或多小的前瞻性信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Asian Journal of Probability and Statistics

自引率

0.00%

发文量