小区域贫困指标估计的多元混合模型

IF 1.5 3区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS Journal of the Royal Statistical Society Series A-Statistics in Society Pub Date : 2022-12-12 DOI:10.1111/rssa.12965

Agne Bikauskaite, Isabel Molina, Domingo Morales

{"title":"小区域贫困指标估计的多元混合模型","authors":"Agne Bikauskaite, Isabel Molina, Domingo Morales","doi":"10.1111/rssa.12965","DOIUrl":null,"url":null,"abstract":"<p>When disaggregation of national estimates in several domains or areas is required, direct survey estimators, which use only the domain-specific survey data, are usually design-unbiased even under complex survey designs (at least approximately) and require no model assumptions. Nevertheless, they are appropriate only for domains or areas with sufficiently large sample size. For example, when estimating poverty in a domain with a small sample size (small area), the volatility of a direct estimator might make that area seems like very poor in one period and very rich in the next one. Small area (or indirect) estimators have been developed in order to avoid such undesired instability. Small area estimators borrow strength from the other areas so as to improve the precision and therefore obtain much more stable estimators. However, the usual model-based assumptions, which include some kind of area homogeneity, may not hold in real applications. A more flexible model based on multivariate mixtures of normal distributions that generalises the usual nested error linear regression model is proposed for estimation of general parameters in small areas. This flexibility makes the model adaptable to more general situations, where there may be areas with a different behaviour from the other ones, making the model less restrictive (hence, more close to nonparametric) and more robust to outlying areas. An expectation-maximisation (E-M) method is designed for fitting the proposed mixture model. Under the proposed mixture model, two different new predictors of general small area indicators are proposed. A parametric bootstrap method is used to estimate the mean squared errors of the proposed predictors. Small sample properties of the new predictors and of the bootstrap procedure are analysed by simulation studies and the new methodology is illustrated with an application to poverty mapping in Palestine.</p>","PeriodicalId":49983,"journal":{"name":"Journal of the Royal Statistical Society Series A-Statistics in Society","volume":"185 S2","pages":"S724-S755"},"PeriodicalIF":1.5000,"publicationDate":"2022-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://rss.onlinelibrary.wiley.com/doi/epdf/10.1111/rssa.12965","citationCount":"1","resultStr":"{\"title\":\"Multivariate mixture model for small area estimation of poverty indicators\",\"authors\":\"Agne Bikauskaite, Isabel Molina, Domingo Morales\",\"doi\":\"10.1111/rssa.12965\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>When disaggregation of national estimates in several domains or areas is required, direct survey estimators, which use only the domain-specific survey data, are usually design-unbiased even under complex survey designs (at least approximately) and require no model assumptions. Nevertheless, they are appropriate only for domains or areas with sufficiently large sample size. For example, when estimating poverty in a domain with a small sample size (small area), the volatility of a direct estimator might make that area seems like very poor in one period and very rich in the next one. Small area (or indirect) estimators have been developed in order to avoid such undesired instability. Small area estimators borrow strength from the other areas so as to improve the precision and therefore obtain much more stable estimators. However, the usual model-based assumptions, which include some kind of area homogeneity, may not hold in real applications. A more flexible model based on multivariate mixtures of normal distributions that generalises the usual nested error linear regression model is proposed for estimation of general parameters in small areas. This flexibility makes the model adaptable to more general situations, where there may be areas with a different behaviour from the other ones, making the model less restrictive (hence, more close to nonparametric) and more robust to outlying areas. An expectation-maximisation (E-M) method is designed for fitting the proposed mixture model. Under the proposed mixture model, two different new predictors of general small area indicators are proposed. A parametric bootstrap method is used to estimate the mean squared errors of the proposed predictors. Small sample properties of the new predictors and of the bootstrap procedure are analysed by simulation studies and the new methodology is illustrated with an application to poverty mapping in Palestine.</p>\",\"PeriodicalId\":49983,\"journal\":{\"name\":\"Journal of the Royal Statistical Society Series A-Statistics in Society\",\"volume\":\"185 S2\",\"pages\":\"S724-S755\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2022-12-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://rss.onlinelibrary.wiley.com/doi/epdf/10.1111/rssa.12965\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the Royal Statistical Society Series A-Statistics in Society\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/rssa.12965\",\"RegionNum\":3,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"SOCIAL SCIENCES, MATHEMATICAL METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Royal Statistical Society Series A-Statistics in Society","FirstCategoryId":"100","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/rssa.12965","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SOCIAL SCIENCES, MATHEMATICAL METHODS","Score":null,"Total":0}

引用次数: 1

摘要

当需要对几个领域或地区的国家估计进行分解时，仅使用特定领域的调查数据的直接调查估计器通常是设计无偏的，即使在复杂的调查设计下(至少近似地)，也不需要模型假设。然而，它们仅适用于具有足够大样本量的域或区域。例如，当估计一个小样本量(小区域)领域的贫困时，直接估计器的波动性可能会使该地区在一个时期看起来非常贫穷，而在下一个时期看起来非常富有。为了避免这种不稳定，已经开发了小面积(或间接)估计器。小区域估计器从其他区域中汲取力量，从而提高精度，从而获得更稳定的估计器。然而，通常的基于模型的假设，包括某种面积同质性，在实际应用中可能不成立。提出了一种基于多元正态分布混合的更灵活的模型，推广了常用的嵌套误差线性回归模型，用于小范围内一般参数的估计。这种灵活性使模型适应于更一般的情况，其中可能存在与其他区域具有不同行为的区域，使模型限制更少(因此，更接近非参数)，并且对外围区域更健壮。设计了一种期望最大化(E-M)方法来拟合所提出的混合模型。在该混合模型下，提出了两种不同的小面积综合指标的新预测因子。采用参数自举法估计所提预测器的均方误差。通过模拟研究分析了新预测器和自举程序的小样本特性，并通过在巴勒斯坦贫困制图中的应用说明了新方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Multivariate mixture model for small area estimation of poverty indicators

When disaggregation of national estimates in several domains or areas is required, direct survey estimators, which use only the domain-specific survey data, are usually design-unbiased even under complex survey designs (at least approximately) and require no model assumptions. Nevertheless, they are appropriate only for domains or areas with sufficiently large sample size. For example, when estimating poverty in a domain with a small sample size (small area), the volatility of a direct estimator might make that area seems like very poor in one period and very rich in the next one. Small area (or indirect) estimators have been developed in order to avoid such undesired instability. Small area estimators borrow strength from the other areas so as to improve the precision and therefore obtain much more stable estimators. However, the usual model-based assumptions, which include some kind of area homogeneity, may not hold in real applications. A more flexible model based on multivariate mixtures of normal distributions that generalises the usual nested error linear regression model is proposed for estimation of general parameters in small areas. This flexibility makes the model adaptable to more general situations, where there may be areas with a different behaviour from the other ones, making the model less restrictive (hence, more close to nonparametric) and more robust to outlying areas. An expectation-maximisation (E-M) method is designed for fitting the proposed mixture model. Under the proposed mixture model, two different new predictors of general small area indicators are proposed. A parametric bootstrap method is used to estimate the mean squared errors of the proposed predictors. Small sample properties of the new predictors and of the bootstrap procedure are analysed by simulation studies and the new methodology is illustrated with an application to poverty mapping in Palestine.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of the Royal Statistical Society Series A-Statistics in Society 数学-统计学与概率论

CiteScore

2.90

自引率

5.00%

发文量

136

审稿时长

>12 weeks

期刊介绍： Series A (Statistics in Society) publishes high quality papers that demonstrate how statistical thinking, design and analyses play a vital role in all walks of life and benefit society in general. There is no restriction on subject-matter: any interesting, topical and revelatory applications of statistics are welcome. For example, important applications of statistical and related data science methodology in medicine, business and commerce, industry, economics and finance, education and teaching, physical and biomedical sciences, the environment, the law, government and politics, demography, psychology, sociology and sport all fall within the journal''s remit. The journal is therefore aimed at a wide statistical audience and at professional statisticians in particular. Its emphasis is on well-written and clearly reasoned quantitative approaches to problems in the real world rather than the exposition of technical detail. Thus, although the methodological basis of papers must be sound and adequately explained, methodology per se should not be the main focus of a Series A paper. Of particular interest are papers on topical or contentious statistical issues, papers which give reviews or exposés of current statistical concerns and papers which demonstrate how appropriate statistical thinking has contributed to our understanding of important substantive questions. Historical, professional and biographical contributions are also welcome, as are discussions of methods of data collection and of ethical issues, provided that all such papers have substantial statistical relevance.