Robust estimation of the population mean using quantile regression under systematic sampling

IF 1.4 3区 社会学 Q3 DEMOGRAPHY Mathematical Population Studies Pub Date : 2022-12-05 DOI:10.1080/08898480.2022.2139072
U. Shahzad, I. Ahmad, N. Al-Noor, M. Hanif, I. Almanjahie
{"title":"Robust estimation of the population mean using quantile regression under systematic sampling","authors":"U. Shahzad, I. Ahmad, N. Al-Noor, M. Hanif, I. Almanjahie","doi":"10.1080/08898480.2022.2139072","DOIUrl":null,"url":null,"abstract":"ABSTRACT Regression ratio mean estimators of a study variable are defined as the coefficients provided by the ordinary least-squares regression of on a given auxiliary variable . They can be improved by using the coefficient of variation and the coefficient of kurtosis of . The influence of outliers on the estimates of the population mean of is neutralized by calculating robust regression coefficients, obtained by the method of either least absolute deviations, Huber-M, Huber-MM, Hampel-M, Tukey-M, or adjusted least squares. These robust coefficients are used to estimate the population mean of under simple random sampling. Extension to systematic sampling—which is a probability sampling in which every element of the population has equal probability of inclusion to be drawn—using the coefficients provided by quantile regression—whose coefficients result from the minimization of the sum of absolute deviations rather than from the square deviations from the regression line—requires ratio estimators of the population mean of . The mean square errors of these estimators are expressed analytically. If the quantile regression coefficient is greater than the ratio of the covariance between the study and the auxiliary variables to the variance of the auxiliary variable minus a function of the mean or the coefficient of variation, skewness, or kurtosis of and , then the proposed robust quantile regression mean estimator of is more efficient than the ratio estimators in the presence of outliers under systematic sampling. The reason is that these estimators only use regression coefficients and not the ratio between the population mean and sample means of the auxiliary variable . The aforementioned condition occurs with the values of the case study. For empirical data of 176 forest strips, the proposed estimate of the volume of timber is over 30% more efficient than the ratio estimates based on quantile regression coefficients.","PeriodicalId":49859,"journal":{"name":"Mathematical Population Studies","volume":"30 1","pages":"195 - 207"},"PeriodicalIF":1.4000,"publicationDate":"2022-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Population Studies","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1080/08898480.2022.2139072","RegionNum":3,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"DEMOGRAPHY","Score":null,"Total":0}
引用次数: 2

Abstract

ABSTRACT Regression ratio mean estimators of a study variable are defined as the coefficients provided by the ordinary least-squares regression of on a given auxiliary variable . They can be improved by using the coefficient of variation and the coefficient of kurtosis of . The influence of outliers on the estimates of the population mean of is neutralized by calculating robust regression coefficients, obtained by the method of either least absolute deviations, Huber-M, Huber-MM, Hampel-M, Tukey-M, or adjusted least squares. These robust coefficients are used to estimate the population mean of under simple random sampling. Extension to systematic sampling—which is a probability sampling in which every element of the population has equal probability of inclusion to be drawn—using the coefficients provided by quantile regression—whose coefficients result from the minimization of the sum of absolute deviations rather than from the square deviations from the regression line—requires ratio estimators of the population mean of . The mean square errors of these estimators are expressed analytically. If the quantile regression coefficient is greater than the ratio of the covariance between the study and the auxiliary variables to the variance of the auxiliary variable minus a function of the mean or the coefficient of variation, skewness, or kurtosis of and , then the proposed robust quantile regression mean estimator of is more efficient than the ratio estimators in the presence of outliers under systematic sampling. The reason is that these estimators only use regression coefficients and not the ratio between the population mean and sample means of the auxiliary variable . The aforementioned condition occurs with the values of the case study. For empirical data of 176 forest strips, the proposed estimate of the volume of timber is over 30% more efficient than the ratio estimates based on quantile regression coefficients.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
系统抽样下使用分位数回归对总体均值的稳健估计
研究变量的回归比均值估计量定义为给定辅助变量的普通最小二乘回归所提供的系数。它们可以通过使用的变异系数和峰度系数来改进。通过计算稳健回归系数来抵消异常值对总体平均值估计的影响,稳健回归系数通过最小绝对偏差、Huber-M、Huber-MM、Hampel-M、Tukey-M或调整后的最小二乘法获得。这些稳健系数用于估计简单随机抽样下的总体平均值。系统抽样是一种概率抽样,其中群体的每个元素都有相等的被包含概率,使用分位数回归提供的系数,其系数来自绝对偏差之和的最小化,而不是回归线的平方偏差人口平均数。这些估计量的均方误差用解析方法表示。如果分位数回归系数大于研究和辅助变量之间的协方差与辅助变量的方差之比减去和的平均值或变异系数、偏度或峰度的函数,则在系统抽样条件下,所提出的稳健分位数回归均值估计比存在异常值的比率估计更有效。原因是这些估计量只使用回归系数,而不是辅助变量的总体平均值和样本平均值之间的比率。上述情况发生在案例研究的值中。对于176个林带的经验数据,所提出的木材体积估计比基于分位数回归系数的比率估计有效30%以上。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Mathematical Population Studies
Mathematical Population Studies 数学-数学跨学科应用
CiteScore
3.20
自引率
11.10%
发文量
7
审稿时长
>12 weeks
期刊介绍: Mathematical Population Studies publishes carefully selected research papers in the mathematical and statistical study of populations. The journal is strongly interdisciplinary and invites contributions by mathematicians, demographers, (bio)statisticians, sociologists, economists, biologists, epidemiologists, actuaries, geographers, and others who are interested in the mathematical formulation of population-related questions. The scope covers both theoretical and empirical work. Manuscripts should be sent to Manuscript central for review. The editor-in-chief has final say on the suitability for publication.
期刊最新文献
Researching algorithm awareness: methodological approaches to investigate how people perceive, know, and interact with algorithms Fractional Lindley distribution generated by time scale theory, with application to discrete-time lifetime data Estimating the structure by age and sex of the US sexually active population Optimizing criterion for the upper limit of the signal response of brain neurons Optimal estimators of the population mean of a skewed distribution using auxiliary variables in median ranked-set sampling
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1