{"title":"论在调查抽样的回归估计中最有效地使用连续辅助变量","authors":"Takis Merkouris","doi":"10.1111/insr.12561","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Auxiliary variables with known population totals are extensively used in survey sampling to construct generalised regression (GR) estimators or optimal regression (OR) estimators of totals or means of study variables. This article explores the possibility of improving the efficiency of such estimators when continuous auxiliary variables are used in the regression estimation jointly with appropriate power functions of them, provided that the values of the auxiliary variables are known for all units in the population. The efficiency gain is determined analytically in the case of the OR estimator. A practical criterion for choosing the power functions that maximise the efficiency gain, involving the coefficient of determination in the regression fit of the study variable, is proposed for both the OR estimation and the more practicable, but generally less efficient, GR estimation. Furthermore, the effect of adding a power function of a continuous auxiliary variable in regression estimation is investigated when this variable is also used at the design stage. A simulation study shows that the joint use of a continuous auxiliary variable and a power function of it chosen according to the proposed criterion may improve considerably the efficiency of OR estimation, and much more the efficiency of GR estimation.</p>\n </div>","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":null,"pages":null},"PeriodicalIF":1.7000,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"On the Most Effective Use of Continuous Auxiliary Variables in Regression Estimation in Survey Sampling\",\"authors\":\"Takis Merkouris\",\"doi\":\"10.1111/insr.12561\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Auxiliary variables with known population totals are extensively used in survey sampling to construct generalised regression (GR) estimators or optimal regression (OR) estimators of totals or means of study variables. This article explores the possibility of improving the efficiency of such estimators when continuous auxiliary variables are used in the regression estimation jointly with appropriate power functions of them, provided that the values of the auxiliary variables are known for all units in the population. The efficiency gain is determined analytically in the case of the OR estimator. A practical criterion for choosing the power functions that maximise the efficiency gain, involving the coefficient of determination in the regression fit of the study variable, is proposed for both the OR estimation and the more practicable, but generally less efficient, GR estimation. Furthermore, the effect of adding a power function of a continuous auxiliary variable in regression estimation is investigated when this variable is also used at the design stage. A simulation study shows that the joint use of a continuous auxiliary variable and a power function of it chosen according to the proposed criterion may improve considerably the efficiency of OR estimation, and much more the efficiency of GR estimation.</p>\\n </div>\",\"PeriodicalId\":14479,\"journal\":{\"name\":\"International Statistical Review\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2023-12-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Statistical Review\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/insr.12561\",\"RegionNum\":3,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Statistical Review","FirstCategoryId":"100","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/insr.12561","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
摘要
在调查抽样中,已知总体的辅助变量被广泛用于构建研究变量总体或均值的广义回归(GR)估计值或最优回归(OR)估计值。本文探讨了当连续辅助变量与适当的幂函数一起用于回归估计时,如果辅助变量的值是已知的,则可以提高此类估计器的效率。在 OR 估计器的情况下,效率增益是通过分析确定的。针对 OR 估计和更实用但通常效率较低的 GR 估计,提出了一个实用标准,用于选择能使效率增益最大化的幂函数,该标准涉及研究变量回归拟合的决定系数。此外,还研究了在回归估计中添加连续辅助变量的幂函数的效果,当该变量也在设计阶段使用时。模拟研究表明,联合使用连续辅助变量和根据所提标准选择的幂函数,可以显著提高 OR 估计的效率,并大大提高 GR 估计的效率。
On the Most Effective Use of Continuous Auxiliary Variables in Regression Estimation in Survey Sampling
Auxiliary variables with known population totals are extensively used in survey sampling to construct generalised regression (GR) estimators or optimal regression (OR) estimators of totals or means of study variables. This article explores the possibility of improving the efficiency of such estimators when continuous auxiliary variables are used in the regression estimation jointly with appropriate power functions of them, provided that the values of the auxiliary variables are known for all units in the population. The efficiency gain is determined analytically in the case of the OR estimator. A practical criterion for choosing the power functions that maximise the efficiency gain, involving the coefficient of determination in the regression fit of the study variable, is proposed for both the OR estimation and the more practicable, but generally less efficient, GR estimation. Furthermore, the effect of adding a power function of a continuous auxiliary variable in regression estimation is investigated when this variable is also used at the design stage. A simulation study shows that the joint use of a continuous auxiliary variable and a power function of it chosen according to the proposed criterion may improve considerably the efficiency of OR estimation, and much more the efficiency of GR estimation.
期刊介绍:
International Statistical Review is the flagship journal of the International Statistical Institute (ISI) and of its family of Associations. It publishes papers of broad and general interest in statistics and probability. The term Review is to be interpreted broadly. The types of papers that are suitable for publication include (but are not limited to) the following: reviews/surveys of significant developments in theory, methodology, statistical computing and graphics, statistical education, and application areas; tutorials on important topics; expository papers on emerging areas of research or application; papers describing new developments and/or challenges in relevant areas; papers addressing foundational issues; papers on the history of statistics and probability; white papers on topics of importance to the profession or society; and historical assessment of seminal papers in the field and their impact.