具有异质性的非概率样本的双稳健估计

IF 2.6 2区 数学 Q1 MATHEMATICS, APPLIED Journal of Computational and Applied Mathematics Pub Date : 2025-09-01 Epub Date: 2025-02-21 DOI:10.1016/j.cam.2025.116567
Zhan Liu, Yi Sun, Yong Li, Yuanmeng Li
{"title":"具有异质性的非概率样本的双稳健估计","authors":"Zhan Liu,&nbsp;Yi Sun,&nbsp;Yong Li,&nbsp;Yuanmeng Li","doi":"10.1016/j.cam.2025.116567","DOIUrl":null,"url":null,"abstract":"<div><div>With the development of network technology and the rise of big data, non-probability sampling has wider applications in practice. However, it brings a challenge to make inference from non-probability samples since the inclusion probabilities of non-probability samples are unknown. The propensity score approach, superpopulation model approach, doubly robust estimation are three main methods to infer the population from non-probability samples. However, the first two methods are sensitive to the misspecified models. Thus, they cannot generate desirable performances when deal with heterogeneous non-probability samples. In this paper, a doubly robust estimation method for non-probability samples with heterogeneous data is proposed. A heterogeneous superpopulation model is fitted based on a heterogeneous non-probability sample and used to construct a doubly robust estimator for the population mean. Specifically, the inverse estimated inclusion probabilities of the non-probability sample are added into the estimating equation as weights in model parameter estimation. The simulation results confirm that the proposed method outperforms the other contrastive methods in terms of bias, standard deviation and mean square error. Its application is illustrated with the Pew Research Center dataset and the Behavioral Risk Factor Surveillance System dataset, which is consistent with the simulation results.</div></div>","PeriodicalId":50226,"journal":{"name":"Journal of Computational and Applied Mathematics","volume":"465 ","pages":"Article 116567"},"PeriodicalIF":2.6000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Doubly robust estimation for non-probability samples with heterogeneity\",\"authors\":\"Zhan Liu,&nbsp;Yi Sun,&nbsp;Yong Li,&nbsp;Yuanmeng Li\",\"doi\":\"10.1016/j.cam.2025.116567\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>With the development of network technology and the rise of big data, non-probability sampling has wider applications in practice. However, it brings a challenge to make inference from non-probability samples since the inclusion probabilities of non-probability samples are unknown. The propensity score approach, superpopulation model approach, doubly robust estimation are three main methods to infer the population from non-probability samples. However, the first two methods are sensitive to the misspecified models. Thus, they cannot generate desirable performances when deal with heterogeneous non-probability samples. In this paper, a doubly robust estimation method for non-probability samples with heterogeneous data is proposed. A heterogeneous superpopulation model is fitted based on a heterogeneous non-probability sample and used to construct a doubly robust estimator for the population mean. Specifically, the inverse estimated inclusion probabilities of the non-probability sample are added into the estimating equation as weights in model parameter estimation. The simulation results confirm that the proposed method outperforms the other contrastive methods in terms of bias, standard deviation and mean square error. Its application is illustrated with the Pew Research Center dataset and the Behavioral Risk Factor Surveillance System dataset, which is consistent with the simulation results.</div></div>\",\"PeriodicalId\":50226,\"journal\":{\"name\":\"Journal of Computational and Applied Mathematics\",\"volume\":\"465 \",\"pages\":\"Article 116567\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computational and Applied Mathematics\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0377042725000822\",\"RegionNum\":2,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/2/21 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"MATHEMATICS, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational and Applied Mathematics","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0377042725000822","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/21 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
引用次数: 0

摘要

随着网络技术的发展和大数据的兴起,非概率抽样在实践中得到了更广泛的应用。然而,由于非概率样本的包含概率是未知的,这给从非概率样本中进行推理带来了挑战。倾向得分法、超总体模型法和双稳健估计是从非概率样本中推断总体的三种主要方法。然而,前两种方法对错误指定的模型很敏感。因此,当处理异构非概率样本时,它们不能产生理想的性能。本文提出了一种具有异构数据的非概率样本的双鲁棒估计方法。基于异质非概率样本拟合异质超总体模型,构建了总体均值的双鲁棒估计。具体而言,将非概率样本的包含概率逆估计作为模型参数估计的权重加入到估计方程中。仿真结果表明,该方法在偏置、标准差和均方误差方面优于其他对比方法。用皮尤研究中心数据集和行为风险因素监测系统数据集说明了其应用,与模拟结果一致。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Doubly robust estimation for non-probability samples with heterogeneity
With the development of network technology and the rise of big data, non-probability sampling has wider applications in practice. However, it brings a challenge to make inference from non-probability samples since the inclusion probabilities of non-probability samples are unknown. The propensity score approach, superpopulation model approach, doubly robust estimation are three main methods to infer the population from non-probability samples. However, the first two methods are sensitive to the misspecified models. Thus, they cannot generate desirable performances when deal with heterogeneous non-probability samples. In this paper, a doubly robust estimation method for non-probability samples with heterogeneous data is proposed. A heterogeneous superpopulation model is fitted based on a heterogeneous non-probability sample and used to construct a doubly robust estimator for the population mean. Specifically, the inverse estimated inclusion probabilities of the non-probability sample are added into the estimating equation as weights in model parameter estimation. The simulation results confirm that the proposed method outperforms the other contrastive methods in terms of bias, standard deviation and mean square error. Its application is illustrated with the Pew Research Center dataset and the Behavioral Risk Factor Surveillance System dataset, which is consistent with the simulation results.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
5.40
自引率
4.20%
发文量
437
审稿时长
3.0 months
期刊介绍: The Journal of Computational and Applied Mathematics publishes original papers of high scientific value in all areas of computational and applied mathematics. The main interest of the Journal is in papers that describe and analyze new computational techniques for solving scientific or engineering problems. Also the improved analysis, including the effectiveness and applicability, of existing methods and algorithms is of importance. The computational efficiency (e.g. the convergence, stability, accuracy, ...) should be proved and illustrated by nontrivial numerical examples. Papers describing only variants of existing methods, without adding significant new computational properties are not of interest. The audience consists of: applied mathematicians, numerical analysts, computational scientists and engineers.
期刊最新文献
A PDE-based image restoration method: Mathematical analysis and implementation Robust learning of minimum error entropy under heavy-tailed noise Dual-valued functions of dual matrices with applications in causal emergence On non-standard Isaac’s wall pursuit game using dynamic programming method A mixed Nitsche extended finite element method for Biot’s consolidation interface problems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1