Wenna Xi, Alice Hinton, Bo Lu, Karol Krotki, Brittney Keller-Hamilton, Amy Ferketich, Amang Sukasih
{"title":"Analysis of combined probability and nonprobability samples: A simulation evaluation and application to a teen smoking behavior survey.","authors":"Wenna Xi, Alice Hinton, Bo Lu, Karol Krotki, Brittney Keller-Hamilton, Amy Ferketich, Amang Sukasih","doi":"10.1080/03610918.2022.2102181","DOIUrl":null,"url":null,"abstract":"<p><p>In scientific studies with low-prevalence outcomes, probability sampling may be supplemented by nonprobability sampling to boost the sample size of desired subpopulation while remaining representative to the entire study population. To utilize both probability and nonprobability samples appropriately, several methods have been proposed in the literature to generate pseudo-weights, including ad-hoc weights, inclusion probability adjusted weights, and propensity score adjusted weights. We empirically compare various weighting strategies via an extensive simulation study, where probability and nonprobability samples are combined. Weight normalization and raking adjustment are also considered. Our simulation results suggest that the unity weight method (with weight normalization) and the inclusion probability adjusted weight method yield very good overall performance. This work is motivated by the Buckeye Teen Health Study, which examines risk factors for the initiation of smoking among teenage males in Ohio. To address the low response rate in the initial probability sample and low prevalence of smokers in the target population, a small convenience sample was collected as a supplement. Our proposed method yields estimates very close to the ones from the analysis using only the probability sample and enjoys the additional benefit of being able to track more teens with risky behaviors through follow-ups.</p>","PeriodicalId":49392,"journal":{"name":"Urban Water Journal","volume":"19 1","pages":"3285-3301"},"PeriodicalIF":1.6000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11343077/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Urban Water Journal","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1080/03610918.2022.2102181","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/7/25 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"WATER RESOURCES","Score":null,"Total":0}
引用次数: 0
Abstract
In scientific studies with low-prevalence outcomes, probability sampling may be supplemented by nonprobability sampling to boost the sample size of desired subpopulation while remaining representative to the entire study population. To utilize both probability and nonprobability samples appropriately, several methods have been proposed in the literature to generate pseudo-weights, including ad-hoc weights, inclusion probability adjusted weights, and propensity score adjusted weights. We empirically compare various weighting strategies via an extensive simulation study, where probability and nonprobability samples are combined. Weight normalization and raking adjustment are also considered. Our simulation results suggest that the unity weight method (with weight normalization) and the inclusion probability adjusted weight method yield very good overall performance. This work is motivated by the Buckeye Teen Health Study, which examines risk factors for the initiation of smoking among teenage males in Ohio. To address the low response rate in the initial probability sample and low prevalence of smokers in the target population, a small convenience sample was collected as a supplement. Our proposed method yields estimates very close to the ones from the analysis using only the probability sample and enjoys the additional benefit of being able to track more teens with risky behaviors through follow-ups.
在低流行率结果的科学研究中,概率抽样可辅以非概率抽样,以增加所需亚人群的样本量,同时保持对整个研究人群的代表性。为了合理利用概率和非概率样本,文献中提出了几种方法来生成伪权重,包括临时权重、纳入概率调整权重和倾向得分调整权重。我们通过广泛的模拟研究,结合概率样本和非概率样本,对各种加权策略进行了实证比较。我们还考虑了权重归一化和耙式调整。模拟结果表明,统一权重法(权重归一化)和包含概率调整权重法的总体性能非常好。这项工作由 "巴克基耶青少年健康研究"(Buckeye Teen Health Study)激发,该研究调查了俄亥俄州青少年男性开始吸烟的风险因素。为了解决初始概率样本响应率低和目标人群吸烟率低的问题,我们收集了一个小型便利样本作为补充。我们建议的方法得出的估计值与仅使用概率样本进行分析得出的估计值非常接近,而且还能通过随访跟踪更多有危险行为的青少年,从而获得额外的益处。
期刊介绍:
Urban Water Journal provides a forum for the research and professional communities dealing with water systems in the urban environment, directly contributing to the furtherance of sustainable development. Particular emphasis is placed on the analysis of interrelationships and interactions between the individual water systems, urban water bodies and the wider environment. The Journal encourages the adoption of an integrated approach, and system''s thinking to solve the numerous problems associated with sustainable urban water management.
Urban Water Journal focuses on the water-related infrastructure in the city: namely potable water supply, treatment and distribution; wastewater collection, treatment and management, and environmental return; storm drainage and urban flood management. Specific topics of interest include:
network design, optimisation, management, operation and rehabilitation;
novel treatment processes for water and wastewater, resource recovery, treatment plant design and optimisation as well as treatment plants as part of the integrated urban water system;
demand management and water efficiency, water recycling and source control;
stormwater management, urban flood risk quantification and management;
monitoring, utilisation and management of urban water bodies including groundwater;
water-sensitive planning and design (including analysis of interactions of the urban water cycle with city planning and green infrastructure);
resilience of the urban water system, long term scenarios to manage uncertainty, system stress testing;
data needs, smart metering and sensors, advanced data analytics for knowledge discovery, quantification and management of uncertainty, smart technologies for urban water systems;
decision-support and informatic tools;...