Heteroscedasticity-aware stratified sampling to improve uplift modeling

IF 6 2区 管理学 Q1 OPERATIONS RESEARCH & MANAGEMENT SCIENCE European Journal of Operational Research Pub Date : 2025-08-16 Epub Date: 2025-03-10 DOI:10.1016/j.ejor.2025.02.030
Björn Bokelmann , Stefan Lessmann
{"title":"Heteroscedasticity-aware stratified sampling to improve uplift modeling","authors":"Björn Bokelmann ,&nbsp;Stefan Lessmann","doi":"10.1016/j.ejor.2025.02.030","DOIUrl":null,"url":null,"abstract":"<div><div>Randomized controlled trials (RCTs) are conducted in many business applications including online marketing or customer churn prevention to investigate the effect of specific treatments (coupons, retention offers, mailings, etc.). RCTs allow for the estimation of average treatment effects and the training of (uplift) models for the heterogeneity of treatment effects across individuals. The problem with RCTs is that they are costly, and this cost increases with the number of individuals included. These costs have inspired research on how to conduct experiments with a small number of individuals while still obtaining precise treatment effect estimates. We contribute to this literature a <em>heteroskedasticity-aware stratified sampling</em> (HS) scheme. We leverage the fact that different individuals have different noise levels in their outcome and that precise treatment effect estimation requires more observations from the “high-noise” individuals than from the “low-noise” individuals. We show theoretically and empirically that HS sampling yields significantly more precise estimates of the ATE, improves uplift models, and makes their evaluation more reliable compared to RCT data sampled completely randomly. Due to these benefits and the simplicity of our approach, we expect HS sampling to be valuable in many real-world applications in business and beyond.</div></div>","PeriodicalId":55161,"journal":{"name":"European Journal of Operational Research","volume":"325 1","pages":"Pages 118-131"},"PeriodicalIF":6.0000,"publicationDate":"2025-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Operational Research","FirstCategoryId":"91","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0377221725001535","RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/10 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"OPERATIONS RESEARCH & MANAGEMENT SCIENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Randomized controlled trials (RCTs) are conducted in many business applications including online marketing or customer churn prevention to investigate the effect of specific treatments (coupons, retention offers, mailings, etc.). RCTs allow for the estimation of average treatment effects and the training of (uplift) models for the heterogeneity of treatment effects across individuals. The problem with RCTs is that they are costly, and this cost increases with the number of individuals included. These costs have inspired research on how to conduct experiments with a small number of individuals while still obtaining precise treatment effect estimates. We contribute to this literature a heteroskedasticity-aware stratified sampling (HS) scheme. We leverage the fact that different individuals have different noise levels in their outcome and that precise treatment effect estimation requires more observations from the “high-noise” individuals than from the “low-noise” individuals. We show theoretically and empirically that HS sampling yields significantly more precise estimates of the ATE, improves uplift models, and makes their evaluation more reliable compared to RCT data sampled completely randomly. Due to these benefits and the simplicity of our approach, we expect HS sampling to be valuable in many real-world applications in business and beyond.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
异方差感知分层采样改进隆升建模
随机对照试验(rct)在许多商业应用中进行,包括在线营销或客户流失预防,以调查特定处理(优惠券,保留优惠,邮件等)的效果。随机对照试验允许估计平均治疗效果和训练(提升)模型,以适应个体间治疗效果的异质性。随机对照试验的问题在于成本很高,而且这种成本随着试验对象的增加而增加。这些费用激发了对如何在少数人身上进行实验的研究,同时仍能获得精确的治疗效果估计。我们贡献这一文献的异方差意识分层抽样(HS)方案。我们利用了这样一个事实,即不同的个体在其结果中具有不同的噪声水平,并且精确的治疗效果估计需要从“高噪声”个体中比从“低噪声”个体中进行更多的观察。我们从理论上和经验上表明,与完全随机抽样的RCT数据相比,HS抽样对ATE的估计更加精确,改进了隆升模型,使其评估更加可靠。由于这些优点和我们方法的简单性,我们期望HS采样在商业和其他领域的许多实际应用中具有价值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
European Journal of Operational Research
European Journal of Operational Research 管理科学-运筹学与管理科学
CiteScore
11.90
自引率
9.40%
发文量
786
审稿时长
8.2 months
期刊介绍: The European Journal of Operational Research (EJOR) publishes high quality, original papers that contribute to the methodology of operational research (OR) and to the practice of decision making.
期刊最新文献
Recent developments in location-routing problems Super-efficiency in piecewise Cobb-Douglas technology with flexible endogenous direction Increasing competitiveness by imbalanced groups: The example of the 48-team FIFA World Cup A hybrid multi-layered ensemble model based on heterogeneous information network for small and medium-sized enterprise default prediction A global malmquist productivity index of athletics performance in olympic games
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1