Sequential stratified inference for the mean

Jacob V. Spertus, Mayuri Sridhar, Philip B. Stark
{"title":"Sequential stratified inference for the mean","authors":"Jacob V. Spertus, Mayuri Sridhar, Philip B. Stark","doi":"arxiv-2409.06680","DOIUrl":null,"url":null,"abstract":"We develop conservative tests for the mean of a bounded population using data\nfrom a stratified sample. The sample may be drawn sequentially, with or without\nreplacement. The tests are \"anytime valid,\" allowing optional stopping and\ncontinuation in each stratum. We call this combination of properties\nsequential, finite-sample, nonparametric validity. The methods express a\nhypothesis about the population mean as a union of intersection hypotheses\ndescribing within-stratum means. They test each intersection hypothesis using\nindependent test supermartingales (TSMs) combined across strata by\nmultiplication. The $P$-value of the global null hypothesis is then the maximum\n$P$-value of any intersection hypothesis in the union. This approach has three\nprimary moving parts: (i) the rule for deciding which stratum to draw from next\nto test each intersection null, given the sample so far; (ii) the form of the\nTSM for each null in each stratum; and (iii) the method of combining evidence\nacross strata. These choices interact. We examine the performance of a variety\nof rules with differing computational complexity. Approximately optimal methods\nhave a prohibitive computational cost, while naive rules may be inconsistent --\nthey will never reject for some alternative populations, no matter how large\nthe sample. We present a method that is statistically comparable to optimal\nmethods in examples where optimal methods are computable, but computationally\ntractable for arbitrarily many strata. In numerical examples its expected\nsample size is substantially smaller than that of previous methods.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"42 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Methodology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.06680","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

We develop conservative tests for the mean of a bounded population using data from a stratified sample. The sample may be drawn sequentially, with or without replacement. The tests are "anytime valid," allowing optional stopping and continuation in each stratum. We call this combination of properties sequential, finite-sample, nonparametric validity. The methods express a hypothesis about the population mean as a union of intersection hypotheses describing within-stratum means. They test each intersection hypothesis using independent test supermartingales (TSMs) combined across strata by multiplication. The $P$-value of the global null hypothesis is then the maximum $P$-value of any intersection hypothesis in the union. This approach has three primary moving parts: (i) the rule for deciding which stratum to draw from next to test each intersection null, given the sample so far; (ii) the form of the TSM for each null in each stratum; and (iii) the method of combining evidence across strata. These choices interact. We examine the performance of a variety of rules with differing computational complexity. Approximately optimal methods have a prohibitive computational cost, while naive rules may be inconsistent -- they will never reject for some alternative populations, no matter how large the sample. We present a method that is statistically comparable to optimal methods in examples where optimal methods are computable, but computationally tractable for arbitrarily many strata. In numerical examples its expected sample size is substantially smaller than that of previous methods.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
均值的连续分层推断
我们利用分层抽样的数据,对有界群体的均值进行保守检验。样本可以连续抽取,也可以不替换。检验是 "随时有效 "的,允许在每个分层中选择停止和继续。我们把这种特性组合称为序列、有限样本、非参数有效性。这些方法将关于总体均值的假设表达为描述层内均值的交叉假设的结合。它们使用通过乘法跨层组合的独立检验超马尔廷公式(TSM)来检验每个交集假设。然后,全局零假设的 P$ 值就是联盟中任何交叉假设的最大 P$ 值。这种方法有三个主要的活动部分:(i) 根据迄今为止的样本情况,决定下一步从哪个层抽取样本来检验每个交叉零假设的规则;(ii) 每个层中每个零假设的全局矩阵形式;(iii) 组合跨层证据的方法。这些选择是相互影响的。我们研究了计算复杂度不同的各种规则的性能。近似最优方法的计算成本过高,而天真规则则可能不一致--无论样本量有多大,它们都不会拒绝某些替代人群。在最优方法可计算的例子中,我们提出了一种在统计上可与最优方法相媲美的方法,但对于任意多的分层来说,这种方法在计算上非常困难。在数值例子中,它的预期样本量大大小于以前的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Poisson approximate likelihood compared to the particle filter Optimising the Trade-Off Between Type I and Type II Errors: A Review and Extensions Bias Reduction in Matched Observational Studies with Continuous Treatments: Calipered Non-Bipartite Matching and Bias-Corrected Estimation and Inference Forecasting age distribution of life-table death counts via α-transformation Probability-scale residuals for event-time data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1