{"title":"Sequential stratified inference for the mean","authors":"Jacob V. Spertus, Mayuri Sridhar, Philip B. Stark","doi":"arxiv-2409.06680","DOIUrl":null,"url":null,"abstract":"We develop conservative tests for the mean of a bounded population using data\nfrom a stratified sample. The sample may be drawn sequentially, with or without\nreplacement. The tests are \"anytime valid,\" allowing optional stopping and\ncontinuation in each stratum. We call this combination of properties\nsequential, finite-sample, nonparametric validity. The methods express a\nhypothesis about the population mean as a union of intersection hypotheses\ndescribing within-stratum means. They test each intersection hypothesis using\nindependent test supermartingales (TSMs) combined across strata by\nmultiplication. The $P$-value of the global null hypothesis is then the maximum\n$P$-value of any intersection hypothesis in the union. This approach has three\nprimary moving parts: (i) the rule for deciding which stratum to draw from next\nto test each intersection null, given the sample so far; (ii) the form of the\nTSM for each null in each stratum; and (iii) the method of combining evidence\nacross strata. These choices interact. We examine the performance of a variety\nof rules with differing computational complexity. Approximately optimal methods\nhave a prohibitive computational cost, while naive rules may be inconsistent --\nthey will never reject for some alternative populations, no matter how large\nthe sample. We present a method that is statistically comparable to optimal\nmethods in examples where optimal methods are computable, but computationally\ntractable for arbitrarily many strata. In numerical examples its expected\nsample size is substantially smaller than that of previous methods.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"42 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Methodology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.06680","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We develop conservative tests for the mean of a bounded population using data
from a stratified sample. The sample may be drawn sequentially, with or without
replacement. The tests are "anytime valid," allowing optional stopping and
continuation in each stratum. We call this combination of properties
sequential, finite-sample, nonparametric validity. The methods express a
hypothesis about the population mean as a union of intersection hypotheses
describing within-stratum means. They test each intersection hypothesis using
independent test supermartingales (TSMs) combined across strata by
multiplication. The $P$-value of the global null hypothesis is then the maximum
$P$-value of any intersection hypothesis in the union. This approach has three
primary moving parts: (i) the rule for deciding which stratum to draw from next
to test each intersection null, given the sample so far; (ii) the form of the
TSM for each null in each stratum; and (iii) the method of combining evidence
across strata. These choices interact. We examine the performance of a variety
of rules with differing computational complexity. Approximately optimal methods
have a prohibitive computational cost, while naive rules may be inconsistent --
they will never reject for some alternative populations, no matter how large
the sample. We present a method that is statistically comparable to optimal
methods in examples where optimal methods are computable, but computationally
tractable for arbitrarily many strata. In numerical examples its expected
sample size is substantially smaller than that of previous methods.