The Annals of Statistics最新文献

英文中文

Conformal prediction beyond exchangeability 超越互换性的保角预测

The Annals of Statistics

Pub Date : 2022-02-27 DOI: 10.1214/23-aos2276

R. Barber, E. Candès, Aaditya Ramdas, R. Tibshirani

Conformal prediction is a popular, modern technique for providing valid predictive inference for arbitrary machine learning models. Its validity relies on the assumptions of exchangeability of the data, and symmetry of the given model fitting algorithm as a function of the data. However, exchangeability is often violated when predictive models are deployed in practice. For example, if the data distribution drifts over time, then the data points are no longer exchangeable; moreover, in such settings, we might want to use a nonsymmetric algorithm that treats recent observations as more relevant. This paper generalizes conformal prediction to deal with both aspects: we employ weighted quantiles to introduce robustness against distribution drift, and design a new randomization technique to allow for algorithms that do not treat data points symmetrically. Our new methods are provably robust, with substantially less loss of coverage when exchangeability is violated due to distribution drift or other challenging features of real data, while also achieving the same coverage guarantees as existing conformal prediction methods if the data points are in fact exchangeable. We demonstrate the practical utility of these new tools with simulations and real-data experiments on electricity and election forecasting.

保形预测是一种流行的现代技术，用于为任意机器学习模型提供有效的预测推理。它的有效性依赖于数据可交换性的假设，以及给定模型拟合算法作为数据函数的对称性。然而，在实践中部署预测模型时，互换性经常被破坏。例如，如果数据分布随着时间的推移而漂移，那么数据点就不再是可交换的;此外，在这种情况下，我们可能希望使用一种非对称算法，将最近的观察结果视为更相关的。本文推广了保形预测来处理这两个方面:我们使用加权分位数来引入抗分布漂移的鲁棒性，并设计了一种新的随机化技术来允许不对称处理数据点的算法。我们的新方法被证明是鲁棒的，当由于分布漂移或真实数据的其他具有挑战性的特征而违反可交换性时，覆盖损失大大减少，同时如果数据点实际上是可交换的，也可以实现与现有保形预测方法相同的覆盖保证。我们通过模拟和实际数据实验证明了这些新工具在电力和选举预测方面的实际效用。

{"title":"Conformal prediction beyond exchangeability","authors":"R. Barber, E. Candès, Aaditya Ramdas, R. Tibshirani","doi":"10.1214/23-aos2276","DOIUrl":"https://doi.org/10.1214/23-aos2276","url":null,"abstract":"Conformal prediction is a popular, modern technique for providing valid predictive inference for arbitrary machine learning models. Its validity relies on the assumptions of exchangeability of the data, and symmetry of the given model fitting algorithm as a function of the data. However, exchangeability is often violated when predictive models are deployed in practice. For example, if the data distribution drifts over time, then the data points are no longer exchangeable; moreover, in such settings, we might want to use a nonsymmetric algorithm that treats recent observations as more relevant. This paper generalizes conformal prediction to deal with both aspects: we employ weighted quantiles to introduce robustness against distribution drift, and design a new randomization technique to allow for algorithms that do not treat data points symmetrically. Our new methods are provably robust, with substantially less loss of coverage when exchangeability is violated due to distribution drift or other challenging features of real data, while also achieving the same coverage guarantees as existing conformal prediction methods if the data points are in fact exchangeable. We demonstrate the practical utility of these new tools with simulations and real-data experiments on electricity and election forecasting.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"36 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84913508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 83

A general characterization of optimal tie-breaker designs 最优决胜设计的一般特征

The Annals of Statistics

Pub Date : 2022-02-25 DOI: 10.1214/23-aos2275

Harrison H. Li, A. Owen

Tie-breaker designs trade off a statistical design objective with short-term gain from preferentially assigning a binary treatment to those with high values of a running variable $x$. The design objective is any continuous function of the expected information matrix in a two-line regression model, and short-term gain is expressed as the covariance between the running variable and the treatment indicator. We investigate how to specify design functions indicating treatment probabilities as a function of $x$ to optimize these competing objectives, under external constraints on the number of subjects receiving treatment. Our results include sharp existence and uniqueness guarantees, while accommodating the ethically appealing requirement that treatment probabilities are non-decreasing in $x$. Under such a constraint, there always exists an optimal design function that is constant below and above a single discontinuity. When the running variable distribution is not symmetric or the fraction of subjects receiving the treatment is not $1/2$, our optimal designs improve upon a $D$-optimality objective without sacrificing short-term gain, compared to the three level tie-breaker designs of Owen and Varian (2020) that fix treatment probabilities at $0$, $1/2$, and $1$. We illustrate our optimal designs with data from Head Start, an early childhood government intervention program.

Tie-breaker设计权衡了统计设计目标和短期收益，即优先分配二元处理给那些具有高运行变量x值的人。设计目标为双线回归模型中期望信息矩阵的任意连续函数，短期收益表示为运行变量与处理指标之间的协方差。我们研究了在接受治疗的受试者数量的外部约束下，如何指定设计函数，将治疗概率作为$x$的函数来优化这些竞争目标。我们的结果包括明显的存在性和唯一性保证，同时适应了伦理上吸引人的要求，即治疗概率在$x$中不减少。在这种约束下，总是存在一个最优设计函数，该函数在单个不连续点以下和以上都是常数。当运行变量分布不对称或接受治疗的受试者比例不是1/2美元时，与欧文和瓦里安(2020)将治疗概率固定在0美元、1/2美元和1美元的三级平局设计相比，我们的最优设计在不牺牲短期收益的情况下改进了D$-最优目标。我们用一个儿童早期政府干预项目“启智计划”的数据来说明我们的最佳设计。

{"title":"A general characterization of optimal tie-breaker designs","authors":"Harrison H. Li, A. Owen","doi":"10.1214/23-aos2275","DOIUrl":"https://doi.org/10.1214/23-aos2275","url":null,"abstract":"Tie-breaker designs trade off a statistical design objective with short-term gain from preferentially assigning a binary treatment to those with high values of a running variable $x$. The design objective is any continuous function of the expected information matrix in a two-line regression model, and short-term gain is expressed as the covariance between the running variable and the treatment indicator. We investigate how to specify design functions indicating treatment probabilities as a function of $x$ to optimize these competing objectives, under external constraints on the number of subjects receiving treatment. Our results include sharp existence and uniqueness guarantees, while accommodating the ethically appealing requirement that treatment probabilities are non-decreasing in $x$. Under such a constraint, there always exists an optimal design function that is constant below and above a single discontinuity. When the running variable distribution is not symmetric or the fraction of subjects receiving the treatment is not $1/2$, our optimal designs improve upon a $D$-optimality objective without sacrificing short-term gain, compared to the three level tie-breaker designs of Owen and Varian (2020) that fix treatment probabilities at $0$, $1/2$, and $1$. We illustrate our optimal designs with data from Head Start, an early childhood government intervention program.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"54 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73786500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Optimal high-dimensional and nonparametric distributed testing under communication constraints 通信约束下最优高维非参数分布测试

The Annals of Statistics

Pub Date : 2022-02-02 DOI: 10.1214/23-aos2269

Botond Szab'o, Lasse Vuursteen, H. Zanten

We derive minimax testing errors in a distributed framework where the data is split over multiple machines and their communication to a central machine is limited to $b$ bits. We investigate both the $d$- and infinite-dimensional signal detection problem under Gaussian white noise. We also derive distributed testing algorithms reaching the theoretical lower bounds. Our results show that distributed testing is subject to fundamentally different phenomena that are not observed in distributed estimation. Among our findings, we show that testing protocols that have access to shared randomness can perform strictly better in some regimes than those that do not. We also observe that consistent nonparametric distributed testing is always possible, even with as little as $1$-bit of communication and the corresponding test outperforms the best local test using only the information available at a single local machine. Furthermore, we also derive adaptive nonparametric distributed testing strategies and the corresponding theoretical lower bounds.

我们在分布式框架中推导出最小最大的测试错误，其中数据分散在多台机器上，并且它们与中央机器的通信被限制在$b$比特。研究了高斯白噪声下的d维和无限维信号检测问题。我们还推导了达到理论下界的分布式测试算法。我们的结果表明，分布式测试服从于在分布式估计中没有观察到的根本不同的现象。在我们的研究结果中，我们表明，在某些制度下，具有共享随机性的测试协议可以比那些没有共享随机性的测试协议表现得更好。我们还观察到，一致的非参数分布式测试总是可能的，即使只有$1$-bit的通信，并且相应的测试优于仅使用单个本地机器上可用信息的最佳本地测试。此外，我们还推导了自适应非参数分布测试策略和相应的理论下界。

引用次数: 2

Minimax nonparametric estimation of pure quantum states 纯量子态的极大极小非参数估计

The Annals of Statistics

Pub Date : 2022-02-01 DOI: 10.1214/21-aos2115

Samriddha Lahiry, M. Nussbaum

引用次数: 0

Testing community structure for hypergraphs 测试超图的社区结构

The Annals of Statistics

Pub Date : 2022-02-01 DOI: 10.1214/21-aos2099

Mingao Yuan, Ruiqi Liu, Yang Feng, Zuofeng Shang

Many complex networks in the real world can be formulated as hypergraphs where community detection has been widely used. However, the fundamental question of whether communities exist or not in an observed hypergraph remains unclear. This work aims to tackle this important problem. Specifically, we systematically study when a hypergraph with community structure can be successfully distinguished from its Erdős–Rényi counterpart, and propose concrete test statistics when the models are distinguishable. The main contribution of this paper is threefold. First, we discover a phase transition in the hyperedge probability for distinguishability. Second, in the bounded-degree regime, we derive a sharp signal-to-noise ratio (SNR) threshold for distinguishability in the special two-community 3uniform hypergraphs, and derive nearly tight SNR thresholds in the general two-community m-uniform hypergraphs. Third, in the dense regime, we propose a computationally feasible test based on sub-hypergraph counts, obtain its asymptotic distribution, and analyze its power. Our results are further extended to nonuniform hypergraphs in which a new test involving both edge and hyperedge information is proposed. The proofs rely on Janson’s contiguity theory (Combin. Probab. Comput. 4 (1995) 369–405), a high-moments driven asymptotic normality result by Gao and Wormald (Probab. Theory Related Fields 130 (2004) 368–376), and a truncation technique for analyzing the likelihood ratio.

在现实世界中，许多复杂的网络都可以表述为超图，其中社区检测已经得到了广泛的应用。然而，在观测到的超图中是否存在群的基本问题仍然不清楚。这项工作旨在解决这个重要问题。具体而言，我们系统地研究了具有群落结构的超图何时能够与Erdős-Rényi对应的超图成功区分，并提出了模型可区分时的具体检验统计量。本文的主要贡献有三个方面。首先，我们发现了可分辨性的超边缘概率中的相变。其次，在有界度域中，我们推导出了特殊双群落3一致超图的显著信噪比(SNR)阈值，并推导出了一般双群落m一致超图的近紧密信噪比阈值。第三，在密集区域，我们提出了一个基于子超图计数的计算可行检验，得到了它的渐近分布，并分析了它的幂。我们的结果进一步推广到非均匀超图，其中提出了一个涉及边缘和超边缘信息的新测试。这些证明依赖于詹森的邻近理论。Probab。计算4(1995)369-405)，一个高矩驱动的渐近正态性结果由Gao和Wormald (Probab。理论相关领域130(2004)368-376)，以及分析似然比的截断技术。

{"title":"Testing community structure for hypergraphs","authors":"Mingao Yuan, Ruiqi Liu, Yang Feng, Zuofeng Shang","doi":"10.1214/21-aos2099","DOIUrl":"https://doi.org/10.1214/21-aos2099","url":null,"abstract":"Many complex networks in the real world can be formulated as hypergraphs where community detection has been widely used. However, the fundamental question of whether communities exist or not in an observed hypergraph remains unclear. This work aims to tackle this important problem. Specifically, we systematically study when a hypergraph with community structure can be successfully distinguished from its Erdős–Rényi counterpart, and propose concrete test statistics when the models are distinguishable. The main contribution of this paper is threefold. First, we discover a phase transition in the hyperedge probability for distinguishability. Second, in the bounded-degree regime, we derive a sharp signal-to-noise ratio (SNR) threshold for distinguishability in the special two-community 3uniform hypergraphs, and derive nearly tight SNR thresholds in the general two-community m-uniform hypergraphs. Third, in the dense regime, we propose a computationally feasible test based on sub-hypergraph counts, obtain its asymptotic distribution, and analyze its power. Our results are further extended to nonuniform hypergraphs in which a new test involving both edge and hyperedge information is proposed. The proofs rely on Janson’s contiguity theory (Combin. Probab. Comput. 4 (1995) 369–405), a high-moments driven asymptotic normality result by Gao and Wormald (Probab. Theory Related Fields 130 (2004) 368–376), and a truncation technique for analyzing the likelihood ratio.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"14 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82013937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

Dimension reduction for functional data based on weak conditional moments 基于弱条件矩的函数数据降维

The Annals of Statistics

Pub Date : 2022-02-01 DOI: 10.1214/21-aos2091

Bing Li, Jun Song

We develop a general theory and estimation methods for functional linear sufficient dimension reduction, where both the predictor and the response can be random functions, or even vectors of functions. Unlike the existing dimension reduction methods, our approach does not rely on the estimation of conditional mean and conditional variance. Instead, it is based on a new statistical construction — the weak conditional expectation, which is based on Carleman operators and their inducing functions. Weak conditional expectation is a generalization of conditional expectation. Its key advantage is to replace the projection on to an L2-space — which defines conditional expectation — by projection on to an arbitrary Hilbert space, while still maintaining the unbiasedness of the related dimension reduction methods. This flexibility is particularly important for functional data, because attempting to estimate a full-fledged conditional mean or conditional variance by slicing or smoothing over the space of vector-valued functions may be inefficient due to the curse of dimensionality. We evaluated the performances of the our new methods by simulation and in several applied settings.

我们发展了泛函线性充分降维的一般理论和估计方法，其中预测器和响应都可以是随机函数，甚至是函数的向量。与现有的降维方法不同，我们的方法不依赖于条件均值和条件方差的估计。相反，它是基于一种新的统计结构——弱条件期望，它是基于Carleman算子及其诱导函数。弱条件期望是条件期望的概括。它的主要优点是将l2空间上的投影替换为任意Hilbert空间上的投影，同时仍然保持相关降维方法的无偏性。这种灵活性对于函数数据尤其重要，因为试图通过对向量值函数的空间进行切片或平滑来估计成熟的条件均值或条件方差可能由于维度的诅咒而效率低下。我们通过模拟和几个应用环境评估了我们的新方法的性能。

引用次数: 10

Half-trek criterion for identifiability of latent variable models 潜在变量模型可识别性的半跋涉准则

The Annals of Statistics

Pub Date : 2022-01-12 DOI: 10.1214/22-aos2221

R. Barber, M. Drton, Nils Sturma, Luca Weihs

We consider linear structural equation models with latent variables and develop a criterion to certify whether the direct causal effects between the observable variables are identifiable based on the observed covariance matrix. Linear structural equation models assume that both observed and latent variables solve a linear equation system featuring stochastic noise terms. Each model corresponds to a directed graph whose edges represent the direct effects that appear as coefficients in the equation system. Prior research has developed a variety of methods to decide identifiability of direct effects in a latent projection framework, in which the confounding effects of the latent variables are represented by correlation among noise terms. This approach is effective when the confounding is sparse and effects only small subsets of the observed variables. In contrast, the new latent-factor half-trek criterion (LF-HTC) we develop in this paper operates on the original unprojected latent variable model and is able to certify identifiability in settings, where some latent variables may also have dense effects on many or even all of the observables. Our LF-HTC is an effective sufficient criterion for rational identifiability, under which the direct effects can be uniquely recovered as rational functions of the joint covariance matrix of the observed random variables. When restricting the search steps in LF-HTC to consider subsets of latent variables of bounded size, the criterion can be verified in time that is polynomial in the size of the graph.

我们考虑具有潜在变量的线性结构方程模型，并根据观察到的协方差矩阵制定了一个标准来证明可观察变量之间的直接因果关系是否可识别。线性结构方程模型假设观测变量和潜变量都求解一个具有随机噪声项的线性方程组。每个模型对应于一个有向图，其边表示在方程系统中作为系数出现的直接效应。在隐投影框架中，隐变量的混杂效应用噪声项之间的相关性来表示，前人的研究发展了多种方法来确定隐投影框架中直接效应的可识别性。当混杂是稀疏的并且只影响观察变量的一小部分时，这种方法是有效的。相比之下，我们在本文中开发的新的潜在因素半跋涉准则(LF-HTC)在原始的未投影潜在变量模型上运行，并且能够证明在设置中的可识别性，其中一些潜在变量也可能对许多甚至所有的可观测值产生密集影响。我们的LF-HTC是一个有效的充分的有理可辨识准则，在该准则下，直接效应可以唯一地恢复为观测随机变量联合协方差矩阵的有理函数。当将LF-HTC中的搜索步骤限制为考虑有界大小的潜在变量子集时，可以及时验证该准则是图大小的多项式。

{"title":"Half-trek criterion for identifiability of latent variable models","authors":"R. Barber, M. Drton, Nils Sturma, Luca Weihs","doi":"10.1214/22-aos2221","DOIUrl":"https://doi.org/10.1214/22-aos2221","url":null,"abstract":"We consider linear structural equation models with latent variables and develop a criterion to certify whether the direct causal effects between the observable variables are identifiable based on the observed covariance matrix. Linear structural equation models assume that both observed and latent variables solve a linear equation system featuring stochastic noise terms. Each model corresponds to a directed graph whose edges represent the direct effects that appear as coefficients in the equation system. Prior research has developed a variety of methods to decide identifiability of direct effects in a latent projection framework, in which the confounding effects of the latent variables are represented by correlation among noise terms. This approach is effective when the confounding is sparse and effects only small subsets of the observed variables. In contrast, the new latent-factor half-trek criterion (LF-HTC) we develop in this paper operates on the original unprojected latent variable model and is able to certify identifiability in settings, where some latent variables may also have dense effects on many or even all of the observables. Our LF-HTC is an effective sufficient criterion for rational identifiability, under which the direct effects can be uniquely recovered as rational functions of the joint covariance matrix of the observed random variables. When restricting the search steps in LF-HTC to consider subsets of latent variables of bounded size, the criterion can be verified in time that is polynomial in the size of the graph.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"130 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79605232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

On robustness and local differential privacy 关于鲁棒性和局部差分隐私

The Annals of Statistics

Pub Date : 2022-01-03 DOI: 10.1214/23-aos2267

Mengchu Li, Thomas B. Berrett, Yi Yu

It is of soaring demand to develop statistical analysis tools that are robust against contamination as well as preserving individual data owners' privacy. In spite of the fact that both topics host a rich body of literature, to the best of our knowledge, we are the first to systematically study the connections between the optimality under Huber's contamination model and the local differential privacy (LDP) constraints. In this paper, we start with a general minimax lower bound result, which disentangles the costs of being robust against Huber's contamination and preserving LDP. We further study four concrete examples: a two-point testing problem, a potentially-diverging mean estimation problem, a nonparametric density estimation problem and a univariate median estimation problem. For each problem, we demonstrate procedures that are optimal in the presence of both contamination and LDP constraints, comment on the connections with the state-of-the-art methods that are only studied under either contamination or privacy constraints, and unveil the connections between robustness and LDP via partially answering whether LDP procedures are robust and whether robust procedures can be efficiently privatised. Overall, our work showcases a promising prospect of joint study for robustness and local differential privacy.

开发统计分析工具的需求不断飙升，这些工具既能抵御污染，又能保护个人数据所有者的隐私。尽管这两个主题都有丰富的文献，但据我们所知，我们是第一个系统地研究Huber污染模型下的最优性与局部差分隐私(LDP)约束之间的联系的人。在本文中，我们从一个一般的极大极小下界结果出发，解出了对Huber污染的鲁棒性和保持LDP的代价。我们进一步研究了四个具体的例子:两点检验问题、潜在发散均值估计问题、非参数密度估计问题和单变量中值估计问题。对于每个问题，我们展示了在污染和LDP约束存在下的最优程序，评论了仅在污染或隐私约束下研究的最先进方法的联系，并通过部分回答LDP程序是否鲁棒以及鲁棒程序是否可以有效地私有化来揭示鲁棒性和LDP之间的联系。总的来说，我们的工作展示了鲁棒性和局部差异隐私联合研究的前景。

{"title":"On robustness and local differential privacy","authors":"Mengchu Li, Thomas B. Berrett, Yi Yu","doi":"10.1214/23-aos2267","DOIUrl":"https://doi.org/10.1214/23-aos2267","url":null,"abstract":"It is of soaring demand to develop statistical analysis tools that are robust against contamination as well as preserving individual data owners' privacy. In spite of the fact that both topics host a rich body of literature, to the best of our knowledge, we are the first to systematically study the connections between the optimality under Huber's contamination model and the local differential privacy (LDP) constraints. In this paper, we start with a general minimax lower bound result, which disentangles the costs of being robust against Huber's contamination and preserving LDP. We further study four concrete examples: a two-point testing problem, a potentially-diverging mean estimation problem, a nonparametric density estimation problem and a univariate median estimation problem. For each problem, we demonstrate procedures that are optimal in the presence of both contamination and LDP constraints, comment on the connections with the state-of-the-art methods that are only studied under either contamination or privacy constraints, and unveil the connections between robustness and LDP via partially answering whether LDP procedures are robust and whether robust procedures can be efficiently privatised. Overall, our work showcases a promising prospect of joint study for robustness and local differential privacy.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"25 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84019787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

Variable selection, monotone likelihood ratio and group sparsity 变量选择、单调似然比和组稀疏性

The Annals of Statistics

Pub Date : 2021-12-30 DOI: 10.1214/22-aos2251

C. Butucea, E. Mammen, M. Ndaoud, A. Tsybakov

In the pivotal variable selection problem, we derive the exact non-asymptotic minimax selector over the class of all $s$-sparse vectors, which is also the Bayes selector with respect to the uniform prior. While this optimal selector is, in general, not realizable in polynomial time, we show that its tractable counterpart (the scan selector) attains the minimax expected Hamming risk to within factor 2, and is also exact minimax with respect to the probability of wrong recovery. As a consequence, we establish explicit lower bounds under the monotone likelihood ratio property and we obtain a tight characterization of the minimax risk in terms of the best separable selector risk. We apply these general results to derive necessary and sufficient conditions of exact and almost full recovery in the location model with light tail distributions and in the problem of group variable selection under Gaussian noise.

在关键变量选择问题中，我们导出了所有$s$-稀疏向量类的精确非渐近极大极小选择器，这也是关于一致先验的贝叶斯选择器。虽然这个最优选择器通常不能在多项式时间内实现，但我们表明，其可处理的对应物(扫描选择器)在因子2内达到最小最大预期汉明风险，并且相对于错误恢复的概率也是精确的最小最大。因此，我们在单调似然比性质下建立了显式下界，并得到了用最佳可分离选择器风险表示的最大最小风险的严密表征。应用这些一般结果，导出了高斯噪声下具有轻尾分布的定位模型和群变量选择问题精确恢复和几乎完全恢复的充分必要条件。

引用次数: 0

General and feasible tests with multiply-imputed datasets 用多输入数据集进行一般和可行的测试

The Annals of Statistics

Pub Date : 2021-12-30 DOI: 10.1214/21-aos2132

Kin Wai Chan

Multiple imputation (MI) is a technique especially designed for handling missing data in public-use datasets. It allows analysts to perform incompletedata inference straightforwardly by using several already imputed datasets released by the dataset owners. However, the existing MI tests require either a restrictive assumption on the missing-data mechanism, known as equal odds of missing information (EOMI), or an infinite number of imputations. Some of them also require analysts to have access to restrictive or nonstandard computer subroutines. Besides, the existing MI testing procedures cover only Wald’s tests and likelihood ratio tests but not Rao’s score tests, therefore, these MI testing procedures are not general enough. In addition, the MI Wald’s tests and MI likelihood ratio tests are not procedurally identical, so analysts need to resort to distinct algorithms for implementation. In this paper, we propose a general MI procedure, called stacked multiple imputation (SMI), for performing Wald’s tests, likelihood ratio tests and Rao’s score tests by a unified algorithm. SMI requires neither EOMI nor an infinite number of imputations. It is particularly feasible for analysts as they just need to use a complete-data testing device for performing the corresponding incomplete-data test.

多重输入(Multiple imputation, MI)是一种专门用于处理公共使用数据集中缺失数据的技术。它允许分析人员通过使用数据集所有者发布的几个已经输入的数据集直接执行不完整的数据推断。然而，现有的MI测试要么需要对丢失数据机制(称为丢失信息的等几率(EOMI))进行限制性假设，要么需要无限数量的估算。其中一些还要求分析人员能够访问限制性或非标准的计算机子程序。此外，现有的MI检验程序只包括Wald检验和似然比检验，而没有Rao分数检验，因此这些MI检验程序不够通用。此外，MI Wald测试和MI似然比测试在程序上并不相同，因此分析师需要采用不同的算法来实现。在本文中，我们提出了一种通用的MI程序，称为堆叠多重imputation (SMI)，用于用统一的算法进行Wald检验、似然比检验和Rao分数检验。SMI既不需要EOMI，也不需要无限次的imputation。这对于分析人员来说尤其可行，因为他们只需要使用完整数据测试设备来执行相应的不完整数据测试。

{"title":"General and feasible tests with multiply-imputed datasets","authors":"Kin Wai Chan","doi":"10.1214/21-aos2132","DOIUrl":"https://doi.org/10.1214/21-aos2132","url":null,"abstract":"Multiple imputation (MI) is a technique especially designed for handling missing data in public-use datasets. It allows analysts to perform incompletedata inference straightforwardly by using several already imputed datasets released by the dataset owners. However, the existing MI tests require either a restrictive assumption on the missing-data mechanism, known as equal odds of missing information (EOMI), or an infinite number of imputations. Some of them also require analysts to have access to restrictive or nonstandard computer subroutines. Besides, the existing MI testing procedures cover only Wald’s tests and likelihood ratio tests but not Rao’s score tests, therefore, these MI testing procedures are not general enough. In addition, the MI Wald’s tests and MI likelihood ratio tests are not procedurally identical, so analysts need to resort to distinct algorithms for implementation. In this paper, we propose a general MI procedure, called stacked multiple imputation (SMI), for performing Wald’s tests, likelihood ratio tests and Rao’s score tests by a unified algorithm. SMI requires neither EOMI nor an infinite number of imputations. It is particularly feasible for analysts as they just need to use a complete-data testing device for performing the corresponding incomplete-data test.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90771126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

The Annals of Statistics

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀