首页 > 最新文献

The Annals of Statistics最新文献

英文 中文
Deep learning for the partially linear Cox model 部分线性Cox模型的深度学习
Pub Date : 2022-06-01 DOI: 10.1214/21-aos2153
Qixian Zhong, Jonas W. Mueller, Jane-ling Wang
{"title":"Deep learning for the partially linear Cox model","authors":"Qixian Zhong, Jonas W. Mueller, Jane-ling Wang","doi":"10.1214/21-aos2153","DOIUrl":"https://doi.org/10.1214/21-aos2153","url":null,"abstract":"","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"113 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79321490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Evidence factors from multiple, possibly invalid, instrumental variables 证据因素来自多个可能无效的工具变量
Pub Date : 2022-06-01 DOI: 10.1214/21-aos2148
Anqi Zhao, Youjin Lee, Dylan S. Small, B. Karmakar
{"title":"Evidence factors from multiple, possibly invalid, instrumental variables","authors":"Anqi Zhao, Youjin Lee, Dylan S. Small, B. Karmakar","doi":"10.1214/21-aos2148","DOIUrl":"https://doi.org/10.1214/21-aos2148","url":null,"abstract":"","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"146 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88635180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A data-adaptive method for estimating density level sets under shape conditions 形状条件下密度水平集估计的数据自适应方法
Pub Date : 2022-06-01 DOI: 10.1214/21-aos2168
A. Rodríguez-Casal, P. Saavedra-Nieves
Given a random sample of points from some unknown density, we propose a method for estimating density level sets, for a given threshold t, under the r ́convexity assumption. This shape condition generalizes the convexity property and allows to consider level sets with more than one connected component. The main problem in practice is that r is an unknown geometric characteristic of the set related to its curvature, which may depend on t. A stochastic algorithm is proposed for selecting its value from data. The resulting reconstruction of the level set is able to achieve minimax rates for Hausdorff metric and distance in measure uniformly on the level t.
给定来自未知密度点的随机样本,我们提出了一种方法来估计密度水平集,对于给定阈值t,在r凸性假设下。这个形状条件推广了凸性,并允许考虑具有多个连通分量的水平集。实践中的主要问题是r是与曲率相关的集合的未知几何特征,它可能依赖于t。提出了一种从数据中选择其值的随机算法。所得的水平集重建能够在水平t上均匀地实现豪斯多夫度量和测量距离的极小极大率。
{"title":"A data-adaptive method for estimating density level sets under shape conditions","authors":"A. Rodríguez-Casal, P. Saavedra-Nieves","doi":"10.1214/21-aos2168","DOIUrl":"https://doi.org/10.1214/21-aos2168","url":null,"abstract":"Given a random sample of points from some unknown density, we propose a method for estimating density level sets, for a given threshold t, under the r ́convexity assumption. This shape condition generalizes the convexity property and allows to consider level sets with more than one connected component. The main problem in practice is that r is an unknown geometric characteristic of the set related to its curvature, which may depend on t. A stochastic algorithm is proposed for selecting its value from data. The resulting reconstruction of the level set is able to achieve minimax rates for Hausdorff metric and distance in measure uniformly on the level t.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"18 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75614441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Consistent order selection for ARFIMA processes 为ARFIMA流程选择一致的订单
Pub Date : 2022-06-01 DOI: 10.1214/21-aos2149
Hsueh-Han Huang, N. Chan, Kun Chen, C. Ing
{"title":"Consistent order selection for ARFIMA processes","authors":"Hsueh-Han Huang, N. Chan, Kun Chen, C. Ing","doi":"10.1214/21-aos2149","DOIUrl":"https://doi.org/10.1214/21-aos2149","url":null,"abstract":"","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"386 1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75542601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
False discovery rate control with unknown null distribution: Is it possible to mimic the oracle? 错误发现率控制与未知null分布:是否有可能模仿oracle?
Pub Date : 2022-04-01 DOI: 10.1214/21-aos2141
Étienne Roquain, N. Verzelen
Classical multiple testing theory prescribes the null distribution, which is often a too stringent assumption for nowadays large scale experiments. This paper presents theoretical foundations to understand the limitations caused by ignoring the null distribution, and how it can be properly learned from the (same) data-set, when possible. We explore this issue in the case where the null distributions are Gaussian with an unknown rescaling parameters (mean and variance) and the alternative distribution is let arbitrary. While an oracle procedure in that case is the Benjamini Hochberg procedure applied with the true (unknown) null distribution, we pursue the aim of building a procedure that asymptotically mimics the performance of the oracle (AMO in short). Our main result states that an AMO procedure exists if and only if the sparsity parameter k (number of false nulls) is of order less than n/ log(n), where n is the total number of tests. Further sparsity boundaries are derived for general location models where the shape of the null distribution is not necessarily Gaussian. Given our impossibility results, we also pursue a weaker objective, which is to find a confidence region for the oracle. To this end, we develop a distribution-dependent confidence region for the null distribution. As practical by-products, this provides a goodness of fit test for the null distribution, as well as a visual method assessing the reliability of empirical null multiple testing methods. Our results are illustrated with numerical experiments and a companion vignette Roquain and Verzelen (2020). AMS 2000 subject classifications: Primary 62G10; secondary 62C20.
经典的多重检验理论规定了零分布,这对于当今的大规模实验来说往往是一个过于严格的假设。本文提供了理解忽略零分布所造成的限制的理论基础,以及如何在可能的情况下从(相同)数据集中正确地学习它。我们在零分布是高斯分布的情况下探讨这个问题,其中零分布具有未知的重标参数(均值和方差),而替代分布是任意的。在这种情况下,oracle过程是应用真实(未知)零分布的Benjamini Hochberg过程,而我们追求的目标是构建一个渐进地模仿oracle(简称AMO)性能的过程。我们的主要结果表明,当且仅当稀疏性参数k(假空数)小于n/ log(n)的数量级时存在AMO过程,其中n是测试的总数。对于零分布形状不一定是高斯分布的一般位置模型,导出了进一步的稀疏性边界。鉴于我们的不可能结果,我们还追求一个较弱的目标,即为神谕找到一个置信区域。为此,我们为零分布建立了一个分布相关的置信区域。作为实际的副产品,这提供了零分布的拟合优度检验,以及评估经验零多重检验方法可靠性的可视化方法。我们的结果用数值实验和配套的小插图Roquain和Verzelen(2020)来说明。AMS 2000学科分类:初级62G10;二次62甜。
{"title":"False discovery rate control with unknown null distribution: Is it possible to mimic the oracle?","authors":"Étienne Roquain, N. Verzelen","doi":"10.1214/21-aos2141","DOIUrl":"https://doi.org/10.1214/21-aos2141","url":null,"abstract":"Classical multiple testing theory prescribes the null distribution, which is often a too stringent assumption for nowadays large scale experiments. This paper presents theoretical foundations to understand the limitations caused by ignoring the null distribution, and how it can be properly learned from the (same) data-set, when possible. We explore this issue in the case where the null distributions are Gaussian with an unknown rescaling parameters (mean and variance) and the alternative distribution is let arbitrary. While an oracle procedure in that case is the Benjamini Hochberg procedure applied with the true (unknown) null distribution, we pursue the aim of building a procedure that asymptotically mimics the performance of the oracle (AMO in short). Our main result states that an AMO procedure exists if and only if the sparsity parameter k (number of false nulls) is of order less than n/ log(n), where n is the total number of tests. Further sparsity boundaries are derived for general location models where the shape of the null distribution is not necessarily Gaussian. Given our impossibility results, we also pursue a weaker objective, which is to find a confidence region for the oracle. To this end, we develop a distribution-dependent confidence region for the null distribution. As practical by-products, this provides a goodness of fit test for the null distribution, as well as a visual method assessing the reliability of empirical null multiple testing methods. Our results are illustrated with numerical experiments and a companion vignette Roquain and Verzelen (2020). AMS 2000 subject classifications: Primary 62G10; secondary 62C20.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"227 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80166690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Adaptive estimation in multivariate response regression with hidden variables 隐变量多元响应回归的自适应估计
Pub Date : 2022-04-01 DOI: 10.1214/21-aos2059
Xin Bing, Y. Ning, Yaosheng Xu
{"title":"Adaptive estimation in multivariate response regression with hidden variables","authors":"Xin Bing, Y. Ning, Yaosheng Xu","doi":"10.1214/21-aos2059","DOIUrl":"https://doi.org/10.1214/21-aos2059","url":null,"abstract":"","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"133 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84919837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Parametric copula adjusted for non- and semiparametric regression 为非参数和半参数回归调整的参数联结
Pub Date : 2022-04-01 DOI: 10.1214/21-aos2126
Yue Zhao, I. Gijbels, I. Van Keilegom
{"title":"Parametric copula adjusted for non- and semiparametric regression","authors":"Yue Zhao, I. Gijbels, I. Van Keilegom","doi":"10.1214/21-aos2126","DOIUrl":"https://doi.org/10.1214/21-aos2126","url":null,"abstract":"","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"7 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74837633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Sparse high-dimensional linear regression. Estimating squared error and a phase transition 稀疏高维线性回归。估计平方误差和相变
Pub Date : 2022-04-01 DOI: 10.1214/21-aos2130
D. Gamarnik, Ilias Zadik
We consider a sparse high dimensional regression model where the goal is to recover a k-sparse unknown binary vector β∗ from n noisy linear observations of the form Y = Xβ∗+W ∈ R where X ∈ Rn×p has i.i.d. N(0, 1) entries and W ∈ R has i.i.d. N(0, σ) entries. In the high signal-to-noise ratio regime and sublinear sparsity regime, while the order of the sample size needed to recover the unknown vector information-theoretially is known to be n∗ := 2k log p/ log(k/σ + 1), no polynomial-time algorithm is known to succeed unless n > nalg := (2k + σ) log p. In this work, we offer a series of results investigating multiple computational and statistical aspects of the recovery task in the regime n ∈ [n∗, nalg]. First, we establish a novel information-theoretic property of the MLE of the problem happening around n = n∗ samples, which we coin as an “all-or-nothing behavior”: when n > n∗ it recovers almost perfectly the support of β∗, while if n < n∗ it fails to recover any fraction of it correctly. Second, at an attempt to understand the computational hardness in the regime n ∈ [n∗, nalg] we prove that at order nalg samples there is an Overlap Gap Property (OGP) phase transition occurring at the landscape of the MLE: for constants c, C > 0 when n < cnalg OGP appears in the landscape of MLE while if n > Cnalg OGP disappears. OGP is a geometric “disconnectivity” property which initially appeared in the theory of spin glasses and is known to suggest algorithmic hardness when it occurs. Finally, using certain technical results obtained to establish the OGP phase transition, we additionally establish various novel positive and negative algorithmic results for the recovery task of interest, including the failure of LASSO with access to n < cnalg samples and the success of a simple Local Search method with access to n > Cnalg samples.
我们考虑一个稀疏的高维回归模型,其目标是从n个形式为Y = Xβ∗+W∈R的噪声线性观测中恢复k稀疏未知二进制向量β∗,其中X∈Rn×p有i.i.d.n(0,1)个条目,W∈R有i.i.d.n (0, σ)个条目。在高信噪比政权和次线性稀疏的政权,而恢复所需的样本大小的顺序已知未知向量information-theoretially n∗:= 2 k日志p /日志(k /σ+ 1),没有多项式时间算法被成功除非n > nalg:日志p = (2 k +σ)。在这工作,我们提供一系列的结果调查多个计算和统计方面的恢复任务的政权n∈(n∗,nalg)。首先,我们建立了发生在n = n∗样本周围的问题的MLE的一个新的信息理论性质,我们称之为“全有或全无行为”:当n > n∗时,它几乎完美地恢复β∗的支持,而如果n < n∗,它不能正确地恢复它的任何部分。其次,在试图理解n∈[n∗,nalg]区域的计算硬度时,我们证明了在nalg阶样本中,在MLE的横向上发生重叠间隙属性(OGP)相变:对于常数c, c > 0,当n < cnalg OGP出现在MLE的横向上,而如果n > cnalg OGP消失。OGP是一种几何“不连通性”性质,最初出现在自旋玻璃理论中,当它发生时,已知表明算法硬度。最后,利用获得的某些技术结果来建立OGP相变,我们还为感兴趣的恢复任务建立了各种新的正负算法结果,包括访问n < cnalg样本的LASSO失败以及访问n > cnalg样本的简单局部搜索方法的成功。
{"title":"Sparse high-dimensional linear regression. Estimating squared error and a phase transition","authors":"D. Gamarnik, Ilias Zadik","doi":"10.1214/21-aos2130","DOIUrl":"https://doi.org/10.1214/21-aos2130","url":null,"abstract":"We consider a sparse high dimensional regression model where the goal is to recover a k-sparse unknown binary vector β∗ from n noisy linear observations of the form Y = Xβ∗+W ∈ R where X ∈ Rn×p has i.i.d. N(0, 1) entries and W ∈ R has i.i.d. N(0, σ) entries. In the high signal-to-noise ratio regime and sublinear sparsity regime, while the order of the sample size needed to recover the unknown vector information-theoretially is known to be n∗ := 2k log p/ log(k/σ + 1), no polynomial-time algorithm is known to succeed unless n > nalg := (2k + σ) log p. In this work, we offer a series of results investigating multiple computational and statistical aspects of the recovery task in the regime n ∈ [n∗, nalg]. First, we establish a novel information-theoretic property of the MLE of the problem happening around n = n∗ samples, which we coin as an “all-or-nothing behavior”: when n > n∗ it recovers almost perfectly the support of β∗, while if n < n∗ it fails to recover any fraction of it correctly. Second, at an attempt to understand the computational hardness in the regime n ∈ [n∗, nalg] we prove that at order nalg samples there is an Overlap Gap Property (OGP) phase transition occurring at the landscape of the MLE: for constants c, C > 0 when n < cnalg OGP appears in the landscape of MLE while if n > Cnalg OGP disappears. OGP is a geometric “disconnectivity” property which initially appeared in the theory of spin glasses and is known to suggest algorithmic hardness when it occurs. Finally, using certain technical results obtained to establish the OGP phase transition, we additionally establish various novel positive and negative algorithmic results for the recovery task of interest, including the failure of LASSO with access to n < cnalg samples and the success of a simple Local Search method with access to n > Cnalg samples.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"2000 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88295763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
On resampling schemes for particle filters with weakly informative observations 弱信息观测粒子滤波器的重采样方案
Pub Date : 2022-03-18 DOI: 10.1214/22-aos2222
N. Chopin, Sumeetpal S. Singh, Tom'as Soto, M. Vihola
We consider particle filters with weakly informative observations (or `potentials') relative to the latent state dynamics. The particular focus of this work is on particle filters to approximate time-discretisations of continuous-time Feynman--Kac path integral models -- a scenario that naturally arises when addressing filtering and smoothing problems in continuous time -- but our findings are indicative about weakly informative settings beyond this context too. We study the performance of different resampling schemes, such as systematic resampling, SSP (Srinivasan sampling process) and stratified resampling, as the time-discretisation becomes finer and also identify their continuous-time limit, which is expressed as a suitably defined `infinitesimal generator.' By contrasting these generators, we find that (certain modifications of) systematic and SSP resampling `dominate' stratified and independent `killing' resampling in terms of their limiting overall resampling rate. The reduced intensity of resampling manifests itself in lower variance in our numerical experiment. This efficiency result, through an ordering of the resampling rate, is new to the literature. The second major contribution of this work concerns the analysis of the limiting behaviour of the entire population of particles of the particle filter as the time discretisation becomes finer. We provide the first proof, under general conditions, that the particle approximation of the discretised continuous-time Feynman--Kac path integral models converges to a (uniformly weighted) continuous-time particle system.
我们考虑具有相对于潜在状态动力学的弱信息观测(或“势”)的粒子滤波器。这项工作的特别重点是粒子滤波器,以近似连续时间费曼-卡茨路径积分模型的时间离散——这是在处理连续时间的滤波和平滑问题时自然出现的一种情况——但我们的发现也表明了这种情况之外的弱信息设置。我们研究了不同重采样方案的性能,如系统重采样,SSP (Srinivasan采样过程)和分层重采样,因为时间离散化变得更精细,并且还确定了它们的连续时间极限,这被表示为适当定义的“无穷小发生器”。通过对比这些发生器,我们发现(某些修改)系统和SSP重采样“主导”分层和独立的“杀伤”重采样,就其限制总体重采样率而言。在我们的数值实验中,重采样强度的降低表现为方差的降低。这种效率结果,通过重新采样率的排序,是新的文献。这项工作的第二个主要贡献涉及随着时间离散化变得更细,粒子滤波器的整个粒子群的极限行为的分析。在一般条件下,我们首次证明了离散连续时间Feynman—Kac路径积分模型的粒子近似收敛于(均匀加权)连续时间粒子系统。
{"title":"On resampling schemes for particle filters with weakly informative observations","authors":"N. Chopin, Sumeetpal S. Singh, Tom'as Soto, M. Vihola","doi":"10.1214/22-aos2222","DOIUrl":"https://doi.org/10.1214/22-aos2222","url":null,"abstract":"We consider particle filters with weakly informative observations (or `potentials') relative to the latent state dynamics. The particular focus of this work is on particle filters to approximate time-discretisations of continuous-time Feynman--Kac path integral models -- a scenario that naturally arises when addressing filtering and smoothing problems in continuous time -- but our findings are indicative about weakly informative settings beyond this context too. We study the performance of different resampling schemes, such as systematic resampling, SSP (Srinivasan sampling process) and stratified resampling, as the time-discretisation becomes finer and also identify their continuous-time limit, which is expressed as a suitably defined `infinitesimal generator.' By contrasting these generators, we find that (certain modifications of) systematic and SSP resampling `dominate' stratified and independent `killing' resampling in terms of their limiting overall resampling rate. The reduced intensity of resampling manifests itself in lower variance in our numerical experiment. This efficiency result, through an ordering of the resampling rate, is new to the literature. The second major contribution of this work concerns the analysis of the limiting behaviour of the entire population of particles of the particle filter as the time discretisation becomes finer. We provide the first proof, under general conditions, that the particle approximation of the discretised continuous-time Feynman--Kac path integral models converges to a (uniformly weighted) continuous-time particle system.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"128 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82914734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A new and flexible design construction for orthogonal arrays for modern applications 面向现代应用的一种新型、灵活的正交阵列设计结构
Pub Date : 2022-03-12 DOI: 10.1214/21-aos2159
Yuanzhen He, C. D. Lin, Fasheng Sun
Orthogonal array, a classical and effective tool for collecting data, has been flourished with its applications in modern computer experiments and engineering statistics. Driven by the wide use of computer experiments with both qualitative and quantitative factors, multiple computer experiments, multi-fidelity computer experiments, cross-validation and stochastic optimization, orthogonal arrays with certain structures have been introduced. Sliced orthogonal arrays and nested orthogonal arrays are examples of such arrays. This article introduces a flexible, fresh construction method which uses smaller arrays and a special structure. The method uncovers the hidden structure of many existing fixed-level orthogonal arrays of given run sizes, possibly with more columns. It also allows fixed-level orthogonal arrays of nearly strength three to be constructed, which are useful as there are not many construction methods for fixed-level orthogonal arrays of strength three, and also helpful for generating Latin hypercube designs with desirable low-dimensional projections. Theoretical properties of the proposed method are explored. As by-products, several theoretical results on orthogonal arrays are obtained.
正交阵列作为一种经典而有效的数据收集工具,在现代计算机实验和工程统计中得到了广泛的应用。随着计算机实验在定性和定量因素、多重计算机实验、多保真计算机实验、交叉验证和随机优化等方面的广泛应用,引入了具有一定结构的正交阵列。切片正交阵列和嵌套正交阵列就是这种阵列的例子。本文介绍了一种灵活新颖的构造方法,该方法采用较小的阵列和特殊的结构。该方法揭示了许多现有的给定运行大小的固定水平正交数组的隐藏结构,可能有更多的列。它还允许构造强度接近3的固定水平正交阵列,这在强度为3的固定水平正交阵列的构造方法不多的情况下非常有用,并且对于生成具有理想低维投影的拉丁超立方体设计也很有帮助。探讨了该方法的理论性质。作为副产物,得到了几个关于正交阵列的理论结果。
{"title":"A new and flexible design construction for orthogonal arrays for modern applications","authors":"Yuanzhen He, C. D. Lin, Fasheng Sun","doi":"10.1214/21-aos2159","DOIUrl":"https://doi.org/10.1214/21-aos2159","url":null,"abstract":"Orthogonal array, a classical and effective tool for collecting data, has been flourished with its applications in modern computer experiments and engineering statistics. Driven by the wide use of computer experiments with both qualitative and quantitative factors, multiple computer experiments, multi-fidelity computer experiments, cross-validation and stochastic optimization, orthogonal arrays with certain structures have been introduced. Sliced orthogonal arrays and nested orthogonal arrays are examples of such arrays. This article introduces a flexible, fresh construction method which uses smaller arrays and a special structure. The method uncovers the hidden structure of many existing fixed-level orthogonal arrays of given run sizes, possibly with more columns. It also allows fixed-level orthogonal arrays of nearly strength three to be constructed, which are useful as there are not many construction methods for fixed-level orthogonal arrays of strength three, and also helpful for generating Latin hypercube designs with desirable low-dimensional projections. Theoretical properties of the proposed method are explored. As by-products, several theoretical results on orthogonal arrays are obtained.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89375175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
The Annals of Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1