首页 > 最新文献

arXiv - MATH - Statistics Theory最新文献

英文 中文
Individualized Treatment Allocations with Distributional Welfare 个体化治疗分配与分配福利
Pub Date : 2023-11-27 DOI: arxiv-2311.15878
Yifan Cui, Sukjin Han
In this paper, we explore optimal treatment allocation policies that targetdistributional welfare. Most literature on treatment choice has consideredutilitarian welfare based on the conditional average treatment effect (ATE).While average welfare is intuitive, it may yield undesirable allocationsespecially when individuals are heterogeneous (e.g., with outliers) - the veryreason individualized treatments were introduced in the first place. Thisobservation motivates us to propose an optimal policy that allocates thetreatment based on the conditional emph{quantile of individual treatmenteffects} (QoTE). Depending on the choice of the quantile probability, thiscriterion can accommodate a policymaker who is either prudent or negligent. Thechallenge of identifying the QoTE lies in its requirement for knowledge of thejoint distribution of the counterfactual outcomes, which is generally hard torecover even with experimental data. Therefore, we introduce minimax optimalpolicies that are robust to model uncertainty. We then propose a range ofidentifying assumptions under which we can point or partially identify theQoTE. We establish the asymptotic bound on the regret of implementing theproposed policies. We consider both stochastic and deterministic rules. Insimulations and two empirical applications, we compare optimal decisions basedon the QoTE with decisions based on other criteria.
本文探讨了以分配福利为目标的最优待遇分配政策。大多数关于治疗选择的文献都考虑了基于条件平均治疗效果(ATE)的功利主义福利。虽然平均福利是直观的,但它可能会产生不受欢迎的分配,特别是当个体是异质的(例如,有异常值)——这就是首先引入个性化治疗的原因。这一观察结果促使我们提出一种基于emph{个体治疗效果的条件分位数分配治疗的最佳策略}(quote)。根据分位数概率的选择,这一标准可以容纳审慎或疏忽的政策制定者。识别引文的挑战在于它需要了解反事实结果的联合分布,这通常即使使用实验数据也很难恢复。因此,我们引入了对模型不确定性具有鲁棒性的极大极小最优策略。然后,我们提出一系列识别假设,在这些假设下,我们可以指向或部分识别报价。我们建立了实施所提出的政策的遗憾的渐近界。我们同时考虑随机规则和确定性规则。在模拟和两个经验应用中,我们比较了基于quote的最优决策与基于其他标准的决策。
{"title":"Individualized Treatment Allocations with Distributional Welfare","authors":"Yifan Cui, Sukjin Han","doi":"arxiv-2311.15878","DOIUrl":"https://doi.org/arxiv-2311.15878","url":null,"abstract":"In this paper, we explore optimal treatment allocation policies that target\u0000distributional welfare. Most literature on treatment choice has considered\u0000utilitarian welfare based on the conditional average treatment effect (ATE).\u0000While average welfare is intuitive, it may yield undesirable allocations\u0000especially when individuals are heterogeneous (e.g., with outliers) - the very\u0000reason individualized treatments were introduced in the first place. This\u0000observation motivates us to propose an optimal policy that allocates the\u0000treatment based on the conditional emph{quantile of individual treatment\u0000effects} (QoTE). Depending on the choice of the quantile probability, this\u0000criterion can accommodate a policymaker who is either prudent or negligent. The\u0000challenge of identifying the QoTE lies in its requirement for knowledge of the\u0000joint distribution of the counterfactual outcomes, which is generally hard to\u0000recover even with experimental data. Therefore, we introduce minimax optimal\u0000policies that are robust to model uncertainty. We then propose a range of\u0000identifying assumptions under which we can point or partially identify the\u0000QoTE. We establish the asymptotic bound on the regret of implementing the\u0000proposed policies. We consider both stochastic and deterministic rules. In\u0000simulations and two empirical applications, we compare optimal decisions based\u0000on the QoTE with decisions based on other criteria.","PeriodicalId":501330,"journal":{"name":"arXiv - MATH - Statistics Theory","volume":"17 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138526139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Approach to Linear Bayesian Networks 线性贝叶斯网络的贝叶斯方法
Pub Date : 2023-11-27 DOI: arxiv-2311.15610
Seyong Hwang, Kyoungjae Lee, Sunmin Oh, Gunwoong Park
This study proposes the first Bayesian approach for learning high-dimensionallinear Bayesian networks. The proposed approach iteratively estimates eachelement of the topological ordering from backward and its parent using theinverse of a partial covariance matrix. The proposed method successfullyrecovers the underlying structure when Bayesian regularization for the inversecovariance matrix with unequal shrinkage is applied. Specifically, it showsthat the number of samples $n = Omega( d_M^2 log p)$ and $n = Omega(d_M^2p^{2/m})$ are sufficient for the proposed algorithm to learn linear Bayesiannetworks with sub-Gaussian and 4m-th bounded-moment error distributions,respectively, where $p$ is the number of nodes and $d_M$ is the maximum degreeof the moralized graph. The theoretical findings are supported by extensivesimulation studies including real data analysis. Furthermore the proposedmethod is demonstrated to outperform state-of-the-art frequentist approaches,such as the BHLSM, LISTEN, and TD algorithms in synthetic data.
本研究提出了学习高维线性贝叶斯网络的第一种贝叶斯方法。该方法使用部分协方差矩阵的逆来迭代地估计拓扑排序从后向及其父节点的每个元素。当对不等收缩的反方差矩阵进行贝叶斯正则化时,该方法成功地恢复了底层结构。具体来说,研究表明,样本数量$n = Omega( d_M^2 log p)$和$n = Omega(d_M^2p^{2/m})$足以使所提出的算法分别学习亚高斯和4m-th有界矩误差分布的线性贝叶斯网络,其中$p$为节点数,$d_M$为道德图的最大程度。理论发现得到了广泛的模拟研究的支持,包括实际数据分析。此外,所提出的方法被证明优于最先进的频率方法,如BHLSM, LISTEN和TD算法在合成数据中。
{"title":"Bayesian Approach to Linear Bayesian Networks","authors":"Seyong Hwang, Kyoungjae Lee, Sunmin Oh, Gunwoong Park","doi":"arxiv-2311.15610","DOIUrl":"https://doi.org/arxiv-2311.15610","url":null,"abstract":"This study proposes the first Bayesian approach for learning high-dimensional\u0000linear Bayesian networks. The proposed approach iteratively estimates each\u0000element of the topological ordering from backward and its parent using the\u0000inverse of a partial covariance matrix. The proposed method successfully\u0000recovers the underlying structure when Bayesian regularization for the inverse\u0000covariance matrix with unequal shrinkage is applied. Specifically, it shows\u0000that the number of samples $n = Omega( d_M^2 log p)$ and $n = Omega(d_M^2\u0000p^{2/m})$ are sufficient for the proposed algorithm to learn linear Bayesian\u0000networks with sub-Gaussian and 4m-th bounded-moment error distributions,\u0000respectively, where $p$ is the number of nodes and $d_M$ is the maximum degree\u0000of the moralized graph. The theoretical findings are supported by extensive\u0000simulation studies including real data analysis. Furthermore the proposed\u0000method is demonstrated to outperform state-of-the-art frequentist approaches,\u0000such as the BHLSM, LISTEN, and TD algorithms in synthetic data.","PeriodicalId":501330,"journal":{"name":"arXiv - MATH - Statistics Theory","volume":"62 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138526138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pseudo-likelihood Estimators for Graphical Models: Existence and Uniqueness 图模型的伪似然估计:存在性与唯一性
Pub Date : 2023-11-27 DOI: arxiv-2311.15528
Benjamin Roycraft, Bala Rajaratnam
Graphical and sparse (inverse) covariance models have found widespread use inmodern sample-starved high dimensional applications. A part of their wideappeal stems from the significantly low sample sizes required for the existenceof estimators, especially in comparison with the classical full covariancemodel. For undirected Gaussian graphical models, the minimum sample sizerequired for the existence of maximum likelihood estimators had been an openquestion for almost half a century, and has been recently settled. The verysame question for pseudo-likelihood estimators has remained unsolved ever sincetheir introduction in the '70s. Pseudo-likelihood estimators have recentlyreceived renewed attention as they impose fewer restrictive assumptions andhave better computational tractability, improved statistical performance, andappropriateness in modern high dimensional applications, thus renewing interestin this longstanding problem. In this paper, we undertake a comprehensive studyof this open problem within the context of the two classes of pseudo-likelihoodmethods proposed in the literature. We provide a precise answer to thisquestion for both pseudo-likelihood approaches and relate the correspondingsolutions to their Gaussian counterpart.
图形化和稀疏(逆)协方差模型在现代缺乏样本的高维应用中得到了广泛的应用。它们的广泛吸引力的一部分源于存在估计量所需的显著低样本量,特别是与经典的全协方差模型相比。对于无向高斯图形模型,最大似然估计存在所需的最小样本量是近半个世纪以来一个悬而未决的问题,最近才得到解决。伪似然估计器自70年代问世以来,同样的问题一直没有得到解决。伪似然估计最近受到了新的关注,因为它们施加了更少的限制性假设,具有更好的计算可追溯性,改进的统计性能,以及在现代高维应用中的适用性,从而重新引起了对这个长期存在的问题的兴趣。在本文中,我们在文献中提出的两类伪似然方法的背景下对这个开放问题进行了全面的研究。我们为这两种伪似然方法提供了这个问题的精确答案,并将相应的解与它们的高斯对应解联系起来。
{"title":"Pseudo-likelihood Estimators for Graphical Models: Existence and Uniqueness","authors":"Benjamin Roycraft, Bala Rajaratnam","doi":"arxiv-2311.15528","DOIUrl":"https://doi.org/arxiv-2311.15528","url":null,"abstract":"Graphical and sparse (inverse) covariance models have found widespread use in\u0000modern sample-starved high dimensional applications. A part of their wide\u0000appeal stems from the significantly low sample sizes required for the existence\u0000of estimators, especially in comparison with the classical full covariance\u0000model. For undirected Gaussian graphical models, the minimum sample size\u0000required for the existence of maximum likelihood estimators had been an open\u0000question for almost half a century, and has been recently settled. The very\u0000same question for pseudo-likelihood estimators has remained unsolved ever since\u0000their introduction in the '70s. Pseudo-likelihood estimators have recently\u0000received renewed attention as they impose fewer restrictive assumptions and\u0000have better computational tractability, improved statistical performance, and\u0000appropriateness in modern high dimensional applications, thus renewing interest\u0000in this longstanding problem. In this paper, we undertake a comprehensive study\u0000of this open problem within the context of the two classes of pseudo-likelihood\u0000methods proposed in the literature. We provide a precise answer to this\u0000question for both pseudo-likelihood approaches and relate the corresponding\u0000solutions to their Gaussian counterpart.","PeriodicalId":501330,"journal":{"name":"arXiv - MATH - Statistics Theory","volume":"34 9","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138526166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low-Degree Hardness of Detection for Correlated Erdős-Rényi Graphs 关联Erdős-Rényi图的低度检测硬度
Pub Date : 2023-11-27 DOI: arxiv-2311.15931
Jian Ding, Hang Du, Zhangsong Li
Given two ErdH{o}s-R'enyi graphs with $n$ vertices whose edges arecorrelated through a latent vertex correspondence, we study complexity lowerbounds for the associated correlation detection problem for the class oflow-degree polynomial algorithms. We provide evidence that anydegree-$O(rho^{-1})$ polynomial algorithm fails for detection, where $rho$ isthe edge correlation. Furthermore, in the sparse regime where the edge density$q=n^{-1+o(1)}$, we provide evidence that any degree-$d$ polynomial algorithmfails for detection, as long as $log d=obig( frac{log n}{log nq} wedgesqrt{log n} big)$ and the correlation $rho
给定两个Erd H{o} s- r图,其$n$顶点的边通过潜在顶点对应关系相关联,我们研究了一类低次多项式算法的关联相关检测问题的复杂度下限。我们提供的证据表明,任何程度- $O(rho^{-1})$多项式算法失败的检测,其中$rho$是边缘相关。此外,在边缘密度$q=n^{-1+o(1)}$的稀疏状态下,我们提供证据表明,只要$log d=obig( frac{log n}{log nq} wedgesqrt{log n} big)$和相关性$rho
{"title":"Low-Degree Hardness of Detection for Correlated Erdős-Rényi Graphs","authors":"Jian Ding, Hang Du, Zhangsong Li","doi":"arxiv-2311.15931","DOIUrl":"https://doi.org/arxiv-2311.15931","url":null,"abstract":"Given two ErdH{o}s-R'enyi graphs with $n$ vertices whose edges are\u0000correlated through a latent vertex correspondence, we study complexity lower\u0000bounds for the associated correlation detection problem for the class of\u0000low-degree polynomial algorithms. We provide evidence that any\u0000degree-$O(rho^{-1})$ polynomial algorithm fails for detection, where $rho$ is\u0000the edge correlation. Furthermore, in the sparse regime where the edge density\u0000$q=n^{-1+o(1)}$, we provide evidence that any degree-$d$ polynomial algorithm\u0000fails for detection, as long as $log d=obig( frac{log n}{log nq} wedge\u0000sqrt{log n} big)$ and the correlation $rho<sqrt{alpha}$ where\u0000$alphaapprox 0.338$ is the Otter's constant. Our result suggests that several\u0000state-of-the-art algorithms on correlation detection and exact matching\u0000recovery may be essentially the best possible.","PeriodicalId":501330,"journal":{"name":"arXiv - MATH - Statistics Theory","volume":"73 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138526168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Local Landscape of Phase Retrieval Under Limited Samples 有限样本条件下相位检索的局部图景
Pub Date : 2023-11-26 DOI: arxiv-2311.15221
Kaizhao Liu, Zihao Wang, Lei Wu
In this paper, we provide a fine-grained analysis of the local landscape ofphase retrieval under the regime with limited samples. Our aim is to ascertainthe minimal sample size necessary to guarantee a benign local landscapesurrounding global minima in high dimensions. Let $n$ and $d$ denote the samplesize and input dimension, respectively. We first explore the local convexityand establish that when $n=o(dlog d)$, for almost every fixed point in thelocal ball, the Hessian matrix must have negative eigenvalues as long as $d$ issufficiently large. Consequently, the local landscape is highly non-convex. Wenext consider the one-point strong convexity and show that as long as$n=omega(d)$, with high probability, the landscape is one-point stronglyconvex in the local annulus: ${winmathbb{R}^d: o_d(1)leqslant|w-w^*|leqslant c}$, where $w^*$ is the ground truth and $c$ is an absoluteconstant. This implies that gradient descent initialized from any point in thisdomain can converge to an $o_d(1)$-loss solution exponentially fast.Furthermore, we show that when $n=o(dlog d)$, there is a radius of$widetildeThetaleft(sqrt{1/d}right)$ such that one-point convexity breaksin the corresponding smaller local ball. This indicates an impossibility toestablish a convergence to exact $w^*$ for gradient descent under limitedsamples by relying solely on one-point convexity.
在本文中,我们提供了一个细粒度的分析,在有限的样本制度下,相位检索的局部景观。我们的目标是确定必要的最小样本量,以保证一个良性的局部景观周围的高维全球最小值。设$n$和$d$分别表示样本量和输入维度。我们首先探索了局部凸性,并建立了当$n=o(dlog d)$时,对于几乎每个局部球上的不动点,只要$d$足够大,Hessian矩阵必须具有负特征值。因此,当地景观是高度非凸的。接下来考虑一点强凸性,并表明只要$n=omega(d)$,在高概率下,景观在局部环空是一点强凸的:${winmathbb{R}^d: o_d(1)leqslant|w-w^*|leqslant c}$,其中$w^*$是基本真理,$c$是绝对常数。这意味着从该域中任何点初始化的梯度下降都可以以指数级速度收敛到$o_d(1)$ -loss解。进一步,我们证明了当$n=o(dlog d)$时,存在一个半径$widetildeThetaleft(sqrt{1/d}right)$,使得相应的较小的局部球的一点凸性破裂。这表明,在有限样本下,仅依靠一点凸性,不可能建立精确$w^*$梯度下降的收敛性。
{"title":"The Local Landscape of Phase Retrieval Under Limited Samples","authors":"Kaizhao Liu, Zihao Wang, Lei Wu","doi":"arxiv-2311.15221","DOIUrl":"https://doi.org/arxiv-2311.15221","url":null,"abstract":"In this paper, we provide a fine-grained analysis of the local landscape of\u0000phase retrieval under the regime with limited samples. Our aim is to ascertain\u0000the minimal sample size necessary to guarantee a benign local landscape\u0000surrounding global minima in high dimensions. Let $n$ and $d$ denote the sample\u0000size and input dimension, respectively. We first explore the local convexity\u0000and establish that when $n=o(dlog d)$, for almost every fixed point in the\u0000local ball, the Hessian matrix must have negative eigenvalues as long as $d$ is\u0000sufficiently large. Consequently, the local landscape is highly non-convex. We\u0000next consider the one-point strong convexity and show that as long as\u0000$n=omega(d)$, with high probability, the landscape is one-point strongly\u0000convex in the local annulus: ${winmathbb{R}^d: o_d(1)leqslant\u0000|w-w^*|leqslant c}$, where $w^*$ is the ground truth and $c$ is an absolute\u0000constant. This implies that gradient descent initialized from any point in this\u0000domain can converge to an $o_d(1)$-loss solution exponentially fast.\u0000Furthermore, we show that when $n=o(dlog d)$, there is a radius of\u0000$widetildeThetaleft(sqrt{1/d}right)$ such that one-point convexity breaks\u0000in the corresponding smaller local ball. This indicates an impossibility to\u0000establish a convergence to exact $w^*$ for gradient descent under limited\u0000samples by relying solely on one-point convexity.","PeriodicalId":501330,"journal":{"name":"arXiv - MATH - Statistics Theory","volume":"55 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138526137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Characterization of valid auxiliary functions for representations of extreme value distributions and their max-domains of attraction 表示极值分布及其最大吸引域的有效辅助函数的表征
Pub Date : 2023-11-26 DOI: arxiv-2311.15355
Miriam Isabel Seifert
In this paper we study two important representations for extreme valuedistributions and their max-domains of attraction (MDA), namely von Misesrepresentation (vMR) and variation representation (VR), which are convenientways to gain limit results. Both VR and vMR are defined via so-called auxiliaryfunctions psi. Up to now, however, the set of valid auxiliary functions for vMRhas neither been characterized completely nor separated from those for VR. Wecontribute to the current literature by introducing ''universal'' auxiliaryfunctions which are valid for both VR and vMR representations for the entireMDA distribution families. Then we identify exactly the sets of valid auxiliaryfunctions for both VR and vMR. Moreover, we propose a method for findingappropriate auxiliary functions with analytically simple structure and providethem for several important distributions.
本文研究了极值分布及其最大吸引域(MDA)的两种重要表示,即von Misesrepresentation (vMR)和variation representation (VR),它们是获得极限结果的方便方法。VR和vMR都是通过所谓的辅助函数psi来定义的。然而,到目前为止,vmr的有效辅助功能集既没有完全表征,也没有与VR的有效辅助功能集分离开来。我们通过引入对整个remda分布家族的VR和vMR表示都有效的“通用”辅助函数来贡献当前的文献。然后,我们准确地确定了VR和vMR的有效辅助函数集。此外,我们还提出了一种寻找结构简单的辅助函数的方法,并为几个重要的分布提供了辅助函数。
{"title":"Characterization of valid auxiliary functions for representations of extreme value distributions and their max-domains of attraction","authors":"Miriam Isabel Seifert","doi":"arxiv-2311.15355","DOIUrl":"https://doi.org/arxiv-2311.15355","url":null,"abstract":"In this paper we study two important representations for extreme value\u0000distributions and their max-domains of attraction (MDA), namely von Mises\u0000representation (vMR) and variation representation (VR), which are convenient\u0000ways to gain limit results. Both VR and vMR are defined via so-called auxiliary\u0000functions psi. Up to now, however, the set of valid auxiliary functions for vMR\u0000has neither been characterized completely nor separated from those for VR. We\u0000contribute to the current literature by introducing ''universal'' auxiliary\u0000functions which are valid for both VR and vMR representations for the entire\u0000MDA distribution families. Then we identify exactly the sets of valid auxiliary\u0000functions for both VR and vMR. Moreover, we propose a method for finding\u0000appropriate auxiliary functions with analytically simple structure and provide\u0000them for several important distributions.","PeriodicalId":501330,"journal":{"name":"arXiv - MATH - Statistics Theory","volume":"25 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138526154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quickest Change Detection with Post-Change Density Estimation 最快的变化检测与后变化密度估计
Pub Date : 2023-11-25 DOI: arxiv-2311.15128
Yuchen Liang, Venugopal V. Veeravalli
The problem of quickest change detection in a sequence of independentobservations is considered. The pre-change distribution is assumed to be known,while the post-change distribution is unknown. Two tests based on post-changedensity estimation are developed for this problem, the window-limitednon-parametric generalized likelihood ratio (NGLR) CuSum test and thenon-parametric window-limited adaptive (NWLA) CuSum test. Both tests do notassume any knowledge of the post-change distribution, except that thepost-change density satisfies certain smoothness conditions that allows forefficient non-parametric estimation. Also, they do not require anypre-collected post-change training samples. Under certain convergenceconditions on the density estimator, it is shown that both tests arefirst-order asymptotically optimal, as the false alarm rate goes to zero. Theanalysis is validated through numerical results, where both tests are comparedwith baseline tests that have distributional knowledge.
研究了独立观测序列中最快速的变化检测问题。假设变更前的分布是已知的,而变更后的分布是未知的。针对这一问题,提出了基于变后密度估计的两种检验方法:限窗非参数广义似然比(NGLR) CuSum检验和非参数限窗自适应(NWLA) CuSum检验。这两个测试都不假设对变化后的分布有任何了解,除非变化后的密度满足一定的平滑条件,允许有效的非参数估计。此外,它们不需要预先收集任何更改后的训练样本。在密度估计量的一定收敛条件下,当虚警率趋于零时,两个检验都是一阶渐近最优的。通过数值结果验证了该分析,其中两个测试都与具有分布知识的基线测试进行了比较。
{"title":"Quickest Change Detection with Post-Change Density Estimation","authors":"Yuchen Liang, Venugopal V. Veeravalli","doi":"arxiv-2311.15128","DOIUrl":"https://doi.org/arxiv-2311.15128","url":null,"abstract":"The problem of quickest change detection in a sequence of independent\u0000observations is considered. The pre-change distribution is assumed to be known,\u0000while the post-change distribution is unknown. Two tests based on post-change\u0000density estimation are developed for this problem, the window-limited\u0000non-parametric generalized likelihood ratio (NGLR) CuSum test and the\u0000non-parametric window-limited adaptive (NWLA) CuSum test. Both tests do not\u0000assume any knowledge of the post-change distribution, except that the\u0000post-change density satisfies certain smoothness conditions that allows for\u0000efficient non-parametric estimation. Also, they do not require any\u0000pre-collected post-change training samples. Under certain convergence\u0000conditions on the density estimator, it is shown that both tests are\u0000first-order asymptotically optimal, as the false alarm rate goes to zero. The\u0000analysis is validated through numerical results, where both tests are compared\u0000with baseline tests that have distributional knowledge.","PeriodicalId":501330,"journal":{"name":"arXiv - MATH - Statistics Theory","volume":"83 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138521329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Moment-Type Estimators for the Dirichlet and the Multivariate Gamma Distributions Dirichlet分布和多元伽玛分布的矩型估计
Pub Date : 2023-11-25 DOI: arxiv-2311.15025
Ioannis Oikonomidis, Samis Trevezas
This study presents new closed-form estimators for the Dirichlet and theMultivariate Gamma distribution families, whose maximum likelihood estimatorcannot be explicitly derived. The methodology builds upon the score-adjustedestimators for the Beta and Gamma distributions, extending their applicabilityto the Dirichlet and Multivariate Gamma distributions. Expressions for theasymptotic variance-covariance matrices are provided, demonstrating thesuperior performance of score-adjusted estimators over the traditional momentones. Leveraging well-established connections between Dirichlet andMultivariate Gamma distributions, a novel class of estimators for the latter isintroduced, referred to as "Dirichlet-based moment-type estimators". Thegeneral asymptotic variance-covariance matrix form for this estimator class isderived. To facilitate the application of these innovative estimators, an Rpackage called estimators is developed and made publicly available.
本文针对Dirichlet分布族和多元伽玛分布族的最大似然估计量不能显式导出的问题,提出了新的封闭估计量。该方法建立在Beta和Gamma分布的分数调整估计器上,扩展了它们对Dirichlet和多元Gamma分布的适用性。给出了渐近方差-协方差矩阵的表达式,证明了分数调整估计量优于传统矩量的性能。利用Dirichlet和多元伽玛分布之间建立的良好联系,介绍了后者的一类新的估计量,称为“基于Dirichlet的矩型估计量”。导出了该类估计量的一般渐近方差-协方差矩阵形式。为了促进这些创新估算器的应用,开发了一个名为estimators的Rpackage,并使其公开可用。
{"title":"Moment-Type Estimators for the Dirichlet and the Multivariate Gamma Distributions","authors":"Ioannis Oikonomidis, Samis Trevezas","doi":"arxiv-2311.15025","DOIUrl":"https://doi.org/arxiv-2311.15025","url":null,"abstract":"This study presents new closed-form estimators for the Dirichlet and the\u0000Multivariate Gamma distribution families, whose maximum likelihood estimator\u0000cannot be explicitly derived. The methodology builds upon the score-adjusted\u0000estimators for the Beta and Gamma distributions, extending their applicability\u0000to the Dirichlet and Multivariate Gamma distributions. Expressions for the\u0000asymptotic variance-covariance matrices are provided, demonstrating the\u0000superior performance of score-adjusted estimators over the traditional moment\u0000ones. Leveraging well-established connections between Dirichlet and\u0000Multivariate Gamma distributions, a novel class of estimators for the latter is\u0000introduced, referred to as \"Dirichlet-based moment-type estimators\". The\u0000general asymptotic variance-covariance matrix form for this estimator class is\u0000derived. To facilitate the application of these innovative estimators, an R\u0000package called estimators is developed and made publicly available.","PeriodicalId":501330,"journal":{"name":"arXiv - MATH - Statistics Theory","volume":"30 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138526184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Kernel-based measures of association between inputs and outputs based on ANOVA 基于方差分析的输入和输出之间关联的基于核的度量
Pub Date : 2023-11-25 DOI: arxiv-2311.14894
Matieyendou Lamboni
ANOVA decomposition of function with random input variables provides ANOVAfunctionals (AFs), which contain information about the contributions of theinput variables on the output variable(s). By embedding AFs into an appropriatereproducing kernel Hilbert space regarding their distributions, we propose anefficient statistical test of independence between the input variables andoutput variable(s). The resulting test statistic leads to new dependentmeasures of association between inputs and outputs that allow for i) dealingwith any distribution of AFs, including the Cauchy distribution, ii) accountingfor the necessary or desirable moments of AFs and the interactions among theinput variables. In uncertainty quantification for mathematical models, anumber of existing measures are special cases of this framework. We thenprovide unified and general global sensitivity indices and their consistentestimators, including asymptotic distributions. For Gaussian-distributed AFs,we obtain Sobol' indices and dependent generalized sensitivity indices usingquadratic kernels.
随机输入变量函数的方差分析分解提供了方差函数(AFs),其中包含有关输入变量对输出变量的贡献的信息。通过将AFs嵌入到有关其分布的适当再现核希尔伯特空间中,我们提出了输入变量和输出变量之间独立性的有效统计检验。由此产生的检验统计量导致输入和输出之间关联的新依赖度量,允许i)处理AFs的任何分布,包括柯西分布,ii)考虑AFs的必要或理想时刻以及输入变量之间的相互作用。在数学模型的不确定性量化中,现有测度的数量是该框架的特殊情况。然后,我们给出了统一的和通用的全局灵敏度指标及其一致性估计,包括渐近分布。对于高斯分布AFs,我们利用二次核得到Sobol指标和相关广义灵敏度指标。
{"title":"Kernel-based measures of association between inputs and outputs based on ANOVA","authors":"Matieyendou Lamboni","doi":"arxiv-2311.14894","DOIUrl":"https://doi.org/arxiv-2311.14894","url":null,"abstract":"ANOVA decomposition of function with random input variables provides ANOVA\u0000functionals (AFs), which contain information about the contributions of the\u0000input variables on the output variable(s). By embedding AFs into an appropriate\u0000reproducing kernel Hilbert space regarding their distributions, we propose an\u0000efficient statistical test of independence between the input variables and\u0000output variable(s). The resulting test statistic leads to new dependent\u0000measures of association between inputs and outputs that allow for i) dealing\u0000with any distribution of AFs, including the Cauchy distribution, ii) accounting\u0000for the necessary or desirable moments of AFs and the interactions among the\u0000input variables. In uncertainty quantification for mathematical models, a\u0000number of existing measures are special cases of this framework. We then\u0000provide unified and general global sensitivity indices and their consistent\u0000estimators, including asymptotic distributions. For Gaussian-distributed AFs,\u0000we obtain Sobol' indices and dependent generalized sensitivity indices using\u0000quadratic kernels.","PeriodicalId":501330,"journal":{"name":"arXiv - MATH - Statistics Theory","volume":"66 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138526142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Identification and Dimensionality Robust Test for Instrumental Variables Models 工具变量模型的识别和量纲稳健性检验
Pub Date : 2023-11-25 DOI: arxiv-2311.14892
Manu Navjeevan
I propose a new identification-robust test for the structural parameter in aheteroskedastic linear instrumental variables model. The proposed teststatistic is similar in spirit to a jackknife version of the K-statistic andthe resulting test has exact asymptotic size so long as an auxiliary parametercan be consistently estimated. This is possible under approximate sparsity evenwhen the number of instruments is much larger than the sample size. As thenumber of instruments is allowed, but not required, to be large, the limitingbehavior of the test statistic is difficult to examine via existing centrallimit theorems. Instead, I derive the asymptotic chi-squared distribution ofthe test statistic using a direct Gaussian approximation technique. To improvepower against certain alternatives, I propose a simple combination with thesup-score statistic of Belloni et al. (2012) based on a thresholding rule. Idemonstrate favorable size control and power properties in a simulation studyand apply the new methods to revisit the effect of social spillovers in movieconsumption.
本文提出了非方差线性工具变量模型中结构参数的一种新的识别鲁棒性检验方法。所提出的检验统计量在精神上类似于k统计量的折刀版本,并且只要可以一致地估计辅助参数,所得到的检验就具有精确的渐近大小。这在近似稀疏度下是可能的,即使仪器的数量远远大于样本量。由于仪器的数量是允许的,但不是必需的,很大,检验统计量的限制行为很难通过现有的中心极限定理来检验。相反,我使用直接高斯近似技术推导检验统计量的渐近卡方分布。为了提高对某些替代方案的能力,我提出了一个简单的结合Belloni等人(2012)基于阈值规则的up-score统计。在模拟研究中展示有利的尺寸控制和功率特性,并应用新方法重新审视电影消费中的社会溢出效应。
{"title":"An Identification and Dimensionality Robust Test for Instrumental Variables Models","authors":"Manu Navjeevan","doi":"arxiv-2311.14892","DOIUrl":"https://doi.org/arxiv-2311.14892","url":null,"abstract":"I propose a new identification-robust test for the structural parameter in a\u0000heteroskedastic linear instrumental variables model. The proposed test\u0000statistic is similar in spirit to a jackknife version of the K-statistic and\u0000the resulting test has exact asymptotic size so long as an auxiliary parameter\u0000can be consistently estimated. This is possible under approximate sparsity even\u0000when the number of instruments is much larger than the sample size. As the\u0000number of instruments is allowed, but not required, to be large, the limiting\u0000behavior of the test statistic is difficult to examine via existing central\u0000limit theorems. Instead, I derive the asymptotic chi-squared distribution of\u0000the test statistic using a direct Gaussian approximation technique. To improve\u0000power against certain alternatives, I propose a simple combination with the\u0000sup-score statistic of Belloni et al. (2012) based on a thresholding rule. I\u0000demonstrate favorable size control and power properties in a simulation study\u0000and apply the new methods to revisit the effect of social spillovers in movie\u0000consumption.","PeriodicalId":501330,"journal":{"name":"arXiv - MATH - Statistics Theory","volume":"EM-13 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138526172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - MATH - Statistics Theory
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1