首页 > 最新文献

arXiv - STAT - Statistics Theory最新文献

英文 中文
RandALO: Out-of-sample risk estimation in no time flat RandALO:快速进行样本外风险评估
Pub Date : 2024-09-15 DOI: arxiv-2409.09781
Parth T. Nobel, Daniel LeJeune, Emmanuel J. Candès
Estimating out-of-sample risk for models trained on large high-dimensionaldatasets is an expensive but essential part of the machine learning process,enabling practitioners to optimally tune hyperparameters. Cross-validation (CV)serves as the de facto standard for risk estimation but poorly trades off highbias ($K$-fold CV) for computational cost (leave-one-out CV). We propose arandomized approximate leave-one-out (RandALO) risk estimator that is not onlya consistent estimator of risk in high dimensions but also less computationallyexpensive than $K$-fold CV. We support our claims with extensive simulations onsynthetic and real data and provide a user-friendly Python package implementingRandALO available on PyPI as randalo and at https://github.com/cvxgrp/randalo.
估算在大型高维数据集上训练的模型的样本外风险是机器学习过程中一个昂贵但重要的部分,它使实践者能够优化调整超参数。交叉验证(CV)是风险估计的事实标准,但在高偏差($K$-fold CV)与计算成本(leave-one-out CV)之间的权衡并不理想。我们提出了随机化近似撇除(RandALO)风险估计器,它不仅是高维度风险的一致估计器,而且计算成本低于 K$-fold CV。我们在合成数据和真实数据上进行了大量模拟,为我们的主张提供了支持,并提供了一个实现 RandALO 的用户友好型 Python 软件包,可在 PyPI 上以 randalo 的形式下载,也可在 https://github.com/cvxgrp/randalo 上下载。
{"title":"RandALO: Out-of-sample risk estimation in no time flat","authors":"Parth T. Nobel, Daniel LeJeune, Emmanuel J. Candès","doi":"arxiv-2409.09781","DOIUrl":"https://doi.org/arxiv-2409.09781","url":null,"abstract":"Estimating out-of-sample risk for models trained on large high-dimensional\u0000datasets is an expensive but essential part of the machine learning process,\u0000enabling practitioners to optimally tune hyperparameters. Cross-validation (CV)\u0000serves as the de facto standard for risk estimation but poorly trades off high\u0000bias ($K$-fold CV) for computational cost (leave-one-out CV). We propose a\u0000randomized approximate leave-one-out (RandALO) risk estimator that is not only\u0000a consistent estimator of risk in high dimensions but also less computationally\u0000expensive than $K$-fold CV. We support our claims with extensive simulations on\u0000synthetic and real data and provide a user-friendly Python package implementing\u0000RandALO available on PyPI as randalo and at https://github.com/cvxgrp/randalo.","PeriodicalId":501379,"journal":{"name":"arXiv - STAT - Statistics Theory","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accuracy of the Ensemble Kalman Filter in the Near-Linear Setting 近线性环境下卡尔曼滤波器的精度
Pub Date : 2024-09-15 DOI: arxiv-2409.09800
Edoardo Calvello, Pierre Monmarché, Andrew M. Stuart, Urbain Vaes
The filtering distribution captures the statistics of the state of adynamical system from partial and noisy observations. Classical particlefilters provably approximate this distribution in quite general settings;however they behave poorly for high dimensional problems, suffering weightcollapse. This issue is circumvented by the ensemble Kalman filter which is anequal-weight interacting particle system. However, this finite particle systemis only proven to approximate the true filter in the linear Gaussian case. Inpractice, however, it is applied in much broader settings; as a result,establishing its approximation properties more generally is important. Therehas been recent progress in the theoretical analysis of the algorithm,establishing stability and error estimates in non-Gaussian settings, but theassumptions on the dynamics and observation models rule out the unboundedvector fields that arise in practice and the analysis applies only to the meanfield limit of the ensemble Kalman filter. The present work establishes errorbounds between the filtering distribution and the finite particle ensembleKalman filter when the model exhibits linear growth.
滤波分布能从部分和噪声观测中捕捉到动态系统的状态统计。经典粒子滤波器可以在相当普遍的情况下逼近这种分布,但在高维问题上表现不佳,会出现权重崩溃。集合卡尔曼滤波器是一个等权重交互粒子系统,它可以规避这个问题。然而,这种有限粒子系统只被证明在线性高斯情况下近似于真正的滤波器。然而,在实际应用中,卡尔曼滤波器的应用范围要广泛得多;因此,更普遍地建立卡尔曼滤波器的近似特性非常重要。最近在算法的理论分析方面取得了一些进展,建立了非高斯环境下的稳定性和误差估计,但对动力学和观测模型的假设排除了实践中出现的无界向量场,分析仅适用于集合卡尔曼滤波器的均值场极限。当模型呈现线性增长时,本研究建立了滤波分布与有限粒子集合卡尔曼滤波器之间的误差边界。
{"title":"Accuracy of the Ensemble Kalman Filter in the Near-Linear Setting","authors":"Edoardo Calvello, Pierre Monmarché, Andrew M. Stuart, Urbain Vaes","doi":"arxiv-2409.09800","DOIUrl":"https://doi.org/arxiv-2409.09800","url":null,"abstract":"The filtering distribution captures the statistics of the state of a\u0000dynamical system from partial and noisy observations. Classical particle\u0000filters provably approximate this distribution in quite general settings;\u0000however they behave poorly for high dimensional problems, suffering weight\u0000collapse. This issue is circumvented by the ensemble Kalman filter which is an\u0000equal-weight interacting particle system. However, this finite particle system\u0000is only proven to approximate the true filter in the linear Gaussian case. In\u0000practice, however, it is applied in much broader settings; as a result,\u0000establishing its approximation properties more generally is important. There\u0000has been recent progress in the theoretical analysis of the algorithm,\u0000establishing stability and error estimates in non-Gaussian settings, but the\u0000assumptions on the dynamics and observation models rule out the unbounded\u0000vector fields that arise in practice and the analysis applies only to the mean\u0000field limit of the ensemble Kalman filter. The present work establishes error\u0000bounds between the filtering distribution and the finite particle ensemble\u0000Kalman filter when the model exhibits linear growth.","PeriodicalId":501379,"journal":{"name":"arXiv - STAT - Statistics Theory","volume":"20 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Asymptotics for irregularly observed long memory processes 不规则观测长记忆过程的渐近线
Pub Date : 2024-09-14 DOI: arxiv-2409.09498
Mohamedou Ould-Haye, Anne Philippe
We study the effect of observing a stationary process at irregular timepoints via a renewal process. We establish a sharp difference in the asymptoticbehaviour of the self-normalized sample mean of the observed process dependingon the renewal process. In particular, we show that if the renewal process hasa moderate heavy tail distribution then the limit is a so-called NormalVariance Mixture (NVM) and we characterize the randomized variance part of thelimiting NVM as an integral function of a L'evy stable motion. Otherwise, thenormalized sample mean will be asymptotically normal.
我们研究了通过更新过程在不规则时间点观测静止过程的效果。我们发现,观测过程的自归一化样本平均值的渐近行为与更新过程有显著差异。特别是,我们证明了如果更新过程具有适度的重尾分布,那么极限就是所谓的正态方差混集(NVM),并且我们将极限 NVM 的随机方差部分表征为 L'evy 稳定运动的积分函数。否则,归一化样本平均数将是渐近正态的。
{"title":"Asymptotics for irregularly observed long memory processes","authors":"Mohamedou Ould-Haye, Anne Philippe","doi":"arxiv-2409.09498","DOIUrl":"https://doi.org/arxiv-2409.09498","url":null,"abstract":"We study the effect of observing a stationary process at irregular time\u0000points via a renewal process. We establish a sharp difference in the asymptotic\u0000behaviour of the self-normalized sample mean of the observed process depending\u0000on the renewal process. In particular, we show that if the renewal process has\u0000a moderate heavy tail distribution then the limit is a so-called Normal\u0000Variance Mixture (NVM) and we characterize the randomized variance part of the\u0000limiting NVM as an integral function of a L'evy stable motion. Otherwise, the\u0000normalized sample mean will be asymptotically normal.","PeriodicalId":501379,"journal":{"name":"arXiv - STAT - Statistics Theory","volume":"10 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Statistical Viewpoint on Differential Privacy: Hypothesis Testing, Representation and Blackwell's Theorem 关于差异隐私的统计学观点:假设检验、表征和布莱克韦尔定理
Pub Date : 2024-09-14 DOI: arxiv-2409.09558
Weijie J. Su
Differential privacy is widely considered the formal privacy forprivacy-preserving data analysis due to its robust and rigorous guarantees,with increasingly broad adoption in public services, academia, and industry.Despite originating in the cryptographic context, in this review paper we arguethat, fundamentally, differential privacy can be considered a textit{pure}statistical concept. By leveraging a theorem due to David Blackwell, our focusis to demonstrate that the definition of differential privacy can be formallymotivated from a hypothesis testing perspective, thereby showing thathypothesis testing is not merely convenient but also the right language forreasoning about differential privacy. This insight leads to the definition of$f$-differential privacy, which extends other differential privacy definitionsthrough a representation theorem. We review techniques that render$f$-differential privacy a unified framework for analyzing privacy bounds indata analysis and machine learning. Applications of this differential privacydefinition to private deep learning, private convex optimization, shuffledmechanisms, and U.S.~Census data are discussed to highlight the benefits ofanalyzing privacy bounds under this framework compared to existingalternatives.
差分隐私因其稳健而严格的保证而被广泛认为是保护隐私的数据分析的正式隐私,并在公共服务、学术界和工业界得到越来越广泛的应用。尽管差分隐私起源于密码学背景,但在这篇综述论文中,我们认为从根本上讲,差分隐私可以被视为一个(文本{纯}统计概念。通过利用大卫-布莱克韦尔(David Blackwell)提出的一个定理,我们的重点是证明差分隐私的定义可以从假设检验的角度进行正式推导,从而表明假设检验不仅方便,而且是推理差分隐私的正确语言。这一见解引出了$f$差分隐私的定义,它通过表示定理扩展了其他差分隐私的定义。我们回顾了一些技术,这些技术使f$差分隐私成为分析数据分析和机器学习中隐私边界的统一框架。我们讨论了这一差分隐私定义在隐私深度学习、隐私凸优化、洗牌机制和美国人口普查数据中的应用,以突出与现有替代方法相比,在这一框架下分析隐私边界的好处。
{"title":"A Statistical Viewpoint on Differential Privacy: Hypothesis Testing, Representation and Blackwell's Theorem","authors":"Weijie J. Su","doi":"arxiv-2409.09558","DOIUrl":"https://doi.org/arxiv-2409.09558","url":null,"abstract":"Differential privacy is widely considered the formal privacy for\u0000privacy-preserving data analysis due to its robust and rigorous guarantees,\u0000with increasingly broad adoption in public services, academia, and industry.\u0000Despite originating in the cryptographic context, in this review paper we argue\u0000that, fundamentally, differential privacy can be considered a textit{pure}\u0000statistical concept. By leveraging a theorem due to David Blackwell, our focus\u0000is to demonstrate that the definition of differential privacy can be formally\u0000motivated from a hypothesis testing perspective, thereby showing that\u0000hypothesis testing is not merely convenient but also the right language for\u0000reasoning about differential privacy. This insight leads to the definition of\u0000$f$-differential privacy, which extends other differential privacy definitions\u0000through a representation theorem. We review techniques that render\u0000$f$-differential privacy a unified framework for analyzing privacy bounds in\u0000data analysis and machine learning. Applications of this differential privacy\u0000definition to private deep learning, private convex optimization, shuffled\u0000mechanisms, and U.S.~Census data are discussed to highlight the benefits of\u0000analyzing privacy bounds under this framework compared to existing\u0000alternatives.","PeriodicalId":501379,"journal":{"name":"arXiv - STAT - Statistics Theory","volume":"56 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Random-effects Approach to Regression Involving Many Categorical Predictors and Their Interactions 涉及多个分类预测因子及其交互作用的随机效应回归方法
Pub Date : 2024-09-14 DOI: arxiv-2409.09355
Hanmei Sun, Jiangshan Zhang, Jiming Jiang
Linear model prediction with a large number of potential predictors is bothstatistically and computationally challenging. The traditional approaches arelargely based on shrinkage selection/estimation methods, which are applicableeven when the number of potential predictors is (much) larger than the samplesize. A situation of the latter scenario occurs when the candidate predictorsinvolve many binary indicators corresponding to categories of some categoricalpredictors as well as their interactions. We propose an alternative approach tothe shrinkage prediction methods in such a case based on mixed modelprediction, which effectively treats combinations of the categorical effects asrandom effects. We establish theoretical validity of the proposed method, anddemonstrate empirically its advantage over the shrinkage methods. We alsodevelop measures of uncertainty for the proposed method and evaluate theirperformance empirically. A real-data example is considered.
使用大量潜在预测因子进行线性模型预测,在统计和计算上都具有挑战性。传统方法主要基于收缩选择/估计方法,这些方法即使在潜在预测因子数量(远远)大于样本数量时也适用。当候选预测因子涉及许多与某些分类预测因子类别相对应的二进制指标以及它们之间的交互作用时,就会出现后一种情况。在这种情况下,我们提出了一种基于混合模型预测的收缩预测方法的替代方法,它能有效地将分类效应的组合视为随机效应。我们建立了所提方法的理论有效性,并通过实证证明了它相对于收缩预测方法的优势。我们还为所提出的方法制定了不确定性度量,并对其性能进行了实证评估。我们还考虑了一个真实数据示例。
{"title":"A Random-effects Approach to Regression Involving Many Categorical Predictors and Their Interactions","authors":"Hanmei Sun, Jiangshan Zhang, Jiming Jiang","doi":"arxiv-2409.09355","DOIUrl":"https://doi.org/arxiv-2409.09355","url":null,"abstract":"Linear model prediction with a large number of potential predictors is both\u0000statistically and computationally challenging. The traditional approaches are\u0000largely based on shrinkage selection/estimation methods, which are applicable\u0000even when the number of potential predictors is (much) larger than the sample\u0000size. A situation of the latter scenario occurs when the candidate predictors\u0000involve many binary indicators corresponding to categories of some categorical\u0000predictors as well as their interactions. We propose an alternative approach to\u0000the shrinkage prediction methods in such a case based on mixed model\u0000prediction, which effectively treats combinations of the categorical effects as\u0000random effects. We establish theoretical validity of the proposed method, and\u0000demonstrate empirically its advantage over the shrinkage methods. We also\u0000develop measures of uncertainty for the proposed method and evaluate their\u0000performance empirically. A real-data example is considered.","PeriodicalId":501379,"journal":{"name":"arXiv - STAT - Statistics Theory","volume":"105 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bounding the probability of causality under ordinal outcomes 限定序数结果下的因果关系概率
Pub Date : 2024-09-14 DOI: arxiv-2409.09297
Hanmei Sun, Chengfeng Shi, Qiang Zhao
The probability of causation (PC) is often used in liability assessments. Ina legal context, for example, where a patient suffered the side effect aftertaking a medication and sued the pharmaceutical company as a result, the valueof the PC can help assess the likelihood that the side effect was caused by themedication, in other words, how likely it is that the patient will win thecase. Beyond the issue of legal disputes, the PC plays an equally large rolewhen one wants to go about explaining causal relationships between events thathave already occurred in other areas. This article begins by reviewing thedefinitions and bounds of the probability of causality for binary outcomes,then generalizes them to ordinal outcomes. It demonstrates that incorporatingadditional mediator variable information in a complete mediation analysisprovides a more refined bound compared to the simpler scenario where onlyexposure and outcome variables are considered.
因果关系概率(PC)通常用于责任评估。例如,在法律方面,当病人服用某种药物后出现副作用并因此起诉制药公司时,因果概率的值可以帮助评估副作用是由药物引起的可能性,换句话说,病人胜诉的可能性有多大。除了法律纠纷问题之外,当人们想要解释其他领域已经发生的事件之间的因果关系时,PC 同样发挥着重要作用。本文首先回顾了二元结果因果关系概率的定义和界限,然后将其推广到序数结果。文章证明,与只考虑暴露变量和结果变量的简单情况相比,在完整的中介分析中加入额外的中介变量信息能提供更精细的界限。
{"title":"Bounding the probability of causality under ordinal outcomes","authors":"Hanmei Sun, Chengfeng Shi, Qiang Zhao","doi":"arxiv-2409.09297","DOIUrl":"https://doi.org/arxiv-2409.09297","url":null,"abstract":"The probability of causation (PC) is often used in liability assessments. In\u0000a legal context, for example, where a patient suffered the side effect after\u0000taking a medication and sued the pharmaceutical company as a result, the value\u0000of the PC can help assess the likelihood that the side effect was caused by the\u0000medication, in other words, how likely it is that the patient will win the\u0000case. Beyond the issue of legal disputes, the PC plays an equally large role\u0000when one wants to go about explaining causal relationships between events that\u0000have already occurred in other areas. This article begins by reviewing the\u0000definitions and bounds of the probability of causality for binary outcomes,\u0000then generalizes them to ordinal outcomes. It demonstrates that incorporating\u0000additional mediator variable information in a complete mediation analysis\u0000provides a more refined bound compared to the simpler scenario where only\u0000exposure and outcome variables are considered.","PeriodicalId":501379,"journal":{"name":"arXiv - STAT - Statistics Theory","volume":"49 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Asymptotics of Wide Remedians 宽补数的渐近性
Pub Date : 2024-09-14 DOI: arxiv-2409.09528
Philip T. Labo
The remedian uses a $ktimes b$ matrix to approximate the median of $nleqb^{k}$ streaming input values by recursively replacing buffers of $b$ valueswith their medians, thereby ignoring its $200(lceil b/2rceil / b)^{k}%$ mostextreme inputs. Rousseeuw & Bassett (1990) and Chao & Lin (1993); Chen & Chen(2005) study the remedian's distribution as $krightarrowinfty$ and as$k,brightarrowinfty$. The remedian's breakdown point vanishes as$krightarrowinfty$, but approaches $(1/2)^{k}$ as $brightarrowinfty$. Westudy the remedian's robust-regime distribution as $brightarrowinfty$,deriving a normal distribution for standardized (mean, median, remedian,remedian rank) as $brightarrowinfty$, thereby illuminating the remedian'saccuracy in approximating the sample median. We derive the asymptoticefficiency of the remedian relative to the mean and the median. Finally, wediscuss the estimation of more than one quantile at once, proposing anasymptotic distribution for the random vector that results when we applyremedian estimation in parallel to the components of i.i.d. random vectors.
remedian使用一个$k/times b$矩阵来近似$nleqb^{k}$流输入值的中值,方法是递归地将$b$值的缓冲区替换为它们的中值,从而忽略其$200(lceil b/2rceil / b)^{k}%$最极端的输入。Rousseeuw & Bassett(1990)、Chao & Lin(1993)、Chen & Chen(2005)分别研究了 $krightarrowinfty$ 和 $k,b/rightarrow/infty$时的remedian分布。当 $krightarrowinfty$ 时,remedian 的崩溃点消失,但当 $brightarrowinfty$ 时,remedian 的崩溃点接近 $(1/2)^{k}$。我们以 $brightarrowinfty$ 的形式研究了remedian的稳健-时间分布,并以 $brightarrowinfty$的形式推导出标准化(均值、中位数、remedian、remedian rank)的正态分布,从而揭示了remedian在逼近样本中位数方面的准确性。我们推导出相对于均值和中位数的再中值的渐近效率。最后,我们讨论了同时估计多个量级的问题,提出了当我们对 i.i.d. 随机向量的分量并行应用remedian估计时所产生的随机向量的渐近分布。
{"title":"The Asymptotics of Wide Remedians","authors":"Philip T. Labo","doi":"arxiv-2409.09528","DOIUrl":"https://doi.org/arxiv-2409.09528","url":null,"abstract":"The remedian uses a $ktimes b$ matrix to approximate the median of $nleq\u0000b^{k}$ streaming input values by recursively replacing buffers of $b$ values\u0000with their medians, thereby ignoring its $200(lceil b/2rceil / b)^{k}%$ most\u0000extreme inputs. Rousseeuw & Bassett (1990) and Chao & Lin (1993); Chen & Chen\u0000(2005) study the remedian's distribution as $krightarrowinfty$ and as\u0000$k,brightarrowinfty$. The remedian's breakdown point vanishes as\u0000$krightarrowinfty$, but approaches $(1/2)^{k}$ as $brightarrowinfty$. We\u0000study the remedian's robust-regime distribution as $brightarrowinfty$,\u0000deriving a normal distribution for standardized (mean, median, remedian,\u0000remedian rank) as $brightarrowinfty$, thereby illuminating the remedian's\u0000accuracy in approximating the sample median. We derive the asymptotic\u0000efficiency of the remedian relative to the mean and the median. Finally, we\u0000discuss the estimation of more than one quantile at once, proposing an\u0000asymptotic distribution for the random vector that results when we apply\u0000remedian estimation in parallel to the components of i.i.d. random vectors.","PeriodicalId":501379,"journal":{"name":"arXiv - STAT - Statistics Theory","volume":"7 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Locally sharp goodness-of-fit testing in sup norm for high-dimensional counts 高维计数的超常规局部尖锐拟合优度测试
Pub Date : 2024-09-13 DOI: arxiv-2409.08871
Subhodh Kotekal, Julien Chhor, Chao Gao
We consider testing the goodness-of-fit of a distribution againstalternatives separated in sup norm. We study the twin settings ofPoisson-generated count data with a large number of categories andhigh-dimensional multinomials. In previous studies of different separationmetrics, it has been found that the local minimax separation rate exhibitssubstantial heterogeneity and is a complicated function of the nulldistribution; the rate-optimal test requires careful tailoring to the null. Inthe setting of sup norm, this remains the case and we establish that the localminimax separation rate is determined by the finer decay behavior of thecategory rates. The upper bound is obtained by a test involving the samplemaximum, and the lower bound argument involves reducing the originalheteroskedastic null to an auxiliary homoskedastic null determined by the decayof the rates. Further, in a particular asymptotic setup, the sharp constantsare identified.
我们考虑用超常规分离的替代变量来测试分布的拟合优度。我们研究的是具有大量类别和高维多项式的泊松计数数据。在以前对不同分离度量的研究中,我们发现局部最小分离率表现出很大的异质性,并且是空分布的一个复杂函数;最优分离率检验需要对空分布进行仔细调整。在 sup norm 的设置中,情况依然如此,我们确定局部最小分离率是由类别率更精细的衰减行为决定的。上界是通过涉及样本最大值的检验得到的,而下界的论证涉及将原来的异方差空值简化为由速率衰减决定的辅助同方差空值。此外,在一个特定的渐近设置中,确定了尖锐常数。
{"title":"Locally sharp goodness-of-fit testing in sup norm for high-dimensional counts","authors":"Subhodh Kotekal, Julien Chhor, Chao Gao","doi":"arxiv-2409.08871","DOIUrl":"https://doi.org/arxiv-2409.08871","url":null,"abstract":"We consider testing the goodness-of-fit of a distribution against\u0000alternatives separated in sup norm. We study the twin settings of\u0000Poisson-generated count data with a large number of categories and\u0000high-dimensional multinomials. In previous studies of different separation\u0000metrics, it has been found that the local minimax separation rate exhibits\u0000substantial heterogeneity and is a complicated function of the null\u0000distribution; the rate-optimal test requires careful tailoring to the null. In\u0000the setting of sup norm, this remains the case and we establish that the local\u0000minimax separation rate is determined by the finer decay behavior of the\u0000category rates. The upper bound is obtained by a test involving the sample\u0000maximum, and the lower bound argument involves reducing the original\u0000heteroskedastic null to an auxiliary homoskedastic null determined by the decay\u0000of the rates. Further, in a particular asymptotic setup, the sharp constants\u0000are identified.","PeriodicalId":501379,"journal":{"name":"arXiv - STAT - Statistics Theory","volume":"209 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-dimensional regression with a count response 计数响应的高维回归
Pub Date : 2024-09-13 DOI: arxiv-2409.08821
Or Zilberman, Felix Abramovich
We consider high-dimensional regression with a count response modeled byPoisson or negative binomial generalized linear model (GLM). We propose apenalized maximum likelihood estimator with a properly chosen complexitypenalty and establish its adaptive minimaxity across models of varioussparsity. To make the procedure computationally feasible for high-dimensionaldata we consider its LASSO and SLOPE convex surrogates. Their performance isillustrated through simulated and real-data examples.
我们考虑了以泊松或负二项广义线性模型(GLM)为模型的计数响应的高维回归。我们提出了一个具有适当复杂度惩罚的最大似然估计器,并确定了它在各种稀疏模型中的自适应最小性。为了使该程序在高维数据的计算上可行,我们考虑了其 LASSO 和 SLOPE 凸代理。我们通过模拟和真实数据实例来说明它们的性能。
{"title":"High-dimensional regression with a count response","authors":"Or Zilberman, Felix Abramovich","doi":"arxiv-2409.08821","DOIUrl":"https://doi.org/arxiv-2409.08821","url":null,"abstract":"We consider high-dimensional regression with a count response modeled by\u0000Poisson or negative binomial generalized linear model (GLM). We propose a\u0000penalized maximum likelihood estimator with a properly chosen complexity\u0000penalty and establish its adaptive minimaxity across models of various\u0000sparsity. To make the procedure computationally feasible for high-dimensional\u0000data we consider its LASSO and SLOPE convex surrogates. Their performance is\u0000illustrated through simulated and real-data examples.","PeriodicalId":501379,"journal":{"name":"arXiv - STAT - Statistics Theory","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142268001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-Organized State-Space Models with Artificial Dynamics 人工动力学自组织状态空间模型
Pub Date : 2024-09-13 DOI: arxiv-2409.08928
Yuan Chen, Mathieu Gerber, Christophe Andrieu, Randal Douc
In this paper we consider a state-space model (SSM) parametrized by someparameter $theta$, and our aim is to perform joint parameter and stateinference. A simple idea to perform this task, which almost dates back to theorigin of the Kalman filter, is to replace the static parameter $theta$ by aMarkov chain $(theta_t)_{tgeq 0}$ on the parameter space and then to apply astandard filtering algorithm to the extended, or self-organized SSM. However,the practical implementation of this idea in a theoretically justified way hasremained an open problem. In this paper we fill this gap by introducing variouspossible constructions of the Markov chain $(theta_t)_{tgeq 0}$ that ensurethe validity of the self-organized SSM (SO-SSM) for joint parameter and stateinference. Notably, we show that theoretically valid SO-SSMs can be definedeven if $|mathrm{Var}(theta_{t}|theta_{t-1})|$ converges to 0 slowly as$trightarrowinfty$. This result is important since, as illustrated in ournumerical experiments, such models can be efficiently approximated usingstandard particle filter algorithms. While the idea studied in this work wasfirst introduced for online inference in SSMs, it has also been proved to beuseful for computing the maximum likelihood estimator (MLE) of a given SSM,since iterated filtering algorithms can be seen as particle filters applied toSO-SSMs for which the target parameter value is the MLE of interest. Based onthis observation, we also derive constructions of $(theta_t)_{tgeq 0}$ andtheoretical results tailored to these specific applications of SO-SSMs, and asa result, we introduce new iterated filtering algorithms. From a practicalpoint of view, the algorithms introduced in this work have the merit of beingsimple to implement and only requiring minimal tuning to perform well.
在本文中,我们考虑了一个由某个参数$theta$参数化的状态空间模型(SSM),我们的目的是执行联合参数和状态推断。执行这项任务的一个简单想法(几乎可以追溯到卡尔曼滤波器的起源)是用参数空间上的马尔可夫链 $(theta_t)_{tgeq 0}$来替换静态参数 $theta$,然后将标准滤波算法应用于扩展的或自组织的 SSM。然而,如何以理论上合理的方式实际实现这一想法仍是一个悬而未决的问题。本文通过引入马尔可夫链$(theta_t)_{tgeq 0}$的各种可能构造来填补这一空白,从而确保自组织SSM(SO-SSM)在联合参数和状态推理中的有效性。值得注意的是,我们证明,即使 $mathrm{Var}(theta_{t}|theta_{t-1})|$ 随着 $trightarrowinfty$ 缓慢收敛到 0,也可以定义理论上有效的 SO-SSM。这一结果非常重要,因为正如我们的数值实验所说明的,这种模型可以用标准粒子滤波算法有效地逼近。虽然这项工作中研究的想法最初是针对 SSM 的在线推断提出的,但它也被证明对计算给定 SSM 的最大似然估计器(MLE)很有用,因为迭代滤波算法可以看作是应用于 SSM 的粒子滤波器,其目标参数值就是感兴趣的 MLE。基于这一观察,我们还推导出了$(theta_t)_{tgeq 0}$的构造以及针对这些SO-SSMs特定应用的理论结果,并由此引入了新的迭代滤波算法。从实用的角度来看,这项工作中引入的算法具有实现简单、只需极少调整即可运行良好的优点。
{"title":"Self-Organized State-Space Models with Artificial Dynamics","authors":"Yuan Chen, Mathieu Gerber, Christophe Andrieu, Randal Douc","doi":"arxiv-2409.08928","DOIUrl":"https://doi.org/arxiv-2409.08928","url":null,"abstract":"In this paper we consider a state-space model (SSM) parametrized by some\u0000parameter $theta$, and our aim is to perform joint parameter and state\u0000inference. A simple idea to perform this task, which almost dates back to the\u0000origin of the Kalman filter, is to replace the static parameter $theta$ by a\u0000Markov chain $(theta_t)_{tgeq 0}$ on the parameter space and then to apply a\u0000standard filtering algorithm to the extended, or self-organized SSM. However,\u0000the practical implementation of this idea in a theoretically justified way has\u0000remained an open problem. In this paper we fill this gap by introducing various\u0000possible constructions of the Markov chain $(theta_t)_{tgeq 0}$ that ensure\u0000the validity of the self-organized SSM (SO-SSM) for joint parameter and state\u0000inference. Notably, we show that theoretically valid SO-SSMs can be defined\u0000even if $|mathrm{Var}(theta_{t}|theta_{t-1})|$ converges to 0 slowly as\u0000$trightarrowinfty$. This result is important since, as illustrated in our\u0000numerical experiments, such models can be efficiently approximated using\u0000standard particle filter algorithms. While the idea studied in this work was\u0000first introduced for online inference in SSMs, it has also been proved to be\u0000useful for computing the maximum likelihood estimator (MLE) of a given SSM,\u0000since iterated filtering algorithms can be seen as particle filters applied to\u0000SO-SSMs for which the target parameter value is the MLE of interest. Based on\u0000this observation, we also derive constructions of $(theta_t)_{tgeq 0}$ and\u0000theoretical results tailored to these specific applications of SO-SSMs, and as\u0000a result, we introduce new iterated filtering algorithms. From a practical\u0000point of view, the algorithms introduced in this work have the merit of being\u0000simple to implement and only requiring minimal tuning to perform well.","PeriodicalId":501379,"journal":{"name":"arXiv - STAT - Statistics Theory","volume":"20 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142267996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - STAT - Statistics Theory
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1