首页 > 最新文献

Biometrika最新文献

英文 中文
Generalized kernel two-sample tests 广义核双样本测试
2区 数学 Q2 BIOLOGY Pub Date : 2023-11-14 DOI: 10.1093/biomet/asad068
Hoseung Song, Hao Chen
Summary Kernel two-sample tests have been widely used for multivariate data to test equality of distributions. However, existing tests based on mapping distributions into a reproducing kernel Hilbert space mainly target specific alternatives and do not work well for some scenarios when the dimension of the data is moderate to high due to the curse of dimensionality. We propose a new test statistic that makes use of a common pattern under moderate and high dimensions and achieves substantial power improvements over existing kernel two-sample tests for a wide range of alternatives. We also propose alternative testing procedures that maintain high power with low computational cost, offering easy off-the-shelf tools for large datasets. The new approaches are compared to other state-of-the-art tests under various settings and show good performance. We showcase the new approaches through two applications: the comparison of musks and non-musks using the shape of molecules, and the comparison of taxi trips starting from John F. Kennedy airport in consecutive months. All proposed methods are implemented in an R package kerTests.
核二样本检验被广泛用于多变量数据的分布是否相等。然而,现有的基于将分布映射到再现内核希尔伯特空间的测试主要针对特定的替代方案,并且由于维数的诅咒,当数据的维数从中等到高时,它不能很好地工作。我们提出了一种新的测试统计量,它利用了中等和高维下的通用模式,并在广泛的替代方案中实现了对现有内核双样本测试的实质性改进。我们还提出了替代测试程序,以低计算成本保持高功率,为大型数据集提供简单的现成工具。将新方法与其他最先进的测试方法在各种设置下进行了比较,并显示出良好的性能。我们通过两个应用程序展示了新方法:使用分子形状比较麝香和非麝香,以及比较从约翰肯尼迪机场连续几个月的出租车行程。所有建议的方法都在一个R包kerTests中实现。
{"title":"Generalized kernel two-sample tests","authors":"Hoseung Song, Hao Chen","doi":"10.1093/biomet/asad068","DOIUrl":"https://doi.org/10.1093/biomet/asad068","url":null,"abstract":"Summary Kernel two-sample tests have been widely used for multivariate data to test equality of distributions. However, existing tests based on mapping distributions into a reproducing kernel Hilbert space mainly target specific alternatives and do not work well for some scenarios when the dimension of the data is moderate to high due to the curse of dimensionality. We propose a new test statistic that makes use of a common pattern under moderate and high dimensions and achieves substantial power improvements over existing kernel two-sample tests for a wide range of alternatives. We also propose alternative testing procedures that maintain high power with low computational cost, offering easy off-the-shelf tools for large datasets. The new approaches are compared to other state-of-the-art tests under various settings and show good performance. We showcase the new approaches through two applications: the comparison of musks and non-musks using the shape of molecules, and the comparison of taxi trips starting from John F. Kennedy airport in consecutive months. All proposed methods are implemented in an R package kerTests.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"114 19","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134957329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Testing Serial Independence of Object-Valued Time Series 对象值时间序列序列独立性的检验
2区 数学 Q2 BIOLOGY Pub Date : 2023-11-11 DOI: 10.1093/biomet/asad069
Feiyu Jiang, Hanjia Gao, Xiaofeng Shao
Summary We propose a novel method for testing serial independence of object-valued time series in metric spaces, which is more general than Euclidean or Hilbert spaces. The proposed method is fully nonparametric, free of tuning parameters and can capture all nonlinear pairwise dependence. The key concept used in this paper is the distance covariance in metric spaces, which is extended to auto-distance covariance for object-valued time series. Furthermore, we propose a generalized spectral density function to account for pairwise dependence at all lags and construct a Cramér von-Mises type test statistic. New theoretical arguments are developed to establish the asymptotic behaviour of the test statistic. A wild bootstrap is also introduced to obtain the critical values of the nonpivotal limiting null distribution. Extensive numerical simulations and two real data applications on cumulative intraday returns and human mortality data are conducted to illustrate the effectiveness and versatility of our proposed test.
摘要本文提出了一种在度量空间中检验对象值时间序列序列独立性的新方法,该方法比欧几里得空间和希尔伯特空间更为普遍。该方法是完全非参数的,不需要调整参数,可以捕获所有的非线性两两依赖关系。本文使用的关键概念是度量空间中的距离协方差,并将其推广到对象值时间序列的自距离协方差。此外,我们提出了一个广义谱密度函数来解释所有滞后的两两依赖,并构造了一个cram - mises型检验统计量。提出了新的理论论据来建立检验统计量的渐近行为。为了得到非枢纽极限零分布的临界值,还引入了野自举法。广泛的数值模拟和两个实际数据应用累积日内收益和人类死亡率数据,以说明我们提出的测试的有效性和多功能性。
{"title":"Testing Serial Independence of Object-Valued Time Series","authors":"Feiyu Jiang, Hanjia Gao, Xiaofeng Shao","doi":"10.1093/biomet/asad069","DOIUrl":"https://doi.org/10.1093/biomet/asad069","url":null,"abstract":"Summary We propose a novel method for testing serial independence of object-valued time series in metric spaces, which is more general than Euclidean or Hilbert spaces. The proposed method is fully nonparametric, free of tuning parameters and can capture all nonlinear pairwise dependence. The key concept used in this paper is the distance covariance in metric spaces, which is extended to auto-distance covariance for object-valued time series. Furthermore, we propose a generalized spectral density function to account for pairwise dependence at all lags and construct a Cramér von-Mises type test statistic. New theoretical arguments are developed to establish the asymptotic behaviour of the test statistic. A wild bootstrap is also introduced to obtain the critical values of the nonpivotal limiting null distribution. Extensive numerical simulations and two real data applications on cumulative intraday returns and human mortality data are conducted to illustrate the effectiveness and versatility of our proposed test.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"6 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135087094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the optimality of score-driven models 分数驱动模型的最优性
2区 数学 Q2 BIOLOGY Pub Date : 2023-11-09 DOI: 10.1093/biomet/asad067
P Gorgi, C S A Lauria, A Luati
Summary Score-driven models have been recently introduced as a general framework to specify time-varying parameters of conditional densities. %The underlying idea is to specify a time-varying parameter as an autoregressive process with innovation given by the score of the associated log-likelihood. The score enjoys stochastic properties that make these models easy to implement and convenient to apply in several contexts, ranging from biostatistics to finance. Score-driven parameter updates have been shown to be optimal in terms of locally reducing a local version of the Kullback–Leibler divergence between the true conditional density and the postulated density of the model. A key limitation of such an optimality property is that it holds only locally both in the parameter space and sample space, yielding to a definition of local Kullback–Leibler divergence that is in fact not a divergence measure. The current paper shows that score-driven updates satisfy stronger optimality properties that are based on a global definition of Kullback–Leibler divergence. In particular, it is shown that score-driven updates reduce the distance between the expected updated parameter and the pseudo-true parameter. Furthermore, depending on the conditional density and the scaling of the score, the optimality result can hold globally over the parameter space, which can be viewed as a generalization of the monotonicity property of the stochastic gradient descent scheme. Several examples illustrate how the results derived in the paper apply to specific models under different easy-to-check assumptions, and provide a formal method to select the link-function and the scaling of the score.
摘要分数驱动模型最近被引入作为一个通用框架来指定条件密度的时变参数。基本思想是指定一个时变参数作为一个自回归过程,创新由相关的对数似然评分给出。分数具有随机特性,这使得这些模型易于实现,并且可以方便地应用于从生物统计学到金融等多种环境中。分数驱动的参数更新已被证明是最优的,因为它可以局部减少模型的真实条件密度和假设密度之间的Kullback-Leibler散度的局部版本。这种最优性的一个关键限制是,它只在局部参数空间和样本空间中成立,从而产生局部Kullback-Leibler散度的定义,实际上不是散度度量。当前的论文表明,分数驱动的更新满足基于Kullback-Leibler散度的全局定义的更强的最优性属性。特别是,分数驱动的更新减少了预期更新参数与伪真参数之间的距离。此外,根据条件密度和分数的缩放,最优性结果可以在参数空间上全局保持,这可以看作是随机梯度下降方案单调性的推广。几个例子说明了本文得出的结果如何应用于不同易于检查的假设下的特定模型,并提供了一种选择链接函数和评分标度的形式化方法。
{"title":"On the optimality of score-driven models","authors":"P Gorgi, C S A Lauria, A Luati","doi":"10.1093/biomet/asad067","DOIUrl":"https://doi.org/10.1093/biomet/asad067","url":null,"abstract":"Summary Score-driven models have been recently introduced as a general framework to specify time-varying parameters of conditional densities. %The underlying idea is to specify a time-varying parameter as an autoregressive process with innovation given by the score of the associated log-likelihood. The score enjoys stochastic properties that make these models easy to implement and convenient to apply in several contexts, ranging from biostatistics to finance. Score-driven parameter updates have been shown to be optimal in terms of locally reducing a local version of the Kullback–Leibler divergence between the true conditional density and the postulated density of the model. A key limitation of such an optimality property is that it holds only locally both in the parameter space and sample space, yielding to a definition of local Kullback–Leibler divergence that is in fact not a divergence measure. The current paper shows that score-driven updates satisfy stronger optimality properties that are based on a global definition of Kullback–Leibler divergence. In particular, it is shown that score-driven updates reduce the distance between the expected updated parameter and the pseudo-true parameter. Furthermore, depending on the conditional density and the scaling of the score, the optimality result can hold globally over the parameter space, which can be viewed as a generalization of the monotonicity property of the stochastic gradient descent scheme. Several examples illustrate how the results derived in the paper apply to specific models under different easy-to-check assumptions, and provide a formal method to select the link-function and the scaling of the score.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":" 28","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135291655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Graphical tools for selecting conditional instrumental sets 用于选择条件仪表组的图形工具
2区 数学 Q2 BIOLOGY Pub Date : 2023-11-03 DOI: 10.1093/biomet/asad066
Henckel, Leonard, Buttenschön, Martin, Maathuis, Marloes H.
Summary We consider the efficient estimation of total causal effects in the presence of unmeasured confounding using conditional instrumental sets. Specifically, we consider the two-stage least squares estimator in the setting of a linear structural equation model with correlated errors that is compatible with a known acyclic directed mixed graph. To set the stage for our results, we characterize the class of linearly valid conditional instrumental sets that yield consistent two-stage least squares estimators for the target total effect and derive a new asymptotic variance formula for these estimators. Equipped with these results, we provide three graphical tools for selecting more efficient linearly valid conditional instrumental sets. First, a graphical criterion that for certain pairs of linearly valid conditional instrumental sets identifies which of the two corresponding estimators has the smaller asymptotic variance. Second, an algorithm that greedily adds covariates that reduce the asymptotic variance to a given linearly valid conditional instrumental set. Third, a linearly valid conditional instrumental set for which the corresponding estimator has the smallest asymptotic variance that can be ensured with a graphical criterion.
我们考虑使用条件工具集在未测量的混杂存在下对总因果效应的有效估计。具体地说,我们考虑了与已知无环有向混合图相容的具有相关误差的线性结构方程模型的两阶段最小二乘估计。为了为我们的结果奠定基础,我们描述了一类线性有效的条件工具集,这些工具集对目标总效应产生一致的两阶段最小二乘估计,并为这些估计量导出了一个新的渐近方差公式。根据这些结果,我们提供了三种图形工具来选择更有效的线性有效条件工具集。首先,对于某些线性有效条件工具集对,一个图形准则确定两个相应的估计量中哪一个具有较小的渐近方差。其次,一种贪婪地添加协变量的算法,这些协变量可以减少给定线性有效条件工具集的渐近方差。第三,一个线性有效的条件工具集,其对应的估计量具有最小的渐近方差,可以用图形准则来保证。
{"title":"Graphical tools for selecting conditional instrumental sets","authors":"Henckel, Leonard, Buttenschön, Martin, Maathuis, Marloes H.","doi":"10.1093/biomet/asad066","DOIUrl":"https://doi.org/10.1093/biomet/asad066","url":null,"abstract":"Summary We consider the efficient estimation of total causal effects in the presence of unmeasured confounding using conditional instrumental sets. Specifically, we consider the two-stage least squares estimator in the setting of a linear structural equation model with correlated errors that is compatible with a known acyclic directed mixed graph. To set the stage for our results, we characterize the class of linearly valid conditional instrumental sets that yield consistent two-stage least squares estimators for the target total effect and derive a new asymptotic variance formula for these estimators. Equipped with these results, we provide three graphical tools for selecting more efficient linearly valid conditional instrumental sets. First, a graphical criterion that for certain pairs of linearly valid conditional instrumental sets identifies which of the two corresponding estimators has the smaller asymptotic variance. Second, an algorithm that greedily adds covariates that reduce the asymptotic variance to a given linearly valid conditional instrumental set. Third, a linearly valid conditional instrumental set for which the corresponding estimator has the smallest asymptotic variance that can be ensured with a graphical criterion.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"217 S694","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135775010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On inference in high-dimensional logistic regression models with separated data 分离数据的高维逻辑回归模型的推理
2区 数学 Q2 BIOLOGY Pub Date : 2023-11-02 DOI: 10.1093/biomet/asad065
R M Lewis, H S Battey
Abstract Direct use of the likelihood function typically produces severely biased estimates when the dimension of the parameter vector is large relative to the effective sample size. With linearly separable data generated from a logistic regression model, the loglikelihood function asymptotes and the maximum likelihood estimator does not exist. We show that an exact analysis for each regression coefficient produces half-infinite confidence sets for some parameters when the data are separable. Such conclusions are not vacuous, but an honest portrayal of the limitations of the data. Finite confidence sets are only achievable when additional, perhaps implicit, assumptions are made. Under a notional double-asymptotic regime in which the dimension of the logistic coefficient vector increases with the sample size, the present paper considers the implications of enforcing a natural constraint on the vector of logistic-transformed probabilities. We derive a relationship between the logistic coefficients and a notional parameter obtained as a probability limit of an ordinary least squares estimator. The latter exists even when the data are separable. Consistency is ascertained under weak conditions on the design matrix.
当参数向量的维数相对于有效样本量较大时,直接使用似然函数通常会产生严重的偏估计。对于由逻辑回归模型产生的线性可分数据,对数似然函数渐近且最大似然估计量不存在。我们表明,当数据可分离时,对每个回归系数的精确分析对某些参数产生半无限置信集。这样的结论不是空洞的,而是对数据局限性的诚实描述。有限的置信集只有在做出额外的(可能是隐含的)假设时才能实现。在逻辑系数向量的维数随样本量的增加而增加的概念双渐近状态下,本文考虑了对逻辑变换概率向量施加自然约束的含义。我们推导了逻辑系数与作为普通最小二乘估计的概率极限的一个概念参数之间的关系。即使数据是可分离的,后者也存在。在弱条件下确定了设计矩阵的一致性。
{"title":"On inference in high-dimensional logistic regression models with separated data","authors":"R M Lewis, H S Battey","doi":"10.1093/biomet/asad065","DOIUrl":"https://doi.org/10.1093/biomet/asad065","url":null,"abstract":"Abstract Direct use of the likelihood function typically produces severely biased estimates when the dimension of the parameter vector is large relative to the effective sample size. With linearly separable data generated from a logistic regression model, the loglikelihood function asymptotes and the maximum likelihood estimator does not exist. We show that an exact analysis for each regression coefficient produces half-infinite confidence sets for some parameters when the data are separable. Such conclusions are not vacuous, but an honest portrayal of the limitations of the data. Finite confidence sets are only achievable when additional, perhaps implicit, assumptions are made. Under a notional double-asymptotic regime in which the dimension of the logistic coefficient vector increases with the sample size, the present paper considers the implications of enforcing a natural constraint on the vector of logistic-transformed probabilities. We derive a relationship between the logistic coefficients and a notional parameter obtained as a probability limit of an ordinary least squares estimator. The latter exists even when the data are separable. Consistency is ascertained under weak conditions on the design matrix.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135975796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Nonparametric priors with full-range borrowing of information 具有全范围信息借用的非参数先验
2区 数学 Q2 BIOLOGY Pub Date : 2023-10-19 DOI: 10.1093/biomet/asad063
F Ascolani, B Franzolini, A Lijoi, I Prünster
Summary Modelling of the dependence structure across heterogeneous data is crucial for Bayesian inference since it directly impacts the borrowing of information. Despite the extensive advances over the last two decades, most available proposals only allow for nonnegative correlations. We derive a new class of dependent nonparametric priors that can induce correlations of any sign, thus introducing a new and more flexible idea of borrowing of information. This is achieved thanks to a novel concept, which we term hyper-tie, and represents a direct and simple measure of dependence. We investigate prior and posterior distributional properties of the model and develop algorithms to perform posterior inference. Illustrative examples on simulated and real data show that our proposal outperforms alternatives in terms of prediction and clustering.
跨异构数据的依赖结构建模对贝叶斯推理至关重要,因为它直接影响信息的借用。尽管在过去二十年中取得了广泛的进展,但大多数可用的建议只允许非负相关。我们导出了一类新的非参数依赖先验,它可以诱导任何符号的相关性,从而引入了一种新的更灵活的信息借用思想。这要归功于一个新颖的概念,我们称之为“超联系”,它代表了一种直接而简单的依赖度量。我们研究了模型的先验和后验分布特性,并开发了执行后验推理的算法。在模拟和真实数据上的示例表明,我们的建议在预测和聚类方面优于其他方案。
{"title":"Nonparametric priors with full-range borrowing of information","authors":"F Ascolani, B Franzolini, A Lijoi, I Prünster","doi":"10.1093/biomet/asad063","DOIUrl":"https://doi.org/10.1093/biomet/asad063","url":null,"abstract":"Summary Modelling of the dependence structure across heterogeneous data is crucial for Bayesian inference since it directly impacts the borrowing of information. Despite the extensive advances over the last two decades, most available proposals only allow for nonnegative correlations. We derive a new class of dependent nonparametric priors that can induce correlations of any sign, thus introducing a new and more flexible idea of borrowing of information. This is achieved thanks to a novel concept, which we term hyper-tie, and represents a direct and simple measure of dependence. We investigate prior and posterior distributional properties of the model and develop algorithms to perform posterior inference. Illustrative examples on simulated and real data show that our proposal outperforms alternatives in terms of prediction and clustering.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135729762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Likelihood-based Inference under Non-Convex Boundary Constraints 非凸边界约束下基于似然的推理
2区 数学 Q2 BIOLOGY Pub Date : 2023-10-19 DOI: 10.1093/biomet/asad062
J Y Wang, Z S YE, Y Chen
Summary Likelihood-based inference under nonconvex constraints on model parameters has become increasingly common in biomedical research. In this paper, we establish large-sample properties of the maximum likelihood estimator when the true parameter value lies at the boundary of a nonconvex parameter space. We further derive the asymptotic distribution of the likelihood ratio test statistic under nonconvex constraints on model parameters. A general Monte Carlo procedure for generating the limiting distribution is provided. The theoretical results are demonstrated by five examples in Anderson’s stereotype logistic regression model, genetic association studies, gene-environment interaction tests, cost-constrained linear regression and fairness-constrained linear regression.
基于模型参数非凸约束的似然推理在生物医学研究中越来越普遍。本文建立了真参数值位于非凸参数空间边界时最大似然估计量的大样本性质。进一步推导了模型参数非凸约束下似然比检验统计量的渐近分布。给出了生成极限分布的一般蒙特卡罗程序。通过安德森的刻板印象逻辑回归模型、遗传关联研究、基因-环境交互作用检验、成本约束线性回归和公平约束线性回归五个实例验证了理论结果。
{"title":"Likelihood-based Inference under Non-Convex Boundary Constraints","authors":"J Y Wang, Z S YE, Y Chen","doi":"10.1093/biomet/asad062","DOIUrl":"https://doi.org/10.1093/biomet/asad062","url":null,"abstract":"Summary Likelihood-based inference under nonconvex constraints on model parameters has become increasingly common in biomedical research. In this paper, we establish large-sample properties of the maximum likelihood estimator when the true parameter value lies at the boundary of a nonconvex parameter space. We further derive the asymptotic distribution of the likelihood ratio test statistic under nonconvex constraints on model parameters. A general Monte Carlo procedure for generating the limiting distribution is provided. The theoretical results are demonstrated by five examples in Anderson’s stereotype logistic regression model, genetic association studies, gene-environment interaction tests, cost-constrained linear regression and fairness-constrained linear regression.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135729962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On geometric convergence for MALA under simple conditions 简单条件下MALA的几何收敛性
2区 数学 Q2 BIOLOGY Pub Date : 2023-10-03 DOI: 10.1093/biomet/asad060
Alain Oliviero-Durmus, Éric Moulines
Summary While the Metropolis Adjusted Langevin Algorithm (MALA) is a popular and widely used Markov chain Monte Carlo method, very few papers derive conditions that ensure its convergence. In particular, to the authors' knowledge, assumptions that are both easy to verify and guarantee geometric convergence, are still missing. In this work, we establish V-uniformly geometric convergence for MALA under mild assumptions about the target distribution. Unlike previous work, we only consider tail and smoothness conditions for the potential associated with the target distribution. These conditions are quite common in the MCMC literature. Finally, we pay special attention to the dependence of the bounds we derive on the step size of the Euler-Maruyama discretization, which corresponds to the proposal Markov kernel of MALA.
Metropolis Adjusted Langevin Algorithm (MALA)是一种广泛应用的马尔可夫链蒙特卡罗算法,但很少有论文给出保证其收敛性的条件。特别是,据作者所知,那些既容易验证又保证几何收敛的假设仍然缺失。在对目标分布的温和假设下,我们建立了MALA的v -均匀几何收敛性。与以前的工作不同,我们只考虑与目标分布相关的势的尾部和平滑条件。这些情况在MCMC文献中很常见。最后,我们特别注意了我们得到的边界与Euler-Maruyama离散化的步长的依赖关系,这对应于MALA的建议马尔可夫核。
{"title":"On geometric convergence for MALA under simple conditions","authors":"Alain Oliviero-Durmus, Éric Moulines","doi":"10.1093/biomet/asad060","DOIUrl":"https://doi.org/10.1093/biomet/asad060","url":null,"abstract":"Summary While the Metropolis Adjusted Langevin Algorithm (MALA) is a popular and widely used Markov chain Monte Carlo method, very few papers derive conditions that ensure its convergence. In particular, to the authors' knowledge, assumptions that are both easy to verify and guarantee geometric convergence, are still missing. In this work, we establish V-uniformly geometric convergence for MALA under mild assumptions about the target distribution. Unlike previous work, we only consider tail and smoothness conditions for the potential associated with the target distribution. These conditions are quite common in the MCMC literature. Finally, we pay special attention to the dependence of the bounds we derive on the step size of the Euler-Maruyama discretization, which corresponds to the proposal Markov kernel of MALA.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135695601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Evaluation of Natural Stochastic Policies in Offline Reinforcement Learning 离线强化学习中自然随机策略的有效评价
2区 数学 Q2 BIOLOGY Pub Date : 2023-09-27 DOI: 10.1093/biomet/asad059
Nathan Kallus, Masatoshi Uehara
We study the efficient off-policy evaluation of natural stochastic policies, which are defined in terms of deviations from the behavior policy. This is a departure from the literature on off-policy evaluation where most work consider the evaluation of explicitly specified policies. Crucially, offline reinforcement learning with natural stochastic policies can help alleviate issues of weak overlap, lead to policies that build upon current practice, and improve policies' implementability in practice. Compared with the classic case of a pre-specified evaluation policy, when evaluating natural stochastic policies, the efficiency bound, which measures the best-achievable estimation error, is inflated since the evaluation policy itself is unknown. In this paper, we derive the efficiency bounds of two major types of natural stochastic policies: tilting policies and modified treatment policies. We then propose efficient nonparametric estimators that attain the efficiency bounds under very lax conditions. These also enjoy a (partial) double robustness property.
我们研究了自然随机策略的有效离政策评估,自然随机策略是根据偏离行为策略来定义的。这与非政策评估的文献不同,在非政策评估中,大多数工作都考虑对明确规定的政策进行评估。至关重要的是,使用自然随机策略的离线强化学习可以帮助缓解弱重叠问题,产生基于当前实践的策略,并提高策略在实践中的可实施性。与经典的预先设定评估策略相比,在评估自然随机策略时,由于评估策略本身是未知的,衡量最佳可实现估计误差的效率界被夸大了。本文导出了两大类自然随机策略的效率界:倾斜策略和修正处理策略。然后我们提出有效的非参数估计,在非常宽松的条件下达到效率界。它们还具有(部分)双重鲁棒性。
{"title":"Efficient Evaluation of Natural Stochastic Policies in Offline Reinforcement Learning","authors":"Nathan Kallus, Masatoshi Uehara","doi":"10.1093/biomet/asad059","DOIUrl":"https://doi.org/10.1093/biomet/asad059","url":null,"abstract":"We study the efficient off-policy evaluation of natural stochastic policies, which are defined in terms of deviations from the behavior policy. This is a departure from the literature on off-policy evaluation where most work consider the evaluation of explicitly specified policies. Crucially, offline reinforcement learning with natural stochastic policies can help alleviate issues of weak overlap, lead to policies that build upon current practice, and improve policies' implementability in practice. Compared with the classic case of a pre-specified evaluation policy, when evaluating natural stochastic policies, the efficiency bound, which measures the best-achievable estimation error, is inflated since the evaluation policy itself is unknown. In this paper, we derive the efficiency bounds of two major types of natural stochastic policies: tilting policies and modified treatment policies. We then propose efficient nonparametric estimators that attain the efficiency bounds under very lax conditions. These also enjoy a (partial) double robustness property.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135471800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Selective machine learning of doubly robust functionals 双鲁棒函数的选择性机器学习
2区 数学 Q2 BIOLOGY Pub Date : 2023-09-26 DOI: 10.1093/biomet/asad055
Y Cui, E Tchetgen Tchetgen
Abstract While model selection is a well-studied topic in parametric and nonparametric regression or density estimation, selection of possibly high-dimensional nuisance parameters in semiparametric problems is far less developed. In this paper, we propose a selective machine learning framework for making inferences about a finite-dimensional functional defined on a semiparametric model, when the latter admits a doubly robust estimating function and several candidate machine learning algorithms are available for estimating the nuisance parameters. We introduce a new selection criterion aimed at bias reduction in estimating the functional of interest based on a novel definition of pseudo-risk inspired by the double robustness property. Intuitively, the proposed criterion selects a pair of learners with the smallest pseudo-risk, so that the estimated functional is least sensitive to perturbations of a nuisance parameter. We establish an oracle property for a multi-fold cross-validation version of the new selection criterion which states that our empirical criterion performs nearly as well as an oracle with a priori knowledge of the pseudo-risk for each pair of candidate learners. Finally, we apply the approach to model selection of a semiparametric estimator of average treatment effect given an ensemble of candidate machine learners to account for confounding in an observational study which we illustrate in simulations and a data application.
虽然模型选择是参数和非参数回归或密度估计中一个研究得很好的主题,但半参数问题中可能高维干扰参数的选择却远远不够发达。在本文中,我们提出了一种选择性机器学习框架,用于对半参数模型上定义的有限维泛函进行推断,当后者允许双鲁棒估计函数和几种候选机器学习算法可用于估计干扰参数。基于双重鲁棒性所启发的伪风险的新定义,我们引入了一种新的选择准则,目的是在估计利息泛函时减少偏差。直观地,该准则选择了一对伪风险最小的学习器,使得估计函数对干扰参数的扰动最不敏感。我们为新选择标准的多重交叉验证版本建立了一个oracle属性,该属性表明我们的经验标准几乎与每对候选学习者的伪风险先验知识的oracle一样好。最后,我们将该方法应用于给定候选机器学习者集合的平均治疗效果的半参数估计量的模型选择,以解释我们在模拟和数据应用中说明的观察性研究中的混淆。
{"title":"Selective machine learning of doubly robust functionals","authors":"Y Cui, E Tchetgen Tchetgen","doi":"10.1093/biomet/asad055","DOIUrl":"https://doi.org/10.1093/biomet/asad055","url":null,"abstract":"Abstract While model selection is a well-studied topic in parametric and nonparametric regression or density estimation, selection of possibly high-dimensional nuisance parameters in semiparametric problems is far less developed. In this paper, we propose a selective machine learning framework for making inferences about a finite-dimensional functional defined on a semiparametric model, when the latter admits a doubly robust estimating function and several candidate machine learning algorithms are available for estimating the nuisance parameters. We introduce a new selection criterion aimed at bias reduction in estimating the functional of interest based on a novel definition of pseudo-risk inspired by the double robustness property. Intuitively, the proposed criterion selects a pair of learners with the smallest pseudo-risk, so that the estimated functional is least sensitive to perturbations of a nuisance parameter. We establish an oracle property for a multi-fold cross-validation version of the new selection criterion which states that our empirical criterion performs nearly as well as an oracle with a priori knowledge of the pseudo-risk for each pair of candidate learners. Finally, we apply the approach to model selection of a semiparametric estimator of average treatment effect given an ensemble of candidate machine learners to account for confounding in an observational study which we illustrate in simulations and a data application.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134960403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biometrika
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1