首页 > 最新文献

Journal of the Royal Statistical Society Series B-Statistical Methodology最新文献

英文 中文
Robust Detection of Watermarks for Large Language Models Under Human Edits. 人工编辑下大型语言模型水印的鲁棒检测。
IF 3.6 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2025-09-22 DOI: 10.1093/jrsssb/qkaf056
Xiang Li, Feng Ruan, Huiyuan Wang, Qi Long, Weijie J Su

Watermarking has offered an effective approach to distinguishing text generated by large language models (LLMs) from human-written text. However, the pervasive presence of human edits on LLM-generated text dilutes watermark signals, thereby significantly degrading detection performance of existing methods. In this paper, by modeling human edits through mixture model detection, we introduce a new method in the form of a truncated goodness-of-fit test for detecting watermarked text under human edits, which we refer to as Tr-GoF. We prove that the Tr-GoF test achieves optimality in robust detection of the Gumbel-max watermark in a certain asymptotic regime of substantial text modifications and vanishing watermark signals. Importantly, Tr-GoF achieves this optimality adaptively as it does not require precise knowledge of human edit levels or probabilistic specifications of the LLMs, in contrast to the optimal but impractical (Neyman-Pearson) likelihood ratio test. Moreover, we establish that the Tr-GoF test attains the highest detection efficiency rate in a certain regime of moderate text modifications. In stark contrast, we show that sum-based detection rules, as employed by existing methods, fail to achieve optimal robustness in both regimes because the additive nature of their statistics is less resilient to edit-induced noise. Finally, we demonstrate the competitive and sometimes superior empirical performance of the Tr-GoF test on both synthetic data and open-source LLMs in the OPT and LLaMA families.

水印技术提供了一种有效的方法来区分由大型语言模型(llm)生成的文本和人类编写的文本。然而,在llm生成的文本中普遍存在的人为编辑会稀释水印信号,从而大大降低现有方法的检测性能。本文通过混合模型检测对人为编辑进行建模,提出了一种截断拟合优度检验的方法来检测人为编辑下的水印文本,我们称之为Tr-GoF。我们证明了在大量文本修改和水印信号消失的一定渐近范围内,Tr-GoF检验在稳健检测Gumbel-max水印方面达到了最优性。重要的是,与最优但不切实际的(内曼-皮尔逊)似然比测试相比,Tr-GoF自适应地实现了这种最优性,因为它不需要对人类编辑水平或法学硕士的概率规格有精确的了解。此外,我们建立了Tr-GoF测试在适度文本修改的一定制度下达到最高的检测效率。与之形成鲜明对比的是,我们发现基于和的检测规则,正如现有方法所采用的那样,在这两种情况下都无法达到最优的鲁棒性,因为它们的统计数据的可加性对编辑引起的噪声的弹性较小。最后,我们证明了在OPT和LLaMA家族的合成数据和开源llm上,Tr-GoF测试具有竞争力,有时甚至更优的经验性能。
{"title":"Robust Detection of Watermarks for Large Language Models Under Human Edits.","authors":"Xiang Li, Feng Ruan, Huiyuan Wang, Qi Long, Weijie J Su","doi":"10.1093/jrsssb/qkaf056","DOIUrl":"10.1093/jrsssb/qkaf056","url":null,"abstract":"<p><p>Watermarking has offered an effective approach to distinguishing text generated by large language models (LLMs) from human-written text. However, the pervasive presence of human edits on LLM-generated text dilutes watermark signals, thereby significantly degrading detection performance of existing methods. In this paper, by modeling human edits through mixture model detection, we introduce a new method in the form of a truncated goodness-of-fit test for detecting watermarked text under human edits, which we refer to as Tr-GoF. We prove that the Tr-GoF test achieves optimality in robust detection of the Gumbel-max watermark in a certain asymptotic regime of substantial text modifications and vanishing watermark signals. Importantly, Tr-GoF achieves this optimality <i>adaptively</i> as it does not require precise knowledge of human edit levels or probabilistic specifications of the LLMs, in contrast to the optimal but impractical (Neyman-Pearson) likelihood ratio test. Moreover, we establish that the Tr-GoF test attains the highest detection efficiency rate in a certain regime of moderate text modifications. In stark contrast, we show that sum-based detection rules, as employed by existing methods, fail to achieve optimal robustness in both regimes because the additive nature of their statistics is less resilient to edit-induced noise. Finally, we demonstrate the competitive and sometimes superior empirical performance of the Tr-GoF test on both synthetic data and open-source LLMs in the OPT and LLaMA families.</p>","PeriodicalId":49982,"journal":{"name":"Journal of the Royal Statistical Society Series B-Statistical Methodology","volume":" ","pages":""},"PeriodicalIF":3.6,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12851586/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146088005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Covariate-assisted bounds on causal effects with instrumental variables. 工具变量因果效应的协变量辅助界。
IF 3.6 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2025-05-27 eCollection Date: 2025-11-01 DOI: 10.1093/jrsssb/qkaf028
Alexander W Levis, Matteo Bonvini, Zhenghao Zeng, Luke Keele, Edward H Kennedy

When an exposure of interest is confounded by unmeasured factors, an instrumental variable (IV) can be used to identify and estimate certain causal contrasts. Identification of the marginal average treatment effect (ATE) from IVs relies on strong untestable structural assumptions. When one is unwilling to assert such structure, IVs can nonetheless be used to construct bounds on the ATE. Famously, Alexander Balke and Judea Pearl proved tight bounds on the ATE for a binary outcome, in a randomized trial with noncompliance and no covariate information. We demonstrate how these bounds remain useful in observational settings with baseline confounders of the IV, as well as randomized trials with measured baseline covariates. The resulting bounds on the ATE are nonsmooth functionals, and thus standard nonparametric efficiency theory is not immediately applicable. To remedy this, we propose (1) under a novel margin condition, influence function-based estimators of the bounds that can attain parametric convergence rates when the nuisance functions are modelled flexibly, and (2) estimators of smooth approximations of these bounds. We propose extensions to continuous outcomes, explore finite sample properties in simulations, and illustrate the proposed estimators in an observational study targeting the effect of higher education on wages.

当感兴趣的暴露被未测量的因素混淆时,可以使用工具变量(IV)来识别和估计某些因果对比。从IVs中识别边际平均处理效果(ATE)依赖于强大的不可检验的结构假设。当人们不愿意断言这样的结构时,iv仍然可以用来构造ATE上的边界。著名的是,Alexander Balke和Judea Pearl在一项随机试验中证明了ATE对二元结果的严格限制,该试验没有依从性和协变量信息。我们证明了这些界限如何在具有IV基线混杂因素的观察设置中以及具有测量基线协变量的随机试验中仍然有用。所得的ATE边界是非光滑泛函,因此标准的非参数效率理论不能立即适用。为了弥补这一点,我们提出(1)在一种新的边界条件下,当讨厌函数被灵活建模时,可以获得参数收敛率的基于影响函数的界估计,以及(2)这些界的光滑近似估计。我们提出了对连续结果的扩展,在模拟中探索有限样本性质,并在一项针对高等教育对工资影响的观察性研究中说明了所提出的估计量。
{"title":"Covariate-assisted bounds on causal effects with instrumental variables.","authors":"Alexander W Levis, Matteo Bonvini, Zhenghao Zeng, Luke Keele, Edward H Kennedy","doi":"10.1093/jrsssb/qkaf028","DOIUrl":"10.1093/jrsssb/qkaf028","url":null,"abstract":"<p><p>When an exposure of interest is confounded by unmeasured factors, an instrumental variable (IV) can be used to identify and estimate certain causal contrasts. Identification of the marginal average treatment effect (ATE) from IVs relies on strong untestable structural assumptions. When one is unwilling to assert such structure, IVs can nonetheless be used to construct bounds on the ATE. Famously, Alexander Balke and Judea Pearl proved tight bounds on the ATE for a binary outcome, in a randomized trial with noncompliance and no covariate information. We demonstrate how these bounds remain useful in observational settings with baseline confounders of the IV, as well as randomized trials with measured baseline covariates. The resulting bounds on the ATE are nonsmooth functionals, and thus standard nonparametric efficiency theory is not immediately applicable. To remedy this, we propose (1) under a novel margin condition, influence function-based estimators of the bounds that can attain parametric convergence rates when the nuisance functions are modelled flexibly, and (2) estimators of smooth approximations of these bounds. We propose extensions to continuous outcomes, explore finite sample properties in simulations, and illustrate the proposed estimators in an observational study targeting the effect of higher education on wages.</p>","PeriodicalId":49982,"journal":{"name":"Journal of the Royal Statistical Society Series B-Statistical Methodology","volume":"87 5","pages":"1508-1527"},"PeriodicalIF":3.6,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12602419/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145507800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A statistical view of column subset selection. 列子集选择的统计视图。
IF 3.6 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2025-05-16 DOI: 10.1093/jrsssb/qkaf023
Anav Sood, Trevor Hastie

We consider the problem of selecting a small subset of representative variables from a large dataset. In the computer science literature, this dimensionality reduction problem is typically formalized as column subset selection (CSS). Meanwhile, the typical statistical formalization is to find an information-maximizing set of principal variables. This paper shows that these two approaches are equivalent, and moreover, both can be viewed as maximum-likelihood estimation within a certain semi-parametric model. Within this model, we establish suitable conditions under which the CSS estimate is consistent in high dimensions, specifically in the proportional asymptotic regime where the number of variables over the sample size converges to a constant. Using these connections, we show how to efficiently (1) perform CSS using only summary statistics from the original dataset; (2) perform CSS in the presence of missing and/or censored data; and (3) select the subset size for CSS in a hypothesis testing framework.

我们考虑从大型数据集中选择代表性变量的小子集的问题。在计算机科学文献中,这种降维问题通常形式化为列子集选择(CSS)。同时,典型的统计形式化是寻找一个信息最大化的主变量集。本文证明了这两种方法是等价的,并且都可以看作是某半参数模型内的极大似然估计。在该模型中,我们建立了适当的条件,在这些条件下,CSS估计在高维上是一致的,特别是在比例渐近状态下,其中变量的数量在样本大小上收敛到一个常数。使用这些连接,我们展示了如何高效地(1)仅使用原始数据集的汇总统计数据执行CSS;(2)在存在缺失和/或删除数据的情况下执行CSS;(3)在假设检验框架中选择CSS的子集大小。
{"title":"A statistical view of column subset selection.","authors":"Anav Sood, Trevor Hastie","doi":"10.1093/jrsssb/qkaf023","DOIUrl":"10.1093/jrsssb/qkaf023","url":null,"abstract":"<p><p>We consider the problem of selecting a small subset of representative variables from a large dataset. In the computer science literature, this dimensionality reduction problem is typically formalized as column subset selection (CSS). Meanwhile, the typical statistical formalization is to find an information-maximizing set of principal variables. This paper shows that these two approaches are equivalent, and moreover, both can be viewed as maximum-likelihood estimation within a certain semi-parametric model. Within this model, we establish suitable conditions under which the CSS estimate is consistent in high dimensions, specifically in the proportional asymptotic regime where the number of variables over the sample size converges to a constant. Using these connections, we show how to efficiently (1) perform CSS using only summary statistics from the original dataset; (2) perform CSS in the presence of missing and/or censored data; and (3) select the subset size for CSS in a hypothesis testing framework.</p>","PeriodicalId":49982,"journal":{"name":"Journal of the Royal Statistical Society Series B-Statistical Methodology","volume":" ","pages":""},"PeriodicalIF":3.6,"publicationDate":"2025-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12288642/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144734981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Product Centred Dirichlet Processes for Bayesian Multiview Clustering. 贝叶斯多视图聚类的产品中心Dirichlet过程。
IF 3.6 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2025-04-30 DOI: 10.1093/jrsssb/qkaf021
Alexander Dombowsky, David B Dunson

While there is an immense literature on Bayesian methods for clustering, the multiview case has received little attention. This problem focuses on obtaining distinct but statistically dependent clusterings in a common set of entities for different data types. For example, clustering patients into subgroups with subgroup membership varying according to the domain of the patient variables. A challenge is how to model the across-view dependence between the partitions of patients into subgroups. The complexities of the partition space make standard methods to model dependence, such as correlation, infeasible. In this article, we propose CLustering with Independence Centring (CLIC), a clustering prior that uses a single parameter to explicitly model dependence between clusterings across views. CLIC is induced by the product centred Dirichlet process (PCDP), a novel hierarchical prior that bridges between independent and equivalent partitions. We show appealing theoretic properties, provide a finite approximation and prove its accuracy, present a marginal Gibbs sampler for posterior computation, and derive closed form expressions for the marginal and joint partition distributions for the CLIC model. On synthetic data and in an application to epidemiology, CLIC accurately characterizes view-specific partitions while providing inference on the dependence level.

虽然有大量关于贝叶斯聚类方法的文献,但多视图情况很少受到关注。这个问题的重点是在一组不同数据类型的公共实体中获得不同的但统计上依赖的聚类。例如,将患者聚类到子组中,子组的成员资格根据患者变量的域而变化。一个挑战是如何在将患者划分为亚组之间建立跨视图依赖性模型。划分空间的复杂性使得对相关性等相关性建模的标准方法不可行。在本文中,我们提出了具有独立中心的聚类(CLIC),这是一种聚类先验,它使用单个参数来显式地模拟跨视图聚类之间的依赖关系。CLIC是由以产品为中心的狄利克雷过程(PCDP)引起的,PCDP是一种新颖的分层先验,在独立和等效分区之间建立了桥梁。我们展示了吸引人的理论性质,提供了一个有限近似并证明了它的准确性,给出了一个用于后验计算的边际Gibbs采样器,并导出了CLIC模型的边际和联合划分分布的封闭形式表达式。在合成数据和流行病学应用中,CLIC准确地描述了特定于视图的分区,同时提供了依赖程度的推断。
{"title":"Product Centred Dirichlet Processes for Bayesian Multiview Clustering.","authors":"Alexander Dombowsky, David B Dunson","doi":"10.1093/jrsssb/qkaf021","DOIUrl":"10.1093/jrsssb/qkaf021","url":null,"abstract":"<p><p>While there is an immense literature on Bayesian methods for clustering, the multiview case has received little attention. This problem focuses on obtaining distinct but statistically dependent clusterings in a common set of entities for different data types. For example, clustering patients into subgroups with subgroup membership varying according to the domain of the patient variables. A challenge is how to model the across-view dependence between the partitions of patients into subgroups. The complexities of the partition space make standard methods to model dependence, such as correlation, infeasible. In this article, we propose CLustering with Independence Centring (CLIC), a clustering prior that uses a single parameter to explicitly model dependence between clusterings across views. CLIC is induced by the product centred Dirichlet process (PCDP), a novel hierarchical prior that bridges between independent and equivalent partitions. We show appealing theoretic properties, provide a finite approximation and prove its accuracy, present a marginal Gibbs sampler for posterior computation, and derive closed form expressions for the marginal and joint partition distributions for the CLIC model. On synthetic data and in an application to epidemiology, CLIC accurately characterizes view-specific partitions while providing inference on the dependence level.</p>","PeriodicalId":49982,"journal":{"name":"Journal of the Royal Statistical Society Series B-Statistical Methodology","volume":" ","pages":""},"PeriodicalIF":3.6,"publicationDate":"2025-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12392789/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144976818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Two-phase rejective sampling and its asymptotic properties. 两相拒绝抽样及其渐近性质。
IF 3.6 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2025-02-10 DOI: 10.1093/jrsssb/qkaf002
Shu Yang, Peng Ding

Rejective sampling improves design and estimation efficiency of single-phase sampling when auxiliary information in a finite population is available. When such auxiliary information is unavailable, we propose to use two-phase rejective sampling (TPRS), which involves measuring auxiliary variables for the sample of units in the first phase, followed by the implementation of rejective sampling for the outcome in the second phase. We explore the asymptotic design properties of double expansion and regression estimators under TPRS. We show that TPRS enhances the efficiency of the double-expansion estimator, rendering it comparable to a regression estimator. We further refine the design to accommodate varying importance of covariates and extend it to multi-phase sampling. We start with the theory for the population mean and then extend the theory to parameters defined by general estimating equations. Our asymptotic results for TPRS immediately cover the existing single-phase rejective sampling, under which the asymptotic theory has not been fully established.

当辅助信息有限时,拒绝抽样提高了单相抽样的设计和估计效率。当这些辅助信息不可获得时,我们建议使用两阶段拒绝抽样(TPRS),其中包括在第一阶段测量单位样本的辅助变量,然后在第二阶段对结果实施拒绝抽样。研究了TPRS条件下双展开式和回归估计量的渐近设计性质。我们证明了TPRS提高了双展开估计器的效率,使其与回归估计器相当。我们进一步完善设计,以适应不同的协变量的重要性,并将其扩展到多相采样。我们从总体均值的理论开始,然后将理论推广到由一般估计方程定义的参数。我们对TPRS的渐近结果立即覆盖了现有的单相拒绝抽样,在这种情况下渐近理论尚未完全建立。
{"title":"Two-phase rejective sampling and its asymptotic properties.","authors":"Shu Yang, Peng Ding","doi":"10.1093/jrsssb/qkaf002","DOIUrl":"10.1093/jrsssb/qkaf002","url":null,"abstract":"<p><p>Rejective sampling improves design and estimation efficiency of single-phase sampling when auxiliary information in a finite population is available. When such auxiliary information is unavailable, we propose to use two-phase rejective sampling (TPRS), which involves measuring auxiliary variables for the sample of units in the first phase, followed by the implementation of rejective sampling for the outcome in the second phase. We explore the asymptotic design properties of double expansion and regression estimators under TPRS. We show that TPRS enhances the efficiency of the double-expansion estimator, rendering it comparable to a regression estimator. We further refine the design to accommodate varying importance of covariates and extend it to multi-phase sampling. We start with the theory for the population mean and then extend the theory to parameters defined by general estimating equations. Our asymptotic results for TPRS immediately cover the existing single-phase rejective sampling, under which the asymptotic theory has not been fully established.</p>","PeriodicalId":49982,"journal":{"name":"Journal of the Royal Statistical Society Series B-Statistical Methodology","volume":" ","pages":""},"PeriodicalIF":3.6,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12355938/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144876438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Probabilistic Richardson extrapolation. 概率Richardson外推法。
IF 3.1 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2024-12-26 eCollection Date: 2025-04-01 DOI: 10.1093/jrsssb/qkae098
Chris J Oates, Toni Karvonen, Aretha L Teckentrup, Marina Strocchi, Steven A Niederer

For over a century, extrapolation methods have provided a powerful tool to improve the convergence order of a numerical method. However, these tools are not well-suited to modern computer codes, where multiple continua are discretized and convergence orders are not easily analysed. To address this challenge, we present a probabilistic perspective on Richardson extrapolation, a point of view that unifies classical extrapolation methods with modern multi-fidelity modelling, and handles uncertain convergence orders by allowing these to be statistically estimated. The approach is developed using Gaussian processes, leading to Gauss-Richardson Extrapolation. Conditions are established under which extrapolation using the conditional mean achieves a polynomial (or even an exponential) speed-up compared to the original numerical method. Further, the probabilistic formulation unlocks the possibility of experimental design, casting the selection of fidelities as a continuous optimization problem, which can then be (approximately) solved. A case study involving a computational cardiac model demonstrates that practical gains in accuracy can be achieved using the GRE method.

一个多世纪以来,外推法提供了一个强大的工具,以提高数值方法的收敛顺序。然而,这些工具不太适合现代计算机代码,因为在现代计算机代码中,多个连续序列是离散的,并且不容易分析收敛阶。为了应对这一挑战,我们提出了Richardson外推的概率观点,这种观点将经典外推方法与现代多保真度建模相结合,并通过允许统计估计来处理不确定的收敛顺序。该方法是使用高斯过程开发的,导致高斯-理查德森外推。与原始数值方法相比,建立了使用条件均值外推实现多项式(甚至指数)加速的条件。此外,概率公式解锁了实验设计的可能性,将保真度的选择作为一个连续的优化问题,然后可以(近似)解决。一个涉及计算心脏模型的案例研究表明,使用GRE方法可以实现精度的实际提高。
{"title":"Probabilistic Richardson extrapolation.","authors":"Chris J Oates, Toni Karvonen, Aretha L Teckentrup, Marina Strocchi, Steven A Niederer","doi":"10.1093/jrsssb/qkae098","DOIUrl":"https://doi.org/10.1093/jrsssb/qkae098","url":null,"abstract":"<p><p>For over a century, extrapolation methods have provided a powerful tool to improve the convergence order of a numerical method. However, these tools are not well-suited to modern computer codes, where multiple continua are discretized and convergence orders are not easily analysed. To address this challenge, we present a probabilistic perspective on Richardson extrapolation, a point of view that unifies classical extrapolation methods with modern multi-fidelity modelling, and handles uncertain convergence orders by allowing these to be statistically estimated. The approach is developed using Gaussian processes, leading to <i>Gauss-Richardson Extrapolation</i>. Conditions are established under which extrapolation using the conditional mean achieves a polynomial (or even an exponential) speed-up compared to the original numerical method. Further, the probabilistic formulation unlocks the possibility of experimental design, casting the selection of fidelities as a continuous optimization problem, which can then be (approximately) solved. A case study involving a computational cardiac model demonstrates that practical gains in accuracy can be achieved using the GRE method.</p>","PeriodicalId":49982,"journal":{"name":"Journal of the Royal Statistical Society Series B-Statistical Methodology","volume":"87 2","pages":"457-479"},"PeriodicalIF":3.1,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11985099/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144058583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust evaluation of longitudinal surrogate markers with censored data. 用删节数据对纵向替代标记进行稳健评估。
IF 3.6 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2024-12-26 eCollection Date: 2025-07-01 DOI: 10.1093/jrsssb/qkae119
Denis Agniel, Layla Parast

The development of statistical methods to evaluate surrogate markers is an active area of research. In many clinical settings, the surrogate marker is not simply a single measurement but is instead a longitudinal trajectory of measurements over time, e.g. fasting plasma glucose measured every 6 months for 3 years. In general, available methods developed for the single-surrogate setting cannot accommodate a longitudinal surrogate marker. Furthermore, many of the methods have not been developed for use with primary outcomes that are time-to-event outcomes and/or subject to censoring. In this paper, we propose robust methods to evaluate a longitudinal surrogate marker in a censored time-to-event outcome setting. Specifically, we propose a method to define and estimate the proportion of the treatment effect on a censored primary outcome that is explained by the treatment effect on a longitudinal surrogate marker measured up to time t 0 . We accommodate both potential censoring of the primary outcome and of the surrogate marker. A simulation study demonstrates a good finite-sample performance of our proposed methods. We illustrate our procedures by examining repeated measures of fasting plasma glucose, a surrogate marker for diabetes diagnosis, using data from the diabetes prevention programme.

开发评估替代标记物的统计方法是一个活跃的研究领域。在许多临床环境中,替代指标不是简单的单一测量,而是随时间推移的纵向测量轨迹,例如,每6个月测量一次空腹血糖,持续3年。一般来说,为单代理设置开发的可用方法不能容纳纵向代理标记。此外,许多方法还没有开发出用于主要结果的时间-事件结果和/或受审查的结果。在本文中,我们提出了稳健的方法来评估一个纵向代理标记在一个审查的时间到事件的结果设置。具体而言,我们提出了一种方法来定义和估计治疗效果对审查的主要结局的比例,该比例由治疗对纵向替代标记物的影响解释,测量时间为t0。我们同时考虑了对主要结局和替代标记物的潜在审查。仿真研究证明了我们提出的方法具有良好的有限样本性能。我们通过检查空腹血糖(糖尿病诊断的替代标志物)的重复测量来说明我们的程序,使用来自糖尿病预防计划的数据。
{"title":"Robust evaluation of longitudinal surrogate markers with censored data.","authors":"Denis Agniel, Layla Parast","doi":"10.1093/jrsssb/qkae119","DOIUrl":"10.1093/jrsssb/qkae119","url":null,"abstract":"<p><p>The development of statistical methods to evaluate surrogate markers is an active area of research. In many clinical settings, the surrogate marker is not simply a single measurement but is instead a longitudinal trajectory of measurements over time, e.g. fasting plasma glucose measured every 6 months for 3 years. In general, available methods developed for the single-surrogate setting cannot accommodate a longitudinal surrogate marker. Furthermore, many of the methods have not been developed for use with primary outcomes that are time-to-event outcomes and/or subject to censoring. In this paper, we propose robust methods to evaluate a longitudinal surrogate marker in a censored time-to-event outcome setting. Specifically, we propose a method to define and estimate the proportion of the treatment effect on a censored primary outcome that is explained by the treatment effect on a longitudinal surrogate marker measured up to time <math><msub><mi>t</mi> <mn>0</mn></msub> </math> . We accommodate both potential censoring of the primary outcome and of the surrogate marker. A simulation study demonstrates a good finite-sample performance of our proposed methods. We illustrate our procedures by examining repeated measures of fasting plasma glucose, a surrogate marker for diabetes diagnosis, using data from the diabetes prevention programme.</p>","PeriodicalId":49982,"journal":{"name":"Journal of the Royal Statistical Society Series B-Statistical Methodology","volume":"87 3","pages":"891-907"},"PeriodicalIF":3.6,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12256123/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144638525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust angle-based transfer learning in high dimensions. 基于角度的高维鲁棒迁移学习。
IF 3.6 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2024-12-03 eCollection Date: 2025-07-01 DOI: 10.1093/jrsssb/qkae111
Tian Gu, Yi Han, Rui Duan

Transfer learning improves target model performance by leveraging data from related source populations, especially when target data are scarce. This study addresses the challenge of training high-dimensional regression models with limited target data in the presence of heterogeneous source populations. We focus on a practical setting where only parameter estimates of pretrained source models are available, rather than individual-level source data. For a single source model, we propose a novel angle-based transfer learning (angleTL) method that leverages concordance between source and target model parameters. AngleTL adapts to the signal strength of the target model, unifies several benchmark methods, and mitigates negative transfer when between-population heterogeneity is large. We extend angleTL to incorporate multiple source models, accounting for varying levels of relevance among them. Our high-dimensional asymptotic analysis provides insights into when a source model benefits the target model and demonstrates the superiority of angleTL over other methods. Extensive simulations validate these findings and highlight the feasibility of applying angleTL to transfer genetic risk prediction models across multiple biobanks.

迁移学习通过利用来自相关源群体的数据来提高目标模型的性能,特别是在目标数据稀缺的情况下。本研究解决了在存在异质源种群的情况下,用有限的目标数据训练高维回归模型的挑战。我们关注的是一个实际的设置,其中只有预训练源模型的参数估计可用,而不是个人层面的源数据。对于单源模型,我们提出了一种新的基于角度的迁移学习(angleTL)方法,该方法利用源模型和目标模型参数之间的一致性。AngleTL适应目标模型的信号强度,统一几种基准方法,在种群间异质性较大时减轻负迁移。我们扩展了angleTL以合并多个源模型,并考虑了它们之间不同程度的相关性。我们的高维渐近分析提供了源模型何时对目标模型有利的见解,并证明了angleTL优于其他方法。大量的模拟验证了这些发现,并强调了应用angleTL在多个生物库之间转移遗传风险预测模型的可行性。
{"title":"Robust angle-based transfer learning in high dimensions.","authors":"Tian Gu, Yi Han, Rui Duan","doi":"10.1093/jrsssb/qkae111","DOIUrl":"10.1093/jrsssb/qkae111","url":null,"abstract":"<p><p>Transfer learning improves target model performance by leveraging data from related source populations, especially when target data are scarce. This study addresses the challenge of training high-dimensional regression models with limited target data in the presence of heterogeneous source populations. We focus on a practical setting where only parameter estimates of pretrained source models are available, rather than individual-level source data. For a single source model, we propose a novel angle-based transfer learning (angleTL) method that leverages concordance between source and target model parameters. AngleTL adapts to the signal strength of the target model, unifies several benchmark methods, and mitigates negative transfer when between-population heterogeneity is large. We extend angleTL to incorporate multiple source models, accounting for varying levels of relevance among them. Our high-dimensional asymptotic analysis provides insights into when a source model benefits the target model and demonstrates the superiority of angleTL over other methods. Extensive simulations validate these findings and highlight the feasibility of applying angleTL to transfer genetic risk prediction models across multiple biobanks.</p>","PeriodicalId":49982,"journal":{"name":"Journal of the Royal Statistical Society Series B-Statistical Methodology","volume":"87 3","pages":"723-745"},"PeriodicalIF":3.6,"publicationDate":"2024-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12256125/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144638524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Causal mediation analysis: selection with asymptotically valid inference. 因果中介分析:具有渐近有效推论的选择。
IF 3.6 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2024-11-28 eCollection Date: 2025-07-01 DOI: 10.1093/jrsssb/qkae109
Jeremiah Jones, Ashkan Ertefaie, Robert L Strawderman

Researchers are often interested in learning not only the effect of treatments on outcomes, but also the mechanisms that transmit these effects. A mediator is a variable that is affected by treatment and subsequently affects outcome. Existing methods for penalized mediation analyses may lead to ignoring important mediators and either assume that finite-dimensional linear models are sufficient to remove confounding bias, or perform no confounding control at all. In practice, these assumptions may not hold. We propose a method that considers the confounding functions as nuisance parameters to be estimated using data-adaptive methods. We then use a novel regularization method applied to this objective function to identify a set of important mediators. We consider natural direct and indirect effects as our target parameters. We then proceed to derive the asymptotic properties of our estimators and establish the oracle property under specific assumptions. Asymptotic results are also presented in a local setting, which contrast the proposal with the standard adaptive lasso. We also propose a perturbation bootstrap technique to provide asymptotically valid postselection inference for the mediated effects of interest. The performance of these methods will be discussed and demonstrated through simulation studies.

研究人员不仅对治疗对结果的影响感兴趣,而且对传递这些影响的机制也感兴趣。中介是一个受治疗影响并随后影响结果的变量。现有的惩罚中介分析方法可能会导致忽略重要的中介,或者假设有限维线性模型足以消除混淆偏差,或者根本不进行混淆控制。在实践中,这些假设可能并不成立。我们提出了一种方法,将混杂函数作为干扰参数,使用数据自适应方法进行估计。然后,我们使用一种新的正则化方法应用于该目标函数来识别一组重要的中介。我们考虑自然的直接和间接效应作为我们的目标参数。然后,我们推导了我们的估计量的渐近性质,并在特定的假设下建立了oracle性质。给出了局部环境下的渐近结果,并与标准自适应套索进行了比较。我们还提出了一种摄动自举技术,为兴趣介导效应提供渐近有效的后选择推理。这些方法的性能将通过仿真研究进行讨论和论证。
{"title":"Causal mediation analysis: selection with asymptotically valid inference.","authors":"Jeremiah Jones, Ashkan Ertefaie, Robert L Strawderman","doi":"10.1093/jrsssb/qkae109","DOIUrl":"10.1093/jrsssb/qkae109","url":null,"abstract":"<p><p>Researchers are often interested in learning not only the effect of treatments on outcomes, but also the mechanisms that transmit these effects. A mediator is a variable that is affected by treatment and subsequently affects outcome. Existing methods for penalized mediation analyses may lead to ignoring important mediators and either assume that finite-dimensional linear models are sufficient to remove confounding bias, or perform no confounding control at all. In practice, these assumptions may not hold. We propose a method that considers the confounding functions as nuisance parameters to be estimated using data-adaptive methods. We then use a novel regularization method applied to this objective function to identify a set of important mediators. We consider natural direct and indirect effects as our target parameters. We then proceed to derive the asymptotic properties of our estimators and establish the oracle property under specific assumptions. Asymptotic results are also presented in a local setting, which contrast the proposal with the standard adaptive lasso. We also propose a perturbation bootstrap technique to provide asymptotically valid postselection inference for the mediated effects of interest. The performance of these methods will be discussed and demonstrated through simulation studies.</p>","PeriodicalId":49982,"journal":{"name":"Journal of the Royal Statistical Society Series B-Statistical Methodology","volume":"87 3","pages":"678-700"},"PeriodicalIF":3.6,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12256126/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144638523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A focusing framework for testing bi-directional causal effects in Mendelian randomization. 孟德尔随机化中双向因果效应检验的聚焦框架。
IF 3.6 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2024-11-21 eCollection Date: 2025-04-01 DOI: 10.1093/jrsssb/qkae101
Sai Li, Ting Ye

Mendelian randomization (MR) is a powerful method that uses genetic variants as instrumental variables to infer the causal effect of a modifiable exposure on an outcome. We study inference for bi-directional causal relationships and causal directions with possibly pleiotropic genetic variants. We show that assumptions for common MR methods are often impossible or too stringent given the potential bi-directional relationships. We propose a new focusing framework for testing bi-directional causal effects and it can be coupled with many state-of-the-art MR methods. We provide theoretical guarantees for our proposal and demonstrate its performance using several simulated and real datasets.

孟德尔随机化(MR)是一种强大的方法,它使用遗传变异作为工具变量来推断可修改暴露对结果的因果关系。我们研究双向因果关系和因果方向的推断与可能的多效性遗传变异。我们表明,考虑到潜在的双向关系,普通MR方法的假设往往是不可能的或过于严格的。我们提出了一个新的聚焦框架来测试双向因果效应,它可以与许多最先进的MR方法相结合。我们为我们的建议提供了理论保证,并使用几个模拟和真实数据集证明了它的性能。
{"title":"A focusing framework for testing bi-directional causal effects in Mendelian randomization.","authors":"Sai Li, Ting Ye","doi":"10.1093/jrsssb/qkae101","DOIUrl":"10.1093/jrsssb/qkae101","url":null,"abstract":"<p><p>Mendelian randomization (MR) is a powerful method that uses genetic variants as instrumental variables to infer the causal effect of a modifiable exposure on an outcome. We study inference for bi-directional causal relationships and causal directions with possibly pleiotropic genetic variants. We show that assumptions for common MR methods are often impossible or too stringent given the potential bi-directional relationships. We propose a new focusing framework for testing bi-directional causal effects and it can be coupled with many state-of-the-art MR methods. We provide theoretical guarantees for our proposal and demonstrate its performance using several simulated and real datasets.</p>","PeriodicalId":49982,"journal":{"name":"Journal of the Royal Statistical Society Series B-Statistical Methodology","volume":"87 2","pages":"529-548"},"PeriodicalIF":3.6,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11985100/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144047748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the Royal Statistical Society Series B-Statistical Methodology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1