首页 > 最新文献

Statistical Papers最新文献

英文 中文
Multiple random change points in survival analysis with applications to clinical trials 生存分析中的多个随机变化点在临床试验中的应用
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-05-06 DOI: 10.1007/s00362-023-01507-z
Jianbo Xu

There is often a presence of random change points (RCPs) with varying timing of hazard rate change among patients in survival analysis within oncology trials. This is in contrast to fixed change points in piecewise constant hazard models, where the timing of hazard rate change remains the same for all subjects. However, currently there is a lack of appropriate statistical methods to effectively tackle this particular issue. This article presents novel statistical methods that aim to characterize these complex survival models. These methods allow for the estimation of important features such as the probability of an event occurring and being censored, and the expected number of events within the clinical trial, prior to any specific time, and within specific time intervals. They also derive expected survival time and parametric expected survival and hazard functions for subjects with any finite number of RCPs. Simulation studies validate these methods and demonstrate their reliability and effectiveness. Real clinical data from an oncology trial is also used to apply these methods. The applications of these methods in oncology trials are extensive, including estimating hazard rates and rate parameters of RCPs, assessing treatment switching, delayed onset of immunotherapy, and subsequent anticancer therapies. They also have value in clinical trial planning, monitoring, and sample size adjustment. The expected parametric survival and hazard functions provide a thorough understanding of the behaviors and effects of RCPs in complex survival models.

在肿瘤试验的生存分析中,经常会出现随机变化点(RCPs),不同患者的危险率变化时间各不相同。这与片断恒定危险模型中的固定变化点形成鲜明对比,在固定变化点中,所有受试者的危险率变化时间保持不变。然而,目前缺乏适当的统计方法来有效解决这一特殊问题。本文介绍了旨在描述这些复杂生存模型特征的新型统计方法。这些方法可估算重要特征,如事件发生和被删减的概率,以及临床试验中、任何特定时间之前和特定时间间隔内的预期事件数量。他们还推导出了具有任意有限数量 RCP 的受试者的预期生存时间以及参数预期生存和危险函数。模拟研究验证了这些方法,并证明了它们的可靠性和有效性。来自肿瘤试验的真实临床数据也被用来应用这些方法。这些方法在肿瘤试验中的应用非常广泛,包括估计 RCP 的危险率和速率参数、评估治疗转换、免疫疗法的延迟开始以及后续抗癌疗法。它们在临床试验规划、监测和样本量调整方面也有价值。预期参数生存和危险函数让我们对复杂生存模型中 RCP 的行为和影响有了透彻的了解。
{"title":"Multiple random change points in survival analysis with applications to clinical trials","authors":"Jianbo Xu","doi":"10.1007/s00362-023-01507-z","DOIUrl":"https://doi.org/10.1007/s00362-023-01507-z","url":null,"abstract":"<p>There is often a presence of random change points (RCPs) with varying timing of hazard rate change among patients in survival analysis within oncology trials. This is in contrast to fixed change points in piecewise constant hazard models, where the timing of hazard rate change remains the same for all subjects. However, currently there is a lack of appropriate statistical methods to effectively tackle this particular issue. This article presents novel statistical methods that aim to characterize these complex survival models. These methods allow for the estimation of important features such as the probability of an event occurring and being censored, and the expected number of events within the clinical trial, prior to any specific time, and within specific time intervals. They also derive expected survival time and parametric expected survival and hazard functions for subjects with any finite number of RCPs. Simulation studies validate these methods and demonstrate their reliability and effectiveness. Real clinical data from an oncology trial is also used to apply these methods. The applications of these methods in oncology trials are extensive, including estimating hazard rates and rate parameters of RCPs, assessing treatment switching, delayed onset of immunotherapy, and subsequent anticancer therapies. They also have value in clinical trial planning, monitoring, and sample size adjustment. The expected parametric survival and hazard functions provide a thorough understanding of the behaviors and effects of RCPs in complex survival models.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"26 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140883339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nested symmetrical Latin hypercube designs 嵌套对称拉丁超立方设计
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-05-06 DOI: 10.1007/s00362-024-01556-y
Xiaodi Wang, Hengzhen Huang

Symmetrical global sensitivity analysis (SGSA) can aid practitioners in reducing the model complexity by identifying symmetries within the model. In this paper, we propose a nested symmetrical Latin hypercube design (NSLHD) for implementing SGSA in a sequential manner. By combining the strengths of the nested Latin hypercube design and symmetrical design, the proposed design allows for the implementation of SGSA without the need to pre-determine the sample size of the experiment. We develop a random sampling procedure and an efficient sequential optimization algorithm to construct flexible NSLHDs in terms of runs and factors. Sampling properties of the constructed designs are studied. Numerical examples are given to demonstrate the effectiveness of the NSLHD for designing sequential sensitivity analysis.

对称全局敏感性分析(SGSA)可以通过识别模型内部的对称性来帮助从业人员降低模型的复杂性。在本文中,我们提出了一种嵌套对称拉丁超立方设计(NSLHD),用于以顺序方式实施 SGSA。通过结合嵌套拉丁超立方设计和对称设计的优势,所提出的设计可以在无需预先确定实验样本大小的情况下实施 SGSA。我们开发了一种随机抽样程序和一种高效的序列优化算法,以在运行和因子方面构建灵活的 NSLHD。我们研究了所构建设计的抽样特性。我们给出了数值示例,以证明 NSLHD 在设计顺序敏感性分析方面的有效性。
{"title":"Nested symmetrical Latin hypercube designs","authors":"Xiaodi Wang, Hengzhen Huang","doi":"10.1007/s00362-024-01556-y","DOIUrl":"https://doi.org/10.1007/s00362-024-01556-y","url":null,"abstract":"<p>Symmetrical global sensitivity analysis (SGSA) can aid practitioners in reducing the model complexity by identifying symmetries within the model. In this paper, we propose a nested symmetrical Latin hypercube design (NSLHD) for implementing SGSA in a sequential manner. By combining the strengths of the nested Latin hypercube design and symmetrical design, the proposed design allows for the implementation of SGSA without the need to pre-determine the sample size of the experiment. We develop a random sampling procedure and an efficient sequential optimization algorithm to construct flexible NSLHDs in terms of runs and factors. Sampling properties of the constructed designs are studied. Numerical examples are given to demonstrate the effectiveness of the NSLHD for designing sequential sensitivity analysis.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"112 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140883227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A two sample nonparametric test for variability via empirical likelihood methods 通过经验似然法对变异性进行双样本非参数检验
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-05-02 DOI: 10.1007/s00362-024-01555-z
Lisa Parveen, Ruhul Ali Khan, Murari Mitra

Comparison of variability or dispersion of two distributions is the major focus of this work. To this end, we consider a two sample testing problem for detecting dominance in dispersive order and develop a test based on U-statistic approach. We also explore a link between the two measures of variability, viz. dispersive order and Gini’s mean difference (GMD). We exploit methodologies based on jackknife empirical likelihood (JEL) and adjusted JEL in order to overcome certain practical difficulties. The performance of the proposed test is assessed by means of a simulation study. Finally, we apply our test in the context of several real life situations including medical studies and insurance data.

比较两个分布的变异性或分散性是这项工作的重点。为此,我们考虑了一个检测分散秩支配性的双样本检验问题,并开发了一种基于 U 统计的检验方法。我们还探讨了两种可变性测量之间的联系,即分散秩序和基尼均值差(GMD)。为了克服某些实际困难,我们利用了基于杰克刀经验似然法(JEL)和调整 JEL 的方法。我们通过模拟研究来评估所提出的检验方法的性能。最后,我们将我们的检验方法应用于包括医学研究和保险数据在内的几种实际情况中。
{"title":"A two sample nonparametric test for variability via empirical likelihood methods","authors":"Lisa Parveen, Ruhul Ali Khan, Murari Mitra","doi":"10.1007/s00362-024-01555-z","DOIUrl":"https://doi.org/10.1007/s00362-024-01555-z","url":null,"abstract":"<p>Comparison of variability or dispersion of two distributions is the major focus of this work. To this end, we consider a two sample testing problem for detecting dominance in dispersive order and develop a test based on <i>U</i>-statistic approach. We also explore a link between the two measures of variability, viz. dispersive order and Gini’s mean difference (GMD). We exploit methodologies based on jackknife empirical likelihood (JEL) and adjusted JEL in order to overcome certain practical difficulties. The performance of the proposed test is assessed by means of a simulation study. Finally, we apply our test in the context of several real life situations including medical studies and insurance data.\u0000</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"194 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140883340","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A trigamma-free approach for computing information matrices related to trigamma function 计算与三角函数相关的信息矩阵的无三角函数方法
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-04-20 DOI: 10.1007/s00362-024-01552-2
Zhou Yu, Niloufar Dousti Mousavi, Jie Yang

Negative binomial related distributions have been widely used in practice. The calculation of the corresponding Fisher information matrices involves the expectation of trigamma function values which can only be calculated numerically and approximately. In this paper, we propose a trigamma-free approach to approximate the expectations involving the trigamma function, along with theoretical upper bounds for approximation errors. We show by numerical studies that our approach is highly efficient and much more accurate than previous methods. We also apply our approach to compute the Fisher information matrices of zero-inflated negative binomial (ZINB) and beta negative binomial (ZIBNB) probabilistic models, as well as ZIBNB regression models.

负二项分布在实践中得到了广泛应用。相应的费雪信息矩阵的计算涉及三角函数值的期望,而三角函数值只能通过数值近似计算。在本文中,我们提出了一种无三角函数方法来近似计算涉及三角函数的期望值,并提出了近似误差的理论上限。我们通过数值研究表明,我们的方法效率很高,比以前的方法精确得多。我们还将我们的方法应用于计算零膨胀负二项(ZINB)和贝塔负二项(ZIBNB)概率模型以及 ZIBNB 回归模型的费雪信息矩阵。
{"title":"A trigamma-free approach for computing information matrices related to trigamma function","authors":"Zhou Yu, Niloufar Dousti Mousavi, Jie Yang","doi":"10.1007/s00362-024-01552-2","DOIUrl":"https://doi.org/10.1007/s00362-024-01552-2","url":null,"abstract":"<p>Negative binomial related distributions have been widely used in practice. The calculation of the corresponding Fisher information matrices involves the expectation of trigamma function values which can only be calculated numerically and approximately. In this paper, we propose a trigamma-free approach to approximate the expectations involving the trigamma function, along with theoretical upper bounds for approximation errors. We show by numerical studies that our approach is highly efficient and much more accurate than previous methods. We also apply our approach to compute the Fisher information matrices of zero-inflated negative binomial (ZINB) and beta negative binomial (ZIBNB) probabilistic models, as well as ZIBNB regression models.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"27 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140623859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On some stable linear functional regression estimators based on random projections 基于随机投影的若干稳定线性函数回归估计器
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-04-17 DOI: 10.1007/s00362-024-01554-0
Asma Ben Saber, Abderrazek Karoui

In this work, we develop two stable estimators for solving linear functional regression problems. It is well known that such a problem is an ill-posed stochastic inverse problem. Hence, a special interest has to be devoted to the stability issue in the design of an estimator for solving such a problem. Our proposed estimators are based on combining a stable least-squares technique and a random projection of the slope function (beta _0(cdot )in L^2(J),) where J is a compact interval. Moreover, these estimators have the advantage of having a fairly good convergence rate with reasonable computational load, since the involved random projections are generally performed over a fairly small dimensional subspace of (L^2(J).) More precisely, the first estimator is given as a least-squares solution of a regularized minimization problem over a finite dimensional subspace of (L^2(J).) In particular, we give an upper bound for the empirical risk error as well as the convergence rate of this estimator. The second proposed stable LFR estimator is based on combining the least-squares technique with a dyadic decomposition of the i.i.d. samples of the stochastic process, associated with the LFR model. In particular, we provide an (L^2)-risk error of this second LFR estimator. Finally, we provide some numerical simulations on synthetic as well as on real data that illustrate the results of this work. These results indicate that our proposed estimators are competitive with some existing and popular LFR estimators.

在这项工作中,我们开发了两个稳定的估计器,用于解决线性函数回归问题。众所周知,此类问题是一个难以解决的随机逆问题。因此,在设计用于解决此类问题的估计器时,必须特别关注稳定性问题。我们提出的估计器基于将稳定的最小二乘技术和斜率函数的随机投影(beta _0(cdot )in L^2(J),)相结合,其中 J 是一个紧凑的区间。此外,这些估计器还具有收敛速度快、计算量合理的优点,因为所涉及的随机投影通常是在(L^2(J).)的一个相当小的维度子空间上进行的。更准确地说,第一个估计器是在(L^2(J).)的有限维子空间上的正则化最小化问题的最小二乘法解中给出的。我们特别给出了经验风险误差的上限以及该估计器的收敛率。第二个稳定的 LFR 估计器是基于最小二乘技术与随机过程 i.i.d. 样本的二元分解相结合,与 LFR 模型相关联。特别是,我们提供了第二个 LFR 估计器的(L^2)风险误差。最后,我们提供了一些合成数据和真实数据的数值模拟,以说明这项工作的结果。这些结果表明,我们提出的估计器与现有的一些流行的 LFR 估计器相比具有竞争力。
{"title":"On some stable linear functional regression estimators based on random projections","authors":"Asma Ben Saber, Abderrazek Karoui","doi":"10.1007/s00362-024-01554-0","DOIUrl":"https://doi.org/10.1007/s00362-024-01554-0","url":null,"abstract":"<p>In this work, we develop two stable estimators for solving linear functional regression problems. It is well known that such a problem is an ill-posed stochastic inverse problem. Hence, a special interest has to be devoted to the stability issue in the design of an estimator for solving such a problem. Our proposed estimators are based on combining a stable least-squares technique and a random projection of the slope function <span>(beta _0(cdot )in L^2(J),)</span> where <i>J</i> is a compact interval. Moreover, these estimators have the advantage of having a fairly good convergence rate with reasonable computational load, since the involved random projections are generally performed over a fairly small dimensional subspace of <span>(L^2(J).)</span> More precisely, the first estimator is given as a least-squares solution of a regularized minimization problem over a finite dimensional subspace of <span>(L^2(J).)</span> In particular, we give an upper bound for the empirical risk error as well as the convergence rate of this estimator. The second proposed stable LFR estimator is based on combining the least-squares technique with a dyadic decomposition of the i.i.d. samples of the stochastic process, associated with the LFR model. In particular, we provide an <span>(L^2)</span>-risk error of this second LFR estimator. Finally, we provide some numerical simulations on synthetic as well as on real data that illustrate the results of this work. These results indicate that our proposed estimators are competitive with some existing and popular LFR estimators.\u0000</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"14 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140612027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Testing practical relevance of treatment effects 测试治疗效果的实用性
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-04-17 DOI: 10.1007/s00362-024-01549-x
Andrea Ongaro, Sonia Migliorati, Roberto Ascari, Enrico Ripamonti

Traditionally, common testing problems are formalized in terms of a precise null hypothesis representing an idealized situation such as absence of a certain “treatment effect”. However, in most applications the real purpose of the analysis is to assess evidence in favor of a practically relevant effect, rather than simply determining its presence/absence. This discrepancy leads to erroneous inferential conclusions, especially in case of moderate or large sample size. In particular, statistical significance, as commonly evaluated on the basis of a precise hypothesis low p value, bears little or no information on practical significance. This paper presents an innovative approach to the problem of testing the practical relevance of effects. This relies upon the proposal of a general method for modifying standard tests by making them suitable to deal with appropriate interval null hypotheses containing all practically irrelevant effect sizes. In addition, when it is difficult to specify exactly which effect sizes are irrelevant we provide the researcher with a benchmark value. Acceptance/rejection can be established purely by deciding on the (ir)relevance of this value. We illustrate our proposal in the context of many important testing setups, and we apply the proposed methods to two case studies in clinical medicine. First, we consider data on the evaluation of systolic blood pressure in a sample of adult participants at risk for nutritional deficit. Second, we focus on a study of the effects of remdesivir on patients hospitalized with COVID-19.

传统上,常见的检验问题都是通过一个精确的零假设来形式化的,它代表了一种理想化的情况,比如不存在某种 "治疗效果"。然而,在大多数应用中,分析的真正目的是评估有利于实际相关效应的证据,而不是简单地确定其存在/不存在。这种差异会导致错误的推断结论,尤其是在样本量适中或较大的情况下。特别是,通常根据精确假设的低 p 值来评估的统计意义,很少或根本没有关于实际意义的信息。本文针对效果的实际相关性测试问题提出了一种创新方法。这有赖于提出一种修改标准检验的通用方法,使其适用于包含所有实际无关效应大小的适当区间零假设。此外,当难以明确哪些效应大小不相关时,我们为研究人员提供了一个基准值。只需决定该值的(不)相关性,即可确定接受/拒绝。我们结合许多重要的测试设置来说明我们的提议,并将提议的方法应用到临床医学的两个案例研究中。首先,我们考虑了有营养缺乏风险的成年参与者样本中的收缩压评估数据。其次,我们重点研究了雷米替韦对 COVID-19 住院患者的影响。
{"title":"Testing practical relevance of treatment effects","authors":"Andrea Ongaro, Sonia Migliorati, Roberto Ascari, Enrico Ripamonti","doi":"10.1007/s00362-024-01549-x","DOIUrl":"https://doi.org/10.1007/s00362-024-01549-x","url":null,"abstract":"<p>Traditionally, common testing problems are formalized in terms of a precise null hypothesis representing an idealized situation such as absence of a certain “treatment effect”. However, in most applications the real purpose of the analysis is to assess evidence in favor of a practically relevant effect, rather than simply determining its presence/absence. This discrepancy leads to erroneous inferential conclusions, especially in case of moderate or large sample size. In particular, statistical significance, as commonly evaluated on the basis of a precise hypothesis low <i>p</i> value, bears little or no information on practical significance. This paper presents an innovative approach to the problem of testing the practical relevance of effects. This relies upon the proposal of a general method for modifying standard tests by making them suitable to deal with appropriate interval null hypotheses containing all practically irrelevant effect sizes. In addition, when it is difficult to specify exactly which effect sizes are irrelevant we provide the researcher with a benchmark value. Acceptance/rejection can be established purely by deciding on the (ir)relevance of this value. We illustrate our proposal in the context of many important testing setups, and we apply the proposed methods to two case studies in clinical medicine. First, we consider data on the evaluation of systolic blood pressure in a sample of adult participants at risk for nutritional deficit. Second, we focus on a study of the effects of remdesivir on patients hospitalized with COVID-19.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"190 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140612379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Supervised dimension reduction for functional time series 功能时间序列的监督降维
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-04-16 DOI: 10.1007/s00362-023-01505-1
Guochang Wang, Zengyao Wen, Shanming Jia, Shanshan Liang

Functional time series model has been the subject of the most research in recent years, and since functional data is infinite dimensional, dimension reduction is essential for functional time series. However, the majority of the existing dimension reduction methods such as the functional principal component and fixed basis expansion are unsupervised and typically result in information loss. Then, the functional time series model has an urgent need for a supervised dimension reduction method. The functional sufficient dimension reduction method is a supervised technique that adequately exploits the regression structure information, resulting in minimal information loss. Functional sliced inverse regression (FSIR) is the most popular functional sufficient dimension reduction method, but it cannot be applied directly to functional time series model. In this paper, we examine a functional time series model in which the response is a scalar time series and the explanatory variable is functional time series. We propose a novel supervised dimension reduction technique for the regression model by combining the FSIR and blind source separation methods. Furthermore, we propose innovative strategies for selecting the dimensionality of dimension reduction space and the lags of the functional time series. Numerical studies, including simulation studies and a real data analysis are show the effectiveness of the proposed methods.

函数时间序列模型是近年来研究最多的主题,由于函数数据是无限维的,因此降维对函数时间序列至关重要。然而,现有的大多数降维方法,如函数主成分和定基扩展,都是无监督的,通常会导致信息丢失。因此,函数时间序列模型迫切需要一种有监督的降维方法。函数充分降维方法是一种有监督的技术,它能充分挖掘回归结构信息,使信息损失最小。函数切分反回归(FSIR)是最流行的函数充分降维方法,但它不能直接应用于函数时间序列模型。在本文中,我们研究了一个函数时间序列模型,其中响应是标量时间序列,解释变量是函数时间序列。我们结合 FSIR 和盲源分离方法,为回归模型提出了一种新颖的监督降维技术。此外,我们还提出了选择降维空间维度和函数时间序列滞后期的创新策略。包括模拟研究和真实数据分析在内的数值研究表明了所提方法的有效性。
{"title":"Supervised dimension reduction for functional time series","authors":"Guochang Wang, Zengyao Wen, Shanming Jia, Shanshan Liang","doi":"10.1007/s00362-023-01505-1","DOIUrl":"https://doi.org/10.1007/s00362-023-01505-1","url":null,"abstract":"<p>Functional time series model has been the subject of the most research in recent years, and since functional data is infinite dimensional, dimension reduction is essential for functional time series. However, the majority of the existing dimension reduction methods such as the functional principal component and fixed basis expansion are unsupervised and typically result in information loss. Then, the functional time series model has an urgent need for a supervised dimension reduction method. The functional sufficient dimension reduction method is a supervised technique that adequately exploits the regression structure information, resulting in minimal information loss. Functional sliced inverse regression (FSIR) is the most popular functional sufficient dimension reduction method, but it cannot be applied directly to functional time series model. In this paper, we examine a functional time series model in which the response is a scalar time series and the explanatory variable is functional time series. We propose a novel supervised dimension reduction technique for the regression model by combining the FSIR and blind source separation methods. Furthermore, we propose innovative strategies for selecting the dimensionality of dimension reduction space and the lags of the functional time series. Numerical studies, including simulation studies and a real data analysis are show the effectiveness of the proposed methods.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"16 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140612442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical inferences for missing response problems based on modified empirical likelihood 基于修正的经验似然法的缺失响应问题统计推断
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-04-16 DOI: 10.1007/s00362-024-01553-1
Sima Sharghi, Kevin Stoll, Wei Ning

In this paper, we advance the application of empirical likelihood (EL) for missing response problems. Inspired by remedies for the shortcomings of EL for parameter hypothesis testing, we modify the EL approach used for statistical inference on the mean response when the response is subject to missing behavior. We propose consistent mean estimators, and associated confidence intervals. We extend the approach to estimate the average treatment effect in causal inference settings. We detail the analogous estimators for average treatment effect, prove their consistency, and example their use in estimating the average effect of smoking on renal function of the patients with atherosclerotic renal-artery stenosis and elevated blood pressure, chronic kidney disease, or both. Our proposed estimators outperform the historical mean estimators under missing responses and causal inference settings in terms of simulated relative RMSE and coverage probability on average.

在本文中,我们推进了经验似然法(EL)在缺失响应问题上的应用。受 EL 用于参数假设检验的缺陷补救措施的启发,我们对 EL 方法进行了修改,使之适用于当响应存在缺失行为时对平均响应的统计推断。我们提出了一致的平均估计值和相关的置信区间。我们将该方法扩展到因果推断设置中的平均治疗效果估计。我们详细介绍了平均治疗效果的类似估计器,证明了它们的一致性,并举例说明了它们在估计吸烟对动脉粥样硬化性肾动脉狭窄、血压升高、慢性肾病或两者兼有的患者的肾功能的平均影响时的应用。在缺失反应和因果推理设置下,我们提出的估计器在模拟相对均方根误差和平均覆盖概率方面优于历史平均估计器。
{"title":"Statistical inferences for missing response problems based on modified empirical likelihood","authors":"Sima Sharghi, Kevin Stoll, Wei Ning","doi":"10.1007/s00362-024-01553-1","DOIUrl":"https://doi.org/10.1007/s00362-024-01553-1","url":null,"abstract":"<p>In this paper, we advance the application of empirical likelihood (EL) for missing response problems. Inspired by remedies for the shortcomings of EL for parameter hypothesis testing, we modify the EL approach used for statistical inference on the mean response when the response is subject to missing behavior. We propose consistent mean estimators, and associated confidence intervals. We extend the approach to estimate the average treatment effect in causal inference settings. We detail the analogous estimators for average treatment effect, prove their consistency, and example their use in estimating the average effect of smoking on renal function of the patients with atherosclerotic renal-artery stenosis and elevated blood pressure, chronic kidney disease, or both. Our proposed estimators outperform the historical mean estimators under missing responses and causal inference settings in terms of simulated relative RMSE and coverage probability on average.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"2015 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140573740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A high-dimensional single-index regression for interactions between treatment and covariates 治疗与协变因素之间交互作用的高维单指数回归
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-04-13 DOI: 10.1007/s00362-024-01546-0
Hyung Park, Thaddeus Tarpey, Eva Petkova, R. Todd Ogden

This paper explores a methodology for dimension reduction in regression models for a treatment outcome, specifically to capture covariates’ moderating impact on the treatment-outcome association. The motivation behind this stems from the field of precision medicine, where a comprehensive understanding of the interactions between a treatment variable and pretreatment covariates is essential for developing individualized treatment regimes (ITRs). We provide a review of sufficient dimension reduction methods suitable for capturing treatment-covariate interactions and establish connections with linear model-based approaches for the proposed model. Within the framework of single-index regression models, we introduce a sparse estimation method for a dimension reduction vector to tackle the challenges posed by high-dimensional covariate data. Our methods offer insights into dimension reduction techniques specifically for interaction analysis, by providing a semiparametric framework for approximating the minimally sufficient subspace for interactions.

本文探讨了在治疗结果回归模型中降低维度的方法,特别是捕捉协变量对治疗-结果关联的调节作用。这背后的动机源于精准医疗领域,在该领域,全面了解治疗变量与治疗前协变量之间的相互作用对于制定个体化治疗方案(ITR)至关重要。我们对适用于捕捉治疗-协变量交互作用的充分降维方法进行了综述,并为拟议模型建立了与基于线性模型方法的联系。在单指标回归模型的框架内,我们引入了降维向量的稀疏估计方法,以应对高维协变量数据带来的挑战。我们的方法提供了一个半参数框架,用于逼近相互作用的最小充分子空间,从而为专门用于相互作用分析的降维技术提供了见解。
{"title":"A high-dimensional single-index regression for interactions between treatment and covariates","authors":"Hyung Park, Thaddeus Tarpey, Eva Petkova, R. Todd Ogden","doi":"10.1007/s00362-024-01546-0","DOIUrl":"https://doi.org/10.1007/s00362-024-01546-0","url":null,"abstract":"<p>This paper explores a methodology for dimension reduction in regression models for a treatment outcome, specifically to capture covariates’ moderating impact on the treatment-outcome association. The motivation behind this stems from the field of precision medicine, where a comprehensive understanding of the interactions between a treatment variable and pretreatment covariates is essential for developing individualized treatment regimes (ITRs). We provide a review of sufficient dimension reduction methods suitable for capturing treatment-covariate interactions and establish connections with linear model-based approaches for the proposed model. Within the framework of single-index regression models, we introduce a sparse estimation method for a dimension reduction vector to tackle the challenges posed by high-dimensional covariate data. Our methods offer insights into dimension reduction techniques specifically for interaction analysis, by providing a semiparametric framework for approximating the minimally sufficient subspace for interactions.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"12 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140573951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Flexible-dimensional L-statistic for mean estimation of symmetric distributions 用于对称分布均值估计的灵活维度 L 统计量
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-04-06 DOI: 10.1007/s00362-024-01547-z
Juan Baz, Diego García-Zamora, Irene Díaz, Susana Montes, Luis Martínez

Estimating the mean of a population is a recurrent topic in statistics because of its multiple applications. If previous data is available, or the distribution of the deviation between the measurements and the mean is known, it is possible to perform such estimation by using L-statistics, whose optimal linear coefficients, typically referred to as weights, are derived from a minimization of the mean squared error. However, such optimal weights can only manage sample sizes equal to the one used to derive them, while in real-world scenarios this size might slightly change. Therefore, this paper proposes a method to overcome such a limitation and derive approximations of flexible-dimensional optimal weights. To do so, a parametric family of functions based on extreme value reductions and amplifications is proposed to be adjusted to the cumulative optimal weights of a given sample from a symmetric distribution. Then, the application of Yager’s method to derive weights for ordered weighted average (OWA) operators allows computing the approximate optimal weights for sample sizes close to the original one. This method is justified from the theoretical point of view by proving a convergence result regarding the cumulative weights obtained for different sample sizes. Finally, the practical performance of the theoretical results is shown for several classical symmetric distributions.

由于应用广泛,估计群体的平均值是统计学中经常出现的话题。如果有以前的数据,或已知测量值与均值之间偏差的分布,就可以使用 L 统计法进行估计,其最优线性系数(通常称为权重)是通过最小化均方误差得出的。然而,这种最优权重只能管理与用于推导权重的样本大小相等的样本,而在现实世界中,样本大小可能会略有变化。因此,本文提出了一种方法来克服这种限制,并推导出灵活维度最优权重的近似值。为此,本文提出了一个基于极值还原和放大的参数函数族,用于调整对称分布中给定样本的累积最优权重。然后,应用雅格方法得出有序加权平均算子(OWA)的权重,从而计算出接近原始样本大小的近似最优权重。通过证明不同样本量下获得的累积权重的收敛结果,从理论角度证明了这一方法的合理性。最后,针对几个经典的对称分布,展示了理论结果的实际表现。
{"title":"Flexible-dimensional L-statistic for mean estimation of symmetric distributions","authors":"Juan Baz, Diego García-Zamora, Irene Díaz, Susana Montes, Luis Martínez","doi":"10.1007/s00362-024-01547-z","DOIUrl":"https://doi.org/10.1007/s00362-024-01547-z","url":null,"abstract":"<p>Estimating the mean of a population is a recurrent topic in statistics because of its multiple applications. If previous data is available, or the distribution of the deviation between the measurements and the mean is known, it is possible to perform such estimation by using L-statistics, whose optimal linear coefficients, typically referred to as weights, are derived from a minimization of the mean squared error. However, such optimal weights can only manage sample sizes equal to the one used to derive them, while in real-world scenarios this size might slightly change. Therefore, this paper proposes a method to overcome such a limitation and derive approximations of flexible-dimensional optimal weights. To do so, a parametric family of functions based on extreme value reductions and amplifications is proposed to be adjusted to the cumulative optimal weights of a given sample from a symmetric distribution. Then, the application of Yager’s method to derive weights for ordered weighted average (OWA) operators allows computing the approximate optimal weights for sample sizes close to the original one. This method is justified from the theoretical point of view by proving a convergence result regarding the cumulative weights obtained for different sample sizes. Finally, the practical performance of the theoretical results is shown for several classical symmetric distributions.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"7 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140573737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Statistical Papers
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1