首页 > 最新文献

Statistical Theory and Related Fields最新文献

英文 中文
Discussion on “on studying extreme values and systematic risks with nonlinear time series models and tail dependence measures” 关于“用非线性时间序列模型和尾部相关测度研究极值和系统风险”的讨论
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2021-01-02 DOI: 10.1080/24754269.2021.1895528
Wen Xu, Huixia Judy Wang
Extreme value theory provides essential mathematical foundations for modelling tail risks and has wide applications. The emerging of big and heterogeneous data calls for the development of new extreme value theory and methods. For studying high-dimensional extremes and extreme clusters in time series, an important problem is how to measure and test for tail dependence between random variables. Section 3.1 of Dr. Zhang’s paper discusses some newly proposed tail dependence measures. In the era of big data, a timely and challenging question is how to study data from heterogeneous populations, e.g. from different sources. Section 3.2 reviews some new developments of extreme value theory for maxima of maxima. The theory and methods in Sections 3.1 and 2.3 set the foundations for modelling extremes of multivariate and heterogeneous data, and we believe they have wide applicability. We will discuss two possible directions: (1) measuring and testing of partial tail dependence; (2) application of the extreme value theory for maxima of maxima in highdimensional inference.
极值理论为尾部风险建模提供了必要的数学基础,具有广泛的应用前景。大数据和异构数据的出现呼唤新的极值理论和方法的发展。对于研究时间序列中的高维极值和极值簇,一个重要的问题是如何测量和检验随机变量之间的尾部相关性。张博士论文的3.1节讨论了一些新提出的尾部依赖度量。在大数据时代,如何研究来自异质人群(例如来自不同来源的数据)是一个及时且具有挑战性的问题。第3.2节回顾了最大值的最大值极值理论的一些新进展。3.1节和2.3节中的理论和方法为多元和异构数据的极值建模奠定了基础,我们认为它们具有广泛的适用性。我们将讨论两个可能的方向:(1)部分尾相关性的测量和检验;(2)极值理论在高维推理中的应用。
{"title":"Discussion on “on studying extreme values and systematic risks with nonlinear time series models and tail dependence measures”","authors":"Wen Xu, Huixia Judy Wang","doi":"10.1080/24754269.2021.1895528","DOIUrl":"https://doi.org/10.1080/24754269.2021.1895528","url":null,"abstract":"Extreme value theory provides essential mathematical foundations for modelling tail risks and has wide applications. The emerging of big and heterogeneous data calls for the development of new extreme value theory and methods. For studying high-dimensional extremes and extreme clusters in time series, an important problem is how to measure and test for tail dependence between random variables. Section 3.1 of Dr. Zhang’s paper discusses some newly proposed tail dependence measures. In the era of big data, a timely and challenging question is how to study data from heterogeneous populations, e.g. from different sources. Section 3.2 reviews some new developments of extreme value theory for maxima of maxima. The theory and methods in Sections 3.1 and 2.3 set the foundations for modelling extremes of multivariate and heterogeneous data, and we believe they have wide applicability. We will discuss two possible directions: (1) measuring and testing of partial tail dependence; (2) application of the extreme value theory for maxima of maxima in highdimensional inference.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"5 1","pages":"26 - 30"},"PeriodicalIF":0.5,"publicationDate":"2021-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24754269.2021.1895528","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43522771","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Covariance estimation via fiducial inference. 基于基准推理的协方差估计。
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2021-01-01 Epub Date: 2021-02-15 DOI: 10.1080/24754269.2021.1877950
W Jenny Shi, Jan Hannig, Randy C S Lai, Thomas C M Lee

As a classical problem, covariance estimation has drawn much attention from the statistical community for decades. Much work has been done under the frequentist and the Bayesian frameworks. Aiming to quantify the uncertainty of the estimators without having to choose a prior, we have developed a fiducial approach to the estimation of covariance matrix. Built upon the Fiducial Berstein-von Mises Theorem (Sonderegger and Hannig 2014), we show that the fiducial distribution of the covariate matrix is consistent under our framework. Consequently, the samples generated from this fiducial distribution are good estimators to the true covariance matrix, which enable us to define a meaningful confidence region for the covariance matrix. Lastly, we also show that the fiducial approach can be a powerful tool for identifying clique structures in covariance matrices.

协方差估计作为一个经典问题,几十年来一直受到统计学界的关注。在频率论和贝叶斯框架下已经做了很多工作。为了在不选择先验的情况下量化估计量的不确定性,我们开发了一种估计协方差矩阵的基准方法。基于Fiducial Berstein-von Mises定理(Sonderegger and Hannig 2014),我们证明了协变量矩阵的Fiducial分布在我们的框架下是一致的。因此,由该基准分布生成的样本是真实协方差矩阵的良好估计,这使我们能够为协方差矩阵定义一个有意义的置信区域。最后,我们还证明了基准方法可以成为识别协方差矩阵中团结构的有力工具。
{"title":"Covariance estimation via fiducial inference.","authors":"W Jenny Shi,&nbsp;Jan Hannig,&nbsp;Randy C S Lai,&nbsp;Thomas C M Lee","doi":"10.1080/24754269.2021.1877950","DOIUrl":"https://doi.org/10.1080/24754269.2021.1877950","url":null,"abstract":"<p><p>As a classical problem, covariance estimation has drawn much attention from the statistical community for decades. Much work has been done under the frequentist and the Bayesian frameworks. Aiming to quantify the uncertainty of the estimators without having to choose a prior, we have developed a fiducial approach to the estimation of covariance matrix. Built upon the Fiducial Berstein-von Mises Theorem (Sonderegger and Hannig 2014), we show that the fiducial distribution of the covariate matrix is consistent under our framework. Consequently, the samples generated from this fiducial distribution are good estimators to the true covariance matrix, which enable us to define a meaningful confidence region for the covariance matrix. Lastly, we also show that the fiducial approach can be a powerful tool for identifying clique structures in covariance matrices.</p>","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"5 4","pages":"316-331"},"PeriodicalIF":0.5,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24754269.2021.1877950","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33442561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
On studying extreme values and systematic risks with nonlinear time series models and tail dependence measures 用非线性时间序列模型和尾部相关测度研究极值和系统风险
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2020-12-23 DOI: 10.1080/24754269.2020.1856590
Zhengjun Zhang
ABSTRACT This review paper discusses advances of statistical inference in modeling extreme observations from multiple sources and heterogeneous populations. The paper starts briefly reviewing classical univariate/multivariate extreme value theory, tail equivalence, and tail (in)dependence. New extreme value theory for heterogeneous populations is then introduced. Time series models for maxima and extreme observations are the focus of the review. These models naturally form a new system with similar structures. They can be used as alternatives to the widely used ARMA models and GARCH models. Applications of these time series models can be in many fields. The paper discusses two important applications: systematic risks and extreme co-movements/large scale contagions.
摘要本文综述了统计推断在多源和异质群体极端观测建模中的进展。本文首先简要回顾了经典的单变量/多变量极值理论、尾部等价和尾部依赖。然后介绍了新的异质种群极值理论。极大值和极值观测的时间序列模型是综述的重点。这些模型自然形成了一个结构相似的新系统。它们可以作为广泛使用的ARMA模型和GARCH模型的替代品。这些时间序列模型可以应用于许多领域。本文讨论了两个重要的应用:系统性风险和极端联合运动/大规模传染。
{"title":"On studying extreme values and systematic risks with nonlinear time series models and tail dependence measures","authors":"Zhengjun Zhang","doi":"10.1080/24754269.2020.1856590","DOIUrl":"https://doi.org/10.1080/24754269.2020.1856590","url":null,"abstract":"ABSTRACT This review paper discusses advances of statistical inference in modeling extreme observations from multiple sources and heterogeneous populations. The paper starts briefly reviewing classical univariate/multivariate extreme value theory, tail equivalence, and tail (in)dependence. New extreme value theory for heterogeneous populations is then introduced. Time series models for maxima and extreme observations are the focus of the review. These models naturally form a new system with similar structures. They can be used as alternatives to the widely used ARMA models and GARCH models. Applications of these time series models can be in many fields. The paper discusses two important applications: systematic risks and extreme co-movements/large scale contagions.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"5 1","pages":"1 - 25"},"PeriodicalIF":0.5,"publicationDate":"2020-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24754269.2020.1856590","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47147542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
New extreme value theory for maxima of maxima 极大值的极大值的新极值理论
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2020-12-20 DOI: 10.1080/24754269.2020.1846115
Wenzhi Cao, Zhengjun Zhang
Although advanced statistical models have been proposed to fit complex data better, the advances of science and technology have generated more complex data, e.g., Big Data, in which existing probability theory and statistical models find their limitations. This work establishes probability foundations for studying extreme values of data generated from a mixture process with the mixture pattern depending on the sample length and data generating sources. In particular, we show that the limit distribution, termed as the accelerated max-stable distribution, of the maxima of maxima of sequences of random variables with the above mixture pattern is a product of three types of extreme value distributions. As a result, our theoretical results are more general than the classical extreme value theory and can be applicable to research problems related to Big Data. Examples are provided to give intuitions of the new distribution family. We also establish mixing conditions for a sequence of random variables to have the limit distributions. The results for the associated independent sequence and the maxima over arbitrary intervals are also developed. We use simulations to demonstrate the advantages of our newly established maxima of maxima extreme value theory.
尽管已经提出了先进的统计模型来更好地拟合复杂数据,但科学技术的进步已经产生了更复杂的数据,例如大数据,现有的概率论和统计模型在其中发现了它们的局限性。这项工作为研究混合过程中产生的数据的极值奠定了概率基础,混合模式取决于样本长度和数据产生源。特别地,我们证明了具有上述混合模式的随机变量序列的最大值的极限分布,称为加速最大稳定分布,是三种极值分布的乘积。因此,我们的理论结果比经典的极值理论更具一般性,可以应用于大数据相关的研究问题。举例说明了新分布族的直观性。我们还建立了一系列随机变量具有极限分布的混合条件。还得到了相关独立序列和任意区间上的最大值的结果。我们使用模拟来证明我们新建立的极大值极值理论的优势。
{"title":"New extreme value theory for maxima of maxima","authors":"Wenzhi Cao, Zhengjun Zhang","doi":"10.1080/24754269.2020.1846115","DOIUrl":"https://doi.org/10.1080/24754269.2020.1846115","url":null,"abstract":"Although advanced statistical models have been proposed to fit complex data better, the advances of science and technology have generated more complex data, e.g., Big Data, in which existing probability theory and statistical models find their limitations. This work establishes probability foundations for studying extreme values of data generated from a mixture process with the mixture pattern depending on the sample length and data generating sources. In particular, we show that the limit distribution, termed as the accelerated max-stable distribution, of the maxima of maxima of sequences of random variables with the above mixture pattern is a product of three types of extreme value distributions. As a result, our theoretical results are more general than the classical extreme value theory and can be applicable to research problems related to Big Data. Examples are provided to give intuitions of the new distribution family. We also establish mixing conditions for a sequence of random variables to have the limit distributions. The results for the associated independent sequence and the maxima over arbitrary intervals are also developed. We use simulations to demonstrate the advantages of our newly established maxima of maxima extreme value theory.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"5 1","pages":"232 - 252"},"PeriodicalIF":0.5,"publicationDate":"2020-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24754269.2020.1846115","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47671905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Nonlinear prediction via Hermite transformation 基于Hermite变换的非线性预测
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2020-12-17 DOI: 10.1080/24754269.2020.1856589
T. McElroy, Srinjoy Das
ABSTRACT General prediction formulas involving Hermite polynomials are developed for time series expressed as a transformation of a Gaussian process. The prediction gains over linear predictors are examined numerically, demonstrating the improvement of nonlinear prediction.
摘要针对高斯过程的变换时间序列,提出了包含埃尔米特多项式的一般预测公式。数值检验了线性预测因子的预测增益,证明了非线性预测的改进。
{"title":"Nonlinear prediction via Hermite transformation","authors":"T. McElroy, Srinjoy Das","doi":"10.1080/24754269.2020.1856589","DOIUrl":"https://doi.org/10.1080/24754269.2020.1856589","url":null,"abstract":"ABSTRACT General prediction formulas involving Hermite polynomials are developed for time series expressed as a transformation of a Gaussian process. The prediction gains over linear predictors are examined numerically, demonstrating the improvement of nonlinear prediction.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"5 1","pages":"49 - 54"},"PeriodicalIF":0.5,"publicationDate":"2020-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24754269.2020.1856589","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44941464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonignorable item nonresponse in panel data 面板数据中不可忽略的无响应项
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2020-12-17 DOI: 10.1080/24754269.2020.1856591
Sijing Li, J. Shao
To estimate unknown population parameters based on panel data having nonignorable item nonresponse, we propose an innovative data grouping approach according to the number of observed components in the multivariate outcome when the joint distribution of and associated covariate is nonparametric and the nonresponse probability conditional on and has a parametric form. To deal with the identifiability issue, we utilise a nonresponse instrument , an auxiliary variable related to but not related to the nonresponse probability conditional on and . We apply a modified generalised method of moments to obtain estimators of the parameters in the nonresponse probability, and a generalised regression estimation to utilise covariate information for efficient estimation of population parameters. Consistency and asymptotic normality of the proposed estimators of the population parameters are established. Simulation and real data results are presented.
为了基于具有不可忽略项目无响应的面板数据来估计未知的总体参数,我们提出了一种创新的数据分组方法,当和相关协变量的联合分布是非参数的,且无响应概率以参数形式为条件时,根据多变量结果中观察到的分量的数量。为了处理可识别性问题,我们使用了无响应工具,这是一个与以和为条件的无响应概率相关但不相关的辅助变量。我们应用改进的广义矩方法来获得无响应概率中参数的估计量,并应用广义回归估计来利用协变量信息来有效估计总体参数。建立了种群参数估计量的一致性和渐近正态性。给出了仿真和实际数据结果。
{"title":"Nonignorable item nonresponse in panel data","authors":"Sijing Li, J. Shao","doi":"10.1080/24754269.2020.1856591","DOIUrl":"https://doi.org/10.1080/24754269.2020.1856591","url":null,"abstract":"To estimate unknown population parameters based on panel data having nonignorable item nonresponse, we propose an innovative data grouping approach according to the number of observed components in the multivariate outcome when the joint distribution of and associated covariate is nonparametric and the nonresponse probability conditional on and has a parametric form. To deal with the identifiability issue, we utilise a nonresponse instrument , an auxiliary variable related to but not related to the nonresponse probability conditional on and . We apply a modified generalised method of moments to obtain estimators of the parameters in the nonresponse probability, and a generalised regression estimation to utilise covariate information for efficient estimation of population parameters. Consistency and asymptotic normality of the proposed estimators of the population parameters are established. Simulation and real data results are presented.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"6 1","pages":"58 - 71"},"PeriodicalIF":0.5,"publicationDate":"2020-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24754269.2020.1856591","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42460662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Research on three-step accelerated gradient algorithm in deep learning 深度学习中三步加速梯度算法的研究
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2020-11-23 DOI: 10.1080/24754269.2020.1846414
Yongqiang Lian, Yincai Tang, Shirong Zhou
Gradient descent (GD) algorithm is the widely used optimisation method in training machine learning and deep learning models. In this paper, based on GD, Polyak's momentum (PM), and Nesterov accelerated gradient (NAG), we give the convergence of the algorithms from an initial value to the optimal value of an objective function in simple quadratic form. Based on the convergence property of the quadratic function, two sister sequences of NAG's iteration and parallel tangent methods in neural networks, the three-step accelerated gradient (TAG) algorithm is proposed, which has three sequences other than two sister sequences. To illustrate the performance of this algorithm, we compare the proposed algorithm with the three other algorithms in quadratic function, high-dimensional quadratic functions, and nonquadratic function. Then we consider to combine the TAG algorithm to the backpropagation algorithm and the stochastic gradient descent algorithm in deep learning. For conveniently facilitate the proposed algorithms, we rewite the R package ‘neuralnet’ and extend it to ‘supneuralnet’. All kinds of deep learning algorithms in this paper are included in ‘supneuralnet’ package. Finally, we show our algorithms are superior to other algorithms in four case studies.
梯度下降算法(GD)是一种广泛应用于机器学习和深度学习模型训练的优化方法。本文基于GD、Polyak动量(PM)和Nesterov加速梯度(NAG),给出了算法从目标函数的初始值到最优值的简单二次型收敛性。基于神经网络中二次函数、NAG迭代的两个姊妹序列和并行切线方法的收敛性,提出了三步加速梯度(TAG)算法,该算法具有除两个姊妹序列外的三个序列。为了说明该算法的性能,我们将该算法与其他三种算法在二次函数、高维二次函数和非二次函数方面进行了比较。然后,我们考虑将TAG算法与深度学习中的反向传播算法和随机梯度下降算法相结合。为了方便所提出的算法,我们重写了R包' neuralnet '并将其扩展为' supneuralnet '。本文中的各种深度学习算法都包含在“supneuralnet”包中。最后,我们在四个案例研究中证明了我们的算法优于其他算法。
{"title":"Research on three-step accelerated gradient algorithm in deep learning","authors":"Yongqiang Lian, Yincai Tang, Shirong Zhou","doi":"10.1080/24754269.2020.1846414","DOIUrl":"https://doi.org/10.1080/24754269.2020.1846414","url":null,"abstract":"Gradient descent (GD) algorithm is the widely used optimisation method in training machine learning and deep learning models. In this paper, based on GD, Polyak's momentum (PM), and Nesterov accelerated gradient (NAG), we give the convergence of the algorithms from an initial value to the optimal value of an objective function in simple quadratic form. Based on the convergence property of the quadratic function, two sister sequences of NAG's iteration and parallel tangent methods in neural networks, the three-step accelerated gradient (TAG) algorithm is proposed, which has three sequences other than two sister sequences. To illustrate the performance of this algorithm, we compare the proposed algorithm with the three other algorithms in quadratic function, high-dimensional quadratic functions, and nonquadratic function. Then we consider to combine the TAG algorithm to the backpropagation algorithm and the stochastic gradient descent algorithm in deep learning. For conveniently facilitate the proposed algorithms, we rewite the R package ‘neuralnet’ and extend it to ‘supneuralnet’. All kinds of deep learning algorithms in this paper are included in ‘supneuralnet’ package. Finally, we show our algorithms are superior to other algorithms in four case studies.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"6 1","pages":"40 - 57"},"PeriodicalIF":0.5,"publicationDate":"2020-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24754269.2020.1846414","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44709557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Target toxicity design for phase I dose-finding I期剂量测定的靶毒性设计
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2020-08-13 DOI: 10.1080/24754269.2020.1800331
Wenchuan Guo, B. Zhong
We propose a new two-/three-stage dose-finding design called Target Toxicity (TT) for phase I clinical trials, where we link the decision rules in the dose-finding process with the conclusions from a hypothesis test. The power to detect excessive toxicity is also given. This solves the problem of why the minimal number of patients is needed for the selected dose level. Our method provides a statistical explanation of traditional ‘3+3’ design using frequentist framework. The proposed method is very flexible and it incorporates other interval-based decision rules through different parameter settings. We provide the decision tables to guide investigators when to decrease, increase or repeat a dose for next cohort of subjects. Simulation experiments were conducted to compare the performance of the proposed method with other dose-finding designs. A free open source R package tsdf is available on CRAN. It is dedicated to deriving two-/three-stage design decision tables and perform dose-finding simulations.
我们提出了一种新的两阶段/三阶段剂量发现设计,称为目标毒性(TT),用于I期临床试验,其中我们将剂量发现过程中的决策规则与假设检验的结论联系起来。并给出了检测过度毒性的能力。这就解决了为什么在选定的剂量水平下需要最少数量的病人的问题。我们的方法利用频率论框架对传统的“3+3”设计进行了统计解释。该方法非常灵活,并通过不同的参数设置融合了其他基于区间的决策规则。我们提供决策表来指导研究者何时减少、增加或重复下一组受试者的剂量。模拟实验比较了该方法与其他剂量检测设计的性能。CRAN上有一个免费的开源R包tsdf。它致力于推导两阶段/三阶段设计决策表并执行剂量查找模拟。
{"title":"Target toxicity design for phase I dose-finding","authors":"Wenchuan Guo, B. Zhong","doi":"10.1080/24754269.2020.1800331","DOIUrl":"https://doi.org/10.1080/24754269.2020.1800331","url":null,"abstract":"We propose a new two-/three-stage dose-finding design called Target Toxicity (TT) for phase I clinical trials, where we link the decision rules in the dose-finding process with the conclusions from a hypothesis test. The power to detect excessive toxicity is also given. This solves the problem of why the minimal number of patients is needed for the selected dose level. Our method provides a statistical explanation of traditional ‘3+3’ design using frequentist framework. The proposed method is very flexible and it incorporates other interval-based decision rules through different parameter settings. We provide the decision tables to guide investigators when to decrease, increase or repeat a dose for next cohort of subjects. Simulation experiments were conducted to compare the performance of the proposed method with other dose-finding designs. A free open source R package tsdf is available on CRAN. It is dedicated to deriving two-/three-stage design decision tables and perform dose-finding simulations.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"5 1","pages":"149 - 161"},"PeriodicalIF":0.5,"publicationDate":"2020-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24754269.2020.1800331","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46466841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Covariate balancing based on kernel density estimates for controlled experiments 基于核密度估计的受控实验协变量平衡
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2020-08-12 DOI: 10.1080/24754269.2021.1878742
Yiou Li, Lulu Kang, Xiao Huang
ABSTRACT Controlled experiments are widely used in many applications to investigate the causal relationship between input factors and experimental outcomes. A completely randomised design is usually used to randomly assign treatment levels to experimental units. When covariates of the experimental units are available, the experimental design should achieve covariate balancing among the treatment groups, such that the statistical inference of the treatment effects is not confounded with any possible effects of covariates. However, covariate imbalance often exists, because the experiment is carried out based on a single realisation of the complete randomisation. It is more likely to occur and worsen when the size of the experimental units is small or moderate. In this paper, we introduce a new covariate balancing criterion, which measures the differences between kernel density estimates of the covariates of treatment groups. To achieve covariate balance before the treatments are randomly assigned, we partition the experimental units by minimising the criterion, then randomly assign the treatment levels to the partitioned groups. Through numerical examples, we show that the proposed partition approach can improve the accuracy of the difference-in-mean estimator and outperforms the complete randomisation and rerandomisation approaches.
摘要对照实验被广泛用于研究输入因素与实验结果之间的因果关系。完全随机设计通常用于将处理水平随机分配给实验单位。当实验单元的协变量可用时,实验设计应在治疗组之间实现协变量平衡,这样治疗效果的统计推断不会与任何可能的协变量影响相混淆。然而,协变量不平衡经常存在,因为实验是基于完全随机化的单一实现进行的。当实验单位规模较小或中等时,更容易发生和恶化。在本文中,我们引入了一个新的协变量平衡准则,它测量了处理组协变量核密度估计之间的差异。为了在随机分配治疗之前实现协变量平衡,我们通过最小化标准来划分实验单元,然后将治疗水平随机分配到划分的组中。通过数值算例,我们证明了所提出的分割方法可以提高均值差估计器的精度,并且优于完全随机化和再随机化方法。
{"title":"Covariate balancing based on kernel density estimates for controlled experiments","authors":"Yiou Li, Lulu Kang, Xiao Huang","doi":"10.1080/24754269.2021.1878742","DOIUrl":"https://doi.org/10.1080/24754269.2021.1878742","url":null,"abstract":"ABSTRACT Controlled experiments are widely used in many applications to investigate the causal relationship between input factors and experimental outcomes. A completely randomised design is usually used to randomly assign treatment levels to experimental units. When covariates of the experimental units are available, the experimental design should achieve covariate balancing among the treatment groups, such that the statistical inference of the treatment effects is not confounded with any possible effects of covariates. However, covariate imbalance often exists, because the experiment is carried out based on a single realisation of the complete randomisation. It is more likely to occur and worsen when the size of the experimental units is small or moderate. In this paper, we introduce a new covariate balancing criterion, which measures the differences between kernel density estimates of the covariates of treatment groups. To achieve covariate balance before the treatments are randomly assigned, we partition the experimental units by minimising the criterion, then randomly assign the treatment levels to the partitioned groups. Through numerical examples, we show that the proposed partition approach can improve the accuracy of the difference-in-mean estimator and outperforms the complete randomisation and rerandomisation approaches.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"5 1","pages":"102 - 113"},"PeriodicalIF":0.5,"publicationDate":"2020-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24754269.2021.1878742","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41800210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A three-parameter logistic regression model 一个三参数逻辑回归模型
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2020-07-24 DOI: 10.1080/24754269.2020.1796098
Xiaoli Yu, Shaoting Li, Jiahua Chen
Dose–response experiments and data analyses are often carried out according to an optimal design under a model assumption. A two-parameter logistic model is often used because of its nice mathematical properties and plausible stochastic response mechanisms. There is an extensive literature on its optimal designs and data analysis strategies. However, a model is at best a good approximation in a real-world application, and researchers must be aware of the risk of model mis-specification. In this paper, we investigate the effectiveness of the sequential ED-design, the D-optimal design, and the up-and-down design under the three-parameter logistic regression model, and we develop a numerical method for the parameter estimation. Simulations show that the combination of the proposed model and the data analysis strategy performs well. When the logistic model is correct, this more complex model has hardly any efficiency loss. The three-parameter logistic model works better than the two-parameter logistic model in the presence of model mis-specification.
剂量反应实验和数据分析通常根据模型假设下的最佳设计进行。双参数逻辑模型由于其良好的数学性质和合理的随机响应机制而经常被使用。关于它的优化设计和数据分析策略,有大量的文献。然而,在现实世界的应用中,模型充其量是一个很好的近似值,研究人员必须意识到模型错误规范的风险。在本文中,我们在三参数逻辑回归模型下研究了顺序ED设计、D最优设计和上下设计的有效性,并开发了一种参数估计的数值方法。仿真结果表明,该模型与数据分析策略的结合效果良好。当逻辑模型正确时,这种更复杂的模型几乎没有任何效率损失。在存在模型错误规范的情况下,三参数逻辑模型比两参数逻辑模型工作得更好。
{"title":"A three-parameter logistic regression model","authors":"Xiaoli Yu, Shaoting Li, Jiahua Chen","doi":"10.1080/24754269.2020.1796098","DOIUrl":"https://doi.org/10.1080/24754269.2020.1796098","url":null,"abstract":"Dose–response experiments and data analyses are often carried out according to an optimal design under a model assumption. A two-parameter logistic model is often used because of its nice mathematical properties and plausible stochastic response mechanisms. There is an extensive literature on its optimal designs and data analysis strategies. However, a model is at best a good approximation in a real-world application, and researchers must be aware of the risk of model mis-specification. In this paper, we investigate the effectiveness of the sequential ED-design, the D-optimal design, and the up-and-down design under the three-parameter logistic regression model, and we develop a numerical method for the parameter estimation. Simulations show that the combination of the proposed model and the data analysis strategy performs well. When the logistic model is correct, this more complex model has hardly any efficiency loss. The three-parameter logistic model works better than the two-parameter logistic model in the presence of model mis-specification.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"5 1","pages":"265 - 274"},"PeriodicalIF":0.5,"publicationDate":"2020-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/24754269.2020.1796098","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46183255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Statistical Theory and Related Fields
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1