Statistical Methodology最新文献

英文中文

HeartCast: Predicting acute hypotensive episodes in intensive care units 心脏预测:预测重症监护病房的急性低血压发作

Q Mathematics

Statistical Methodology

Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.07.001

Sun-Hee Kim , Lei Li , Christos Faloutsos , Hyung-Jeong Yang , Seong-Whan Lee

Acute hypotensive episodes (AHEs) are serious clinical events in intensive care units (ICUs), and require immediate treatment to prevent patient injury. Reducing the risks associated with an AHE requires effective and efficient mining of data generated from multiple physiological time series. We propose HeartCast, a model that extracts essential features from such data to effectively predict AHE. HeartCast combines a non-linear support vector machine with best-feature extraction via analysis of the baseline threshold, quartile parameters, and window size of the physiological signals. Our approach has the following benefits: (a) it extracts the most relevant features; (b) it provides the best results for identification of an AHE event; (c) it is fast and scales with linear complexity over the length of the window; and (d) it can manage missing values and noise/outliers by using a best-feature extraction method. We performed experiments on data continuously captured from physiological time series of ICU patients (roughly 3 GB of processed data). HeartCast was found to outperform other state-of-the-art methods found in the literature with a 13.7% improvement in classification accuracy.

急性低血压发作(ahs)是重症监护病房(icu)的严重临床事件，需要立即治疗以防止患者受伤。降低与AHE相关的风险需要对多个生理时间序列产生的数据进行有效和高效的挖掘。我们提出了一个从这些数据中提取基本特征以有效预测AHE的模型HeartCast。HeartCast结合了非线性支持向量机和最佳特征提取，通过分析基线阈值、四分位数参数和生理信号的窗口大小。我们的方法有以下好处:(a)它提取了最相关的特征;(b)为识别AHE事件提供最佳结果;(c)它速度快，并且随窗口长度的线性复杂度缩放;(d)利用最佳特征提取方法对缺失值和噪声/异常值进行管理。我们对从ICU患者的生理时间序列中连续捕获的数据(大约3gb的处理数据)进行了实验。研究发现，HeartCast优于文献中发现的其他最先进的方法，分类准确率提高了13.7%。

{"title":"HeartCast: Predicting acute hypotensive episodes in intensive care units","authors":"Sun-Hee Kim , Lei Li , Christos Faloutsos , Hyung-Jeong Yang , Seong-Whan Lee","doi":"10.1016/j.stamet.2016.07.001","DOIUrl":"https://doi.org/10.1016/j.stamet.2016.07.001","url":null,"abstract":"<div>Acute hypotensive episodes (AHEs) are serious clinical events in intensive care units (ICUs), and require immediate treatment to prevent patient injury. Reducing the risks associated with an AHE requires effective and efficient mining of data generated from multiple physiological time series. We propose HeartCast, a model that extracts essential features from such data to effectively predict AHE. HeartCast combines a non-linear support vector machine with best-feature extraction via analysis of the baseline threshold, quartile parameters, and window size of the physiological signals. Our approach has the following benefits: (a) it extracts the most relevant features; (b) it provides the best results for identification of an AHE event; (c) it is fast and scales with linear complexity over the length of the window; and (d) it can manage missing values and noise/outliers by using a best-feature extraction method. We performed experiments on data continuously captured from physiological time series of ICU patients (roughly 3 GB of processed data). HeartCast was found to outperform other state-of-the-art methods found in the literature with a 13.7% improvement in classification accuracy.</div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.07.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136837496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

Inference procedures about population correlations under order restrictions 序约束下种群相关性的推理过程

Q Mathematics

Statistical Methodology

Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.09.001

Gregory E. Wilding, Mark C. Baker

The testing of equality of several Pearson correlations can be found in a number of scientific fields. We surmise in many such cases that the alternatives of interest in practice are, in deed, order restricted, and therefore the researcher is best served by use of testing procedures developed for those specific alternatives. In this note we introduce a collection of tests for use in testing equality of $k$ correlation coefficients against order alternatives, with an emphasis on simple order. Specifically, we propose likelihood ratio tests and contrast tests based on the well known Fisher Z transformation as well as tests which make use of generalized variable methodologies. The proposed procedures are empirically compared with regard to type I and II error rates via Monte Carlo simulations studies, and the use of the approaches is illustrated using an example. These tests are found to be vastly superior to tests for the general alternative, and the contrast tests based on the Fisher Z transformation are recommended for practice based on the observed test properties and simplicity.

在许多科学领域都可以找到对几个皮尔逊相关的相等性的检验。我们推测，在许多这样的情况下，在实践中感兴趣的替代方案，实际上，顺序限制，因此，研究人员最好使用测试程序开发的那些特定的替代方案。在本文中，我们介绍了一组测试，用于测试k相关系数对顺序选择的等式，重点是简单顺序。具体来说，我们提出了基于著名的Fisher Z变换的似然比检验和对比检验，以及使用广义变量方法的检验。通过蒙特卡罗模拟研究，对所提出的程序进行了关于I型和II型错误率的经验比较，并通过一个例子说明了这些方法的使用。这些测试被发现远远优于一般替代测试，并且根据观察到的测试特性和简单性，推荐基于Fisher Z变换的对比测试用于实践。

{"title":"Inference procedures about population correlations under order restrictions","authors":"Gregory E. Wilding, Mark C. Baker","doi":"10.1016/j.stamet.2016.09.001","DOIUrl":"https://doi.org/10.1016/j.stamet.2016.09.001","url":null,"abstract":"<div>The testing of equality of several Pearson correlations can be found in a number of scientific fields. We surmise in many such cases that the alternatives of interest in practice are, in deed, order restricted, and therefore the researcher is best served by use of testing procedures developed for those specific alternatives. In this note we introduce a collection of tests for use in testing equality of <math><mi>k</mi></math> correlation coefficients against order alternatives, with an emphasis on simple order. Specifically, we propose likelihood ratio tests and contrast tests based on the well known Fisher Z transformation as well as tests which make use of generalized variable methodologies. The proposed procedures are empirically compared with regard to type I and II error rates via Monte Carlo simulations studies, and the use of the approaches is illustrated using an example. These tests are found to be vastly superior to tests for the general alternative, and the contrast tests based on the Fisher Z transformation are recommended for practice based on the observed test properties and simplicity.</div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.09.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136837553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Edge density of new graph types based on a random digraph family 基于随机有向图族的新图类型的边密度

Q Mathematics

Statistical Methodology

Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.07.003

Elvan Ceyhan

We consider two types of graphs based on a family of proximity catch digraphs (PCDs) and study their edge density. In particular, the PCDs we use are a parameterized digraph family called proportional-edge (PE) PCDs and the two associated graph types are the “underlying graphs” and the newly introduced “reflexivity graphs” based on the PE-PCDs. These graphs are extensions of random geometric graphs where distance is replaced with a dissimilarity measure and the threshold is not fixed but depends on the location of the points. PCDs and the associated graphs are constructed based on data points from two classes, say $X$ and $Y$ , where one class (say class $X$ ) forms the vertices of the PCD and the Delaunay tessellation of the other class (i.e., class $Y$ ) yields the (Delaunay) cells which serve as the support of class $X$ points. We demonstrate that edge density of these graphs is a $U$ -statistic, hence obtain the asymptotic normality of it for data from any distribution that satisfies mild regulatory conditions. The rate of convergence to asymptotic normality is sharper for the edge density of the reflexivity and underlying graphs compared to the arc density of the PE-PCDs. For uniform data in Euclidean plane where Delaunay cells are triangles, we demonstrate that the distribution of the edge density is geometry invariant (i.e., independent of the shape of the triangular support). We compute the explicit forms of the asymptotic normal distribution for uniform data in one Delaunay triangle in the Euclidean plane utilizing this geometry invariance property. We also provide various versions of edge density in the multiple triangle case. The approach presented here can also be extended for application to data in higher dimensions.

我们考虑了两种基于邻近捕获有向图(PCDs)的图，并研究了它们的边缘密度。具体来说，我们使用的是一种被称为比例边缘(PE)的参数化有向图族，相关的两种图类型是“底层图”和基于PE- pcd的新引入的“自反性图”。这些图是随机几何图的扩展，其中距离用不相似度量代替，阈值不是固定的，而是取决于点的位置。PCD和相关图形是基于来自两个类的数据点构建的，比如X和Y，其中一个类(比如X类)形成PCD的顶点，另一个类(比如Y类)的Delaunay镶嵌产生(Delaunay)单元，作为X类点的支持。我们证明了这些图的边密度是一个u统计量，从而得到了它对于任何满足温和调节条件的分布的数据的渐近正态性。与PE-PCDs的弧密度相比，反射率和底层图的边缘密度收敛到渐近正态的速度更快。对于欧几里得平面中Delaunay单元为三角形的均匀数据，我们证明了边缘密度的分布是几何不变的(即与三角形支撑的形状无关)。利用这一几何不变性，我们计算了欧几里得平面上一个Delaunay三角形均匀数据的渐近正态分布的显式形式。在多重三角形的情况下，我们还提供了各种版本的边缘密度。这里介绍的方法也可以扩展到应用于更高维度的数据。

{"title":"Edge density of new graph types based on a random digraph family","authors":"Elvan Ceyhan","doi":"10.1016/j.stamet.2016.07.003","DOIUrl":"https://doi.org/10.1016/j.stamet.2016.07.003","url":null,"abstract":"<div>We consider two types of graphs based on a family of proximity catch digraphs (PCDs) and study their edge density. In particular, the PCDs we use are a parameterized digraph family called proportional-edge (PE) PCDs and the two associated graph types are the “underlying graphs” and the newly introduced “reflexivity graphs” based on the PE-PCDs. These graphs are extensions of random geometric graphs where distance is replaced with a dissimilarity measure and the threshold is not fixed but depends on the location of the points. PCDs and the associated graphs are constructed based on data points from two classes, say <math><mi>X</mi></math> and <math><mi>Y</mi></math>, where one class (say class <math><mi>X</mi></math>) forms the vertices of the PCD and the Delaunay tessellation of the other class (i.e., class <math><mi>Y</mi></math>) yields the (Delaunay) cells which serve as the support of class <math><mi>X</mi></math> points. We demonstrate that edge density of these graphs is a <math><mi>U</mi></math>-statistic, hence obtain the asymptotic normality of it for data from any distribution that satisfies mild regulatory conditions. The rate of convergence to asymptotic normality is sharper for the edge density of the reflexivity and underlying graphs compared to the arc density of the PE-PCDs. For uniform data in Euclidean plane where Delaunay cells are triangles, we demonstrate that the distribution of the edge density is geometry invariant (i.e., independent of the shape of the triangular support). We compute the explicit forms of the asymptotic normal distribution for uniform data in one Delaunay triangle in the Euclidean plane utilizing this geometry invariance property. We also provide various versions of edge density in the multiple triangle case. The approach presented here can also be extended for application to data in higher dimensions.</div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.07.003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136837485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Large sample convergence diagnostics for likelihood based inference: Logistic regression 基于似然推理的大样本收敛诊断:逻辑回归

Q Mathematics

Statistical Methodology

Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.08.001

Michael Brimacombe

A general diagnostic approach to the evaluation of asymptotic approximation in likelihood based models is developed and applied to logistic regression. The expected asymptotic and observed log-likelihood functions are compared using a chi distribution in a directional Bayesian setting. This provides a general approach to assessing and visualizing non-convergence in higher dimensional models. Several well-known examples from the logistic regression literature are discussed.

提出了一种基于似然模型的渐近逼近评估的一般诊断方法，并将其应用于逻辑回归。期望的渐近和观察到的对数似然函数在一个方向贝叶斯设置中使用chi分布进行比较。这为评估和可视化高维模型中的非收敛性提供了一种通用方法。讨论了逻辑回归文献中几个著名的例子。

引用次数: 3

Constructing tests to compare two proportions whose critical regions guarantee to be Barnard convex sets 构造检验来比较临界区域保证为Barnard凸集的两个比例

Q Mathematics

Statistical Methodology

Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.08.005

Félix Almendra-Arao , José Juan Castro-Alva , Hortensia Reyes-Cervantes

In both statistical non-inferiority (NI) and superiority (S) tests, the critical region must be a Barnard convex set for two main reasons. One, being computational in nature, based on the fact that calculating test sizes is a computationally intensive problem due to the presence of a nuisance parameter. However, this calculation is considerably reduced when the critical region is a Barnard convex set. The other reason is that in order for the NI/S statistical tests to make sense, its critical regions must be Barnard convex sets. While it is indeed possible for NI/S tests’ critical regions to not be Barnard convex sets, for the reasons stated above, it is desirable that they are. Therefore, it is important to generate, from a given NI/S test, a test which guarantees that the critical regions are Barnard convex sets. We propose a method by which, from a given NI/S test, we construct another NI/S test, ensuring that the critical regions corresponding to the modified test are Barnard convex sets, we illustrate this through examples. This work is theoretical because the type of developments refers to the general framework of NI/S testing for two independent binomial proportions and it is applied because statistical tests that do not ensure that their critical regions are Barnard convex sets may appear in practice, particularly in the clinical trials area.

在统计非劣效性(NI)和优越性(S)检验中，关键区域必须是Barnard凸集，主要有两个原因。其一，本质上是计算性的，基于这样一个事实，即计算测试大小是一个计算密集的问题，因为存在一个讨厌的参数。然而，当临界区域是Barnard凸集时，这种计算大大减少。另一个原因是，为了使NI/S统计检验有意义，它的临界区域必须是Barnard凸集。虽然NI/S测试的关键区域确实有可能不是Barnard凸集，但由于上述原因，它们是可取的。因此，从给定的NI/S测试中生成一个保证关键区域是Barnard凸集的测试是很重要的。我们提出了一种方法，通过该方法，我们从给定的NI/S测试中构造另一个NI/S测试，确保修改后的测试对应的临界区域是Barnard凸集，我们通过实例说明了这一点。这项工作是理论性的，因为发展类型涉及两个独立二项比例的NI/S测试的一般框架，它被应用是因为不确保其关键区域是巴纳德凸集的统计测试可能出现在实践中，特别是在临床试验领域。

{"title":"Constructing tests to compare two proportions whose critical regions guarantee to be Barnard convex sets","authors":"Félix Almendra-Arao , José Juan Castro-Alva , Hortensia Reyes-Cervantes","doi":"10.1016/j.stamet.2016.08.005","DOIUrl":"https://doi.org/10.1016/j.stamet.2016.08.005","url":null,"abstract":"<div>In both statistical non-inferiority (NI) and superiority (S) tests, the critical region must be a Barnard convex set for two main reasons. One, being computational in nature, based on the fact that calculating test sizes is a computationally intensive problem due to the presence of a nuisance parameter. However, this calculation is considerably reduced when the critical region is a Barnard convex set. The other reason is that in order for the NI/S statistical tests to make sense, its critical regions must be Barnard convex sets. While it is indeed possible for NI/S tests’ critical regions to not be Barnard convex sets, for the reasons stated above, it is desirable that they are. Therefore, it is important to generate, from a given NI/S test, a test which guarantees that the critical regions are Barnard convex sets. We propose a method by which, from a given NI/S test, we construct another NI/S test, ensuring that the critical regions corresponding to the modified test are Barnard convex sets, we illustrate this through examples. This work is theoretical because the type of developments refers to the general framework of NI/S testing for two independent binomial proportions and it is applied because statistical tests that do not ensure that their critical regions are Barnard convex sets may appear in practice, particularly in the clinical trials area.</div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.08.005","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136837552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Estimating the integer mean of a normal model related to binomial distribution 估计与二项分布有关的正态模型的整数平均值

Q Mathematics

Statistical Methodology

Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.09.004

Rasul A. Khan

A problem for estimating the number of trials $n$ in the binomial distribution $B (n, p)$ , is revisited by considering the large sample model $N (μ, c μ)$ and the associated maximum likelihood estimator (MLE) and some sequential procedures. Asymptotic properties of the MLE of $n$ via the normal model $N (μ, c μ)$ are briefly described. Beyond the asymptotic properties, our main focus is on the sequential estimation of $n$ . Let $X_{1}, X_{2}, \dots, X_{m}, \dots$ be iid $N (μ, c μ)$ $(c > 0)$ random variables with an unknown mean $μ = 1, 2, \dots$ and variance $c μ$ , where $c$ is known. The sequential estimation of $μ$ is explored by a method initiated by Robbins (1970) and further pursued by Khan (1973). Various properties of the procedure including the error probability and the expected sample size are determined. An asymptotic optimality of the procedure is given. Sequential interval estimation and point estimation are also briefly discussed.

通过考虑大样本模型n (μ，cμ)和相关极大似然估计量(MLE)以及一些顺序过程，重新讨论了二项分布B(n,p)中试验数n的估计问题。通过正态模型n (μ，cμ)，简要描述了n的最大似然函数的渐近性质。除了渐近性质之外，我们的主要重点是对n的顺序估计。设X1,X2，…，Xm，…为n (μ，cμ)(c>0)个随机变量，平均值μ=1,2，…，方差cμ，其中c是已知的。μ的序贯估计由Robbins(1970)提出，Khan(1973)进一步研究。该过程的各种特性，包括误差概率和期望样本量被确定。给出了该方法的一个渐近最优性。对序列区间估计和点估计也作了简要的讨论。

{"title":"Estimating the integer mean of a normal model related to binomial distribution","authors":"Rasul A. Khan","doi":"10.1016/j.stamet.2016.09.004","DOIUrl":"https://doi.org/10.1016/j.stamet.2016.09.004","url":null,"abstract":"<div>A problem for estimating the number of trials <math><mi>n</mi></math> in the binomial distribution <math><mi>B</mi><mrow><mo>(</mo><mi>n</mi><mo>,</mo><mi>p</mi><mo>)</mo></mrow></math>, is revisited by considering the large sample model <math><mi>N</mi><mrow><mo>(</mo><mi>μ</mi><mo>,</mo><mi>c</mi><mi>μ</mi><mo>)</mo></mrow></math> and the associated maximum likelihood estimator (MLE) and some sequential procedures. Asymptotic properties of the MLE of <math><mi>n</mi></math> via the normal model <math><mi>N</mi><mrow><mo>(</mo><mi>μ</mi><mo>,</mo><mi>c</mi><mi>μ</mi><mo>)</mo></mrow></math> are briefly described. Beyond the asymptotic properties, our main focus is on the sequential estimation of <math><mi>n</mi></math>. Let <math><msub><mrow><mi>X</mi></mrow><mrow><mn>1</mn></mrow></msub><mo>,</mo><msub><mrow><mi>X</mi></mrow><mrow><mn>2</mn></mrow></msub><mo>,</mo><mo>…</mo><mo>,</mo><msub><mrow><mi>X</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>,</mo><mo>…</mo></math> be iid <math><mi>N</mi><mrow><mo>(</mo><mi>μ</mi><mo>,</mo><mi>c</mi><mi>μ</mi><mo>)</mo></mrow></math><math><mrow><mo>(</mo><mi>c</mi><mo>></mo><mn>0</mn><mo>)</mo></mrow></math> random variables with an unknown mean <math><mi>μ</mi><mo>=</mo><mn>1</mn><mo>,</mo><mn>2</mn><mo>,</mo><mo>…</mo></math> and variance <math><mi>c</mi><mspace></mspace><mi>μ</mi></math>, where <math><mi>c</mi></math> is known. The sequential estimation of <math><mi>μ</mi></math> is explored by a method initiated by Robbins (1970) and further pursued by Khan (1973). Various properties of the procedure including the error probability and the expected sample size are determined. An asymptotic optimality of the procedure is given. Sequential interval estimation and point estimation are also briefly discussed.</div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.09.004","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136837554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Sequential testing of hypotheses about drift for Gaussian diffusions 高斯扩散漂移假设的序贯检验

Q Mathematics

Statistical Methodology

Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.07.002

David Stibůrek

In statistical inference on the drift parameter $θ$ in the process $X_{t} = θ a (t) + \int_{0}^{t} b (s) d W_{s}$ , where $a (t)$ and $b (t)$ are known, deterministic functions, there is known a large number of options how to do it. We may, for example, base this inference on the differences between the observed values of the process at discrete times and their normality. Although such methods are very simple, it turns out that it is more appropriate to use sequential methods. For the hypotheses testing about the drift parameter $θ$ , it is more proper to standardize the observed process and to use sequential methods based on the first exit time of the observed process of a pre-specified interval until some given time. These methods can be generalized to the case of random part being a symmetric Itô integral or continuous symmetric martingale.

在对漂移参数θ的统计推断过程中，Xt=θa(t)+∫0tb(s)dWs，其中a(t)和b(t)是已知的确定性函数，有大量已知的选择方法。例如，我们可以根据该过程在离散时间的观测值与其正态性之间的差异来作出这种推断。虽然这些方法非常简单，但事实证明，使用顺序方法更合适。对于漂移参数θ的假设检验，采用对观测过程进行标准化，并根据观测过程在预定区间内的第一次退出时间到某一给定时间的顺序方法更为合适。这些方法可以推广到随机部分为对称Itô积分或连续对称鞅的情况。

{"title":"Sequential testing of hypotheses about drift for Gaussian diffusions","authors":"David Stibůrek","doi":"10.1016/j.stamet.2016.07.002","DOIUrl":"https://doi.org/10.1016/j.stamet.2016.07.002","url":null,"abstract":"<div>In statistical inference on the drift parameter <math><mi>θ</mi></math> in the process <math><msub><mrow><mi>X</mi></mrow><mrow><mi>t</mi></mrow></msub><mo>=</mo><mi>θ</mi><mi>a</mi><mrow><mo>(</mo><mi>t</mi><mo>)</mo></mrow><mo>+</mo><msubsup><mrow><mo>∫</mo></mrow><mrow><mn>0</mn></mrow><mrow><mi>t</mi></mrow></msubsup><mi>b</mi><mrow><mo>(</mo><mi>s</mi><mo>)</mo></mrow><mstyle><mi>d</mi></mstyle><msub><mrow><mi>W</mi></mrow><mrow><mi>s</mi></mrow></msub></math>, where <math><mi>a</mi><mrow><mo>(</mo><mi>t</mi><mo>)</mo></mrow></math> and <math><mi>b</mi><mrow><mo>(</mo><mi>t</mi><mo>)</mo></mrow></math> are known, deterministic functions, there is known a large number of options how to do it. We may, for example, base this inference on the differences between the observed values of the process at discrete times and their normality. Although such methods are very simple, it turns out that it is more appropriate to use sequential methods. For the hypotheses testing about the drift parameter <math><mi>θ</mi></math>, it is more proper to standardize the observed process and to use sequential methods based on the first exit time of the observed process of a pre-specified interval until some given time. These methods can be generalized to the case of random part being a symmetric Itô integral or continuous symmetric martingale.</div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.07.002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136837495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Nonlinear regression models under skew scale mixtures of normal distributions 正态分布混合偏尺度下的非线性回归模型

Q Mathematics

Statistical Methodology

Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.08.004

Clécio S. Ferreira , Víctor H. Lachos

Normal nonlinear regression models are applied in some areas of the sciences and engineering to explain or describe the phenomena under study. However, it is well known that several phenomena are not always represented by the normal model due to lack of symmetry or the presence of heavy- and light-tailed distributions related to the normal law in the data. This paper proposes an extension of nonlinear regression models using the skew-scale mixtures of normal (SSMN) distributions proposed by Ferreira et al. (2011). This class of models provides a useful generalization of the symmetrical nonlinear regression models since the random term distributions cover both asymmetric and heavy-tailed distributions, such as the skew- $t$ -normal, skew-slash and skew-contaminated normal, among others. An expectation–maximization (EM) algorithm for maximum likelihood (ML) estimates is presented and the observed information matrix is derived analytically. Some simulation studies are presented to examine the performance of the proposed methods, with relation to robustness and asymptotic properties of the ML estimates. Finally, an illustration of the method is presented considering a dataset previously analyzed under normal and skew-normal (SN) nonlinear regression models. The main conclusion is that the ML estimates from the heavy tails SSMN nonlinear models are more robust against outlying observations compared to the corresponding SN estimates.

正态非线性回归模型被应用于科学和工程的某些领域来解释或描述所研究的现象。然而，众所周知，由于缺乏对称性或数据中存在与正态律相关的重尾和轻尾分布，一些现象并不总是用正态模型来表示。本文利用Ferreira et al.(2011)提出的斜尺度混合正态分布(SSMN)对非线性回归模型进行了扩展。这类模型提供了对称非线性回归模型的有用推广，因为随机项分布涵盖了不对称和重尾分布，如斜t正态、斜斜线和斜污染正态等。提出了一种最大似然估计的期望最大化算法，并解析导出了观测到的信息矩阵。提出了一些仿真研究来检验所提出的方法的性能，以及与ML估计的鲁棒性和渐近特性的关系。最后，以正态和偏态正态(SN)非线性回归模型下分析的数据集为例说明了该方法。主要结论是，与相应的SN估计相比，来自重尾SSMN非线性模型的ML估计对外围观测值更具鲁棒性。

{"title":"Nonlinear regression models under skew scale mixtures of normal distributions","authors":"Clécio S. Ferreira , Víctor H. Lachos","doi":"10.1016/j.stamet.2016.08.004","DOIUrl":"https://doi.org/10.1016/j.stamet.2016.08.004","url":null,"abstract":"<div>Normal nonlinear regression models are applied in some areas of the sciences and engineering to explain or describe the phenomena under study. However, it is well known that several phenomena are not always represented by the normal model due to lack of symmetry or the presence of heavy- and light-tailed distributions related to the normal law in the data. This paper proposes an extension of nonlinear regression models using the skew-scale mixtures of normal (SSMN) distributions proposed by Ferreira et al. (2011). This class of models provides a useful generalization of the symmetrical nonlinear regression models since the random term distributions cover both asymmetric and heavy-tailed distributions, such as the skew-<math><mi>t</mi></math>-normal, skew-slash and skew-contaminated normal, among others. An expectation–maximization (EM) algorithm for maximum likelihood (ML) estimates is presented and the observed information matrix is derived analytically. Some simulation studies are presented to examine the performance of the proposed methods, with relation to robustness and asymptotic properties of the ML estimates. Finally, an illustration of the method is presented considering a dataset previously analyzed under normal and skew-normal (SN) nonlinear regression models. The main conclusion is that the ML estimates from the heavy tails SSMN nonlinear models are more robust against outlying observations compared to the corresponding SN estimates.</div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.08.004","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136837486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Going beyond oracle property: Selection consistency and uniqueness of local solution of the generalized linear model 超越oracle属性:广义线性模型局部解的选择、一致性和唯一性

Q Mathematics

Statistical Methodology

Pub Date : 2016-09-01 DOI: 10.1016/j.stamet.2016.05.006

Chi Tim Ng , Seungyoung Oh , Youngjo Lee

Recently, the selection consistency of penalized least square estimators has received a great deal of attention. For the penalized likelihood estimation with certain non-convex penalties, search space can be constructed within which there exists a unique local minimizer that exhibits selection consistency in high-dimensional generalized linear models under certain conditions. In particular, we prove that the SCAD penalty of Fan and Li (2001) and a new modified version of the unbounded penalty of Lee and Oh (2014) can be employed to achieve such a property. These results hold even for the non-sparse cases where the number of relevant covariates increases with the sample size. Simulation studies are provided to compare the performance of SCAD penalty and the newly proposed penalty.

近年来，惩罚最小二乘估计的选择一致性问题受到了广泛的关注。对于具有一定非凸惩罚的惩罚似然估计，可以构造搜索空间，在该空间内存在唯一的局部最小值，且在一定条件下高维广义线性模型中表现出选择一致性。特别是，我们证明了可以使用Fan and Li(2001)的SCAD刑和Lee and Oh(2014)的无界刑的新修改版本来实现这一性质。这些结果甚至适用于相关协变量数量随样本量增加而增加的非稀疏情况。仿真研究比较了SCAD惩罚和新提出的惩罚的性能。

引用次数: 7

Some new results on the LQE ordering 关于LQE排序的一些新结果

Q Mathematics

Statistical Methodology

Pub Date : 2016-09-01 DOI: 10.1016/j.stamet.2016.06.001

Dian-tong Kang

Ebrahimi and Pellerey (1995) and Ebrahimi (1996) proposed the residual entropy. Recently, Sunoj and Sankaran (2012) obtained a quantile version of the residual entropy, the residual quantile entropy (RQE). Based on the RQE function, they defined a new stochastic order, the less quantile entropy (LQE) order, and studied some properties of this order. In this paper, we focus on further properties of this new order. Some characterizations of the LQE order are investigated, closure and reversed closure properties are obtained, meanwhile, some illustrative examples are shown. As applications of a main result, the preservation of the LQE order in several stochastic models is discussed. We give the closure and reversed closure properties of the LQE order for coherent systems with dependent and identically distributed components, and also consider a potential application to insurance of this order.

Ebrahimi and Pellerey(1995)和Ebrahimi(1996)提出残差熵。最近，Sunoj和Sankaran(2012)获得了残差熵的分位数版本，残差分位数熵(residual quantile entropy, RQE)。在RQE函数的基础上，他们定义了一种新的随机阶数——少分位熵(LQE)阶数，并研究了该阶数的一些性质。在本文中，我们重点讨论了这一新阶的进一步性质。研究了LQE阶的一些性质，得到了闭包和反闭包性质，并给出了一些例子。作为一个主要结果的应用，讨论了几种随机模型中LQE阶的保持问题。我们给出了具有依赖和相同分布组件的相干系统的LQE阶的闭包和反闭包性质，并考虑了该阶的保险的潜在应用。

引用次数: 4

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Statistical Methodology

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀