首页 > 最新文献

Statistical Methodology最新文献

英文 中文
Nonparametric M-estimation for right censored regression model with stationary ergodic data 平稳遍历数据右截尾回归模型的非参数m估计
Q Mathematics Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.10.002
Mohamed Chaouch , Naâmane Laïb , Elias Ould Saïd

The present paper deals with a nonparametric M-estimation for right censored regression model with stationary ergodic data. Defined as an implicit function, a kernel-type estimator of a family of robust regression is considered when the covariate takes its values in Rd (d1) and the data are sampled from a stationary ergodic process. The strong consistency (with rate) and the asymptotic distribution of the estimator are established under mild assumptions. Moreover, a usable confidence interval is provided which does not depend on any unknown quantity. Our results hold without any mixing condition and do not require the existence of marginal densities. A comparison study based on simulated data is also provided.

研究具有平稳遍历数据的右截尾回归模型的非参数m估计。当协变量在Rd (d≥1)中取其值并且数据从平稳遍历过程中采样时,将鲁棒回归族的核型估计量定义为隐函数。在温和的假设条件下,建立了估计量的强相合性和渐近分布。此外,提供了一个可用的置信区间,它不依赖于任何未知量。我们的结果不需要任何混合条件,也不需要存在边际密度。并在模拟数据的基础上进行了对比研究。
{"title":"Nonparametric M-estimation for right censored regression model with stationary ergodic data","authors":"Mohamed Chaouch ,&nbsp;Naâmane Laïb ,&nbsp;Elias Ould Saïd","doi":"10.1016/j.stamet.2016.10.002","DOIUrl":"10.1016/j.stamet.2016.10.002","url":null,"abstract":"<div><p>The present paper deals with a nonparametric <span><math><mi>M</mi></math></span><span><span>-estimation for right censored regression model with stationary ergodic data. Defined as an implicit function, a kernel-type estimator of a family of robust regression is considered when the </span>covariate takes its values in </span><span><math><msup><mrow><mi>R</mi></mrow><mrow><mi>d</mi></mrow></msup></math></span> (<span><math><mi>d</mi><mo>≥</mo><mn>1</mn></math></span>) and the data are sampled from a <em>stationary ergodic process</em><span>. The strong consistency (with rate) and the asymptotic distribution of the estimator are established under mild assumptions. Moreover, a usable confidence interval is provided which does not depend on any unknown quantity. Our results hold without any mixing condition and do not require the existence of marginal densities. A comparison study based on simulated data is also provided.</span></p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"33 ","pages":"Pages 234-255"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.10.002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125822228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Discrete time software reliability modeling with periodic debugging schedule 具有周期性调试计划的离散时间软件可靠性建模
Q Mathematics Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.08.006
Sudipta Das, Anup Dewanji, Debasis Sengupta

In many situations, multiple copies of a software are tested in parallel with different test cases as input, and the detected errors from a particular round of testing are debugged together. In this article, we discuss a discrete time model of software reliability for such a scenario of periodic debugging. We propose likelihood based inference of the model parameters, including the initial number of errors, under the assumption that all errors are equally likely to be detected. The proposed method is used to estimate the reliability of the software. We establish asymptotic normality of the estimated model parameters. The performance of the proposed method is evaluated through a simulation study and its use is illustrated through the analysis of a dataset obtained from testing of a real-time flight control software. We also consider a more general model, in which different errors have different probabilities of detection.

在许多情况下,软件的多个副本以不同的测试用例作为输入并行地进行测试,并且从特定一轮测试中检测到的错误被一起调试。在本文中,我们讨论了这种周期性调试场景下软件可靠性的离散时间模型。我们提出了基于似然的模型参数推理,包括错误的初始数量,假设所有错误都是同样可能被检测到的。采用该方法对软件的可靠性进行了评估。我们建立了估计模型参数的渐近正态性。通过仿真研究评估了该方法的性能,并通过对实时飞行控制软件测试数据集的分析说明了该方法的应用。我们还考虑了一个更一般的模型,其中不同的错误具有不同的检测概率。
{"title":"Discrete time software reliability modeling with periodic debugging schedule","authors":"Sudipta Das,&nbsp;Anup Dewanji,&nbsp;Debasis Sengupta","doi":"10.1016/j.stamet.2016.08.006","DOIUrl":"https://doi.org/10.1016/j.stamet.2016.08.006","url":null,"abstract":"<div><p>In many situations, multiple copies of a software are tested in parallel with different test cases as input, and the detected errors from a particular round of testing are debugged together. In this article, we discuss a discrete time model of software reliability for such a scenario of periodic debugging. We propose likelihood based inference of the model parameters, including the initial number of errors, under the assumption that all errors are equally likely to be detected. The proposed method is used to estimate the reliability of the software. We establish asymptotic normality<span> of the estimated model parameters<span>. The performance of the proposed method is evaluated through a simulation study and its use is illustrated through the analysis of a dataset obtained from testing of a real-time flight control software. We also consider a more general model, in which different errors have different probabilities of detection.</span></span></p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"33 ","pages":"Pages 147-159"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.08.006","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136837489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
HeartCast: Predicting acute hypotensive episodes in intensive care units 心脏预测:预测重症监护病房的急性低血压发作
Q Mathematics Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.07.001
Sun-Hee Kim , Lei Li , Christos Faloutsos , Hyung-Jeong Yang , Seong-Whan Lee

Acute hypotensive episodes (AHEs) are serious clinical events in intensive care units (ICUs), and require immediate treatment to prevent patient injury. Reducing the risks associated with an AHE requires effective and efficient mining of data generated from multiple physiological time series. We propose HeartCast, a model that extracts essential features from such data to effectively predict AHE. HeartCast combines a non-linear support vector machine with best-feature extraction via analysis of the baseline threshold, quartile parameters, and window size of the physiological signals. Our approach has the following benefits: (a) it extracts the most relevant features; (b) it provides the best results for identification of an AHE event; (c) it is fast and scales with linear complexity over the length of the window; and (d) it can manage missing values and noise/outliers by using a best-feature extraction method. We performed experiments on data continuously captured from physiological time series of ICU patients (roughly 3 GB of processed data). HeartCast was found to outperform other state-of-the-art methods found in the literature with a 13.7% improvement in classification accuracy.

急性低血压发作(ahs)是重症监护病房(icu)的严重临床事件,需要立即治疗以防止患者受伤。降低与AHE相关的风险需要对多个生理时间序列产生的数据进行有效和高效的挖掘。我们提出了一个从这些数据中提取基本特征以有效预测AHE的模型HeartCast。HeartCast结合了非线性支持向量机和最佳特征提取,通过分析基线阈值、四分位数参数和生理信号的窗口大小。我们的方法有以下好处:(a)它提取了最相关的特征;(b)为识别AHE事件提供最佳结果;(c)它速度快,并且随窗口长度的线性复杂度缩放;(d)利用最佳特征提取方法对缺失值和噪声/异常值进行管理。我们对从ICU患者的生理时间序列中连续捕获的数据(大约3gb的处理数据)进行了实验。研究发现,HeartCast优于文献中发现的其他最先进的方法,分类准确率提高了13.7%。
{"title":"HeartCast: Predicting acute hypotensive episodes in intensive care units","authors":"Sun-Hee Kim ,&nbsp;Lei Li ,&nbsp;Christos Faloutsos ,&nbsp;Hyung-Jeong Yang ,&nbsp;Seong-Whan Lee","doi":"10.1016/j.stamet.2016.07.001","DOIUrl":"https://doi.org/10.1016/j.stamet.2016.07.001","url":null,"abstract":"<div><p>Acute hypotensive episodes (AHEs) are serious clinical events in intensive care units (ICUs), and require immediate treatment to prevent patient injury. Reducing the risks associated with an AHE requires effective and efficient mining of data generated from multiple physiological time series. We propose HeartCast, a model that extracts essential features from such data to effectively predict AHE. HeartCast combines a non-linear support vector machine with best-feature extraction via analysis of the baseline threshold, quartile parameters, and window size of the physiological signals. Our approach has the following benefits: (a) it extracts the most relevant features; (b) it provides the best results for identification of an AHE event; (c) it is fast and scales with linear complexity over the length of the window; and (d) it can manage missing values and noise/outliers by using a best-feature extraction method. We performed experiments on data continuously captured from physiological time series of ICU patients (roughly 3 GB of processed data). HeartCast was found to outperform other state-of-the-art methods found in the literature with a 13.7% improvement in classification accuracy.</p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"33 ","pages":"Pages 1-13"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.07.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136837496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Inference procedures about population correlations under order restrictions 序约束下种群相关性的推理过程
Q Mathematics Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.09.001
Gregory E. Wilding, Mark C. Baker

The testing of equality of several Pearson correlations can be found in a number of scientific fields. We surmise in many such cases that the alternatives of interest in practice are, in deed, order restricted, and therefore the researcher is best served by use of testing procedures developed for those specific alternatives. In this note we introduce a collection of tests for use in testing equality of k correlation coefficients against order alternatives, with an emphasis on simple order. Specifically, we propose likelihood ratio tests and contrast tests based on the well known Fisher Z transformation as well as tests which make use of generalized variable methodologies. The proposed procedures are empirically compared with regard to type I and II error rates via Monte Carlo simulations studies, and the use of the approaches is illustrated using an example. These tests are found to be vastly superior to tests for the general alternative, and the contrast tests based on the Fisher Z transformation are recommended for practice based on the observed test properties and simplicity.

在许多科学领域都可以找到对几个皮尔逊相关的相等性的检验。我们推测,在许多这样的情况下,在实践中感兴趣的替代方案,实际上,顺序限制,因此,研究人员最好使用测试程序开发的那些特定的替代方案。在本文中,我们介绍了一组测试,用于测试k相关系数对顺序选择的等式,重点是简单顺序。具体来说,我们提出了基于著名的Fisher Z变换的似然比检验和对比检验,以及使用广义变量方法的检验。通过蒙特卡罗模拟研究,对所提出的程序进行了关于I型和II型错误率的经验比较,并通过一个例子说明了这些方法的使用。这些测试被发现远远优于一般替代测试,并且根据观察到的测试特性和简单性,推荐基于Fisher Z变换的对比测试用于实践。
{"title":"Inference procedures about population correlations under order restrictions","authors":"Gregory E. Wilding,&nbsp;Mark C. Baker","doi":"10.1016/j.stamet.2016.09.001","DOIUrl":"https://doi.org/10.1016/j.stamet.2016.09.001","url":null,"abstract":"<div><p><span>The testing of equality of several Pearson correlations can be found in a number of scientific fields. We surmise in many such cases that the alternatives of interest in practice are, in deed, order restricted, and therefore the researcher is best served by use of testing procedures developed for those specific alternatives. In this note we introduce a collection of tests for use in testing equality of </span><span><math><mi>k</mi></math></span><span><span> correlation coefficients against order alternatives, with an emphasis on simple order. Specifically, we propose </span>likelihood ratio tests<span> and contrast tests based on the well known Fisher Z transformation as well as tests which make use of generalized variable methodologies. The proposed procedures are empirically compared with regard to type I and II error rates via Monte Carlo simulations studies, and the use of the approaches is illustrated using an example. These tests are found to be vastly superior to tests for the general alternative, and the contrast tests based on the Fisher Z transformation are recommended for practice based on the observed test properties and simplicity.</span></span></p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"33 ","pages":"Pages 203-216"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.09.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136837553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Edge density of new graph types based on a random digraph family 基于随机有向图族的新图类型的边密度
Q Mathematics Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.07.003
Elvan Ceyhan

We consider two types of graphs based on a family of proximity catch digraphs (PCDs) and study their edge density. In particular, the PCDs we use are a parameterized digraph family called proportional-edge (PE) PCDs and the two associated graph types are the “underlying graphs” and the newly introduced “reflexivity graphs” based on the PE-PCDs. These graphs are extensions of random geometric graphs where distance is replaced with a dissimilarity measure and the threshold is not fixed but depends on the location of the points. PCDs and the associated graphs are constructed based on data points from two classes, say X and Y, where one class (say class X) forms the vertices of the PCD and the Delaunay tessellation of the other class (i.e., class Y) yields the (Delaunay) cells which serve as the support of class X points. We demonstrate that edge density of these graphs is a U-statistic, hence obtain the asymptotic normality of it for data from any distribution that satisfies mild regulatory conditions. The rate of convergence to asymptotic normality is sharper for the edge density of the reflexivity and underlying graphs compared to the arc density of the PE-PCDs. For uniform data in Euclidean plane where Delaunay cells are triangles, we demonstrate that the distribution of the edge density is geometry invariant (i.e., independent of the shape of the triangular support). We compute the explicit forms of the asymptotic normal distribution for uniform data in one Delaunay triangle in the Euclidean plane utilizing this geometry invariance property. We also provide various versions of edge density in the multiple triangle case. The approach presented here can also be extended for application to data in higher dimensions.

我们考虑了两种基于邻近捕获有向图(PCDs)的图,并研究了它们的边缘密度。具体来说,我们使用的是一种被称为比例边缘(PE)的参数化有向图族,相关的两种图类型是“底层图”和基于PE- pcd的新引入的“自反性图”。这些图是随机几何图的扩展,其中距离用不相似度量代替,阈值不是固定的,而是取决于点的位置。PCD和相关图形是基于来自两个类的数据点构建的,比如X和Y,其中一个类(比如X类)形成PCD的顶点,另一个类(比如Y类)的Delaunay镶嵌产生(Delaunay)单元,作为X类点的支持。我们证明了这些图的边密度是一个u统计量,从而得到了它对于任何满足温和调节条件的分布的数据的渐近正态性。与PE-PCDs的弧密度相比,反射率和底层图的边缘密度收敛到渐近正态的速度更快。对于欧几里得平面中Delaunay单元为三角形的均匀数据,我们证明了边缘密度的分布是几何不变的(即与三角形支撑的形状无关)。利用这一几何不变性,我们计算了欧几里得平面上一个Delaunay三角形均匀数据的渐近正态分布的显式形式。在多重三角形的情况下,我们还提供了各种版本的边缘密度。这里介绍的方法也可以扩展到应用于更高维度的数据。
{"title":"Edge density of new graph types based on a random digraph family","authors":"Elvan Ceyhan","doi":"10.1016/j.stamet.2016.07.003","DOIUrl":"https://doi.org/10.1016/j.stamet.2016.07.003","url":null,"abstract":"<div><p>We consider two types of graphs based on a family of proximity catch digraphs (PCDs) and study their edge density. In particular, the PCDs we use are a parameterized digraph family called proportional-edge (PE) PCDs and the two associated graph types are the “underlying graphs” and the newly introduced “reflexivity graphs” based on the PE-PCDs. These graphs are extensions of random geometric graphs where distance is replaced with a dissimilarity measure and the threshold is not fixed but depends on the location of the points. PCDs and the associated graphs are constructed based on data points from two classes, say <span><math><mi>X</mi></math></span> and <span><math><mi>Y</mi></math></span>, where one class (say class <span><math><mi>X</mi></math></span>) forms the vertices of the PCD and the Delaunay tessellation of the other class (i.e., class <span><math><mi>Y</mi></math></span>) yields the (Delaunay) cells which serve as the support of class <span><math><mi>X</mi></math></span> points. We demonstrate that edge density of these graphs is a <span><math><mi>U</mi></math></span><span>-statistic, hence obtain the asymptotic normality<span><span> of it for data from any distribution that satisfies mild regulatory conditions. The rate of convergence to asymptotic normality is sharper for the edge density of the reflexivity and underlying graphs compared to the arc density of the PE-PCDs. For uniform data in </span>Euclidean plane<span> where Delaunay cells are triangles, we demonstrate that the distribution of the edge density is geometry invariant (i.e., independent of the shape of the triangular support). We compute the explicit forms of the asymptotic normal distribution<span><span> for uniform data in one Delaunay triangle in the Euclidean plane utilizing this geometry invariance property. We also provide various versions of edge density in the multiple triangle case. The approach presented here can also be extended for application to data in </span>higher dimensions.</span></span></span></span></p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"33 ","pages":"Pages 31-54"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.07.003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136837485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Large sample convergence diagnostics for likelihood based inference: Logistic regression 基于似然推理的大样本收敛诊断:逻辑回归
Q Mathematics Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.08.001
Michael Brimacombe

A general diagnostic approach to the evaluation of asymptotic approximation in likelihood based models is developed and applied to logistic regression. The expected asymptotic and observed log-likelihood functions are compared using a chi distribution in a directional Bayesian setting. This provides a general approach to assessing and visualizing non-convergence in higher dimensional models. Several well-known examples from the logistic regression literature are discussed.

提出了一种基于似然模型的渐近逼近评估的一般诊断方法,并将其应用于逻辑回归。期望的渐近和观察到的对数似然函数在一个方向贝叶斯设置中使用chi分布进行比较。这为评估和可视化高维模型中的非收敛性提供了一种通用方法。讨论了逻辑回归文献中几个著名的例子。
{"title":"Large sample convergence diagnostics for likelihood based inference: Logistic regression","authors":"Michael Brimacombe","doi":"10.1016/j.stamet.2016.08.001","DOIUrl":"https://doi.org/10.1016/j.stamet.2016.08.001","url":null,"abstract":"<div><p>A general diagnostic approach to the evaluation of asymptotic approximation in likelihood based models is developed and applied to logistic regression. The expected asymptotic and observed log-likelihood functions are compared using a chi distribution in a directional Bayesian setting. This provides a general approach to assessing and visualizing non-convergence in higher dimensional models. Several well-known examples from the logistic regression literature are discussed.</p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"33 ","pages":"Pages 114-130"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.08.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136837487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Constructing tests to compare two proportions whose critical regions guarantee to be Barnard convex sets 构造检验来比较临界区域保证为Barnard凸集的两个比例
Q Mathematics Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.08.005
Félix Almendra-Arao , José Juan Castro-Alva , Hortensia Reyes-Cervantes

In both statistical non-inferiority (NI) and superiority (S) tests, the critical region must be a Barnard convex set for two main reasons. One, being computational in nature, based on the fact that calculating test sizes is a computationally intensive problem due to the presence of a nuisance parameter. However, this calculation is considerably reduced when the critical region is a Barnard convex set. The other reason is that in order for the NI/S statistical tests to make sense, its critical regions must be Barnard convex sets. While it is indeed possible for NI/S tests’ critical regions to not be Barnard convex sets, for the reasons stated above, it is desirable that they are. Therefore, it is important to generate, from a given NI/S test, a test which guarantees that the critical regions are Barnard convex sets. We propose a method by which, from a given NI/S test, we construct another NI/S test, ensuring that the critical regions corresponding to the modified test are Barnard convex sets, we illustrate this through examples. This work is theoretical because the type of developments refers to the general framework of NI/S testing for two independent binomial proportions and it is applied because statistical tests that do not ensure that their critical regions are Barnard convex sets may appear in practice, particularly in the clinical trials area.

在统计非劣效性(NI)和优越性(S)检验中,关键区域必须是Barnard凸集,主要有两个原因。其一,本质上是计算性的,基于这样一个事实,即计算测试大小是一个计算密集的问题,因为存在一个讨厌的参数。然而,当临界区域是Barnard凸集时,这种计算大大减少。另一个原因是,为了使NI/S统计检验有意义,它的临界区域必须是Barnard凸集。虽然NI/S测试的关键区域确实有可能不是Barnard凸集,但由于上述原因,它们是可取的。因此,从给定的NI/S测试中生成一个保证关键区域是Barnard凸集的测试是很重要的。我们提出了一种方法,通过该方法,我们从给定的NI/S测试中构造另一个NI/S测试,确保修改后的测试对应的临界区域是Barnard凸集,我们通过实例说明了这一点。这项工作是理论性的,因为发展类型涉及两个独立二项比例的NI/S测试的一般框架,它被应用是因为不确保其关键区域是巴纳德凸集的统计测试可能出现在实践中,特别是在临床试验领域。
{"title":"Constructing tests to compare two proportions whose critical regions guarantee to be Barnard convex sets","authors":"Félix Almendra-Arao ,&nbsp;José Juan Castro-Alva ,&nbsp;Hortensia Reyes-Cervantes","doi":"10.1016/j.stamet.2016.08.005","DOIUrl":"https://doi.org/10.1016/j.stamet.2016.08.005","url":null,"abstract":"<div><p>In both statistical non-inferiority (NI) and superiority (S) tests, the critical region must be a Barnard convex set<span> for two main reasons. One, being computational in nature, based on the fact that calculating test sizes is a computationally intensive problem due to the presence of a nuisance parameter<span>. However, this calculation is considerably reduced when the critical region is a Barnard convex set. The other reason is that in order for the NI/S statistical tests to make sense, its critical regions must be Barnard convex sets. While it is indeed possible for NI/S tests’ critical regions to not be Barnard convex sets, for the reasons stated above, it is desirable that they are. Therefore, it is important to generate, from a given NI/S test, a test which guarantees that the critical regions are Barnard convex sets. We propose a method by which, from a given NI/S test, we construct another NI/S test, ensuring that the critical regions corresponding to the modified test are Barnard convex sets, we illustrate this through examples. This work is theoretical because the type of developments refers to the general framework of NI/S testing for two independent binomial proportions and it is applied because statistical tests that do not ensure that their critical regions are Barnard convex sets may appear in practice, particularly in the clinical trials area.</span></span></p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"33 ","pages":"Pages 160-171"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.08.005","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136837552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating the integer mean of a normal model related to binomial distribution 估计与二项分布有关的正态模型的整数平均值
Q Mathematics Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.09.004
Rasul A. Khan

A problem for estimating the number of trials n in the binomial distribution B(n,p), is revisited by considering the large sample model N(μ,cμ) and the associated maximum likelihood estimator (MLE) and some sequential procedures. Asymptotic properties of the MLE of n via the normal model N(μ,cμ) are briefly described. Beyond the asymptotic properties, our main focus is on the sequential estimation of n. Let X1,X2,,Xm, be iid N(μ,cμ)(c>0) random variables with an unknown mean μ=1,2, and variance cμ, where c is known. The sequential estimation of μ is explored by a method initiated by Robbins (1970) and further pursued by Khan (1973). Various properties of the procedure including the error probability and the expected sample size are determined. An asymptotic optimality of the procedure is given. Sequential interval estimation and point estimation are also briefly discussed.

通过考虑大样本模型n (μ,cμ)和相关极大似然估计量(MLE)以及一些顺序过程,重新讨论了二项分布B(n,p)中试验数n的估计问题。通过正态模型n (μ,cμ),简要描述了n的最大似然函数的渐近性质。除了渐近性质之外,我们的主要重点是对n的顺序估计。设X1,X2,…,Xm,…为n (μ,cμ)(c>0)个随机变量,平均值μ=1,2,…,方差cμ,其中c是已知的。μ的序贯估计由Robbins(1970)提出,Khan(1973)进一步研究。该过程的各种特性,包括误差概率和期望样本量被确定。给出了该方法的一个渐近最优性。对序列区间估计和点估计也作了简要的讨论。
{"title":"Estimating the integer mean of a normal model related to binomial distribution","authors":"Rasul A. Khan","doi":"10.1016/j.stamet.2016.09.004","DOIUrl":"https://doi.org/10.1016/j.stamet.2016.09.004","url":null,"abstract":"<div><p>A problem for estimating the number of trials <span><math><mi>n</mi></math></span><span> in the binomial distribution </span><span><math><mi>B</mi><mrow><mo>(</mo><mi>n</mi><mo>,</mo><mi>p</mi><mo>)</mo></mrow></math></span>, is revisited by considering the large sample model <span><math><mi>N</mi><mrow><mo>(</mo><mi>μ</mi><mo>,</mo><mi>c</mi><mi>μ</mi><mo>)</mo></mrow></math></span><span><span> and the associated maximum likelihood estimator (MLE) and some sequential procedures. </span>Asymptotic properties of the MLE of </span><span><math><mi>n</mi></math></span> via the normal model <span><math><mi>N</mi><mrow><mo>(</mo><mi>μ</mi><mo>,</mo><mi>c</mi><mi>μ</mi><mo>)</mo></mrow></math></span> are briefly described. Beyond the asymptotic properties, our main focus is on the sequential estimation of <span><math><mi>n</mi></math></span>. Let <span><math><msub><mrow><mi>X</mi></mrow><mrow><mn>1</mn></mrow></msub><mo>,</mo><msub><mrow><mi>X</mi></mrow><mrow><mn>2</mn></mrow></msub><mo>,</mo><mo>…</mo><mo>,</mo><msub><mrow><mi>X</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>,</mo><mo>…</mo></math></span> be iid <span><math><mi>N</mi><mrow><mo>(</mo><mi>μ</mi><mo>,</mo><mi>c</mi><mi>μ</mi><mo>)</mo></mrow></math></span><span><math><mrow><mo>(</mo><mi>c</mi><mo>&gt;</mo><mn>0</mn><mo>)</mo></mrow></math></span> random variables with an unknown mean <span><math><mi>μ</mi><mo>=</mo><mn>1</mn><mo>,</mo><mn>2</mn><mo>,</mo><mo>…</mo></math></span> and variance <span><math><mi>c</mi><mspace></mspace><mi>μ</mi></math></span>, where <span><math><mi>c</mi></math></span> is known. The sequential estimation of <span><math><mi>μ</mi></math></span><span> is explored by a method initiated by Robbins (1970) and further pursued by Khan (1973). Various properties of the procedure including the error probability<span> and the expected sample size are determined. An asymptotic optimality<span> of the procedure is given. Sequential interval estimation and point estimation are also briefly discussed.</span></span></span></p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"33 ","pages":"Pages 192-202"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.09.004","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136837554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sequential testing of hypotheses about drift for Gaussian diffusions 高斯扩散漂移假设的序贯检验
Q Mathematics Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.07.002
David Stibůrek

In statistical inference on the drift parameter θ in the process Xt=θa(t)+0tb(s)dWs, where a(t) and b(t) are known, deterministic functions, there is known a large number of options how to do it. We may, for example, base this inference on the differences between the observed values of the process at discrete times and their normality. Although such methods are very simple, it turns out that it is more appropriate to use sequential methods. For the hypotheses testing about the drift parameter θ, it is more proper to standardize the observed process and to use sequential methods based on the first exit time of the observed process of a pre-specified interval until some given time. These methods can be generalized to the case of random part being a symmetric Itô integral or continuous symmetric martingale.

在对漂移参数θ的统计推断过程中,Xt=θa(t)+∫0tb(s)dWs,其中a(t)和b(t)是已知的确定性函数,有大量已知的选择方法。例如,我们可以根据该过程在离散时间的观测值与其正态性之间的差异来作出这种推断。虽然这些方法非常简单,但事实证明,使用顺序方法更合适。对于漂移参数θ的假设检验,采用对观测过程进行标准化,并根据观测过程在预定区间内的第一次退出时间到某一给定时间的顺序方法更为合适。这些方法可以推广到随机部分为对称Itô积分或连续对称鞅的情况。
{"title":"Sequential testing of hypotheses about drift for Gaussian diffusions","authors":"David Stibůrek","doi":"10.1016/j.stamet.2016.07.002","DOIUrl":"https://doi.org/10.1016/j.stamet.2016.07.002","url":null,"abstract":"<div><p><span>In statistical inference<span> on the drift parameter </span></span><span><math><mi>θ</mi></math></span> in the process <span><math><msub><mrow><mi>X</mi></mrow><mrow><mi>t</mi></mrow></msub><mo>=</mo><mi>θ</mi><mi>a</mi><mrow><mo>(</mo><mi>t</mi><mo>)</mo></mrow><mo>+</mo><msubsup><mrow><mo>∫</mo></mrow><mrow><mn>0</mn></mrow><mrow><mi>t</mi></mrow></msubsup><mi>b</mi><mrow><mo>(</mo><mi>s</mi><mo>)</mo></mrow><mstyle><mi>d</mi></mstyle><msub><mrow><mi>W</mi></mrow><mrow><mi>s</mi></mrow></msub></math></span>, where <span><math><mi>a</mi><mrow><mo>(</mo><mi>t</mi><mo>)</mo></mrow></math></span> and <span><math><mi>b</mi><mrow><mo>(</mo><mi>t</mi><mo>)</mo></mrow></math></span><span><span> are known, deterministic functions, there is known a large number of options how to do it. We may, for example, base this inference on the differences between the observed values of the process at discrete times and their normality. Although such methods are very simple, it turns out that it is more appropriate to use sequential methods. For the </span>hypotheses testing about the drift parameter </span><span><math><mi>θ</mi></math></span><span>, it is more proper to standardize the observed process and to use sequential methods based on the first exit time of the observed process of a pre-specified interval until some given time. These methods can be generalized to the case of random part being a symmetric Itô integral or continuous symmetric martingale.</span></p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"33 ","pages":"Pages 14-30"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.07.002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136837495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonlinear regression models under skew scale mixtures of normal distributions 正态分布混合偏尺度下的非线性回归模型
Q Mathematics Pub Date : 2016-12-01 DOI: 10.1016/j.stamet.2016.08.004
Clécio S. Ferreira , Víctor H. Lachos

Normal nonlinear regression models are applied in some areas of the sciences and engineering to explain or describe the phenomena under study. However, it is well known that several phenomena are not always represented by the normal model due to lack of symmetry or the presence of heavy- and light-tailed distributions related to the normal law in the data. This paper proposes an extension of nonlinear regression models using the skew-scale mixtures of normal (SSMN) distributions proposed by Ferreira et al. (2011). This class of models provides a useful generalization of the symmetrical nonlinear regression models since the random term distributions cover both asymmetric and heavy-tailed distributions, such as the skew-t-normal, skew-slash and skew-contaminated normal, among others. An expectation–maximization (EM) algorithm for maximum likelihood (ML) estimates is presented and the observed information matrix is derived analytically. Some simulation studies are presented to examine the performance of the proposed methods, with relation to robustness and asymptotic properties of the ML estimates. Finally, an illustration of the method is presented considering a dataset previously analyzed under normal and skew-normal (SN) nonlinear regression models. The main conclusion is that the ML estimates from the heavy tails SSMN nonlinear models are more robust against outlying observations compared to the corresponding SN estimates.

正态非线性回归模型被应用于科学和工程的某些领域来解释或描述所研究的现象。然而,众所周知,由于缺乏对称性或数据中存在与正态律相关的重尾和轻尾分布,一些现象并不总是用正态模型来表示。本文利用Ferreira et al.(2011)提出的斜尺度混合正态分布(SSMN)对非线性回归模型进行了扩展。这类模型提供了对称非线性回归模型的有用推广,因为随机项分布涵盖了不对称和重尾分布,如斜t正态、斜斜线和斜污染正态等。提出了一种最大似然估计的期望最大化算法,并解析导出了观测到的信息矩阵。提出了一些仿真研究来检验所提出的方法的性能,以及与ML估计的鲁棒性和渐近特性的关系。最后,以正态和偏态正态(SN)非线性回归模型下分析的数据集为例说明了该方法。主要结论是,与相应的SN估计相比,来自重尾SSMN非线性模型的ML估计对外围观测值更具鲁棒性。
{"title":"Nonlinear regression models under skew scale mixtures of normal distributions","authors":"Clécio S. Ferreira ,&nbsp;Víctor H. Lachos","doi":"10.1016/j.stamet.2016.08.004","DOIUrl":"https://doi.org/10.1016/j.stamet.2016.08.004","url":null,"abstract":"<div><p><span>Normal nonlinear regression models are applied in some areas of the sciences and engineering to explain or describe the phenomena under study. However, it is well known that several phenomena are not always represented by the normal model due to lack of symmetry or the presence of heavy- and light-tailed distributions related to the normal law in the data. This paper proposes an extension of nonlinear regression models using the skew-scale mixtures of normal (SSMN) distributions proposed by Ferreira et al. (2011). This class of models provides a useful generalization of the symmetrical nonlinear regression models since the random term distributions cover both asymmetric and heavy-tailed distributions, such as the skew-</span><span><math><mi>t</mi></math></span><span>-normal, skew-slash and skew-contaminated normal, among others. An expectation–maximization (EM) algorithm for maximum likelihood (ML) estimates is presented and the observed information matrix is derived analytically. Some simulation studies are presented to examine the performance of the proposed methods, with relation to robustness and asymptotic properties<span> of the ML estimates. Finally, an illustration of the method is presented considering a dataset previously analyzed under normal and skew-normal (SN) nonlinear regression models. The main conclusion is that the ML estimates from the heavy tails SSMN nonlinear models are more robust against outlying observations compared to the corresponding SN estimates.</span></span></p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"33 ","pages":"Pages 131-146"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.08.004","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136837486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
Statistical Methodology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1