Statistical Methodology最新文献

英文中文

Bayesian predictive inference under a Dirichlet process with sensitivity to the normal baseline Dirichlet过程下对正基线敏感的贝叶斯预测推理

Q Mathematics

Statistical Methodology

Pub Date : 2016-01-01 DOI: 10.1016/j.stamet.2015.07.003

Balgobin Nandram, Jiani Yin

It is well known that the Dirichlet process (DP) model and Dirichlet process mixture (DPM) model are sensitive to the specifications of the baseline distribution. Given a sample from a finite population, we perform Bayesian predictive inference about a finite population quantity (e.g., mean) using a DP model. Generally, in many applications a normal distribution is used for the baseline distribution. Therefore, our main objective is empirical and we show the extent of the sensitivity of inference about the finite population mean with respect to six distributions (normal, lognormal, gamma, inverse Gaussian, a two-component normal mixture and a skewed normal). We have compared the DP model using these baselines with the Polya posterior (fully nonparametric) and the Bayesian bootstrap (sampling with a Haldane prior). We used two examples, one on income data and the other on body mass index data, to compare the performance of these three procedures. These examples show some differences among the six baseline distributions, the Polya posterior and the Bayesian bootstrap, indicating that the normal baseline model cannot be used automatically. Therefore, we consider a simulation study to assess this issue further, and we show how to solve this problem using a leave-one-out kernel baseline. Because the leave-one-out kernel baseline cannot be easily applied to the DPM, we show theoretically how one can solve the sensitivity problem for the DPM as well.

众所周知，Dirichlet过程(DP)模型和Dirichlet过程混合(DPM)模型对基线分布的规格很敏感。给定来自有限总体的样本，我们使用DP模型对有限总体数量(例如，平均值)执行贝叶斯预测推断。通常，在许多应用程序中，正态分布用于基线分布。因此，我们的主要目标是经验的，我们展示了关于六个分布(正态，对数正态，伽马，逆高斯，双分量正态混合和偏斜正态)的有限总体均值推理的敏感性程度。我们将使用这些基线的DP模型与Polya后验(完全非参数)和贝叶斯bootstrap(使用Haldane先验抽样)进行了比较。我们用两个例子，一个是收入数据，另一个是身体质量指数数据，来比较这三种方法的效果。这些例子显示了六种基线分布、Polya后验和贝叶斯bootstrap之间的一些差异，表明正常基线模型不能自动使用。因此，我们考虑一个模拟研究来进一步评估这个问题，我们展示了如何使用留一个内核基线来解决这个问题。由于“留一”核基线不能很容易地应用于DPM，因此我们从理论上说明如何解决DPM的灵敏度问题。

{"title":"Bayesian predictive inference under a Dirichlet process with sensitivity to the normal baseline","authors":"Balgobin Nandram, Jiani Yin","doi":"10.1016/j.stamet.2015.07.003","DOIUrl":"10.1016/j.stamet.2015.07.003","url":null,"abstract":"<div><p>It is well known that the Dirichlet process (DP) model and Dirichlet process mixture (DPM) model are sensitive to the specifications of the baseline distribution. Given a sample from a finite population, we perform Bayesian predictive inference about a finite population quantity (e.g., mean) using a DP model. Generally, in many applications a normal distribution is used for the baseline distribution. Therefore, our main objective is empirical and we show the extent of the sensitivity of inference about the finite population mean with respect to six distributions (normal, lognormal, gamma, inverse Gaussian, a two-component normal mixture and a skewed normal). We have compared the DP model using these baselines with the Polya posterior (fully nonparametric) and the Bayesian bootstrap (sampling with a Haldane prior). We used two examples, one on income data and the other on body mass index data, to compare the performance of these three procedures. These examples show some differences among the six baseline distributions, the Polya posterior and the Bayesian bootstrap, indicating that the normal baseline model cannot be used automatically. Therefore, we consider a simulation study to assess this issue further, and we show how to solve this problem using a leave-one-out kernel baseline. Because the leave-one-out kernel baseline cannot be easily applied to the DPM, we show theoretically how one can solve the sensitivity problem for the DPM as well.</p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2015.07.003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55093089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

The step-stress tampered failure rate model under interval monitoring 区间监测下的阶跃应力扰动故障率模型

Q Mathematics

Statistical Methodology

Pub Date : 2015-11-01 DOI: 10.1016/j.stamet.2015.06.002

Panayiotis Bobotas, Maria Kateri

A step-stress accelerated life testing model is constructed that deals with type-I censored experiments for which a continuous monitoring of the tested items is infeasible and only their inspection at particular time points is possible, producing thus grouped data. A general scale family of distributions is considered for the underlying lifetimes, which allows for flexible modeling by permitting different lifetime distributions for different stress levels. The maximum likelihood estimators of its parameters and their density functions are derived explicitly only when the inspection points coincide with the points of stress-level change. In case of additional inspection points, the estimates are obtained numerically. Asymptotic, exact (whenever possible) and bootstrap confidence intervals (CIs) are considered. For the bootstrap CIs a smoothing-modification is introduced, accounting for the categorical nature of the data.

构建了一种阶跃应力加速寿命试验模型，该模型处理了一类删节实验，其中对被测项目的连续监测是不可实现的，只能在特定时间点对其进行检查，从而产生了分组数据。底层生命周期考虑了一个通用的尺度分布族，通过允许不同应力水平的不同生命周期分布来实现灵活的建模。只有当测点与应力水平变化点重合时，其参数及其密度函数的极大似然估计才会得到明确的推导。如果有额外的检查点，则用数值方法进行估计。渐近，精确(只要可能)和自举置信区间(ci)被考虑。对于自举ci，考虑到数据的分类性质，引入了平滑修正。

引用次数: 6

Cluster-based L2 re-weighted regression 基于聚类的L2重加权回归

Q Mathematics

Statistical Methodology

Pub Date : 2015-11-01 DOI: 10.1016/j.stamet.2015.05.005

Ekele Alih, Hong Choon Ong

A simple robust $L 2$ -regression estimator is presented. The proposed method blends a minimum covariance determinant $(M C D)$ concentration algorithm with a controlled ordinary least squares regression phase. A hierarchical cluster analysis then partitions the data into main cluster of “half set” and a minor cluster of one or more groups. An initial least squares regression estimate arises from the main cluster of “half set”. Thereafter, a group-additive difference in fit statistic is used to activate the minor cluster and a controlled re-weighted least squares regression yields a robust efficient estimator with high breakdown value. Simulation experiment shows the advantage of the proposed method over the popular robust regression techniques in terms of robustness of coefficients, and blending outlier diagnostic procedure with parameter estimation.

给出了一种简单的鲁棒l2 -回归估计。该方法将最小协方差行列式(MCD)集中算法与受控普通最小二乘回归相结合。然后，分层聚类分析将数据划分为“半集”的主聚类和一个或多个组的小聚类。初始最小二乘回归估计由“半集”的主簇产生。然后，使用拟合统计量中的组相加性差异来激活小簇，并使用受控的再加权最小二乘回归产生具有高分解值的鲁棒有效估计器。仿真实验表明，该方法在系数的鲁棒性和将异常值诊断过程与参数估计相结合方面优于常用的鲁棒回归方法。

引用次数: 0

Double acceptance sampling plan based on truncated life tests for half exponential power distribution 基于半指数功率分布截尾寿命试验的双重验收抽样方案

Q Mathematics

Statistical Methodology

Pub Date : 2015-11-01 DOI: 10.1016/j.stamet.2015.07.002

Wenhao Gui , Meiping Xu

In this paper, we develop a double acceptance sampling plan for half exponential power distribution when the lifetime experiment is truncated at a prefixed time. The zero and one failure schemes are considered. We obtain the minimum sample sizes of the first and second samples necessary to ensure the specified mean life at the given consumer’s confidence level. The operating characteristic values and the minimum ratios of the mean life to the specified life are also analyzed. Numerical example is provided to illustrate the double acceptance sampling plan.

本文提出了寿命实验在预定时间被截断时半指数功率分布的双接受抽样方案。考虑了零失效方案和一失效方案。我们获得了在给定的消费者置信水平上确保规定的平均寿命所需的第一个和第二个样本的最小样本量。并对其工作特性值和平均寿命与规定寿命的最小比值进行了分析。通过数值算例说明了双验收抽样方案。

引用次数: 12

Multivariate wavelet-based density estimation with size-biased data 基于多变量小波的尺寸偏倚数据密度估计

Q Mathematics

Statistical Methodology

Pub Date : 2015-11-01 DOI: 10.1016/j.stamet.2015.05.002

Esmaeil Shirazi , Hassan Doosti

In this paper, we employ wavelet method to propose a multivariate density estimator based on a biased sample. We investigate the asymptotic rate of convergence of the proposed estimator over a large class of densities in the Besov space, $B_{p q}^{s}$ . Moreover, we prove the consistency of our estimator when the expectation of weight function is unknown. This paper is an extension of results in Ramirez and Vidakovic (2010) and Chesneau et al. (2012) to the multivariate case.

本文利用小波变换方法提出了一种基于有偏样本的多元密度估计方法。我们研究了Besov空间(Bpqs)中一大类密度上所提估计量的渐近收敛率。此外，在权函数期望未知的情况下，证明了估计量的相合性。本文将Ramirez and Vidakovic(2010)和Chesneau et al.(2012)的结果推广到多元情况。

引用次数: 12

Multivariate discrete scalar hazard rate 多元离散标量危险率

Q Mathematics

Statistical Methodology

Pub Date : 2015-11-01 DOI: 10.1016/j.stamet.2015.05.003

N. Unnikrishnan Nair, P.G. Sankaran

In the present paper, we study the properties of the multivariate discrete scalar hazard rate. Its continuous analogue introduced in the early seventies did not attract much attention because it could not be used to identify the corresponding life distribution. We find the conditions under which an $n$ -variate discrete scalar hazard rate can determine the distribution uniquely. Several other properties of this hazard rate which can be employed in modelling lifetime data are discussed. Some ageing classes based on the scalar hazard function are suggested.

本文研究了多元离散标量危险率的性质。它的连续模拟是在70年代初引入的，由于不能用来识别相应的寿命分布而没有引起太多的注意。我们找到了n变量离散标量风险率唯一确定分布的条件。文中还讨论了可用于寿命数据建模的危险率的其他几个性质。提出了基于标量危害函数的老化分类。

引用次数: 1

Statistical inference on partial linear additive models with distortion measurement errors 具有畸变测量误差的部分线性加性模型的统计推断

Q Mathematics

Statistical Methodology

Pub Date : 2015-11-01 DOI: 10.1016/j.stamet.2015.05.004

Yujie Gai , Jun Zhang , Gaorong Li , Xinchao Luo

We consider statistical inference for partial linear additive models (PLAMs) when the linear covariates are measured with errors and distorted by unknown functions of commonly observable confounding variables. A semiparametric profile least squares estimation procedure is proposed to estimate unknown parameter under unrestricted and restricted conditions. Asymptotic properties for the estimators are established. To test a hypothesis on the parametric components, a test statistic based on the difference between the residual sums of squares under the null and alternative hypotheses is proposed, and we further show that its limiting distribution is a weighted sum of independent standard chi-squared distributions. A bootstrap procedure is further proposed to calculate critical values. Simulation studies are conducted to demonstrate the performance of the proposed procedure and a real example is analyzed for an illustration.

我们考虑了部分线性加性模型(PLAMs)的统计推断，当线性协变量测量有误差，并被常见可观察的混杂变量的未知函数扭曲时。提出了一种半参数轮廓最小二乘估计方法，用于在无限制和受限条件下估计未知参数。建立了估计量的渐近性质。为了检验参数分量上的假设，提出了一个基于零假设和备选假设下的残差平方和之差的检验统计量，并进一步证明了它的极限分布是独立标准卡方分布的加权和。进一步提出了一种计算临界值的自举程序。通过仿真研究验证了该方法的有效性，并对一个实例进行了分析。

引用次数: 3

Dynamic Bayesian analysis of generalized odds ratios assuming multivariate skew-normal distribution for the error terms in the system equation 系统方程误差项多元偏正态分布下广义比值比的动态贝叶斯分析

Q Mathematics

Statistical Methodology

Pub Date : 2015-11-01 DOI: 10.1016/j.stamet.2015.05.001

S.K. Ghoreishi , M.R. Meshkani

In this paper, we develop a methodology for the dynamic Bayesian analysis of generalized odds ratios in contingency tables. It is a standard practice to assume a normal distribution for the random effects in the dynamic system equations. Nevertheless, the normality assumption may be unrealistic in some applications and hence the validity of inferences can be dubious. Therefore, we assume a multivariate skew-normal distribution for the error terms in the system equation at each step. Moreover, we introduce a moving average approach to elicit the hyperparameters. Both simulated data and real data are analyzed to illustrate the application of this methodology.

本文提出了列联表中广义优势比的动态贝叶斯分析方法。假设动力系统方程中的随机效应为正态分布是一种标准做法。然而，正态性假设在某些应用中可能是不现实的，因此推断的有效性可能是可疑的。因此，我们假设每一步系统方程中的误差项是多元偏态正态分布。此外，我们还引入了一种移动平均方法来推导超参数。通过对仿真数据和实际数据的分析，说明了该方法的应用。

引用次数: 1

Navigating choices when applying multiple imputation in the presence of multi-level categorical interaction effects 在存在多层次分类交互效应的情况下应用多重输入时的导航选择

Q Mathematics

Statistical Methodology

Pub Date : 2015-11-01 DOI: 10.1016/j.stamet.2015.06.001

Aya A. Mitani , Allison W. Kurian , Amar K. Das , Manisha Desai

Multiple imputation (MI) is an appealing option for handling missing data. When implementing MI, however, users need to make important decisions to obtain estimates with good statistical properties. One such decision involves the choice of imputation model–the joint modeling (JM) versus fully conditional specification (FCS) approach. Another involves the choice of method to handle interactions. These include imputing the interaction term as any other variable (active imputation), or imputing the main effects and then deriving the interaction (passive imputation). Our study investigates the best approach to perform MI in the presence of interaction effects involving two categorical variables. Such effects warrant special attention as they involve multiple correlated parameters that are handled differently under JM and FCS modeling. Through an extensive simulation study, we compared active, passive and an improved passive approach under FCS, as JM precludes passive imputation. We additionally compared JM and FCS techniques using active imputation. Performance between active and passive imputation was comparable. The improved passive approach proved superior to the other two particularly when the number of parameters corresponding to the interaction was large. JM without rounding and FCS using active imputation were also mostly comparable, with JM outperforming FCS when the number of parameters was large. In a direct comparison of JM active and FCS improved passive, the latter was the clear winner. We recommend improved passive imputation under FCS along with sensitivity analyses to handle multi-level interaction terms.

多重输入(Multiple imputation, MI)是处理缺失数据的一种很有吸引力的选择。然而，在实现MI时，用户需要做出重要的决策，以获得具有良好统计特性的估计。其中一个决策涉及到对输入模型的选择——联合建模(JM)和完全条件规范(FCS)方法。另一个涉及处理交互的方法的选择。这些方法包括将交互项推定为任何其他变量(主动推定)，或推定主要效应，然后推导交互(被动推定)。我们的研究探讨了在涉及两个分类变量的相互作用效应存在的情况下执行MI的最佳方法。这些影响需要特别注意，因为它们涉及多个相关参数，这些参数在JM和FCS建模中处理方式不同。通过广泛的仿真研究，我们比较了FCS下的主动、被动和改进的被动方法，因为JM排除了被动imputation。此外，我们比较了JM和FCS技术使用主动插入。主动和被动插补之间的性能具有可比性。改进后的被动方法优于其他两种方法，特别是当相互作用对应的参数数量较大时。不舍入的JM和使用主动插值的FCS也具有很大的可比性，当参数数量较大时，JM优于FCS。在JM主动和FCS改进被动的直接比较中，后者是明显的赢家。我们建议改进FCS下的被动归算和敏感性分析，以处理多层次的相互作用项。

{"title":"Navigating choices when applying multiple imputation in the presence of multi-level categorical interaction effects","authors":"Aya A. Mitani , Allison W. Kurian , Amar K. Das , Manisha Desai","doi":"10.1016/j.stamet.2015.06.001","DOIUrl":"10.1016/j.stamet.2015.06.001","url":null,"abstract":"<div><p>Multiple imputation (MI) is an appealing option for handling missing data. When implementing MI, however, users need to make important decisions to obtain estimates with good statistical properties. One such decision involves the choice of imputation model–the joint modeling (JM) versus fully conditional specification (FCS) approach. Another involves the choice of method to handle interactions. These include imputing the interaction term as any other variable (active imputation), or imputing the main effects and then deriving the interaction (passive imputation). Our study investigates the best approach to perform MI in the presence of interaction effects involving two categorical variables. Such effects warrant special attention as they involve multiple correlated parameters that are handled differently under JM and FCS modeling. Through an extensive simulation study, we compared active, passive and an improved passive approach under FCS, as JM precludes passive imputation. We additionally compared JM and FCS techniques using active imputation. Performance between active and passive imputation was comparable. The improved passive approach proved superior to the other two particularly when the number of parameters corresponding to the interaction was large. JM without rounding and FCS using active imputation were also mostly comparable, with JM outperforming FCS when the number of parameters was large. In a direct comparison of JM active and FCS improved passive, the latter was the clear winner. We recommend improved passive imputation under FCS along with sensitivity analyses to handle multi-level interaction terms.</p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2015.06.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55093038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Directionally collapsible parameterizations of multivariate binary distributions 多元二元分布的方向可折叠参数化

Q Mathematics

Statistical Methodology

Pub Date : 2015-11-01 DOI: 10.1016/j.stamet.2015.07.001

Tamás Rudas

Odds ratios and log-linear parameters are not collapsible, which means that including a variable into the analysis or omitting one from it, may change the strength of association among the remaining variables. Even the direction of association may be reversed, a fact that is often discussed under the name of Simpson’s paradox. A parameter of association is directionally collapsible, if this reversal cannot occur. The paper investigates the existence of parameters of association which are directionally collapsible. It is shown, that subject to two simple assumptions, no parameter of association, which depends only on the conditional distributions, like the odds ratio does, can be directionally collapsible. The main result is that every directionally collapsible parameter of association gives the same direction of association as a linear contrast of the cell probabilities does. The implication for dealing with Simpson’s paradox is that there is exactly one way to associate direction with the association in any table, so that the paradox never occurs.

比值比和对数线性参数是不可折叠的，这意味着在分析中包括一个变量或从中省略一个变量，可能会改变剩余变量之间的关联强度。甚至连联想的方向也可能被逆转，这一事实经常被称为辛普森悖论。如果这种反转不能发生，关联参数在方向上是可折叠的。研究了方向可折叠的关联参数的存在性。结果表明，在两个简单的假设条件下，没有任何关联参数可以定向折叠，而关联参数只取决于条件分布，就像比值比一样。主要结果是，每个方向可折叠的关联参数都给出了与单元格概率线性对比相同的关联方向。处理辛普森悖论的含义是，只有一种方法可以将任何表中的方向与关联关联起来，这样悖论就永远不会发生。

引用次数: 4

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Statistical Methodology

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀