首页 > 最新文献

Statistical Modeling最新文献

英文 中文
Bayesian spatial models for small area estimation of proportions 小面积比例估计的贝叶斯空间模型
Pub Date : 2002-10-01 DOI: 10.1191/1471082x02st032oa
Fas Moura
This article presents a logistic hierarchical model approach for small area prediction of proportions, taking into account both possible spatial and unstructured heterogeneity effects. The posterior distributions of the proportion predictors are obtained via Markov Chain Monte Carlo methods. This automatically takes into account the extra uncertainty associated with the hyperparameters. The procedures are applied to a real data set and comparisons are made under several settings, including a quite general logistic hierarchical model with spatial structure plus unstructured heterogeneity for small area effects. A model selection criterion based on the Expected Prediction Deviance is proposed. Its utility for selecting among competitive models in the small area prediction context is examined.
本文提出了一种考虑可能的空间和非结构异质性效应的小面积比例预测逻辑层次模型方法。通过马尔可夫链蒙特卡罗方法得到了比例预测因子的后验分布。这将自动考虑与超参数相关的额外不确定性。这些程序应用于实际数据集,并在几种设置下进行了比较,包括具有空间结构和小区域效应的非结构化异质性的相当一般的逻辑分层模型。提出了一种基于预期预测偏差的模型选择准则。研究了它在小区域预测环境下对竞争模型进行选择的效用。
{"title":"Bayesian spatial models for small area estimation of proportions","authors":"Fas Moura","doi":"10.1191/1471082x02st032oa","DOIUrl":"https://doi.org/10.1191/1471082x02st032oa","url":null,"abstract":"This article presents a logistic hierarchical model approach for small area prediction of proportions, taking into account both possible spatial and unstructured heterogeneity effects. The posterior distributions of the proportion predictors are obtained via Markov Chain Monte Carlo methods. This automatically takes into account the extra uncertainty associated with the hyperparameters. The procedures are applied to a real data set and comparisons are made under several settings, including a quite general logistic hierarchical model with spatial structure plus unstructured heterogeneity for small area effects. A model selection criterion based on the Expected Prediction Deviance is proposed. Its utility for selecting among competitive models in the small area prediction context is examined.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122524343","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Semiparametric modelling of spatial binary observations 空间二元观测的半参数建模
Pub Date : 2002-07-01 DOI: 10.1191/1471082x02st023oa
M. Alfò, P. Postiglione
In the past decade various attempts have been made to extend standard random effects models to the analysis of spatial observations. This extension is a source of theoretical difficulty due to the multidirectional dependence among nearest observations; much of the previous work was based on parametric assumptions about the random effects distribution. To avoid any restriction, we propose a conditional model for spatial binary responses, without assuming a parametric distribution for the random effects. The model parameters are estimated using the EM algorithm for nonparametric maximum likelihood estimation of a mixing distribution. To illustrate the proposed approach, the model is applied to a remote sensed image of the Nebrodi Mountains (Italy).
在过去十年中,人们进行了各种尝试,将标准随机效应模型扩展到空间观测的分析中。由于最近观测之间的多向依赖,这种扩展是理论困难的来源;以前的大部分工作都是基于随机效应分布的参数假设。为了避免任何限制,我们提出了一个空间二元响应的条件模型,而不假设随机效应的参数分布。利用EM算法对混合分布进行非参数极大似然估计,对模型参数进行估计。为了说明所提出的方法,将该模型应用于Nebrodi山脉(意大利)的遥感图像。
{"title":"Semiparametric modelling of spatial binary observations","authors":"M. Alfò, P. Postiglione","doi":"10.1191/1471082x02st023oa","DOIUrl":"https://doi.org/10.1191/1471082x02st023oa","url":null,"abstract":"In the past decade various attempts have been made to extend standard random effects models to the analysis of spatial observations. This extension is a source of theoretical difficulty due to the multidirectional dependence among nearest observations; much of the previous work was based on parametric assumptions about the random effects distribution. To avoid any restriction, we propose a conditional model for spatial binary responses, without assuming a parametric distribution for the random effects. The model parameters are estimated using the EM algorithm for nonparametric maximum likelihood estimation of a mixing distribution. To illustrate the proposed approach, the model is applied to a remote sensed image of the Nebrodi Mountains (Italy).","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"323 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133712694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Improving financial risk assessment through dependency 通过依赖改进财务风险评估
Pub Date : 2002-07-01 DOI: 10.1191/1471082x02st028oa
Beatriz Mendes, A. Moretti
Understanding dependency between financial markets is crucial when measuring globally integrated exposures to risk. To this end the first step may be the investigation of the joint behaviour of their most representative indexes. We fit by parametric and nonparametric methods bivariate extreme value models on the component wise maxima and minima computed monthly from several pairs of indexes representing the North American, Latin American, and Emerging markets. We analyse the role of the asymmetric models, finding which market drives the dependency, and express the degrees of dependence using measures of linear and nonlinear dependency such as the linear correlation coefficient ρ and the measure τ based on the dependence function. We discuss the interpretation of τ as a conditional probability that a crash occurs in a market given that a catastrophic event has occurred in some other market. We assess risks by computing probabilities associated with joint extreme events and by computing joint risk measures. We show empirically that the joint Value-at-Risk may be severely under-estimated if independence is assumed between markets. To take into account the clustering of extreme events we compute the bivariate extremal index and incorporate this information in the analysis.
在衡量全球一体化风险敞口时,了解金融市场之间的依赖关系至关重要。为此,第一步可能是调查它们最具代表性的指数的共同行为。我们通过参数和非参数方法对代表北美、拉丁美洲和新兴市场的几对指数每月计算的分量最大和最小值的二元极值模型进行拟合。我们分析了不对称模型的作用,找出了驱动依赖的市场,并使用线性和非线性依赖度量(如基于依赖函数的线性相关系数ρ和度量τ)来表示依赖程度。我们讨论将τ解释为假设在其他市场发生灾难性事件的情况下,市场发生崩溃的条件概率。我们通过计算与联合极端事件相关的概率和计算联合风险度量来评估风险。我们的经验表明,如果假设市场之间的独立性,联合风险价值可能被严重低估。为了考虑到极端事件的聚类,我们计算了二元极值指数,并将该信息纳入分析。
{"title":"Improving financial risk assessment through dependency","authors":"Beatriz Mendes, A. Moretti","doi":"10.1191/1471082x02st028oa","DOIUrl":"https://doi.org/10.1191/1471082x02st028oa","url":null,"abstract":"Understanding dependency between financial markets is crucial when measuring globally integrated exposures to risk. To this end the first step may be the investigation of the joint behaviour of their most representative indexes. We fit by parametric and nonparametric methods bivariate extreme value models on the component wise maxima and minima computed monthly from several pairs of indexes representing the North American, Latin American, and Emerging markets. We analyse the role of the asymmetric models, finding which market drives the dependency, and express the degrees of dependence using measures of linear and nonlinear dependency such as the linear correlation coefficient ρ and the measure τ based on the dependence function. We discuss the interpretation of τ as a conditional probability that a crash occurs in a market given that a catastrophic event has occurred in some other market. We assess risks by computing probabilities associated with joint extreme events and by computing joint risk measures. We show empirically that the joint Value-at-Risk may be severely under-estimated if independence is assumed between markets. To take into account the clustering of extreme events we compute the bivariate extremal index and incorporate this information in the analysis.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124824301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Genetic analysis of cause of death in a mixture model of bivariate lifetime data 双变量寿命数据混合模型中死因的遗传分析
Pub Date : 2002-07-01 DOI: 10.1191/1471082x02st030oa
A. Wienke, K. Christensen, A. Skytthe, A. Yashin
A mixture model in multivariate survival analysis is presented, whereby heterogeneity among subjects creates divergent paths for the individual’s risk of experiencing an event (i.e., disease), as well as for the associated length of survival. Dependence among competing risks is included and rendered testable. This method is an extension of the bivariate correlated gamma-frailty model. It is applied to a data set on Danish twins, for whom cause-specific mortality is known. The use of multivariate data solves the identifiability problem which is inherent in the competing risk model of univariate lifetimes. We analyse the influence of genetic and environmental factors on frailty. Using a sample of 1470 monozygotic (MZ) and 2730 dizygotic (DZ) female twin pairs, we apply five genetic models to the associated mortality data, focusing particularly on death from coronary heart disease (CHD). Using the best fitting model, the inheritance risk of death from CHD was 0.39 (standard error 0.13). The results from this model are compared with the results from earlier analysis that used the restricted model, where the independence of competing risks was assumed. Comparing both cases, it turns out, that heritability of frailty on mortality due to CHD change substantially. Despite the inclusion of dependence, analysis confirms the significant genetic component to an individual’s risk of mortality from CHD. Whether dependence or independence is assumed, the best model for analysis with regard to CHD mortality risks is a model assuming that additive factors are responsible for heritability in susceptibility to CHD. The paper ends with a discussion of limitations and possible further extensions to the model presented.
提出了多变量生存分析中的混合模型,其中受试者之间的异质性为个体经历事件(即疾病)的风险以及相关的生存时间长度创造了不同的路径。相互竞争的风险之间的依赖关系被包括在内,并呈现为可测试的。该方法是二元相关γ -脆弱性模型的扩展。它被应用于丹麦双胞胎的数据集,其中特定原因的死亡率是已知的。多变量数据的使用解决了单变量寿命竞争风险模型固有的可识别性问题。我们分析了遗传和环境因素对脆弱的影响。使用1470对单卵(MZ)和2730对异卵(DZ)女性双胞胎的样本,我们应用五种遗传模型来分析相关的死亡率数据,特别关注冠心病(CHD)的死亡。采用最佳拟合模型,冠心病死亡遗传风险为0.39(标准误差0.13)。该模型的结果与先前使用受限模型的分析结果进行了比较,其中假设竞争风险的独立性。比较这两种情况,结果表明,由于冠心病导致的虚弱和死亡率的遗传性发生了很大的变化。尽管存在依赖性,但分析证实了个体冠心病死亡风险的重要遗传因素。无论假设依赖性还是独立性,关于冠心病死亡风险分析的最佳模型是假设累加性因素对冠心病易感性的遗传性负责的模型。文章最后讨论了所提出模型的局限性和可能的进一步扩展。
{"title":"Genetic analysis of cause of death in a mixture model of bivariate lifetime data","authors":"A. Wienke, K. Christensen, A. Skytthe, A. Yashin","doi":"10.1191/1471082x02st030oa","DOIUrl":"https://doi.org/10.1191/1471082x02st030oa","url":null,"abstract":"A mixture model in multivariate survival analysis is presented, whereby heterogeneity among subjects creates divergent paths for the individual’s risk of experiencing an event (i.e., disease), as well as for the associated length of survival. Dependence among competing risks is included and rendered testable. This method is an extension of the bivariate correlated gamma-frailty model. It is applied to a data set on Danish twins, for whom cause-specific mortality is known. The use of multivariate data solves the identifiability problem which is inherent in the competing risk model of univariate lifetimes. We analyse the influence of genetic and environmental factors on frailty. Using a sample of 1470 monozygotic (MZ) and 2730 dizygotic (DZ) female twin pairs, we apply five genetic models to the associated mortality data, focusing particularly on death from coronary heart disease (CHD). Using the best fitting model, the inheritance risk of death from CHD was 0.39 (standard error 0.13). The results from this model are compared with the results from earlier analysis that used the restricted model, where the independence of competing risks was assumed. Comparing both cases, it turns out, that heritability of frailty on mortality due to CHD change substantially. Despite the inclusion of dependence, analysis confirms the significant genetic component to an individual’s risk of mortality from CHD. Whether dependence or independence is assumed, the best model for analysis with regard to CHD mortality risks is a model assuming that additive factors are responsible for heritability in susceptibility to CHD. The paper ends with a discussion of limitations and possible further extensions to the model presented.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"2975 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127456412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
Stationary space-time Gaussian fields and their time autoregressive representation 静止时空高斯场及其时间自回归表示
Pub Date : 2002-07-01 DOI: 10.1191/1471082x02st029oa
G. Storvik, A. Frigessi, D. Hirst
We compare two different modelling strategies for continuous space discrete time data. The first strategy is in the spirit of Gaussian kriging. The model is a general stationary space-time Gaussian field where the key point is the choice of a parametric form for the covariance function. In the main, covariance functions that are used are separable in space and time. Nonseparable covariance functions are useful in many applications, but construction of these is not easy. The second strategy is to model the time evolution of the process more directly. We consider models of the autoregressive type where the process at time t is obtained by convolving the process at time t − 1 and adding spatially correlated noise. Under specific conditions, the two strategies describe two different formulations of the same stochastic process. We show how the two representations look in different cases. Furthermore, by transforming time-dynamic convolution models to Gaussian fields we can obtain new covariance functions and by writing a Gaussian field as a time-dynamic convolution model, interesting properties are discovered. The computational aspects of the two strategies are discussed through experiments on a dataset of daily UK temperatures. Although algorithms for performing estimation, simulation, and so on are easy to do for the first strategy, more computer-efficient algorithms based on the second strategy can be constructed.
我们比较了连续空间离散时间数据的两种不同建模策略。第一种策略是基于高斯克里格的精神。该模型是一个一般的平稳时空高斯场,关键是协方差函数参数形式的选择。总的来说,所使用的协方差函数在空间和时间上是可分离的。不可分离协方差函数在许多应用中都很有用,但是构造它们并不容易。第二种策略是更直接地对过程的时间演化进行建模。我们考虑自回归类型的模型,其中时间t的过程是通过卷积时间t−1的过程并添加空间相关噪声来获得的。在特定条件下,这两种策略描述了同一随机过程的两种不同表述。我们将展示这两种表示在不同情况下的表现。此外,通过将时间动态卷积模型转换为高斯场,我们可以得到新的协方差函数,并且通过将高斯场写成时间动态卷积模型,我们发现了一些有趣的性质。通过对英国每日温度数据集的实验,讨论了这两种策略的计算方面。虽然对于第一种策略,执行估计、模拟等的算法很容易做到,但是基于第二种策略,可以构建更高效的计算机算法。
{"title":"Stationary space-time Gaussian fields and their time autoregressive representation","authors":"G. Storvik, A. Frigessi, D. Hirst","doi":"10.1191/1471082x02st029oa","DOIUrl":"https://doi.org/10.1191/1471082x02st029oa","url":null,"abstract":"We compare two different modelling strategies for continuous space discrete time data. The first strategy is in the spirit of Gaussian kriging. The model is a general stationary space-time Gaussian field where the key point is the choice of a parametric form for the covariance function. In the main, covariance functions that are used are separable in space and time. Nonseparable covariance functions are useful in many applications, but construction of these is not easy. The second strategy is to model the time evolution of the process more directly. We consider models of the autoregressive type where the process at time t is obtained by convolving the process at time t − 1 and adding spatially correlated noise. Under specific conditions, the two strategies describe two different formulations of the same stochastic process. We show how the two representations look in different cases. Furthermore, by transforming time-dynamic convolution models to Gaussian fields we can obtain new covariance functions and by writing a Gaussian field as a time-dynamic convolution model, interesting properties are discovered. The computational aspects of the two strategies are discussed through experiments on a dataset of daily UK temperatures. Although algorithms for performing estimation, simulation, and so on are easy to do for the first strategy, more computer-efficient algorithms based on the second strategy can be constructed.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125513153","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 51
Semiparametric methods in applied econometrics: do the models fit the data? 应用计量经济学中的半参数方法:模型是否与数据拟合?
Pub Date : 2002-04-01 DOI: 10.1191/1471082x02st024oa
J. Horowitz, S. Lee
Much empirical research in economics and other fields is concerned with estimating the mean of a random variable conditional on one or more explanatory variables (conditional mean function). The most frequently used estimation methods assume that the conditional mean function is known up to a finite number of parameters, but the resulting estimates can be highly misleading if the assumed parametric model is incorrect. This paper reviews several semiparametric methods for estimating conditional mean functions. These methods are more flexible than parametric methods and offer greater estimation precision than do fully nonparametric methods. The various estimation methods are illustrated by applying them to data on the salaries of professional baseball players in the USA. We find that a parametric model and several simple semiparametric models fail to capture important features of the data. However, a sufficiently rich semiparametric model fits the data well. We conclude that semiparametric models can achieve their aim of providing flexible representations of conditional mean functions, but care is needed in choosing the semiparametric specification. Our analysis also provides some suggestions for further research on semiparametric estimation.
经济学和其他领域的许多实证研究都关注于估计一个随机变量的均值,该随机变量的均值取决于一个或多个解释变量(条件均值函数)。最常用的估计方法假设条件平均函数已知有限数量的参数,但如果假设的参数模型不正确,结果估计可能会产生很大的误导。本文综述了几种估计条件平均函数的半参数方法。这些方法比参数方法更灵活,并且比完全非参数方法提供更高的估计精度。通过将各种估计方法应用于美国职业棒球运动员的工资数据来说明各种估计方法。我们发现一个参数模型和几个简单的半参数模型不能捕捉数据的重要特征。然而,一个足够丰富的半参数模型可以很好地拟合数据。我们得出结论,半参数模型可以实现提供条件平均函数的灵活表示的目的,但在选择半参数规范时需要注意。本文的分析也为半参数估计的进一步研究提供了一些建议。
{"title":"Semiparametric methods in applied econometrics: do the models fit the data?","authors":"J. Horowitz, S. Lee","doi":"10.1191/1471082x02st024oa","DOIUrl":"https://doi.org/10.1191/1471082x02st024oa","url":null,"abstract":"Much empirical research in economics and other fields is concerned with estimating the mean of a random variable conditional on one or more explanatory variables (conditional mean function). The most frequently used estimation methods assume that the conditional mean function is known up to a finite number of parameters, but the resulting estimates can be highly misleading if the assumed parametric model is incorrect. This paper reviews several semiparametric methods for estimating conditional mean functions. These methods are more flexible than parametric methods and offer greater estimation precision than do fully nonparametric methods. The various estimation methods are illustrated by applying them to data on the salaries of professional baseball players in the USA. We find that a parametric model and several simple semiparametric models fail to capture important features of the data. However, a sufficiently rich semiparametric model fits the data well. We conclude that semiparametric models can achieve their aim of providing flexible representations of conditional mean functions, but care is needed in choosing the semiparametric specification. Our analysis also provides some suggestions for further research on semiparametric estimation.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123216892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
Fitting exponential family mixed models 拟合指数族混合模型
Pub Date : 2002-04-01 DOI: 10.1191/1471082x02st025oa
J. Palmgren, S. Ripatti
The generalized linear model (McCullagh and Nelder, 1972) and the semiparametric multiplicative hazard model (Cox, 1972) have significantly influenced the way in which statistical modelling is taught and practiced. Common for the two model families is the assumption that conditionally on covariate information (including time) the observations are independent. The obvious difficulty in identifying and measuring all relevant covariates has pushed for methods that can jointly handle both mean and dependence structures. The early 1990s saw a myriad of approaches for dealing with multivariate generalized linear models. More recently, the hazard models have been extended to multivariate settings. Here we review (i) penalized likelihood, (ii) Monte Carlo EM, and (iii) Bayesian Markov chain Monte Carlo methods for fitting the generalized linear mixed models and the frailty models, and we discuss the rationale for choosing between the methods. The similarities of the toolboxes for these two multivariate model families open up for a new level of generality both in teaching and applied research. Two examples are used for illustration, involving censored failure time responses and Poisson responses, respectively.
广义线性模型(McCullagh and Nelder, 1972)和半参数乘法风险模型(Cox, 1972)对统计建模的教学和实践方式产生了重大影响。这两种模型族的共同点是假设观测值在协变量信息(包括时间)的条件下是独立的。识别和测量所有相关协变量的明显困难推动了可以联合处理均值和依赖结构的方法。20世纪90年代初出现了无数处理多元广义线性模型的方法。最近,风险模型已扩展到多变量设置。在这里,我们回顾了(i)惩罚似然,(ii)蒙特卡罗EM和(iii)贝叶斯马尔可夫链蒙特卡罗方法拟合广义线性混合模型和脆弱性模型,并讨论了在方法之间进行选择的基本原理。这两个多元模型族工具箱的相似性为教学和应用研究的通用性开辟了一个新的水平。用两个例子来说明,分别涉及截尾失效时间响应和泊松响应。
{"title":"Fitting exponential family mixed models","authors":"J. Palmgren, S. Ripatti","doi":"10.1191/1471082x02st025oa","DOIUrl":"https://doi.org/10.1191/1471082x02st025oa","url":null,"abstract":"The generalized linear model (McCullagh and Nelder, 1972) and the semiparametric multiplicative hazard model (Cox, 1972) have significantly influenced the way in which statistical modelling is taught and practiced. Common for the two model families is the assumption that conditionally on covariate information (including time) the observations are independent. The obvious difficulty in identifying and measuring all relevant covariates has pushed for methods that can jointly handle both mean and dependence structures. The early 1990s saw a myriad of approaches for dealing with multivariate generalized linear models. More recently, the hazard models have been extended to multivariate settings. Here we review (i) penalized likelihood, (ii) Monte Carlo EM, and (iii) Bayesian Markov chain Monte Carlo methods for fitting the generalized linear mixed models and the frailty models, and we discuss the rationale for choosing between the methods. The similarities of the toolboxes for these two multivariate model families open up for a new level of generality both in teaching and applied research. Two examples are used for illustration, involving censored failure time responses and Poisson responses, respectively.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133650489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
The common structure of several models for non-ignorable dropout 几种不可忽略辍学模型的共同结构
Pub Date : 2002-04-01 DOI: 10.1191/1471082x02st022oa
R. Crouchley, M. Ganjali
This paper presents a multivariate generalization of the classical Heckman selection model and applies it to non-ignorable dropout in repeated continuous responses. Many of the recent models for dropout in repeated continuous responses can be written as special forms of this generalized Heckman model. To illustrate this, we present the parameterizations needed to obtain the form of dropout model that occurs when (1) the separate models for the response and dropout are linked by common random parameters, (2) the dropout model is an explicit function of the previous responses and the possibly unobserved current response, (3) the dropout model is both a function of the current response and a common random parameter, and (4) there is a covariance between the stochastic disturbances of the response and dropout processes. We present the joint likelihood of the generalized Heckman model and a residual for the responses. We contrast two of the dropout models in a simulation study. We compare the results obtained from several dropout models on the well known mastitis data.
本文对经典Heckman选择模型进行了多元推广,并将其应用于重复连续响应中的不可忽略dropout。最近的许多重复连续响应的辍学模型都可以写成这种广义Heckman模型的特殊形式。为了说明这一点,我们提出了获得dropout模型形式所需的参数化,当(1)响应和dropout的独立模型由共同随机参数连接时,(2)dropout模型是先前响应和可能未观察到的当前响应的显式函数,(3)dropout模型既是当前响应的函数,也是共同随机参数。(4)响应的随机扰动与dropout过程之间存在协方差。我们给出了广义Heckman模型的联合似然和响应的残差。我们在模拟研究中对比了两种辍学模型。我们比较了在已知的乳腺炎数据上从几个dropout模型获得的结果。
{"title":"The common structure of several models for non-ignorable dropout","authors":"R. Crouchley, M. Ganjali","doi":"10.1191/1471082x02st022oa","DOIUrl":"https://doi.org/10.1191/1471082x02st022oa","url":null,"abstract":"This paper presents a multivariate generalization of the classical Heckman selection model and applies it to non-ignorable dropout in repeated continuous responses. Many of the recent models for dropout in repeated continuous responses can be written as special forms of this generalized Heckman model. To illustrate this, we present the parameterizations needed to obtain the form of dropout model that occurs when (1) the separate models for the response and dropout are linked by common random parameters, (2) the dropout model is an explicit function of the previous responses and the possibly unobserved current response, (3) the dropout model is both a function of the current response and a common random parameter, and (4) there is a covariance between the stochastic disturbances of the response and dropout processes. We present the joint likelihood of the generalized Heckman model and a residual for the responses. We contrast two of the dropout models in a simulation study. We compare the results obtained from several dropout models on the well known mastitis data.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126622537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
A life table approach to small area health need profiling 小区域健康需求分析的生命表方法
Pub Date : 2002-04-01 DOI: 10.1191/1471082x02st026oa
Peter Congdon
Recent developments in health outcome models for small areas have found benefits from pooling information over areas to produce smoothed estimates of mortality and morbidity rates. Such indices serve as proxies for the need for health care and are often used in allocating health care resources. The present paper adopts a full life table approach to such outcomes, which includes the joint modelling of mortality and health variation between small areas. A further feature of the approach here is random effects modelling of age-specific death and wellness rates, so pooling strength in estimating life table parameters for areas, such as healthy and total life expectancies, which may be based on small event counts. The basic model involves exchangeable random effects for age and area. However, structured forms of variation considered include correlations between mortality and health, spatial correlation in these outcomes, and interrelatedness in age effects. A case study illustration uses deaths and long-term illness data to develop small area life tables for two London boroughs, and includes a temporal perspective on deaths. It then considers the utility of area life table measures in predicting health activity, providing a form of validation in addition to formal statistical cross-validation.
小地区健康结果模型的最新发展发现,汇集各地区的信息有助于对死亡率和发病率进行平滑估计。这些指数可作为卫生保健需求的代表,经常用于分配卫生保健资源。本文件对这种结果采用完整的生命表方法,其中包括小地区之间死亡率和健康差异的联合建模。该方法的另一个特点是对特定年龄的死亡率和健康率进行随机效应建模,因此在估计健康和总预期寿命等领域的生命表参数方面具有综合优势,这些参数可能基于小事件计数。基本模型涉及年龄和地区的可交换随机效应。然而,考虑的结构性变化形式包括死亡率和健康之间的相关性,这些结果的空间相关性以及年龄影响的相互关联性。一个案例研究说明使用死亡和长期疾病数据为两个伦敦行政区开发小区域生命表,并包括死亡的时间视图。然后考虑区域生命表测量在预测健康活动中的效用,除了正式的统计交叉验证之外,还提供了一种验证形式。
{"title":"A life table approach to small area health need profiling","authors":"Peter Congdon","doi":"10.1191/1471082x02st026oa","DOIUrl":"https://doi.org/10.1191/1471082x02st026oa","url":null,"abstract":"Recent developments in health outcome models for small areas have found benefits from pooling information over areas to produce smoothed estimates of mortality and morbidity rates. Such indices serve as proxies for the need for health care and are often used in allocating health care resources. The present paper adopts a full life table approach to such outcomes, which includes the joint modelling of mortality and health variation between small areas. A further feature of the approach here is random effects modelling of age-specific death and wellness rates, so pooling strength in estimating life table parameters for areas, such as healthy and total life expectancies, which may be based on small event counts. The basic model involves exchangeable random effects for age and area. However, structured forms of variation considered include correlations between mortality and health, spatial correlation in these outcomes, and interrelatedness in age effects. A case study illustration uses deaths and long-term illness data to develop small area life tables for two London boroughs, and includes a temporal perspective on deaths. It then considers the utility of area life table measures in predicting health activity, providing a form of validation in addition to formal statistical cross-validation.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123379992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Likelihood and Bayesian analysis of mixtures 混合物的似然和贝叶斯分析
Pub Date : 2001-12-01 DOI: 10.1177/1471082X0100100404
M. Aitkin
This paper compares likelihood and Bayesian analyses of finite mixture distributions, and expresses reservations about the latter. In particular, the role of prior assumptions in the full Monte Carlo Markov chain Bayes analysis is obscure, yet these assumptions clearly play a major role in the conclusions. These issues are illustrated with a detailed discussion of the well-known galaxy data.
本文比较了有限混合分布的似然分析和贝叶斯分析,并对后者提出了保留意见。特别是,先验假设在完整的蒙特卡洛马尔可夫链贝叶斯分析中的作用是模糊的,但这些假设显然在结论中起着重要作用。通过对众所周知的星系数据的详细讨论来说明这些问题。
{"title":"Likelihood and Bayesian analysis of mixtures","authors":"M. Aitkin","doi":"10.1177/1471082X0100100404","DOIUrl":"https://doi.org/10.1177/1471082X0100100404","url":null,"abstract":"This paper compares likelihood and Bayesian analyses of finite mixture distributions, and expresses reservations about the latter. In particular, the role of prior assumptions in the full Monte Carlo Markov chain Bayes analysis is obscure, yet these assumptions clearly play a major role in the conclusions. These issues are illustrated with a detailed discussion of the well-known galaxy data.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121825349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 61
期刊
Statistical Modeling
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1