首页 > 最新文献

Statistical Modeling最新文献

英文 中文
GLMM approach to study the spatial and temporal evolution of spikes in the small intestine GLMM方法研究小肠内刺突的时空演化
Pub Date : 2006-12-01 DOI: 10.1177/1471082006071851
C. Faes, M. Aerts, H. Geys, L. Bijnens, L. Ver Donck, W. Lammers
Mixed models can be applied in a wide range of settings. Probably, they are most commonly used to handle grouping in the data. In addition, mixed models can be used for smoothing purposes as well. When dealing with non-normal data, the use of smoothing methods within the generalized linear mixed models (GLMM) framework is less familiar. We explore the use of GLMM for smoothing purposes in both spatial and longitudinal dimensions. The methodology is illustrated by analysis of spike potentials in the small intestine of different cats. Spatio-temporal models that use two-dimensional smoothing splines across the spatial dimension and random effects to account for the correlations during successive slow-waves are developed. A major advantage of the mixed-model approach is that it can handle smoothing together with grouping (or other types of correlations) in a unified model. In this way, areas with high spike incidence compared with other areas can be detected. Also, the temporal and spatial characteristics of spikes during successive slow-waves can be identified.
混合模型可以应用于广泛的环境。可能,它们最常用于处理数据中的分组。此外,混合模型也可以用于平滑目的。在处理非正态数据时,在广义线性混合模型(GLMM)框架内使用平滑方法是不太熟悉的。我们探索了GLMM在空间和纵向维度上的平滑目的。该方法通过分析不同猫小肠的尖峰电位来说明。时空模型使用二维平滑样条跨越空间维度和随机效应来解释连续慢波期间的相关性。混合模型方法的一个主要优点是,它可以在统一模型中处理平滑和分组(或其他类型的相关性)。通过这种方法,可以检测出与其他区域相比尖峰发生率高的区域。此外,在连续的慢波期间的峰值的时间和空间特征可以确定。
{"title":"GLMM approach to study the spatial and temporal evolution of spikes in the small intestine","authors":"C. Faes, M. Aerts, H. Geys, L. Bijnens, L. Ver Donck, W. Lammers","doi":"10.1177/1471082006071851","DOIUrl":"https://doi.org/10.1177/1471082006071851","url":null,"abstract":"Mixed models can be applied in a wide range of settings. Probably, they are most commonly used to handle grouping in the data. In addition, mixed models can be used for smoothing purposes as well. When dealing with non-normal data, the use of smoothing methods within the generalized linear mixed models (GLMM) framework is less familiar. We explore the use of GLMM for smoothing purposes in both spatial and longitudinal dimensions. The methodology is illustrated by analysis of spike potentials in the small intestine of different cats. Spatio-temporal models that use two-dimensional smoothing splines across the spatial dimension and random effects to account for the correlations during successive slow-waves are developed. A major advantage of the mixed-model approach is that it can handle smoothing together with grouping (or other types of correlations) in a unified model. In this way, areas with high spike incidence compared with other areas can be detected. Also, the temporal and spatial characteristics of spikes during successive slow-waves can be identified.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126784337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Comparing nonparametric surfaces 比较非参数曲面
Pub Date : 2006-12-01 DOI: 10.1177/1471082006071848
A. Bowman
There is a wide variety of problems where the object of primary interest is a surface. Environmental studies in particular, where data often have a spatial structure, provide many examples where estimation of a surface is a central component of analysis. In these settings, the surfaces are often not well described by simple parametric models. Nonparametric regression therefore offers a convenient means of constructing surface estimates in a straightforward manner. In this paper, the issues associated with comparing such regression surfaces across different groups of data are discussed. Formal methods for assessing the equality of a collection of surfaces, or the suitability of a set of parallel surfaces, are described. These not only extend existing methods of nonparametric analysis of covariance but also allow the commonly occurring case of correlated errors to be incorporated. Graphical methods to provide insight into the sources of departure from a candidate model are also proposed. Several applications are provided to illustrate and explore the proposals.
在各种各样的问题中,主要感兴趣的对象是一个表面。特别是环境研究,其中的数据往往具有空间结构,提供了许多例子,其中对表面的估计是分析的中心组成部分。在这些情况下,表面通常不能用简单的参数化模型很好地描述。因此,非参数回归提供了一种以直接方式构造曲面估计的方便方法。在本文中,讨论了与跨不同数据组比较这种回归曲面相关的问题。描述了评估一组曲面的相等性或一组平行曲面的适宜性的形式化方法。这些方法不仅扩展了现有的协方差非参数分析方法,而且允许将常见的相关误差纳入其中。还提出了图形方法,以提供对偏离候选模型的来源的洞察。提供了几个应用来说明和探讨这些建议。
{"title":"Comparing nonparametric surfaces","authors":"A. Bowman","doi":"10.1177/1471082006071848","DOIUrl":"https://doi.org/10.1177/1471082006071848","url":null,"abstract":"There is a wide variety of problems where the object of primary interest is a surface. Environmental studies in particular, where data often have a spatial structure, provide many examples where estimation of a surface is a central component of analysis. In these settings, the surfaces are often not well described by simple parametric models. Nonparametric regression therefore offers a convenient means of constructing surface estimates in a straightforward manner. In this paper, the issues associated with comparing such regression surfaces across different groups of data are discussed. Formal methods for assessing the equality of a collection of surfaces, or the suitability of a set of parallel surfaces, are described. These not only extend existing methods of nonparametric analysis of covariance but also allow the commonly occurring case of correlated errors to be incorporated. Graphical methods to provide insight into the sources of departure from a candidate model are also proposed. Several applications are provided to illustrate and explore the proposals.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129623103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Use of auxiliary data in semi-parametric spatial regression with nonignorable missing responses 具有不可忽略缺失响应的半参数空间回归中辅助数据的使用
Pub Date : 2006-12-01 DOI: 10.1177/1471082006071849
M. Geraci, M. Bottai
We propose a method for reducing the error of the prediction of a quantity of interest when the outcome has missing values that are suspected to be nonignorable and the data are correlated in space. We develop a maximum likelihood approach for the parameter estimation of semi-parametric regressions in a mixed model framework. We apply the proposed method to phytoplankton data collected at fixed stations in the Chesapeake Bay, for which chlorophyll data coming from remote sensing are available. A simulation study is also performed. The availability of a variable correlated to the response allows us to achieve a substantial reduction of the prediction error of the expected value of the smoother, without having to specify a nonignorable model.
我们提出了一种方法,当结果具有被怀疑是不可忽略的缺失值并且数据在空间中相关时,可以减少对感兴趣数量的预测的误差。本文提出了一种混合模型框架下半参数回归参数估计的极大似然方法。我们将所提出的方法应用于切萨皮克湾固定站点收集的浮游植物数据,其中叶绿素数据来自遥感。并进行了仿真研究。与响应相关的变量的可用性使我们能够大大减少平滑器期望值的预测误差,而不必指定不可忽略的模型。
{"title":"Use of auxiliary data in semi-parametric spatial regression with nonignorable missing responses","authors":"M. Geraci, M. Bottai","doi":"10.1177/1471082006071849","DOIUrl":"https://doi.org/10.1177/1471082006071849","url":null,"abstract":"We propose a method for reducing the error of the prediction of a quantity of interest when the outcome has missing values that are suspected to be nonignorable and the data are correlated in space. We develop a maximum likelihood approach for the parameter estimation of semi-parametric regressions in a mixed model framework. We apply the proposed method to phytoplankton data collected at fixed stations in the Chesapeake Bay, for which chlorophyll data coming from remote sensing are available. A simulation study is also performed. The availability of a variable correlated to the response allows us to achieve a substantial reduction of the prediction error of the expected value of the smoother, without having to specify a nonignorable model.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124064676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Bayesian modeling for genetic association in case-control studies: accounting for unknown population substructure 病例对照研究中遗传关联的贝叶斯模型:考虑未知的群体亚结构
Pub Date : 2006-12-01 DOI: 10.1177/1471082006071841
Li Zhang, B. Mukherjee, M. Ghosh, R. Wu
A two-stage parametric Bayesian method is proposed to examine the association between a candidate gene and the occurrence of a disease after accounting for population substructure. This procedure, implemented via a Markov chain Monte Carlo numerical integration technique, first estimates the posterior probability of different unknown population substructures and then integrates this information into a disease-gene association model through the technique of Bayesian model averaging. The model relaxes certain assumptions of previous analyses and provides a unified computational framework to obtain an estimate of the log odds ratio parameter corresponding to the genetic factor after allowing for the allele frequencies to vary across subpopulations. The uncertainty in estimating the population substructure is taken into account while providing credible intervals for parameters in the disease-gene association model. Simulations on unmatched case-control studies that mimic an admixed Argentinean population are performed to demonstrate the statistical properties of our model. The method is also applied to a real data set coming from a genetic association study on obesity.
在考虑种群亚结构后,提出了一种两阶段参数贝叶斯方法来检验候选基因与疾病发生之间的关系。该程序通过马尔可夫链蒙特卡罗数值积分技术实现,首先估计不同未知种群子结构的后验概率,然后通过贝叶斯模型平均技术将该信息整合到疾病基因关联模型中。该模型放宽了先前分析的某些假设,并提供了一个统一的计算框架,在允许等位基因频率在亚种群中变化后,获得与遗传因素对应的对数比值比参数的估计。在为疾病-基因关联模型的参数提供可信区间的同时,考虑了种群子结构估计的不确定性。在模拟混合阿根廷人口的无与伦比的病例对照研究中进行了模拟,以证明我们模型的统计特性。该方法也适用于来自肥胖遗传关联研究的真实数据集。
{"title":"Bayesian modeling for genetic association in case-control studies: accounting for unknown population substructure","authors":"Li Zhang, B. Mukherjee, M. Ghosh, R. Wu","doi":"10.1177/1471082006071841","DOIUrl":"https://doi.org/10.1177/1471082006071841","url":null,"abstract":"A two-stage parametric Bayesian method is proposed to examine the association between a candidate gene and the occurrence of a disease after accounting for population substructure. This procedure, implemented via a Markov chain Monte Carlo numerical integration technique, first estimates the posterior probability of different unknown population substructures and then integrates this information into a disease-gene association model through the technique of Bayesian model averaging. The model relaxes certain assumptions of previous analyses and provides a unified computational framework to obtain an estimate of the log odds ratio parameter corresponding to the genetic factor after allowing for the allele frequencies to vary across subpopulations. The uncertainty in estimating the population substructure is taken into account while providing credible intervals for parameters in the disease-gene association model. Simulations on unmatched case-control studies that mimic an admixed Argentinean population are performed to demonstrate the statistical properties of our model. The method is also applied to a real data set coming from a genetic association study on obesity.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125566753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Analyzing the emergence times of permanent teeth: an example of modeling the covariance matrix with interval-censored data 恒牙出现次数分析:用间隔截尾数据建模协方差矩阵的一个例子
Pub Date : 2006-12-01 DOI: 10.1177/1471082006071844
S. Cecere, A. Jara, E. Lesaffre
Based on a data set obtained in a large dental longitudinal study, conducted in Flanders (Belgium), the joint emergence distribution of seven teeth was modeled as a function of gender and caries experience on primary teeth. Besides establishing the marginal dependence of emergence on the covariates, there was also interest in examining the impact of the covariates on the association among emergence times. This allows the establishment of the preferred rankings of emergence and their dependence on covariates. To this end, the covariance matrix was modeled as a function of covariates. Modeling the covariance matrix in this way needs to ensure the positive definiteness of the covariance matrix and it is preferable that the regression parameters of the model are interpretable. The modified Cholesky decomposition of the covariance matrix, as suggested by Pourahmadi, splits up the covariance matrix into two parts where the parameters can be interpreted, given a natural ranking of the responses. This approach was used here taking into account that the emergence times are interval-censored. Hence, we opted for a Bayesian implementation of the data augmentation algorithm.
根据在比利时法兰德斯进行的一项大型牙齿纵向研究中获得的数据集,将七颗牙齿的联合出牙分布建模为性别和乳牙龋齿经历的函数。除了确定出现对协变量的边际依赖性外,还对检查协变量对出现时间之间关联的影响感兴趣。这允许建立出现的首选排名及其对协变量的依赖。为此,将协方差矩阵建模为协变量的函数。以这种方式对协方差矩阵进行建模,需要保证协方差矩阵的正确定性,并且模型的回归参数最好是可解释的。Pourahmadi提出的协方差矩阵的修正Cholesky分解将协方差矩阵分成两部分,其中给出响应的自然排序,可以解释参数。这里使用这种方法是考虑到出现时间是间隔审查的。因此,我们选择了数据增强算法的贝叶斯实现。
{"title":"Analyzing the emergence times of permanent teeth: an example of modeling the covariance matrix with interval-censored data","authors":"S. Cecere, A. Jara, E. Lesaffre","doi":"10.1177/1471082006071844","DOIUrl":"https://doi.org/10.1177/1471082006071844","url":null,"abstract":"Based on a data set obtained in a large dental longitudinal study, conducted in Flanders (Belgium), the joint emergence distribution of seven teeth was modeled as a function of gender and caries experience on primary teeth. Besides establishing the marginal dependence of emergence on the covariates, there was also interest in examining the impact of the covariates on the association among emergence times. This allows the establishment of the preferred rankings of emergence and their dependence on covariates. To this end, the covariance matrix was modeled as a function of covariates. Modeling the covariance matrix in this way needs to ensure the positive definiteness of the covariance matrix and it is preferable that the regression parameters of the model are interpretable. The modified Cholesky decomposition of the covariance matrix, as suggested by Pourahmadi, splits up the covariance matrix into two parts where the parameters can be interpreted, given a natural ranking of the responses. This approach was used here taking into account that the emergence times are interval-censored. Hence, we opted for a Bayesian implementation of the data augmentation algorithm.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131686432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Generalization of the Weibull distribution: the odd Weibull family 威布尔分布的推广:奇威布尔族
Pub Date : 2006-10-01 DOI: 10.1191/1471082X06st116oa
Kahadawala Cooray
A three-parameter generalization of the Weibull distribution is presented to deal with general situations in modeling survival process with various shapes in the hazard function. This generalized Weibull distribution will be referred to as the odd Weibull family, as it is derived by considering the distributions of the odds of the Weibull and inverse Weibull families. As a result, the odd Weibull family is not only useful for testing goodness-of-fit of the Weibull and inverse Weibull as submodels, but it is also convenient for modeling and fitting different data sets, especially in the presence of censoring. The model parameters for uncensored data are estimated in two different ways because of the fact that the inverse transformation of the odd Weibull family does not change its density function. Adequacy of the model for the given uncensored data is illustrated by using the plot of scaled fitted total time on test (TTT) transforms. Furthermore, simulation studies are conducted to measure the discrepancy between empirical and fitted TTT transforms by using a previously proposed test statistic. Three different examples are, respectively, providedbasedondatafromsurvival, reliabilityandenvironmentalsciencestoillustrateincreasing, bathtub and unimodal failure rates.
提出了一种威布尔分布的三参数泛化方法,以处理在危险函数中具有不同形状的生存过程建模中的一般情况。这个广义威布尔分布将被称为奇威布尔族,因为它是通过考虑威布尔族和逆威布尔族的概率分布推导出来的。因此,奇威布尔族不仅可以用于测试威布尔和逆威布尔作为子模型的拟合优度,而且还可以方便地建模和拟合不同的数据集,特别是在存在审查的情况下。由于奇威布尔族的逆变换不改变其密度函数,因此采用两种不同的方法估计未删减数据的模型参数。对于给定的未删节数据,模型的充分性通过使用缩放拟合的总测试时间(TTT)变换图来说明。此外,通过使用先前提出的检验统计量进行模拟研究,以测量经验和拟合的TTT变换之间的差异。基于生存、可靠性和环境科学的数据,分别提供了三个不同的例子来说明不断增加的浴缸和单峰故障率。
{"title":"Generalization of the Weibull distribution: the odd Weibull family","authors":"Kahadawala Cooray","doi":"10.1191/1471082X06st116oa","DOIUrl":"https://doi.org/10.1191/1471082X06st116oa","url":null,"abstract":"A three-parameter generalization of the Weibull distribution is presented to deal with general situations in modeling survival process with various shapes in the hazard function. This generalized Weibull distribution will be referred to as the odd Weibull family, as it is derived by considering the distributions of the odds of the Weibull and inverse Weibull families. As a result, the odd Weibull family is not only useful for testing goodness-of-fit of the Weibull and inverse Weibull as submodels, but it is also convenient for modeling and fitting different data sets, especially in the presence of censoring. The model parameters for uncensored data are estimated in two different ways because of the fact that the inverse transformation of the odd Weibull family does not change its density function. Adequacy of the model for the given uncensored data is illustrated by using the plot of scaled fitted total time on test (TTT) transforms. Furthermore, simulation studies are conducted to measure the discrepancy between empirical and fitted TTT transforms by using a previously proposed test statistic. Three different examples are, respectively, providedbasedondatafromsurvival, reliabilityandenvironmentalsciencestoillustrateincreasing, bathtub and unimodal failure rates.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130797377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 87
Using the Box-Cox t distribution in GAMLSS to model skewness and kurtosis 使用GAMLSS中的Box-Cox t分布对偏度和峰度进行建模
Pub Date : 2006-10-01 DOI: 10.1191/1471082X06st122oa
Robert A Rigby, D. Stasinopoulos
The Box-Cox t (BCT) distribution is presented as a model for a dependent variable Y exhibiting both skewness and leptokurtosis. The distribution is defined by a power transformation Y v having a shifted and scaled (truncated) t distribution with degrees of freedom parameter τ. The distribution has four parameters and is denoted by BCT(μ, σ,ν, τ). The parameters μ, σ,ν and τ may be interpreted as relating to location (median), scale (centile-based coefficient of variation), skewness (power transformation to symmetry) and kurtosis (degrees of freedom), respectively. The generalized additive model for location, scale and shape (GAMLSS) is extended to allow each of the parameters of the distribution to be modelled as linear and/or non-linear parametric and/or smooth non-parametric functions of explanatory variables. A Fisher scoring algorithm is used to fit the model by maximizing a (penalized) likelihood. The first and expected second and cross derivatives of the likelihood with respect to μ, σ,ν and τ, required for the algorithm, are provided. The use of the BCT distribution is illustrated by two data applications.
Box-Cox t (BCT)分布是因变量Y的模型,同时显示偏度和细峰态。该分布由一个功率变换Y v定义,它具有一个移位和缩放(截断)的t分布,自由度参数为τ。分布有4个参数,用BCT(μ, σ,ν, τ)表示。参数μ、σ、ν和τ可以分别解释为与位置(中位数)、尺度(基于百分位的变异系数)、偏度(向对称的幂变换)和峰度(自由度)有关。对位置、尺度和形状的广义加性模型(GAMLSS)进行了扩展,允许将分布的每个参数建模为解释变量的线性和/或非线性参数和/或光滑非参数函数。使用Fisher评分算法通过最大化(惩罚)似然来拟合模型。给出了算法所需的似然函数对μ、σ、ν和τ的一阶导数和期望二阶导数和交叉导数。通过两个数据应用程序说明了BCT分布的使用。
{"title":"Using the Box-Cox t distribution in GAMLSS to model skewness and kurtosis","authors":"Robert A Rigby, D. Stasinopoulos","doi":"10.1191/1471082X06st122oa","DOIUrl":"https://doi.org/10.1191/1471082X06st122oa","url":null,"abstract":"The Box-Cox t (BCT) distribution is presented as a model for a dependent variable Y exhibiting both skewness and leptokurtosis. The distribution is defined by a power transformation Y v having a shifted and scaled (truncated) t distribution with degrees of freedom parameter τ. The distribution has four parameters and is denoted by BCT(μ, σ,ν, τ). The parameters μ, σ,ν and τ may be interpreted as relating to location (median), scale (centile-based coefficient of variation), skewness (power transformation to symmetry) and kurtosis (degrees of freedom), respectively. The generalized additive model for location, scale and shape (GAMLSS) is extended to allow each of the parameters of the distribution to be modelled as linear and/or non-linear parametric and/or smooth non-parametric functions of explanatory variables. A Fisher scoring algorithm is used to fit the model by maximizing a (penalized) likelihood. The first and expected second and cross derivatives of the likelihood with respect to μ, σ,ν and τ, required for the algorithm, are provided. The use of the BCT distribution is illustrated by two data applications.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116763691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 170
Quantile regression with monotonicity restrictions using P-splines and the L1-norm 使用p样条和l1范数的单调限制分位数回归
Pub Date : 2006-10-01 DOI: 10.1191/1471082X06st118oa
K. Bollaerts, P. Eilers, M. Aerts
Quantile regression is an alternative to OLS regression. In quantile regression, the sum of absolute deviations or the L1-norm is minimized, whereas the sum of squared deviations or the L2-norm is minimized in OLS regression. Quantile regression has the advantage over OLS-regression of being more robust to outlying observations. Furthermore, quantile regression provides information complementing the information provided by OLS-regression. In this study, a non-parametric approach to quantile regression is presented, which constrains the estimated-quantile function to be monotone increasing. In particular, P-splines with an additional asymmetric penalty enforcing monotonicity are used within an L1-framework. This can be translated into a linear programming problem, which will be solved using an interior point algorithm. As an illustration, the presented approach will be applied to estimate quantile growth curves and quantile antibody levels as a function of age.
分位数回归是OLS回归的一种替代方法。在分位数回归中,绝对偏差的总和或l1 -范数被最小化,而在OLS回归中,平方偏差的总和或l2 -范数被最小化。分位数回归比ols回归的优点是对离群观测值更加稳健。此外,分位数回归提供的信息与ols回归提供的信息相补充。本文提出了一种非参数的分位数回归方法,该方法约束了估计的分位数函数是单调递增的。特别地,在l1框架中使用带有额外非对称惩罚的p样条来强制单调性。这可以转化为一个线性规划问题,它将使用内点算法来解决。作为一个例子,所提出的方法将被应用于估计分位数生长曲线和分位数抗体水平作为年龄的函数。
{"title":"Quantile regression with monotonicity restrictions using P-splines and the L1-norm","authors":"K. Bollaerts, P. Eilers, M. Aerts","doi":"10.1191/1471082X06st118oa","DOIUrl":"https://doi.org/10.1191/1471082X06st118oa","url":null,"abstract":"Quantile regression is an alternative to OLS regression. In quantile regression, the sum of absolute deviations or the L1-norm is minimized, whereas the sum of squared deviations or the L2-norm is minimized in OLS regression. Quantile regression has the advantage over OLS-regression of being more robust to outlying observations. Furthermore, quantile regression provides information complementing the information provided by OLS-regression. In this study, a non-parametric approach to quantile regression is presented, which constrains the estimated-quantile function to be monotone increasing. In particular, P-splines with an additional asymmetric penalty enforcing monotonicity are used within an L1-framework. This can be translated into a linear programming problem, which will be solved using an interior point algorithm. As an illustration, the presented approach will be applied to estimate quantile growth curves and quantile antibody levels as a function of age.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129510353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 57
A Bayesian approach to inequality constrained linear mixed models: estimation and model selection 不等式约束线性混合模型的贝叶斯方法:估计和模型选择
Pub Date : 2006-10-01 DOI: 10.1191/1471082X06st119oa
B. Kato, H. Hoijtink
Constrained parameter problems arise in a wide variety of applications. This article deals with estimation and model selection in linear mixed models with inequality constraints on the parameters. It is shown that different theories can be translated into statistical models by putting constraints on the model parameters yielding a set of competing models. A new approach based on the principle of encompassing priors is proposed and used to compute Bayes factors and subsequently posterior model probabilities. Model selection is based on posterior model probabilities. The approach is illustrated using a longitudinal data set.
约束参数问题出现在各种各样的应用中。本文研究了参数具有不等式约束的线性混合模型的估计和模型选择问题。研究表明,不同的理论可以通过对模型参数施加约束而转化为统计模型,从而产生一组相互竞争的模型。提出了一种基于包含先验原理的新方法,并将其用于计算贝叶斯因子和后验模型概率。模型选择基于后验模型概率。该方法使用纵向数据集进行说明。
{"title":"A Bayesian approach to inequality constrained linear mixed models: estimation and model selection","authors":"B. Kato, H. Hoijtink","doi":"10.1191/1471082X06st119oa","DOIUrl":"https://doi.org/10.1191/1471082X06st119oa","url":null,"abstract":"Constrained parameter problems arise in a wide variety of applications. This article deals with estimation and model selection in linear mixed models with inequality constraints on the parameters. It is shown that different theories can be translated into statistical models by putting constraints on the model parameters yielding a set of competing models. A new approach based on the principle of encompassing priors is proposed and used to compute Bayes factors and subsequently posterior model probabilities. Model selection is based on posterior model probabilities. The approach is illustrated using a longitudinal data set.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"205 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123253507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Modelling repeated ordinal reports from multiple informants 模拟重复来自多个线人的顺序报告
Pub Date : 2006-10-01 DOI: 10.1191/1471082X06st121oa
I. Plewis, F. Vitaro, R. Tremblay
Cross-informant associations tend to be low for reports of children’s behaviours at one point in time. The paper extends the literature on multiple informants using data from a well-known longitudinal study of Quebec, Canada, boys to show how to estimate associations between repeated teachers′ and self-reports of aggressive behaviour. These associations, for both level and change, are derived from multilevel models for repeated measures of variables best treated as ordered categories. The ordering is represented by sets of continuation ratios, change by linear and quadratic functions of age, and the multivariate models are estimated using penalized quasi-likelihood. The analyses also incorporate a risk variable: socio-economic status (SES). The correlations between estimates of the growth parameters for the two sets of reports tend to be rather small and smaller than the cross-informant associations for levels. SES is associated with levels of aggression, more so for teacher reports than for self-reports, but not with the decline in aggression with age.
在某一时间点儿童行为的报告中,交叉信息提供者的关联往往较低。这篇论文利用加拿大魁北克一项著名的男孩纵向研究的数据,扩展了关于多个信息提供者的文献,以展示如何估计反复教师和自我报告的攻击行为之间的联系。这些关联,无论是水平还是变化,都是从重复测量变量的多层模型中得出的,这些变量最好被视为有序类别。排序由连续比集合表示,变化由年龄的线性和二次函数表示,多元模型使用惩罚拟似然估计。分析还纳入了一个风险变量:社会经济地位(SES)。两组报告的增长参数估计值之间的相关性往往相当小,小于水平的交叉信息提供者关联。社会经济地位与攻击性水平有关,在教师报告中比在自我报告中更明显,但与攻击性随着年龄的增长而下降无关。
{"title":"Modelling repeated ordinal reports from multiple informants","authors":"I. Plewis, F. Vitaro, R. Tremblay","doi":"10.1191/1471082X06st121oa","DOIUrl":"https://doi.org/10.1191/1471082X06st121oa","url":null,"abstract":"Cross-informant associations tend to be low for reports of children’s behaviours at one point in time. The paper extends the literature on multiple informants using data from a well-known longitudinal study of Quebec, Canada, boys to show how to estimate associations between repeated teachers′ and self-reports of aggressive behaviour. These associations, for both level and change, are derived from multilevel models for repeated measures of variables best treated as ordered categories. The ordering is represented by sets of continuation ratios, change by linear and quadratic functions of age, and the multivariate models are estimated using penalized quasi-likelihood. The analyses also incorporate a risk variable: socio-economic status (SES). The correlations between estimates of the growth parameters for the two sets of reports tend to be rather small and smaller than the cross-informant associations for levels. SES is associated with levels of aggression, more so for teacher reports than for self-reports, but not with the decline in aggression with age.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122582078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
Statistical Modeling
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1