首页 > 最新文献

Canadian Journal of Statistics-Revue Canadienne De Statistique最新文献

英文 中文
Censored autoregressive regression models with Student-t innovations 带有 Student-t 创新值的剔除自回归模型
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-02-21 DOI: 10.1002/cjs.11804
Katherine A. L. Valeriano, Fernanda L. Schumacher, Christian E. Galarza, Larissa A. Matos

Data collected over time are common in applications and may contain censored or missing observations, making it difficult to use standard statistical procedures. This article proposes an algorithm to estimate the parameters of a censored linear regression model with errors serially correlated and innovations following a Student-t distribution. This distribution is widely used in the statistical modelling of data containing outliers because its longer-than-normal tails provide a robust approach to handling such data. The maximum likelihood estimates of the proposed model are obtained through a stochastic approximation of the EM algorithm. The methods are applied to an environmental dataset regarding ammonia-nitrogen concentration, which is subject to a limit of detection (left censoring) and contains missing observations. Additionally, two simulation studies are conducted to examine the asymptotic properties of the estimates and the robustness of the model. The proposed algorithm and methods are implemented in the R package ARCensReg.

长期收集的数据在应用中很常见,可能包含删减或缺失的观测值,因此很难使用标准的统计程序。本文提出了一种算法,用于估计误差序列相关且创新值遵循 Student- 分布的删减线性回归模型参数。这种分布被广泛用于含有异常值的数据的统计建模,因为它的尾部比正态分布长,为处理这类数据提供了一种稳健的方法。拟议模型的最大似然估计值是通过 EM 算法的随机近似值获得的。这些方法被应用于一个有关氨氮浓度的环境数据集,该数据集受到检测极限(左删减)的限制,并包含缺失观测值。此外,还进行了两次模拟研究,以检验估计值的渐近特性和模型的稳健性。提出的算法和方法在 R 软件包 ARCensReg 中实现。
{"title":"Censored autoregressive regression models with Student-t innovations","authors":"Katherine A. L. Valeriano,&nbsp;Fernanda L. Schumacher,&nbsp;Christian E. Galarza,&nbsp;Larissa A. Matos","doi":"10.1002/cjs.11804","DOIUrl":"10.1002/cjs.11804","url":null,"abstract":"<p>Data collected over time are common in applications and may contain censored or missing observations, making it difficult to use standard statistical procedures. This article proposes an algorithm to estimate the parameters of a censored linear regression model with errors serially correlated and innovations following a Student-<span></span><math>\u0000 <mrow>\u0000 <mi>t</mi>\u0000 </mrow></math> distribution. This distribution is widely used in the statistical modelling of data containing outliers because its longer-than-normal tails provide a robust approach to handling such data. The maximum likelihood estimates of the proposed model are obtained through a stochastic approximation of the EM algorithm. The methods are applied to an environmental dataset regarding ammonia-nitrogen concentration, which is subject to a limit of detection (left censoring) and contains missing observations. Additionally, two simulation studies are conducted to examine the asymptotic properties of the estimates and the robustness of the model. The proposed algorithm and methods are implemented in the R package <span>ARCensReg</span>.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.8,"publicationDate":"2024-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11804","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140055886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Acknowledgement of referees' services remerciements aux membres des jurys 感谢推荐人提供的服务 remerciements aux membres des jurys
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2024-02-15 DOI: 10.1002/cjs.11806
{"title":"Acknowledgement of referees' services remerciements aux membres des jurys","authors":"","doi":"10.1002/cjs.11806","DOIUrl":"https://doi.org/10.1002/cjs.11806","url":null,"abstract":"","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2024-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140123773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semiparametric estimation for the functional additive hazards model 功能加性危害模型的半参数估计
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-01-11 DOI: 10.1002/cjs.11805
Meiling Hao, Kin-yat Liu, Wen Su, Xingqiu Zhao

We propose a new functional additive hazards model to investigate the potential effects of functional and scalar predictors on mortality risks, and develop a penalized least squares estimation method for model parameters based on a pseudoscore estimating equation. A reproducing kernel Hilbert space approach is used to establish the consistency, convergence rate, and joint asymptotic distribution of the resulting estimators for finite-dimensional and infinite-dimensional parameters. Our simulation studies demonstrate that the proposed estimation procedure performs well. For illustration, we apply the proposed method to the Medical Information Mart for Intensive Care III dataset.

我们提出了一种新的功能加性危害模型来研究功能和标量预测因子对死亡风险的潜在影响,并开发了一种基于伪core估计方程的模型参数惩罚性最小二乘估计方法。我们使用重现核希尔伯特空间方法来确定有限维和无限维参数估计结果的一致性、收敛率和联合渐近分布。我们的模拟研究表明,所提出的估计程序性能良好。为说明起见,我们将提议的方法应用于重症监护医疗信息市场 III 数据集。
{"title":"Semiparametric estimation for the functional additive hazards model","authors":"Meiling Hao,&nbsp;Kin-yat Liu,&nbsp;Wen Su,&nbsp;Xingqiu Zhao","doi":"10.1002/cjs.11805","DOIUrl":"10.1002/cjs.11805","url":null,"abstract":"<p>We propose a new functional additive hazards model to investigate the potential effects of functional and scalar predictors on mortality risks, and develop a penalized least squares estimation method for model parameters based on a pseudoscore estimating equation. A reproducing kernel Hilbert space approach is used to establish the consistency, convergence rate, and joint asymptotic distribution of the resulting estimators for finite-dimensional and infinite-dimensional parameters. Our simulation studies demonstrate that the proposed estimation procedure performs well. For illustration, we apply the proposed method to the Medical Information Mart for Intensive Care III dataset.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.8,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139421542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Clustering spatial functional data using a geographically weighted Dirichlet process 使用地理加权 Dirichlet 过程对空间功能数据进行聚类
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-01-05 DOI: 10.1002/cjs.11803
Tianyu Pan, Weining Shen, Guanyu Hu

We propose a Bayesian nonparametric clustering approach to study the spatial heterogeneity effect for functional data observed at spatially correlated locations. We consider a geographically weighted Chinese restaurant process equipped with a conditional autoregressive prior to capture fully the spatial correlation of function curves. To sample efficiently from our model, we customize a prior called Quadratic Gamma, which ensures conjugacy. We design a Markov chain Monte Carlo algorithm to infer simultaneously the posterior distributions of the number of groups and the grouping configurations. The superior numerical performance of the proposed method over competing methods is demonstrated using simulated examples and a U.S. annual precipitation study.

我们提出了一种贝叶斯非参数聚类方法,用于研究在空间相关地点观测到的函数数据的空间异质性效应。我们考虑了一个地理加权的中餐馆过程,该过程配备了一个条件自回归先验,以充分捕捉功能曲线的空间相关性。为了从我们的模型中有效采样,我们定制了一种名为 Quadratic Gamma 的先验,它能确保共轭性。我们设计了一种马尔科夫链蒙特卡洛算法,以同时推断分组数和分组配置的后验分布。通过模拟实例和美国年降水量研究,证明了所提出的方法在数值性能上优于其他竞争方法。
{"title":"Clustering spatial functional data using a geographically weighted Dirichlet process","authors":"Tianyu Pan,&nbsp;Weining Shen,&nbsp;Guanyu Hu","doi":"10.1002/cjs.11803","DOIUrl":"10.1002/cjs.11803","url":null,"abstract":"<p>We propose a Bayesian nonparametric clustering approach to study the spatial heterogeneity effect for functional data observed at spatially correlated locations. We consider a geographically weighted Chinese restaurant process equipped with a conditional autoregressive prior to capture fully the spatial correlation of function curves. To sample efficiently from our model, we customize a prior called Quadratic Gamma, which ensures conjugacy. We design a Markov chain Monte Carlo algorithm to infer simultaneously the posterior distributions of the number of groups and the grouping configurations. The superior numerical performance of the proposed method over competing methods is demonstrated using simulated examples and a U.S. annual precipitation study.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.8,"publicationDate":"2024-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140055778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Model Selection via Composite Likelihood for High-dimensional Data Integration 通过复合似然进行贝叶斯模型选择,实现高维数据整合
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-01-05 DOI: 10.1002/cjs.11800
Guanlin Zhang, Yuehua Wu, Xin Gao

We consider data integration problems where correlated data are collected from multiple platforms. Within each platform, there are linear relationships between the responses and a collection of predictors. We extend the linear models to include random errors coming from a much wider family of sub-Gaussian and subexponential distributions. The goal is to select important predictors across multiple platforms, where the number of predictors and the number of observations both increase to infinity. We combine the marginal densities of the responses obtained from different platforms to form a composite likelihood and propose a model selection criterion based on Bayesian composite posterior probabilities. Under some regularity conditions, we prove that the model selection criterion is consistent to recover the union support of the predictors with divergent true model size.

我们考虑了从多个平台收集相关数据的数据整合问题。在每个平台中,响应与一系列预测因子之间存在线性关系。我们对线性模型进行了扩展,纳入了来自更广泛的亚高斯分布和亚指数分布的随机误差。我们的目标是在多个平台上选择重要的预测因子,在这些平台上,预测因子的数量和观测数据的数量都会增加到无穷大。我们将从不同平台获得的响应边际密度结合起来,形成一个复合似然,并提出一种基于贝叶斯复合后验概率的模型选择准则。在一些规则性条件下,我们证明了模型选择准则在恢复具有不同真实模型大小的预测因子的联合支持方面是一致的。
{"title":"Bayesian Model Selection via Composite Likelihood for High-dimensional Data Integration","authors":"Guanlin Zhang,&nbsp;Yuehua Wu,&nbsp;Xin Gao","doi":"10.1002/cjs.11800","DOIUrl":"10.1002/cjs.11800","url":null,"abstract":"<p>We consider data integration problems where correlated data are collected from multiple platforms. Within each platform, there are linear relationships between the responses and a collection of predictors. We extend the linear models to include random errors coming from a much wider family of sub-Gaussian and subexponential distributions. The goal is to select important predictors across multiple platforms, where the number of predictors and the number of observations both increase to infinity. We combine the marginal densities of the responses obtained from different platforms to form a composite likelihood and propose a model selection criterion based on Bayesian composite posterior probabilities. Under some regularity conditions, we prove that the model selection criterion is consistent to recover the union support of the predictors with divergent true model size.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.8,"publicationDate":"2024-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140056288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modelling occurrence and quantity of longitudinal semicontinuous data simultaneously with nonparametric unobserved heterogeneity 利用非参数非观测异质性同时对纵向半连续数据的发生和数量进行建模
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-12-09 DOI: 10.1002/cjs.11801
Guohua Yan, Renjun Ma

Semicontinuous data frequently occur in longitudinal studies. The popular two-part modelling approach deals with longitudinal semicontinuous data by analyzing the occurrence of positive values and the intensity of positive values separately; however, this separation may break down the natural sequence of semicontinuous data within a subject and destroy its serial dependence structure. In this article, we introduce a Tweedie compound Poisson mixed model to study the occurrence of positive values and the quantity of the semicontinuous response simultaneously. In our approach, covariate effects on the semicontinuous response are assessed directly. The correlation within a subject and the unobserved heterogeneity are incorporated with serially correlated nonparametric random effects. Our model unifies subject-specific and population-averaged interpretations. We illustrate the approach with applications to a Brief Symptom Inventory study and an infants' fluoride intake study.

半连续数据经常出现在纵向研究中。流行的两部分建模方法通过分别分析正值的出现和正值的强度来处理纵向半连续数据;然而,这种分离可能会打破半连续数据在一个研究对象中的自然序列,破坏其序列依赖结构。在本文中,我们引入了一个 Tweedie 复合泊松混合模型来同时研究正值的出现和半连续反应的数量。在我们的方法中,协变量对半连续反应的影响是直接评估的。受试者内部的相关性和未观察到的异质性被纳入了序列相关的非参数随机效应。我们的模型统一了特定受试者和人群平均的解释。我们将这一方法应用于一项简明症状量表研究和一项婴儿氟摄入量研究。
{"title":"Modelling occurrence and quantity of longitudinal semicontinuous data simultaneously with nonparametric unobserved heterogeneity","authors":"Guohua Yan,&nbsp;Renjun Ma","doi":"10.1002/cjs.11801","DOIUrl":"10.1002/cjs.11801","url":null,"abstract":"<p>Semicontinuous data frequently occur in longitudinal studies. The popular two-part modelling approach deals with longitudinal semicontinuous data by analyzing the occurrence of positive values and the intensity of positive values separately; however, this separation may break down the natural sequence of semicontinuous data within a subject and destroy its serial dependence structure. In this article, we introduce a Tweedie compound Poisson mixed model to study the occurrence of positive values and the quantity of the semicontinuous response simultaneously. In our approach, covariate effects on the semicontinuous response are assessed directly. The correlation within a subject and the unobserved heterogeneity are incorporated with serially correlated nonparametric random effects. Our model unifies subject-specific and population-averaged interpretations. We illustrate the approach with applications to a Brief Symptom Inventory study and an infants' fluoride intake study.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.8,"publicationDate":"2023-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138561336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient multiply robust imputation in the presence of influential units in surveys 在调查中有影响的单位存在的情况下,有效的乘法稳健估算
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-11-22 DOI: 10.1002/cjs.11802
Sixia Chen, David Haziza, Victoire Michal

Item nonresponse is a common issue in surveys. Because unadjusted estimators may be biased in the presence of nonresponse, it is common practice to impute the missing values with the objective of reducing the nonresponse bias as much as possible. However, commonly used imputation procedures may lead to unstable estimators of population totals/means when influential units are present in the set of respondents. In this article, we consider the class of multiply robust imputation procedures that provide some protection against the failure of underlying model assumptions. We develop an efficient version of multiply robust estimators based on the concept of conditional bias, a measure of influence. We present the results of a simulation study to show the benefits of our proposed method in terms of bias and efficiency.

项目无反应是调查中常见的问题。由于未调整的估计量在无响应的情况下可能会有偏差,因此通常的做法是将丢失的值归为尽可能减少无响应偏差的目标。然而,当有影响力的单位出现在应答者集合中时,常用的归算程序可能导致人口总数/平均值的估计不稳定。在本文中,我们考虑了一类多重鲁棒imputation过程,这些过程提供了一些防止潜在模型假设失败的保护。我们开发了一个有效的版本的多重稳健估计基于条件偏差的概念,影响的量度。我们提出了一个模拟研究的结果,以显示我们提出的方法在偏差和效率方面的好处。
{"title":"Efficient multiply robust imputation in the presence of influential units in surveys","authors":"Sixia Chen,&nbsp;David Haziza,&nbsp;Victoire Michal","doi":"10.1002/cjs.11802","DOIUrl":"10.1002/cjs.11802","url":null,"abstract":"<p>Item nonresponse is a common issue in surveys. Because unadjusted estimators may be biased in the presence of nonresponse, it is common practice to impute the missing values with the objective of reducing the nonresponse bias as much as possible. However, commonly used imputation procedures may lead to unstable estimators of population totals/means when influential units are present in the set of respondents. In this article, we consider the class of multiply robust imputation procedures that provide some protection against the failure of underlying model assumptions. We develop an efficient version of multiply robust estimators based on the concept of conditional bias, a measure of influence. We present the results of a simulation study to show the benefits of our proposed method in terms of bias and efficiency.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.8,"publicationDate":"2023-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11802","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138544108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Football group draw probabilities and corrections 足球小组抽签概率和修正
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-11-10 DOI: 10.1002/cjs.11798
Gareth O. Roberts, Jeffrey S. Rosenthal

This article considers the challenge of designing football group draw mechanisms, which have a uniform distribution over all valid draw assignments, but are also entertaining, practical and transparent. Although this problem is trivial in completely symmetric problems, it becomes challenging when there are draw constraints that are not exchangeable across each of the competing teams, so that symmetry breaks down. We explain how to simulate the FIFA sequential draw method and compute the nonuniformity of its draws by comparison with a uniform rejection sampler. We then propose several practical methods of achieving the uniform distribution while still using balls and bowls in a way which is suitable for a televised draw. The solutions can also be carried out interactively. The general methodology we provide can readily be transported to different competition draws and is not restricted to football events.

本文探讨了设计足球小组抽签机制所面临的挑战,这种机制既要在所有有效的抽签分配中实现均匀分布,又要具有娱乐性、实用性和透明度。虽然这个问题在完全对称的问题中微不足道,但当存在抽签约束时,它就变得具有挑战性,因为这些抽签约束在每个参赛队之间都是不可交换的,这样对称性就被打破了。我们解释了如何模拟国际足联的顺序抽签法,并通过与均匀剔除采样器的比较计算其抽签的不均匀性。然后,我们提出了几种实现均匀分布的实用方法,同时还以适合电视直播抽签的方式使用球和碗。这些解决方案也可以交互式进行。我们提供的一般方法可以很容易地应用到不同的比赛抽签中,并不局限于足球比赛。
{"title":"Football group draw probabilities and corrections","authors":"Gareth O. Roberts,&nbsp;Jeffrey S. Rosenthal","doi":"10.1002/cjs.11798","DOIUrl":"10.1002/cjs.11798","url":null,"abstract":"<p>This article considers the challenge of designing football group draw mechanisms, which have a uniform distribution over all valid draw assignments, but are also entertaining, practical and transparent. Although this problem is trivial in completely symmetric problems, it becomes challenging when there are draw constraints that are not exchangeable across each of the competing teams, so that symmetry breaks down. We explain how to simulate the FIFA sequential draw method and compute the nonuniformity of its draws by comparison with a uniform rejection sampler. We then propose several practical methods of achieving the uniform distribution while still using balls and bowls in a way which is suitable for a televised draw. The solutions can also be carried out interactively. The general methodology we provide can readily be transported to different competition draws and is not restricted to football events.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.8,"publicationDate":"2023-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135141646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonparametric estimation of a survival function in the presence of measurement errors on the failure time of interest 在相关故障时间存在测量误差的情况下,对生存函数进行非参数估计
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-11-10 DOI: 10.1002/cjs.11799
Shaojia Jin, Yanyan Liu, Guangcai Mao, Jianguo Sun, Yuanshan Wu

This article discusses nonparametric estimation of a survival function in the presence of measurement errors on the observation of the failure time of interest. One situation where such issues arise would be clinical studies of chronic diseases where the observation on the time to the failure event of interest such as the onset of the disease relies on patient recall or chart review of electronic medical records. It is easy to see that both situations can be subject to measurement errors. To resolve this problem, we propose a simulation extrapolation approach to correct the bias induced by the measurement error. To overcome potential computational difficulties, we use spline regression to approximate the unspecified extrapolated coefficient function of time, and establish the asymptotic properties of our proposed estimator. The proposed method is applied to nonparametric estimation based on interval-censored data. Extensive numerical experiments involving both simulated and actual study datasets demonstrate the feasibility of this proposed estimation procedure.

这篇文章讨论了在对相关失效时间的观测存在测量误差的情况下,对生存函数进行非参数估计的问题。出现此类问题的一种情况是慢性疾病的临床研究,其中对相关失效事件(如发病)发生时间的观察依赖于患者回忆或电子病历的图表审查。不难看出,这两种情况都可能存在测量误差。为了解决这个问题,我们提出了一种模拟外推法来纠正测量误差引起的偏差。为了克服潜在的计算困难,我们使用样条回归来逼近未指定的时间外推系数函数,并建立了我们提出的估计器的渐近特性。我们将所提出的方法应用于基于区间删失数据的非参数估计。涉及模拟数据集和实际研究数据集的大量数值实验证明了所提估计程序的可行性。
{"title":"Nonparametric estimation of a survival function in the presence of measurement errors on the failure time of interest","authors":"Shaojia Jin,&nbsp;Yanyan Liu,&nbsp;Guangcai Mao,&nbsp;Jianguo Sun,&nbsp;Yuanshan Wu","doi":"10.1002/cjs.11799","DOIUrl":"10.1002/cjs.11799","url":null,"abstract":"<p>This article discusses nonparametric estimation of a survival function in the presence of measurement errors on the observation of the failure time of interest. One situation where such issues arise would be clinical studies of chronic diseases where the observation on the time to the failure event of interest such as the onset of the disease relies on patient recall or chart review of electronic medical records. It is easy to see that both situations can be subject to measurement errors. To resolve this problem, we propose a simulation extrapolation approach to correct the bias induced by the measurement error. To overcome potential computational difficulties, we use spline regression to approximate the unspecified extrapolated coefficient function of time, and establish the asymptotic properties of our proposed estimator. The proposed method is applied to nonparametric estimation based on interval-censored data. Extensive numerical experiments involving both simulated and actual study datasets demonstrate the feasibility of this proposed estimation procedure.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.8,"publicationDate":"2023-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135141149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fused mean structure learning in data integration with dependence 具有依赖性的数据整合中的融合均值结构学习
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-10-27 DOI: 10.1002/cjs.11797
Emily C. Hector

Motivated by image-on-scalar regression with data aggregated across multiple sites, we consider a setting in which multiple independent studies each collect multiple dependent vector outcomes, with potential mean model parameter homogeneity between studies and outcome vectors. To determine the validity of a joint analysis of these data sources, we must learn which of them share mean model parameters. We propose a new model fusion approach that delivers improved flexibility and statistical performance over existing methods. Our proposed approach specifies a quadratic inference function within each data source and fuses mean model parameter vectors in their entirety based on a new formulation of a pairwise fusion penalty. We establish theoretical properties of our estimator and propose an asymptotically equivalent weighted oracle meta-estimator that is more computationally efficient. Simulations and an application to the ABIDE neuroimaging consortium highlight the flexibility of the proposed approach. An R package is provided for ease of implementation.

受图像-尺度回归与多站点数据汇总的启发,我们考虑了这样一种情况,即多项独立研究各自收集多个因变向量结果,而研究与结果向量之间可能存在平均模型参数同质性。为了确定对这些数据源进行联合分析的有效性,我们必须了解其中哪些数据源共享平均模型参数。我们提出了一种新的模型融合方法,与现有方法相比,这种方法具有更好的灵活性和统计性能。我们提出的方法在每个数据源中指定了一个二次推理函数,并根据成对融合罚则的新表述融合了整个平均模型参数向量。我们建立了估计器的理论属性,并提出了一种计算效率更高的渐进等效加权甲骨文元估计器。模拟和在 ABIDE 神经成像联盟中的应用凸显了所提方法的灵活性。为了便于实施,我们还提供了一个 R 软件包。
{"title":"Fused mean structure learning in data integration with dependence","authors":"Emily C. Hector","doi":"10.1002/cjs.11797","DOIUrl":"10.1002/cjs.11797","url":null,"abstract":"<p>Motivated by image-on-scalar regression with data aggregated across multiple sites, we consider a setting in which multiple independent studies each collect multiple dependent vector outcomes, with potential mean model parameter homogeneity between studies and outcome vectors. To determine the validity of a joint analysis of these data sources, we must learn which of them share mean model parameters. We propose a new model fusion approach that delivers improved flexibility and statistical performance over existing methods. Our proposed approach specifies a quadratic inference function within each data source and fuses mean model parameter vectors in their entirety based on a new formulation of a pairwise fusion penalty. We establish theoretical properties of our estimator and propose an asymptotically equivalent weighted oracle meta-estimator that is more computationally efficient. Simulations and an application to the ABIDE neuroimaging consortium highlight the flexibility of the proposed approach. An <span>R</span> package is provided for ease of implementation.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.8,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11797","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136317032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Canadian Journal of Statistics-Revue Canadienne De Statistique
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1