首页 > 最新文献

Canadian Journal of Statistics-Revue Canadienne De Statistique最新文献

英文 中文
Robust reflections 健壮的倒影
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-07-04 DOI: 10.1002/cjs.11709
David Andrews, Chris Field

Two senior statisticians/data scientists reflect on the challenges arising from the analysis of increasingly complex data using robustness. They include some thoughts on the types of robust analysis that will be needed in the future, while cognizant of our very limited ability to successfully predict the future.

两位资深统计学家/数据科学家反思了使用鲁棒性分析日益复杂的数据所带来的挑战。其中包括一些关于未来需要的稳健分析类型的想法,同时认识到我们成功预测未来的能力非常有限。
{"title":"Robust reflections","authors":"David Andrews,&nbsp;Chris Field","doi":"10.1002/cjs.11709","DOIUrl":"10.1002/cjs.11709","url":null,"abstract":"<p>Two senior statisticians/data scientists reflect on the challenges arising from the analysis of increasingly complex data using robustness. They include some thoughts on the types of robust analysis that will be needed in the future, while cognizant of our very limited ability to successfully predict the future.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11709","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44229400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Canadian Statistical Sciences Institute 2003–2022 加拿大统计科学研究所2003-2022
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-07-04 DOI: 10.1002/cjs.11716
Mary Thompson, Nancy Reid, Don Estep

This article describes the founding and growth of the Canadian Statistical Sciences Institute (CANSSI), starting from its early roots and continuing through to establishment as a mature research enterprise. The goal is to present a historical record of events and activities that were important in the development of CANSSI.

本文描述了加拿大统计科学研究所(CANSSI)的成立和成长,从它早期的根源开始,并继续通过建立作为一个成熟的研究企业。目标是呈现在CANSSI发展过程中重要的事件和活动的历史记录。
{"title":"The Canadian Statistical Sciences Institute 2003–2022","authors":"Mary Thompson,&nbsp;Nancy Reid,&nbsp;Don Estep","doi":"10.1002/cjs.11716","DOIUrl":"10.1002/cjs.11716","url":null,"abstract":"<p>This article describes the founding and growth of the Canadian Statistical Sciences Institute (CANSSI), starting from its early roots and continuing through to establishment as a mature research enterprise. The goal is to present a historical record of events and activities that were important in the development of CANSSI.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11716","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42193083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pseudo empirical likelihood inference for nonprobability survey samples 非概率调查样本的伪经验似然推理
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-07-04 DOI: 10.1002/cjs.11708
Yilin Chen, Pengfei Li, J. N. K. Rao, Changbao Wu

In this article, we first provide an overview of two major developments on complex survey data analysis: the empirical likelihood methods and statistical inference with nonprobability survey samples. We highlight the important research contributions to the field of survey sampling in general and the two topics in particular by Canadian survey statisticians. We then propose new inferential procedures for analyzing nonprobability survey samples through the pseudo empirical likelihood approach. The proposed methods lead to point estimators asymptotically equivalent to those discussed in the recent literature but with more desirable features on confidence intervals such as range-respecting and data-driven orientation. Results from a simulation study demonstrate the superiority of the proposed methods in dealing with binary response variables.

在本文中,我们首先概述了复杂调查数据分析的两个主要发展:经验似然方法和非概率调查样本的统计推断。我们强调调查抽样领域的重要研究贡献,特别是加拿大调查统计学家的两个主题。然后,我们提出了新的推理程序,通过伪经验似然方法来分析非概率调查样本。所提出的方法导致点估计渐近等价于最近文献中讨论的那些,但具有更理想的置信区间特征,如范围尊重和数据驱动的方向。仿真研究结果表明,所提方法在处理二元响应变量方面具有优越性。
{"title":"Pseudo empirical likelihood inference for nonprobability survey samples","authors":"Yilin Chen,&nbsp;Pengfei Li,&nbsp;J. N. K. Rao,&nbsp;Changbao Wu","doi":"10.1002/cjs.11708","DOIUrl":"10.1002/cjs.11708","url":null,"abstract":"<p>In this article, we first provide an overview of two major developments on complex survey data analysis: the empirical likelihood methods and statistical inference with nonprobability survey samples. We highlight the important research contributions to the field of survey sampling in general and the two topics in particular by Canadian survey statisticians. We then propose new inferential procedures for analyzing nonprobability survey samples through the pseudo empirical likelihood approach. The proposed methods lead to point estimators asymptotically equivalent to those discussed in the recent literature but with more desirable features on confidence intervals such as range-respecting and data-driven orientation. Results from a simulation study demonstrate the superiority of the proposed methods in dealing with binary response variables.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11708","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48329998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Reflections on Bayesian inference and Markov chain Monte Carlo 关于贝叶斯推理和马尔可夫链蒙特卡罗的思考
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-07-04 DOI: 10.1002/cjs.11707
Radu V. Craiu, Paul Gustafson, Jeffrey S. Rosenthal

Bayesian inference and Markov chain Monte Carlo methods are vigorous areas of statistical research. Here we reflect on some recent developments and future directions in these fields.

贝叶斯推理和马尔可夫链蒙特卡罗方法是统计学研究的热点。在这里,我们回顾一下这些领域的一些最新发展和未来方向。
{"title":"Reflections on Bayesian inference and Markov chain Monte Carlo","authors":"Radu V. Craiu,&nbsp;Paul Gustafson,&nbsp;Jeffrey S. Rosenthal","doi":"10.1002/cjs.11707","DOIUrl":"10.1002/cjs.11707","url":null,"abstract":"<p>Bayesian inference and Markov chain Monte Carlo methods are vigorous areas of statistical research. Here we reflect on some recent developments and future directions in these fields.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11707","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42447477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Distributed estimation with empirical likelihood 基于经验似然的分布估计
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-06-21 DOI: 10.1002/cjs.11706
Qianqian Liu, Zhouping Li

With the development of science and technology, massive datasets stored in multiple machines are increasingly prevalent. It is known that traditional statistical methods may be infeasible for analyzing large datasets owing to excessive computing time, memory limitations, communication costs, and privacy concerns. This article develops divide-and-conquer empirical likelihood (DEL) and divide-and-conquer exponentially tilted empirical likelihood (DETEL) methods for the distributed computing setting. We investigate the theoretical properties of the DEL and DETEL estimators. In particular, we derive upper bounds for the mean squared errors of the DEL and DETEL estimators, and, under some mild conditions, we prove the consistency and the asymptotic normality of the proposed estimators. Simulation studies and a real data analysis are carried out to demonstrate the finite-sample performance of the proposed methods.

随着科学技术的发展,存储在多台机器中的海量数据集越来越普遍。众所周知,由于计算时间过长、内存限制、通信成本和隐私问题,传统的统计方法可能不适用于分析大型数据集。本文为分布式计算环境开发了分治经验似然(DEL)和分治指数倾斜经验似然(DETEL)方法。我们研究了DEL和DETEL估计量的理论性质。特别地,我们导出了DEL和DETEL估计量的均方误差的上界,并且在一些温和的条件下,我们证明了所提出的估计量的一致性和渐近正态性。进行了仿真研究和实际数据分析,以证明所提出方法的有限样本性能。
{"title":"Distributed estimation with empirical likelihood","authors":"Qianqian Liu,&nbsp;Zhouping Li","doi":"10.1002/cjs.11706","DOIUrl":"10.1002/cjs.11706","url":null,"abstract":"<p>With the development of science and technology, massive datasets stored in multiple machines are increasingly prevalent. It is known that traditional statistical methods may be infeasible for analyzing large datasets owing to excessive computing time, memory limitations, communication costs, and privacy concerns. This article develops divide-and-conquer empirical likelihood (DEL) and divide-and-conquer exponentially tilted empirical likelihood (DETEL) methods for the distributed computing setting. We investigate the theoretical properties of the DEL and DETEL estimators. In particular, we derive upper bounds for the mean squared errors of the DEL and DETEL estimators, and, under some mild conditions, we prove the consistency and the asymptotic normality of the proposed estimators. Simulation studies and a real data analysis are carried out to demonstrate the finite-sample performance of the proposed methods.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41412417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Reducing bias due to misclassified exposures using instrumental variables 减少由于使用工具变量对暴露进行错误分类而产生的偏差
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-06-20 DOI: 10.1002/cjs.11705
Christopher Manuel, Samiran Sinha, Suojin Wang

Exposures are often misclassified in observational studies. Any analysis that does not make proper adjustments for misclassification may result in biased estimates of model parameters, resulting in distorted inference. Settings where a multicategory exposure variable has more than two nominal categories or where no validation data are available to assess misclassification probabilities are common in practice but seldom considered in the literature. This article presents a novel method of analyzing cohort data with a misclassified, multicategory exposure variable and a binary response variable that uses instrumental variables in lieu of a validation dataset. First, a sufficient condition is obtained for model identifiability. Then, methods for model estimation and inference are proposed after adopting a sufficient condition for identifiability. We consider a variational Bayesian inference procedure aided by automatic differentiation along with Markov chain Monte Carlo-based computation. Operating characteristics of the proposed methods are assessed through simulation studies. For the purpose of illustration, the proposed Bayesian methods are applied to the U.S. breast cancer mortality data sampled from the Surveillance Epidemiology and End Results database, where reported treatment therapy is the misclassified multicategory exposure variable.

在观察性研究中,暴露常常被错误地分类。任何没有对错误分类进行适当调整的分析都可能导致模型参数的估计有偏差,从而导致推理失真。多类别暴露变量具有两个以上名义类别或没有可用验证数据来评估误分类概率的设置在实践中很常见,但在文献中很少考虑。本文提出了一种分析队列数据的新方法,该方法使用错误分类,多类别暴露变量和使用工具变量代替验证数据集的二元响应变量。首先,得到了模型可辨识性的充分条件。然后,采用可辨识性的充分条件,提出了模型估计和推理的方法。我们考虑了一个由自动微分辅助的变分贝叶斯推理过程以及基于马尔可夫链的蒙特卡罗计算。通过仿真研究评估了所提出方法的工作特性。为了说明这一点,我们将提出的贝叶斯方法应用于从监测流行病学和最终结果数据库中抽样的美国乳腺癌死亡率数据,其中报告的治疗方法是错误分类的多类别暴露变量。
{"title":"Reducing bias due to misclassified exposures using instrumental variables","authors":"Christopher Manuel,&nbsp;Samiran Sinha,&nbsp;Suojin Wang","doi":"10.1002/cjs.11705","DOIUrl":"10.1002/cjs.11705","url":null,"abstract":"<p>Exposures are often misclassified in observational studies. Any analysis that does not make proper adjustments for misclassification may result in biased estimates of model parameters, resulting in distorted inference. Settings where a multicategory exposure variable has more than two nominal categories or where no validation data are available to assess misclassification probabilities are common in practice but seldom considered in the literature. This article presents a novel method of analyzing cohort data with a misclassified, multicategory exposure variable and a binary response variable that uses instrumental variables in lieu of a validation dataset. First, a sufficient condition is obtained for model identifiability. Then, methods for model estimation and inference are proposed after adopting a sufficient condition for identifiability. We consider a variational Bayesian inference procedure aided by automatic differentiation along with Markov chain Monte Carlo-based computation. Operating characteristics of the proposed methods are assessed through simulation studies. For the purpose of illustration, the proposed Bayesian methods are applied to the U.S. breast cancer mortality data sampled from the Surveillance Epidemiology and End Results database, where reported treatment therapy is the misclassified multicategory exposure variable.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48218452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Likelihood identifiability and parameter estimation with nonignorable missing data 不可忽略缺失数据的似然可辨识性和参数估计
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-05-27 DOI: 10.1002/cjs.11704
Siming Zheng, Juan Zhang, Yong Zhou

We identify sufficient conditions to resolve the identification problem under nonignorable missingness, especially the identifiability of the observed likelihood when some of the covariate values are missing not at random, or, simultaneously, the response is also missing not at random. It is more difficult to tackle these cases than the nonignorable nonresponse case, and, to the best of our knowledge, the simultaneously missing case has never been discussed before. Under these conditions, we propose some parameter estimation methods. As an illustration, when some of the covariate values are missing not at random, we adopt a semiparametric logistic model with a tilting parameter to model the missingness mechanism and use an imputed estimating equation based on the generalized method of moments to estimate the parameters of interest and the tilting parameter simultaneously. This approach avoids the requirement for other independent surveys or a validation sample to estimate the unknown tilting parameter. The asymptotic properties of our proposed estimators are derived, and the proofs can be modified to show that our methods of estimation, which are based on inverse probability weighting, augmented inverse probability weighting, and estimating equation projection, have the same asymptotic efficiency when the tilting parameter is either known or unknown but estimated by some other method. In simulation studies, we compare our methods with various alternative approaches and find that our methods are more robust and effective.

我们确定了解决不可忽略缺失下的识别问题的充分条件,特别是当一些协变量值非随机缺失时,或者同时,响应也非随机缺失时,观测似然的可识别性。处理这些情况比处理不可忽视的无反应情况要困难得多,而且,据我们所知,同时失踪的情况以前从未讨论过。在这种情况下,我们提出了一些参数估计方法。举例说明,当某些协变量值存在非随机缺失时,我们采用带倾斜参数的半参数逻辑模型来模拟缺失机制,并使用基于广义矩量法的估算方程同时估计感兴趣的参数和倾斜参数。这种方法避免了对其他独立调查或验证样本的需求来估计未知的倾斜参数。推导了所提估计量的渐近性质,并对证明进行了修正,证明了在倾斜参数已知或未知但采用其他方法估计时,基于逆概率加权、增广逆概率加权和估计方程投影的估计方法具有相同的渐近效率。在仿真研究中,我们将我们的方法与各种替代方法进行了比较,发现我们的方法更加鲁棒和有效。
{"title":"Likelihood identifiability and parameter estimation with nonignorable missing data","authors":"Siming Zheng,&nbsp;Juan Zhang,&nbsp;Yong Zhou","doi":"10.1002/cjs.11704","DOIUrl":"10.1002/cjs.11704","url":null,"abstract":"<p>We identify sufficient conditions to resolve the identification problem under nonignorable missingness, especially the identifiability of the observed likelihood when some of the covariate values are missing not at random, or, simultaneously, the response is also missing not at random. It is more difficult to tackle these cases than the nonignorable nonresponse case, and, to the best of our knowledge, the simultaneously missing case has never been discussed before. Under these conditions, we propose some parameter estimation methods. As an illustration, when some of the covariate values are missing not at random, we adopt a semiparametric logistic model with a tilting parameter to model the missingness mechanism and use an imputed estimating equation based on the generalized method of moments to estimate the parameters of interest and the tilting parameter simultaneously. This approach avoids the requirement for other independent surveys or a validation sample to estimate the unknown tilting parameter. The asymptotic properties of our proposed estimators are derived, and the proofs can be modified to show that our methods of estimation, which are based on inverse probability weighting, augmented inverse probability weighting, and estimating equation projection, have the same asymptotic efficiency when the tilting parameter is either known or unknown but estimated by some other method. In simulation studies, we compare our methods with various alternative approaches and find that our methods are more robust and effective.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47145609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic treatment regimes with interference 有干扰的动态治疗方案
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-05-24 DOI: 10.1002/cjs.11702
Cong Jiang, Michael P. Wallace, Mary E. Thompson

Precision medicine describes health care where patient-level data are used to inform treatment decisions. Within this framework, dynamic treatment regimes (DTRs) are sequences of decision rules that take individual patient information as input data and then output treatment recommendations. DTR estimation from observational data typically relies on the assumption of no interference: i.e., the outcome of one individual is unaffected by the treatment assignment of others. However, in many social network contexts, such as friendship or family networks, and for many health concerns, such as infectious diseases, this assumption is questionable. We investigate the DTR estimation method of dynamic weighted ordinary least squares (dWOLS), which boasts of easy implementation and the so-called double-robustness property, but relies on the assumption of no interference. We define a network propensity function and build on it to establish an implementation of dWOLS that remains doubly robust under interference associated with network links. The method's properties are demonstrated via simulation and applied to data from the Population Assessment of Tobacco and Health (PATH) study to investigate cigarette dependence within two-person household networks.

精准医学描述了使用患者水平数据来为治疗决策提供信息的医疗保健。在这个框架中,动态治疗方案(DTRs)是一系列的决策规则,将个体患者信息作为输入数据,然后输出治疗建议。从观察数据估计DTR通常依赖于无干扰的假设:即,一个人的结果不受其他人的治疗分配的影响。然而,在许多社会网络环境中,如友谊或家庭网络,以及许多健康问题,如传染病,这种假设是值得怀疑的。本文研究了动态加权普通最小二乘(dWOLS)的DTR估计方法,该方法具有易于实现和所谓的双鲁棒性,但依赖于无干扰假设。我们定义了一个网络倾向函数,并在此基础上建立了一个在与网络链接相关的干扰下保持双重鲁棒性的dWOLS实现。通过模拟证明了该方法的特性,并将其应用于烟草与健康人口评估(PATH)研究的数据,该研究旨在调查两人家庭网络中的卷烟依赖。
{"title":"Dynamic treatment regimes with interference","authors":"Cong Jiang,&nbsp;Michael P. Wallace,&nbsp;Mary E. Thompson","doi":"10.1002/cjs.11702","DOIUrl":"10.1002/cjs.11702","url":null,"abstract":"<p>Precision medicine describes health care where patient-level data are used to inform treatment decisions. Within this framework, dynamic treatment regimes (DTRs) are sequences of decision rules that take individual patient information as input data and then output treatment recommendations. DTR estimation from observational data typically relies on the assumption of no interference: i.e., the outcome of one individual is unaffected by the treatment assignment of others. However, in many social network contexts, such as friendship or family networks, and for many health concerns, such as infectious diseases, this assumption is questionable. We investigate the DTR estimation method of dynamic weighted ordinary least squares (dWOLS), which boasts of easy implementation and the so-called double-robustness property, but relies on the assumption of no interference. We define a network propensity function and build on it to establish an implementation of dWOLS that remains doubly robust under interference associated with network links. The method's properties are demonstrated via simulation and applied to data from the Population Assessment of Tobacco and Health (PATH) study to investigate cigarette dependence within two-person household networks.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47847365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Missing data analysis with sufficient dimension reduction 缺失的数据分析与足够的降维
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-05-11 DOI: 10.1002/cjs.11700
Siming Zheng, Alan T. K. Wan, Yong Zhou

This article develops a two-step procedure for estimating the unknown parameters in a model that contains a fixed but large number of covariates, more moment conditions than unknown parameters, and responses that are missing at random. We propose a sufficient dimension reduction method to be implemented in the first step and prove that the method is asymptotically valid. In the second step, we apply three well-known missing data handling mechanisms together with the generalized method of moments to the reduced-dimensional subspace to obtain estimates of unknown parameters. We investigate the theoretical properties of the proposed methods, including the effects of dimension reduction on the asymptotic distributions of the estimators. Our results refute a claim in an earlier study that dimension reduction yields the same asymptotic distributions of estimators as when the reduced-dimensional structure is the true structure. We illustrate our method by way of a simulation study and a real clinical trial data example.

本文开发了一个两步程序,用于估计模型中的未知参数,该模型包含固定但大量的协变量,比未知参数更多的力矩条件,以及随机丢失的响应。我们提出了一个在第一步中实现的充分降维方法,并证明了该方法的渐近有效性。在第二步中,我们将三种已知的缺失数据处理机制与广义矩方法一起应用于降维子空间,以获得未知参数的估计。我们研究了所提方法的理论性质,包括降维对估计量渐近分布的影响。我们的结果驳斥了先前研究中的一个说法,即当降维结构是真实结构时,降维产生的估计量的渐近分布与降维结构相同。我们通过一个模拟研究和一个真实的临床试验数据例子来说明我们的方法。
{"title":"Missing data analysis with sufficient dimension reduction","authors":"Siming Zheng,&nbsp;Alan T. K. Wan,&nbsp;Yong Zhou","doi":"10.1002/cjs.11700","DOIUrl":"10.1002/cjs.11700","url":null,"abstract":"<p>This article develops a two-step procedure for estimating the unknown parameters in a model that contains a fixed but large number of covariates, more moment conditions than unknown parameters, and responses that are missing at random. We propose a sufficient dimension reduction method to be implemented in the first step and prove that the method is asymptotically valid. In the second step, we apply three well-known missing data handling mechanisms together with the generalized method of moments to the reduced-dimensional subspace to obtain estimates of unknown parameters. We investigate the theoretical properties of the proposed methods, including the effects of dimension reduction on the asymptotic distributions of the estimators. Our results refute a claim in an earlier study that dimension reduction yields the same asymptotic distributions of estimators as when the reduced-dimensional structure is the true structure. We illustrate our method by way of a simulation study and a real clinical trial data example.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44037737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Composite bias-reduced L p -quantile-based estimators of extreme quantiles and expectiles 基于组合偏倚减少的Lp分位数的极端分位数和期望分位数估计量
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-04-15 DOI: 10.1002/cjs.11703
Gilles Stupfler, Antoine Usseglio-Carleve

Quantiles are a fundamental concept in extreme value theory. They can be obtained from a minimization framework using an asymmetric absolute error loss criterion. The companion notion of expectiles, based on asymmetric squared rather than asymmetric absolute error loss minimization, has received substantial attention from the fields of actuarial science, finance, and econometrics over the last decade. Quantiles and expectiles can be embedded in a common framework of Lp-quantiles, whose extreme value properties have been explored very recently. Although this generalized notion of quantiles has shown potential for the estimation of extreme quantiles and expectiles, available estimators remain quite difficult to use: they suffer from substantial bias, and the question of the choice of the tuning parameter p remains open. In this article, we work in a context of heavy tails and construct composite bias-reduced estimators of extreme quantiles and expectiles based on Lp-quantiles. We provide a discussion of the data-driven choice of p and of the anchor Lp-quantile level in practice. The proposed methodology is compared with existing approaches on simulated data and real data.

分位数是极值理论中的一个基本概念。它们可以从使用非对称绝对误差损失准则的最小化框架中得到。在过去的十年中,基于非对称平方而非非对称绝对误差损失最小化的期望球的伴生概念受到了精算科学、金融和计量经济学领域的大量关注。分位数和期望位数可以嵌入到Lp -分位数的共同框架中,其极值属性最近已经被探索。虽然这种广义的分位数概念已经显示出估计极端分位数和预期位数的潜力,但可用的估计器仍然很难使用:它们存在很大的偏差,并且选择调谐参数p的问题仍然是开放的。在本文中,我们在重尾的背景下工作,并基于Lp -分位数构建了极端分位数和预期位数的复合偏倚减少估计器。我们提供了在实践中数据驱动的p和锚点Lp分位水平的选择的讨论。并在仿真数据和实际数据上与现有方法进行了比较。
{"title":"Composite bias-reduced \u0000 \u0000 \u0000 \u0000 L\u0000 \u0000 \u0000 p\u0000 \u0000 \u0000 -quantile-based estimators of extreme quantiles and expectiles","authors":"Gilles Stupfler,&nbsp;Antoine Usseglio-Carleve","doi":"10.1002/cjs.11703","DOIUrl":"10.1002/cjs.11703","url":null,"abstract":"<p>Quantiles are a fundamental concept in extreme value theory. They can be obtained from a minimization framework using an asymmetric absolute error loss criterion. The companion notion of expectiles, based on asymmetric squared rather than asymmetric absolute error loss minimization, has received substantial attention from the fields of actuarial science, finance, and econometrics over the last decade. Quantiles and expectiles can be embedded in a common framework of <math>\u0000 <mrow>\u0000 <msup>\u0000 <mrow>\u0000 <mi>L</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mi>p</mi>\u0000 </mrow>\u0000 </msup>\u0000 </mrow></math>-quantiles, whose extreme value properties have been explored very recently. Although this generalized notion of quantiles has shown potential for the estimation of extreme quantiles and expectiles, available estimators remain quite difficult to use: they suffer from substantial bias, and the question of the choice of the tuning parameter <math>\u0000 <mrow>\u0000 <mi>p</mi>\u0000 </mrow></math> remains open. In this article, we work in a context of heavy tails and construct composite bias-reduced estimators of extreme quantiles and expectiles based on <math>\u0000 <mrow>\u0000 <msup>\u0000 <mrow>\u0000 <mi>L</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mi>p</mi>\u0000 </mrow>\u0000 </msup>\u0000 </mrow></math>-quantiles. We provide a discussion of the data-driven choice of <math>\u0000 <mrow>\u0000 <mi>p</mi>\u0000 </mrow></math> and of the anchor <math>\u0000 <mrow>\u0000 <msup>\u0000 <mrow>\u0000 <mi>L</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mi>p</mi>\u0000 </mrow>\u0000 </msup>\u0000 </mrow></math>-quantile level in practice. The proposed methodology is compared with existing approaches on simulated data and real data.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48049797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Canadian Journal of Statistics-Revue Canadienne De Statistique
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1