首页 > 最新文献

Canadian Journal of Statistics-Revue Canadienne De Statistique最新文献

英文 中文
Testing for equality between conditional copulas given discretized conditioning events 给定离散条件事件条件连接词之间相等性的检验
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-11-15 DOI: 10.1002/cjs.11742
Alexis Derumigny, Jean-David Fermanian, Aleksey Min

Several procedures have been recently proposed to test the simplifying assumption for conditional copulas. Instead of considering pointwise conditioning events, we study the constancy of the conditional dependence structure when some covariates belong to general Borel conditioning subsets. We introduce several test statistics based on the equality of conditional Kendall's taus and derive their asymptotic distributions under the null hypothesis. In settings where such conditioning events are not fixed ex ante, we propose a data-driven procedure to recursively build such relevant subsets. This procedure is based on decision trees that maximize the differences between the conditional Kendall's taus, which correspond to the leaves of the trees. Empirical results for such tests are illustrated in the Supplementary Material. Moreover, a study of the conditional dependence between financial stock returns is presented and highlights specific contagion effects of past returns. The last application deals with conditional dependence between coverage amounts in an insurance dataset.

最近提出了几个程序来检验条件copula的简化假设。在不考虑点条件事件的情况下,研究了协变量属于一般玻耳条件子集时条件依赖结构的恒常性。介绍了几种基于条件肯德尔τ等式的检验统计量,并推导了它们在零下的渐近分布。当这些条件事件事先不固定时,我们提出了一个数据驱动的过程来递归地构建这些相关子集。它基于决策树,使与树的叶子相对应的条件肯德尔函数之间的差异最大化。仿真实验说明了这些测试的性能。此外,研究了金融股票收益之间的条件依赖关系,给定了它们过去的一些值的聚类。最后一个应用程序处理保险数据集中保险金额之间的条件依赖关系。
{"title":"Testing for equality between conditional copulas given discretized conditioning events","authors":"Alexis Derumigny,&nbsp;Jean-David Fermanian,&nbsp;Aleksey Min","doi":"10.1002/cjs.11742","DOIUrl":"10.1002/cjs.11742","url":null,"abstract":"<p>Several procedures have been recently proposed to test the simplifying assumption for conditional copulas. Instead of considering pointwise conditioning events, we study the constancy of the conditional dependence structure when some covariates belong to general Borel conditioning subsets. We introduce several test statistics based on the equality of conditional Kendall's taus and derive their asymptotic distributions under the null hypothesis. In settings where such conditioning events are not fixed ex ante, we propose a data-driven procedure to recursively build such relevant subsets. This procedure is based on decision trees that maximize the differences between the conditional Kendall's taus, which correspond to the leaves of the trees. Empirical results for such tests are illustrated in the Supplementary Material. Moreover, a study of the conditional dependence between financial stock returns is presented and highlights specific contagion effects of past returns. The last application deals with conditional dependence between coverage amounts in an insurance dataset.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11742","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44500073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
General minimum lower-order confounding three-level split-plot designs when the whole plot factors are important 当整个地块因素很重要时,一般最小低阶混淆三级分割地块设计
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-11-15 DOI: 10.1002/cjs.11744
Tao Sun, Shengli Zhao

Three-level fractional factorial split-plot (FFSP) designs with the whole plot (WP) factors being more important than the subplot factors are considered in the article. An aliased component-number pattern of type WP (WP-ACNP) is introduced for ranking such designs. The criterion of general minimum lower-order confounding of type WP (WP-GMC) is proposed based on WP-ACNP. The expressions of the key components in WP-ACNP are given via complementary sets. Some necessary conditions for FFSP designs to be WP-GMC FFSP designs are given and some three-level WP-GMC FFSP designs are constructed and tabulated.

本文考虑了整体因子比次要因子更重要的三水平分数因子分割图(FFSP)设计。引入了WP类型的别名组件数模式(WP- acnp)对此类设计进行排序。提出了基于WP- acnp的WP型一般最小低阶混杂判据(WP- gmc)。利用互补集给出了WP-ACNP中关键分量的表达式。给出了FFSP设计成为WP-GMC型FFSP设计的必要条件,并构造了一些三级WP-GMC型FFSP设计。
{"title":"General minimum lower-order confounding three-level split-plot designs when the whole plot factors are important","authors":"Tao Sun,&nbsp;Shengli Zhao","doi":"10.1002/cjs.11744","DOIUrl":"10.1002/cjs.11744","url":null,"abstract":"<p>Three-level fractional factorial split-plot (FFSP) designs with the whole plot (WP) factors being more important than the subplot factors are considered in the article. An aliased component-number pattern of type WP (WP-ACNP) is introduced for ranking such designs. The criterion of general minimum lower-order confounding of type WP (WP-GMC) is proposed based on WP-ACNP. The expressions of the key components in WP-ACNP are given via complementary sets. Some necessary conditions for FFSP designs to be WP-GMC FFSP designs are given and some three-level WP-GMC FFSP designs are constructed and tabulated.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41692296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Canadian contributions to environmetrics 加拿大对环境的贡献
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-11-09 DOI: 10.1002/cjs.11743
Charmaine B. Dean, Abdel H. El-Shaarawi, Sylvia R. Esterby, Joanna Mills Flemming, Richard D. Routledge, Stephen W. Taylor, Douglas G. Woolford, James V. Zidek, Francis W. Zwiers

This article focuses on the importance of collaboration in statistics by Canadian researchers and highlights the contributions that Canadian statisticians have made to many research areas in environmetrics. We provide a discussion about different vehicles that have been developed for collaboration by Canadians in the environmetrics context as well as specific scientific areas that are focused on environmetrics research in Canada including climate science, forestry, and fisheries, which are areas of importance for natural resources in Canada.

本文重点介绍了加拿大研究人员在统计领域合作的重要性,并强调了加拿大统计学家对环境计量学许多研究领域所做的贡献。我们讨论了加拿大人在环境计量学背景下为合作开发的不同工具,以及加拿大专注于环境计量学研究的特定科学领域,包括气候科学、林业和渔业,这些领域对加拿大的自然资源至关重要。
{"title":"Canadian contributions to environmetrics","authors":"Charmaine B. Dean,&nbsp;Abdel H. El-Shaarawi,&nbsp;Sylvia R. Esterby,&nbsp;Joanna Mills Flemming,&nbsp;Richard D. Routledge,&nbsp;Stephen W. Taylor,&nbsp;Douglas G. Woolford,&nbsp;James V. Zidek,&nbsp;Francis W. Zwiers","doi":"10.1002/cjs.11743","DOIUrl":"10.1002/cjs.11743","url":null,"abstract":"<p>This article focuses on the importance of collaboration in statistics by Canadian researchers and highlights the contributions that Canadian statisticians have made to many research areas in environmetrics. We provide a discussion about different vehicles that have been developed for collaboration by Canadians in the environmetrics context as well as specific scientific areas that are focused on environmetrics research in Canada including climate science, forestry, and fisheries, which are areas of importance for natural resources in Canada.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11743","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48528686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Pretest and shrinkage estimators in generalized partially linear models with application to real data 广义部分线性模型的预检验和收缩估计及其在实际数据中的应用
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-11-06 DOI: 10.1002/cjs.11732
Shakhawat Hossain, Saumen Mandal, Le An Lac

Semiparametric models hold promise to address many challenges to statistical inference that arise from real-world applications, but their novelty and theoretical complexity create challenges for estimation. Taking advantage of the broad applicability of semiparametric models, we propose some novel and improved methods to estimate the regression coefficients of generalized partially linear models (GPLM). This model extends the generalized linear model by adding a nonparametric component. Like in parametric models, variable selection is important in the GPLM to single out the inactive covariates for the response. Instead of deleting inactive covariates, our approach uses them as auxiliary information in the estimation procedure. We then define two models, one that includes all the covariates and another that includes the active covariates only. We then combine these two model estimators optimally to form the pretest and shrinkage estimators. Asymptotic properties are studied to derive the asymptotic biases and risks of the proposed estimators. We show that if the shrinkage dimension exceeds two, the asymptotic risks of the shrinkage estimators are strictly less than those of the full model estimators. Extensive Monte Carlo simulation studies are conducted to examine the finite-sample performance of the proposed estimation methods. We then apply our proposed methods to two real data sets. Our simulation and real data results show that the proposed estimators perform with higher accuracy and lower variability in the estimation of regression parameters for GPLM compared with competing estimation methods.

半参数模型有望解决来自现实世界应用的统计推断的许多挑战,但它们的新颖性和理论复杂性给估计带来了挑战。利用半参数模型的广泛适用性,提出了几种新的改进的广义部分线性模型(GPLM)回归系数估计方法。该模型通过加入非参数分量对广义线性模型进行了扩展。与参数模型一样,变量选择在GPLM中很重要,可以为响应挑选出不活跃的协变量。我们的方法不是删除不活跃的协变量,而是将它们作为估计过程中的辅助信息。然后我们定义了两个模型,一个包括所有协变量,另一个只包括活动协变量。然后,我们将这两个模型估计器最优地组合起来,形成预测试和收缩估计器。研究了渐近性质,得到了所提估计量的渐近偏差和风险。我们证明,如果收缩维数超过2,收缩估计量的渐近风险严格小于全模型估计量的渐近风险。进行了广泛的蒙特卡罗模拟研究,以检验所提出的估计方法的有限样本性能。然后,我们将我们提出的方法应用于两个真实的数据集。仿真和实际数据结果表明,与竞争对手的估计方法相比,所提出的估计方法在估计GPLM回归参数方面具有更高的精度和更低的变异性。
{"title":"Pretest and shrinkage estimators in generalized partially linear models with application to real data","authors":"Shakhawat Hossain,&nbsp;Saumen Mandal,&nbsp;Le An Lac","doi":"10.1002/cjs.11732","DOIUrl":"10.1002/cjs.11732","url":null,"abstract":"<p>Semiparametric models hold promise to address many challenges to statistical inference that arise from real-world applications, but their novelty and theoretical complexity create challenges for estimation. Taking advantage of the broad applicability of semiparametric models, we propose some novel and improved methods to estimate the regression coefficients of generalized partially linear models (GPLM). This model extends the generalized linear model by adding a nonparametric component. Like in parametric models, variable selection is important in the GPLM to single out the inactive covariates for the response. Instead of deleting inactive covariates, our approach uses them as auxiliary information in the estimation procedure. We then define two models, one that includes all the covariates and another that includes the active covariates only. We then combine these two model estimators optimally to form the pretest and shrinkage estimators. Asymptotic properties are studied to derive the asymptotic biases and risks of the proposed estimators. We show that if the shrinkage dimension exceeds two, the asymptotic risks of the shrinkage estimators are strictly less than those of the full model estimators. Extensive Monte Carlo simulation studies are conducted to examine the finite-sample performance of the proposed estimation methods. We then apply our proposed methods to two real data sets. Our simulation and real data results show that the proposed estimators perform with higher accuracy and lower variability in the estimation of regression parameters for GPLM compared with competing estimation methods.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42817936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Complex statistical modelling for phylogenetic inference 系统发育推断的复杂统计模型
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-10-29 DOI: 10.1002/cjs.11741
Edward Susko

Molecular sequence data are a primary source of information about evolutionary relationships. Over the past few decades, there have been dramatic increases in the sizes of data available. Consequently, focus has shifted towards increasingly complex models that are less prone to the biases that are a consequence of model misspecification. At the same time, the computational challenges, which were always substantial, have become greater due to these increasingly complex models and larger data sizes. In this article, we review phylogenetic inference using sequence data and some recent advances in phylogenetic modelling. We discuss strategies for dealing with complex models, future challenges and paths forward.

分子序列数据是进化关系信息的主要来源。在过去的几十年里,可用的数据量急剧增加。因此,焦点已经转移到越来越复杂的模型上,这些模型不太容易产生由模型错误说明导致的偏差。与此同时,由于这些日益复杂的模型和更大的数据规模,计算挑战变得越来越大,这一直是实质性的。在这篇文章中,我们回顾了利用序列数据的系统发育推断和系统发育模型的一些最新进展。我们讨论了处理复杂模型的策略、未来的挑战和前进的道路。
{"title":"Complex statistical modelling for phylogenetic inference","authors":"Edward Susko","doi":"10.1002/cjs.11741","DOIUrl":"10.1002/cjs.11741","url":null,"abstract":"<p>Molecular sequence data are a primary source of information about evolutionary relationships. Over the past few decades, there have been dramatic increases in the sizes of data available. Consequently, focus has shifted towards increasingly complex models that are less prone to the biases that are a consequence of model misspecification. At the same time, the computational challenges, which were always substantial, have become greater due to these increasingly complex models and larger data sizes. In this article, we review phylogenetic inference using sequence data and some recent advances in phylogenetic modelling. We discuss strategies for dealing with complex models, future challenges and paths forward.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11741","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42664013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sparse estimation of historical functional linear models with a nested group bridge approach 基于嵌套群桥方法的历史函数线性模型稀疏估计
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-10-28 DOI: 10.1002/cjs.11747
Xiaolei Xun, Tianyu Guan, Jiguo Cao

The conventional historical functional linear model relates the current value of the functional response at time t to all past values of the functional covariate up to time t. Motivated by situations where it is more reasonable to assume that only recent, instead of all, past values of the functional covariate have an impact on the functional response, in this work we investigate the historical functional linear model with an unknown forward time lag into the history. In addition to estimating the bivariate regression coefficient function, we also aim to identify the historical time lag from the data, which is important in many applications. To this end, we propose an estimation procedure that uses the finite element method to conform naturally to the trapezoidal domain of the bivariate coefficient function. We use a nested group bridge penalty to facilitate simultaneous estimation of the bivariate coefficient function and the historical lag, and show that our proposed estimators are consistent. We demonstrate this method of estimation in a real data example investigating the effect of muscle activation recorded via the noninvasive electromyography (EMG) method on lip acceleration during speech production. In addition, we examine the finite sample performance of our proposed method in comparison with the conventional approach to estimation via simulation studies.

传统的历史函数线性模型将函数响应在时间t的当前值与函数协变量直到时间t的所有过去值相关联。在这种情况下,更合理的假设是,只有函数协变量的最近而不是所有过去的值对函数响应有影响,在这项工作中,我们研究了历史上具有未知前向时滞的历史函数线性模型。除了估计二元回归系数函数外,我们还旨在从数据中识别历史时滞,这在许多应用中都很重要。为此,我们提出了一种估计程序,该程序使用有限元方法自然地符合二元系数函数的梯形域。我们使用嵌套的群桥惩罚来促进对二元系数函数和历史滞后的同时估计,并表明我们提出的估计是一致的。我们在一个真实数据示例中演示了这种估计方法,该示例研究了通过非侵入性肌电图(EMG)方法记录的肌肉激活对语音产生过程中嘴唇加速度的影响。此外,我们通过模拟研究,与传统的估计方法相比,检验了我们提出的方法的有限样本性能。
{"title":"Sparse estimation of historical functional linear models with a nested group bridge approach","authors":"Xiaolei Xun,&nbsp;Tianyu Guan,&nbsp;Jiguo Cao","doi":"10.1002/cjs.11747","DOIUrl":"10.1002/cjs.11747","url":null,"abstract":"<p>The conventional historical functional linear model relates the current value of the functional response at time <math>\u0000 <mrow>\u0000 <mi>t</mi>\u0000 </mrow></math> to all past values of the functional covariate up to time <math>\u0000 <mrow>\u0000 <mi>t</mi>\u0000 </mrow></math>. Motivated by situations where it is more reasonable to assume that only recent, instead of all, past values of the functional covariate have an impact on the functional response, in this work we investigate the historical functional linear model with an unknown forward time lag into the history. In addition to estimating the bivariate regression coefficient function, we also aim to identify the historical time lag from the data, which is important in many applications. To this end, we propose an estimation procedure that uses the finite element method to conform naturally to the trapezoidal domain of the bivariate coefficient function. We use a nested group bridge penalty to facilitate simultaneous estimation of the bivariate coefficient function and the historical lag, and show that our proposed estimators are consistent. We demonstrate this method of estimation in a real data example investigating the effect of muscle activation recorded via the noninvasive electromyography (EMG) method on lip acceleration during speech production. In addition, we examine the finite sample performance of our proposed method in comparison with the conventional approach to estimation via simulation studies.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11747","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48140759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Asymptotic theory in bipartite graph models with a growing number of parameters 具有越来越多参数的二部图模型的渐近理论
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-10-25 DOI: 10.1002/cjs.11735
Yifan Fan, Binyan Jiang, Ting Yan, Yuan Zhang

Affiliation networks contain a set of actors and a set of events, where edges denote the affiliation relationships between actors and events. Here, we introduce a class of affiliation network models for modelling the degree heterogeneity, where two sets of degree parameters are used to measure the activeness of actors and the popularity of events, respectively. We develop the moment method to infer these degree parameters. We establish a unified theoretical framework in which the consistency and asymptotic normality of the moment estimator hold as the numbers of actors and events both go to infinity. We apply our results to several popular models with weighted edges, including generalized β-, Poisson and Rayleigh models. Simulation studies and a realistic example that involves the Poisson model provide concrete evidence that supports our theoretical findings.

关联网络包含一组参与者和一组事件,其中的边表示参与者和事件之间的关联关系。在此,我们引入了一类隶属关系网络模型来建模程度异质性,其中两组程度参数分别用于衡量行为者的活跃度和事件的受欢迎程度。我们发展了矩量法来推断度参数。我们建立了一个统一的理论框架,其中矩估计量的相合性和渐近正态性在参与者和事件的数量都趋于无穷时保持不变。我们将结果应用于几种常用的带加权边的模型,包括广义β模型、泊松模型和瑞利模型。我们还在泊松模型下进行了仿真和实际数据应用来验证理论结果。
{"title":"Asymptotic theory in bipartite graph models with a growing number of parameters","authors":"Yifan Fan,&nbsp;Binyan Jiang,&nbsp;Ting Yan,&nbsp;Yuan Zhang","doi":"10.1002/cjs.11735","DOIUrl":"10.1002/cjs.11735","url":null,"abstract":"<p>Affiliation networks contain a set of actors and a set of events, where edges denote the affiliation relationships between actors and events. Here, we introduce a class of affiliation network models for modelling the degree heterogeneity, where two sets of degree parameters are used to measure the activeness of actors and the popularity of events, respectively. We develop the moment method to infer these degree parameters. We establish a unified theoretical framework in which the consistency and asymptotic normality of the moment estimator hold as the numbers of actors and events both go to infinity. We apply our results to several popular models with weighted edges, including generalized <math>\u0000 <mrow>\u0000 <mi>β</mi>\u0000 </mrow></math>-, Poisson and Rayleigh models. Simulation studies and a realistic example that involves the Poisson model provide concrete evidence that supports our theoretical findings.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11735","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44315581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Variation pattern classification of functional data 功能数据的变异模式分类
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-10-25 DOI: 10.1002/cjs.11738
Shuhao Jiao, Ron D. Frostig, Hernando Ombao

A new classification method for functional data is proposed in this article. This work is motivated by the need to identify features that discriminate between neurological conditions on which local field potentials (LFPs) were recorded. Regardless of the condition, these LFPs have zero mean, and thus the first moments of these random processes do not have discriminating power. We propose the variation pattern classification (VPC) method which employs the second-moment structure as the discriminating feature and uses the Hilbert–Schmidt norm to measure the discrepancy between the second-moment structure of different groups. The proposed VPC method is demonstrated to be sensitive to the discrepancy, potentially leading to a higher rate of classification. One important innovation lies in the dimension reduction where the VPC method adaptively determines the basis functions (discriminative feature functions) that account for the major discrepancy. In addition, the selected discriminative feature functions provide insights into the discrepancy between different groups because they reveal the features of variation pattern that differentiate groups. Consistency properties are established and, furthermore, simulation studies and the analysis of rat brain LFP trajectories empirically demonstrate the advantages and effectiveness of the proposed method.

针对不同组或类别的函数具有相似的均值函数但可能存在不同的秒矩的情况,提出了一种新的函数数据分类方法。提出的基于二阶矩的功能分类器(SMFC)方法使用Hilbert-Schmidt范数来度量不同组的二阶矩结构之间的差异。结果表明,该方法对二阶矩结构的差异非常敏感,因此与竞争对手的方法相比,产生了更低的误分类率。一个重要的创新在于降维步骤,其中SMFC方法数据自适应地确定占大部分差异的基函数。因此,误分类率降低了,因为它删除了功能数据中只有弱歧视的成分。此外,所选择的判别基函数可以揭示群体间的差异,因为基函数揭示了群体差异的变异模式特征。建立了一致性特性,并对音素和大鼠大脑活动轨迹进行了模拟研究和分析,经验证明了该方法的优越性。
{"title":"Variation pattern classification of functional data","authors":"Shuhao Jiao,&nbsp;Ron D. Frostig,&nbsp;Hernando Ombao","doi":"10.1002/cjs.11738","DOIUrl":"10.1002/cjs.11738","url":null,"abstract":"<p>A new classification method for functional data is proposed in this article. This work is motivated by the need to identify features that discriminate between neurological conditions on which local field potentials (LFPs) were recorded. Regardless of the condition, these LFPs have zero mean, and thus the first moments of these random processes do not have discriminating power. We propose the variation pattern classification (VPC) method which employs the second-moment structure as the discriminating feature and uses the Hilbert–Schmidt norm to measure the discrepancy between the second-moment structure of different groups. The proposed VPC method is demonstrated to be sensitive to the discrepancy, potentially leading to a higher rate of classification. One important innovation lies in the dimension reduction where the VPC method adaptively determines the basis functions (discriminative feature functions) that account for the major discrepancy. In addition, the selected discriminative feature functions provide insights into the discrepancy between different groups because they reveal the features of variation pattern that differentiate groups. Consistency properties are established and, furthermore, simulation studies and the analysis of rat brain LFP trajectories empirically demonstrate the advantages and effectiveness of the proposed method.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47579526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Automatic structure recovery for generalized additive models 广义加性模型的结构自动恢复
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-10-18 DOI: 10.1002/cjs.11739
Kai Shen, Yichao Wu

In this article, we propose an automatic structure recovery method for generalized additive models (GAMs) by extending Wu and Stefanski's approach. In a similar vein, the proposed method is based on a local scoring algorithm coupled with local polynomial smoothing, along with a kernel-based variable selection approach. Given a specific degree M, the goal is to identify predictors contributing polynomially at different degrees up to M and predictors that contribute beyond degree M. By focusing on two GAMs, logistic regression and Poisson regression, we illustrate the performance of the proposed method using Monte Carlo simulation studies and two real data examples.

本文在推广Wu和Stefanski方法的基础上,提出了一种广义加性模型(GAMs)的自动结构恢复方法。在类似的情况下,所提出的方法是基于局部评分算法,结合局部多项式平滑,以及基于核的变量选择方法。给定一个特定的度M,目标是识别在不同程度上多项式地贡献到M的预测因子和贡献超过度M的预测因子。以逻辑回归和泊松回归这两种GAMs为例,通过蒙特卡罗模拟研究和两个实际数据示例来说明所提出方法的性能。
{"title":"Automatic structure recovery for generalized additive models","authors":"Kai Shen,&nbsp;Yichao Wu","doi":"10.1002/cjs.11739","DOIUrl":"10.1002/cjs.11739","url":null,"abstract":"<p>In this article, we propose an automatic structure recovery method for generalized additive models (GAMs) by extending Wu and Stefanski's approach. In a similar vein, the proposed method is based on a local scoring algorithm coupled with local polynomial smoothing, along with a kernel-based variable selection approach. Given a specific degree <math>\u0000 <mrow>\u0000 <mi>M</mi>\u0000 </mrow></math>, the goal is to identify predictors contributing polynomially at different degrees up to <math>\u0000 <mrow>\u0000 <mi>M</mi>\u0000 </mrow></math> and predictors that contribute beyond degree <math>\u0000 <mrow>\u0000 <mi>M</mi>\u0000 </mrow></math>. By focusing on two GAMs, logistic regression and Poisson regression, we illustrate the performance of the proposed method using Monte Carlo simulation studies and two real data examples.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11739","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43267711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A high-dimensional inverse norm sign test for two-sample location problems 两样本定位问题的高维逆范数符号检验
IF 0.6 4区 数学 Q4 Mathematics Pub Date : 2022-10-17 DOI: 10.1002/cjs.11731
Xifen Huang, Binghui Liu, Qin Zhou, Long Feng

In this article, we focus on the two-sample location testing problem for high-dimensional data, where the data dimension is potentially much larger than the sample sizes. First, we construct a general class of weighted spatial sign tests for the two-sample location problem, which can include some existing high-dimensional nonparametric tests. Then, in this article, we find a locally most powerful test by choosing the inverse norm weight function, named the two-sample inverse norm sign test (tINST). The proposed test can be viewed as an extension of the inverse norm sign test devised for the one-sample problem. We establish the asymptotic properties of the proposed test, which indicate that it is consistent and has greater power than competing tests that belong to the proposed class of weighted spatial sign tests for two-sample location problems. Finally, a large number of numerical investigations and a practical biomedical example demonstrate the power and robustness advantages of the proposed test.

在本文中,我们关注高维数据的双样本位置测试问题,其中数据维度可能比样本大小大得多。首先,针对两样本定位问题,我们构造了一类广义的加权空间符号检验,它可以包含一些现有的高维非参数检验。然后,在本文中,我们通过选择逆范数权重函数来找到一个局部最强大的检验,称为双样本逆范数符号检验(tINST)。所提出的检验可以看作是针对单样本问题设计的逆范数检验的扩展。我们建立了所提出的检验的渐近性质,这表明它是一致的,并且比属于所提出的两样本定位问题加权空间符号检验类的竞争检验具有更大的幂。最后,大量的数值研究和一个实际的生物医学例子证明了所提出的测试的强大和鲁棒性优势。
{"title":"A high-dimensional inverse norm sign test for two-sample location problems","authors":"Xifen Huang,&nbsp;Binghui Liu,&nbsp;Qin Zhou,&nbsp;Long Feng","doi":"10.1002/cjs.11731","DOIUrl":"10.1002/cjs.11731","url":null,"abstract":"<p>In this article, we focus on the two-sample location testing problem for high-dimensional data, where the data dimension is potentially much larger than the sample sizes. First, we construct a general class of weighted spatial sign tests for the two-sample location problem, which can include some existing high-dimensional nonparametric tests. Then, in this article, we find a locally most powerful test by choosing the inverse norm weight function, named the two-sample inverse norm sign test (tINST). The proposed test can be viewed as an extension of the inverse norm sign test devised for the one-sample problem. We establish the asymptotic properties of the proposed test, which indicate that it is consistent and has greater power than competing tests that belong to the proposed class of weighted spatial sign tests for two-sample location problems. Finally, a large number of numerical investigations and a practical biomedical example demonstrate the power and robustness advantages of the proposed test.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45508907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Canadian Journal of Statistics-Revue Canadienne De Statistique
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1