Wiley Interdisciplinary Reviews-Computational Statistics最新文献

英文中文

Issue Information 问题信息

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-04-16 DOI: 10.1002/wics.1473

引用次数: 0

Nonparametric covariance estimation with shrinkage toward stationary models 向平稳模型收缩的非参数协方差估计

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-03-20 DOI: 10.1002/wics.1507

T. A. Blake, Yoonkyung Lee

Estimation of an unstructured covariance matrix is difficult because of the challenges posed by parameter space dimensionality and the positive‐definiteness constraint that estimates should satisfy. We consider a general nonparametric covariance estimation framework for longitudinal data using the Cholesky decomposition of a positive‐definite matrix. The covariance matrix of time‐ordered measurements is diagonalized by a lower triangular matrix with unconstrained entries that are statistically interpretable as parameters for a varying coefficient autoregressive model. Using this dual interpretation of the Cholesky decomposition and allowing for irregular sampling time points, we treat covariance estimation as bivariate smoothing and cast it in a regularization framework for desired forms of simplicity in covariance models. Viewing stationarity as a form of simplicity or parsimony in covariance, we model the varying coefficient function with components depending on time lag and its orthogonal direction separately and penalize the components that capture the nonstationarity in the fitted function. We demonstrate construction of a covariance estimator using the smoothing spline framework. Simulation studies establish the advantage of our approach over alternative estimators proposed in the longitudinal data setting. We analyze a longitudinal dataset to illustrate application of the methodology and compare our estimates to those resulting from alternative models.

非结构化协方差矩阵的估计是困难的，因为参数空间维度和估计应该满足的正定约束带来了挑战。我们使用正定矩阵的Cholesky分解来考虑纵向数据的一般非参数协方差估计框架。时间顺序测量的协方差矩阵由具有无约束项的下三角矩阵对角化，无约束项在统计上可解释为变系数自回归模型的参数。使用Cholesky分解的这种双重解释，并考虑到不规则采样时间点，我们将协方差估计视为二变量平滑，并将其放入正则化框架中，以获得协方差模型中所需的简单形式。将平稳性视为协方差中的简单性或简约性的一种形式，我们分别对具有取决于时滞及其正交方向的分量的变系数函数进行建模，并惩罚拟合函数中捕获非平稳性的分量。我们演示了使用平滑样条框架的协方差估计器的构造。模拟研究表明，与纵向数据设置中提出的替代估计相比，我们的方法具有优势。我们分析了一个纵向数据集来说明该方法的应用，并将我们的估计与替代模型的估计进行了比较。

{"title":"Nonparametric covariance estimation with shrinkage toward stationary models","authors":"T. A. Blake, Yoonkyung Lee","doi":"10.1002/wics.1507","DOIUrl":"https://doi.org/10.1002/wics.1507","url":null,"abstract":"Estimation of an unstructured covariance matrix is difficult because of the challenges posed by parameter space dimensionality and the positive‐definiteness constraint that estimates should satisfy. We consider a general nonparametric covariance estimation framework for longitudinal data using the Cholesky decomposition of a positive‐definite matrix. The covariance matrix of time‐ordered measurements is diagonalized by a lower triangular matrix with unconstrained entries that are statistically interpretable as parameters for a varying coefficient autoregressive model. Using this dual interpretation of the Cholesky decomposition and allowing for irregular sampling time points, we treat covariance estimation as bivariate smoothing and cast it in a regularization framework for desired forms of simplicity in covariance models. Viewing stationarity as a form of simplicity or parsimony in covariance, we model the varying coefficient function with components depending on time lag and its orthogonal direction separately and penalize the components that capture the nonstationarity in the fitted function. We demonstrate construction of a covariance estimator using the smoothing spline framework. Simulation studies establish the advantage of our approach over alternative estimators proposed in the longitudinal data setting. We analyze a longitudinal dataset to illustrate application of the methodology and compare our estimates to those resulting from alternative models.","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":" ","pages":""},"PeriodicalIF":3.2,"publicationDate":"2020-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/wics.1507","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47761789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Review of current advances in survival analysis and frailty models 回顾当前生存分析和衰弱模型的进展

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-03-17 DOI: 10.1002/wics.1504

Usha Govindarajulu, R. D'Agostino

In this article, we have presented a review of existing methods and trends in survival analysis and frailty models. The background has been presented for each topic discussed for survival and frailty models where the presentation flows from original methods to more advanced methods. This article has also shown various current methodologies that exist among survival and frailty models. The advantages and disadvantages of more recent methodologies are presented and discussed in this review.

在这篇文章中，我们介绍了现有的方法和趋势的生存分析和脆弱性模型的回顾。本文介绍了生存和脆弱模型讨论的每个主题的背景，其中介绍从原始方法到更高级的方法。本文还展示了目前存在于生存和脆弱模型中的各种方法。本文介绍并讨论了最新的方法的优缺点。

引用次数: 6

Bayesian spatial and spatiotemporal models based on multiscale factorizations 基于多尺度因子分解的贝叶斯时空模型

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-03-17 DOI: 10.1002/wics.1509

Marco A. R. Ferreira

We review the literature on spatial and spatiotemporal models based on spatial multiscale factorizations. Specifically, we review models based on wavelets and Kolaczyk–Huang factorizations for Gaussian and Poisson data. These multiscale models decompose spatial and spatiotemporal datasets into many small components, called multiscale coefficients, at multiple levels of spatial resolution. Then analysis proceeds independently for each multiscale coefficient. After that, aggregation equations are used to coherently combine the analyses from the multiple multiscale coefficients to obtain a statistical analysis at the original resolution level. The computational cost of such analysis grows linearly with sample size. Furthermore, computations for these models are scalable, parallelizable, and fast. Therefore, these multiscale models are tremendously useful for the analysis of massive spatial and spatiotemporal datasets.

本文综述了基于空间多尺度分解的时空模型。具体来说，我们回顾了基于小波和高斯和泊松数据的Kolaczyk-Huang分解的模型。这些多尺度模型在多个空间分辨率水平上将空间和时空数据集分解成许多小分量，称为多尺度系数。然后对每个多尺度系数独立进行分析。然后，利用聚合方程对多个多尺度系数的分析结果进行相干组合，得到原始分辨率水平的统计分析结果。这种分析的计算成本随样本量线性增长。此外，这些模型的计算是可伸缩的、可并行的和快速的。因此，这些多尺度模型对于大量时空数据集的分析非常有用。

引用次数: 2

Animal movement models for multiple individuals 多个体动物运动模型

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-03-09 DOI: 10.1002/wics.1506

H. Scharf, F. Buderman

Statistical models for animal movement provide tools that help ecologists and biologists learn how animals interact with their environment and each other. Efforts to develop increasingly realistic, implementable, and scientifically valuable methods for analyzing remotely observed trajectories have provided practitioners with a wide selection of models to help them understand animal behavior. Increasingly, researchers are interested in studying multiple animals jointly, which requires methods that can account for dependence across individuals. Dependence can arise for many reasons, including shared behavioral tendencies, familial relationships, and direct interactions on the landscape. We provide a synopsis of recent statistical methods for animal movement data applicable to settings in which inference is desired across multiple individuals. Highlights of these approaches include the ability to infer shared behavioral traits across a group of individuals and the ability to infer unobserved social networks summarizing dynamic relationships that manifest themselves in movement decisions.

动物运动的统计模型提供了帮助生态学家和生物学家了解动物如何与环境和彼此互动的工具。开发越来越现实、可实施和有科学价值的方法来分析远程观测轨迹的努力为从业者提供了广泛的模型选择，以帮助他们理解动物行为。研究人员越来越有兴趣联合研究多种动物，这需要能够解释个体依赖性的方法。依赖的产生有很多原因，包括共同的行为倾向、家庭关系和对景观的直接互动。我们提供了动物运动数据的最新统计方法的概要，适用于需要对多个个体进行推断的环境。这些方法的亮点包括推断一组个体的共同行为特征的能力，以及推断未观察到的社交网络的能力，这些社交网络总结了在运动决策中表现出来的动态关系。

引用次数: 9

A review of flow field forecasting: A high‐dimensional forecasting procedure 流场预测综述:一种高维预测方法

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-02-21 DOI: 10.1002/wics.1505

Kyle A. Caudle, Patrick S. Fleming, R. Hoover

Forecasting, especially high‐dimensional forecasting, is becoming more and more sought after, particularly as computing resources increase in both size and speed. Flow field forecasting is a general purpose regression‐based forecasting method that has recently been expanded to high‐dimensional settings. In this article, we provide an overview of the flow field forecasting methodology, with a particular emphasis on environments where the number of candidate predictor variables is large, potentially larger than the number of observations.

预测，尤其是高维预测，正变得越来越受追捧，特别是随着计算资源在规模和速度上的增加。流场预测是一种基于回归的通用预测方法，最近已扩展到高维设置。在本文中，我们概述了流场预测方法，特别强调了候选预测变量数量很大的环境，可能大于观测值的数量。

引用次数: 1

Issue Information 问题信息

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-02-16 DOI: 10.1002/wics.1472

引用次数: 0

Stationary count time series models 固定计数时间序列模型

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-02-13 DOI: 10.1002/wics.1502

C. Weiß

During the last 20–30 years, there was a remarkable growth in interest on approaches for stationary count time series. We consider popular classes of models for such time series, including thinning‐based models, conditional regression models, and Hidden‐Markov models. We review and compare important members of these model families, having regard to stochastic properties such as the dispersion and autocorrelation structure. Our survey covers univariate and multivariate count data, as well as unbounded and bounded counts. We also discuss an illustrative data example. Besides this critical presentation of the current state‐of‐the‐art, some existing challenges and opportunities for future research are identified.

在过去的20-30年里，人们对平稳计数时间序列方法的兴趣有了显著的增长。我们考虑了这类时间序列的常用模型，包括基于稀疏的模型、条件回归模型和隐马尔可夫模型。我们回顾和比较这些模型族的重要成员，考虑到随机性质，如色散和自相关结构。我们的调查涵盖单变量和多变量计数数据，以及无界和有界计数。我们还讨论了一个说明性的数据示例。除了对当前技术状况的批判性介绍之外，还确定了未来研究的一些现有挑战和机遇。

引用次数: 20

Randomization 随机化

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-02-07 DOI: 10.1002/wics.91

B. Manly

There are three aspects of randomization in statistics that are considered here. The first aspect is randomization as part of a sampling design to estimate one or more parameters for a statistical population such as all the farms in a certain area of a country, based on information obtained about the parameters from only a part of the population. The second aspect is using randomization as part of an experimental design to ensure that the allocation of treatment levels to the experimental units is not biased in any way. For example, the test of a new drug for relieving the symptoms of a disease might involve this drug being randomly allocated to half of a group of patients, while the other half of the patients receive a standard drug that is used for the disease. Finally, the third aspect is using randomization to test some statistical hypothesis. For example, to see if there is a significant difference between two drugs for the treatment of a disease in terms of some suitable outcome measure, the observed mean difference between means for this outcome measure might be compared to the distribution of mean differences that is obtained by randomly reallocating the observed values of the measure to the drugs. The null hypothesis being tested would then be that each of the observed values of the measure was equally likely to have occurred with each of the two drugs. Copyright © 2010 John Wiley & Sons, Inc.

这里考虑了统计学中随机化的三个方面。第一个方面是随机化，作为抽样设计的一部分，根据仅从一部分人口中获得的有关参数的信息，估计统计总体(如一个国家某一地区的所有农场)的一个或多个参数。第二个方面是使用随机化作为实验设计的一部分，以确保分配给实验单位的治疗水平没有任何偏差。例如，一种用于缓解疾病症状的新药的测试可能涉及将这种药物随机分配给一组患者中的一半，而另一半患者则使用用于该疾病的标准药物。最后，第三个方面是使用随机化来检验一些统计假设。例如，为了确定治疗某种疾病的两种药物在某些合适的结果度量方面是否存在显著差异，可以将该结果度量的观察到的均值之间的平均差异与通过随机将该度量的观察值重新分配给药物而获得的均值差异的分布进行比较。被检验的原假设是，测量的每个观察值在两种药物中出现的可能性是一样的。版权所有©2010约翰威利父子公司。

{"title":"Randomization","authors":"B. Manly","doi":"10.1002/wics.91","DOIUrl":"https://doi.org/10.1002/wics.91","url":null,"abstract":"There are three aspects of randomization in statistics that are considered here. The first aspect is randomization as part of a sampling design to estimate one or more parameters for a statistical population such as all the farms in a certain area of a country, based on information obtained about the parameters from only a part of the population. The second aspect is using randomization as part of an experimental design to ensure that the allocation of treatment levels to the experimental units is not biased in any way. For example, the test of a new drug for relieving the symptoms of a disease might involve this drug being randomly allocated to half of a group of patients, while the other half of the patients receive a standard drug that is used for the disease. Finally, the third aspect is using randomization to test some statistical hypothesis. For example, to see if there is a significant difference between two drugs for the treatment of a disease in terms of some suitable outcome measure, the observed mean difference between means for this outcome measure might be compared to the distribution of mean differences that is obtained by randomly reallocating the observed values of the measure to the drugs. The null hypothesis being tested would then be that each of the observed values of the measure was equally likely to have occurred with each of the two drugs. Copyright © 2010 John Wiley & Sons, Inc.","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":"30 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2020-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91159646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 79

Model exploration using conditional visualization 使用条件可视化的模型探索

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-02-07 DOI: 10.1002/wics.1503

C. Hurley

Ideally, statistical parametric model fitting is followed by various summary tables which show predictor contributions, visualizations which assess model assumptions and goodness of fit, and test statistics which compare models. In contrast, modern machine‐learning fits are usually black box in nature, offer high‐performing predictions but suffer from an interpretability deficit. We examine how the paradigm of conditional visualization can be used to address this, specifically to explain predictor contributions, assess goodness of fit, and compare multiple, competing fits. We compare visualizations from techniques including trellis, condvis, visreg, lime, partial dependence, and ice plots. Our examples use random forest fits, but all techniques presented are model agnostic.

理想情况下，统计参数模型拟合之后是各种汇总表，这些汇总表显示了预测因子的贡献，评估模型假设和拟合优度的可视化，以及比较模型的测试统计。相比之下，现代机器学习拟合通常本质上是黑盒，提供了高性能的预测，但存在可解释性缺陷。我们研究了如何使用条件可视化的范式来解决这一问题，特别是解释预测因子的贡献，评估拟合优度，并比较多个竞争拟合。我们比较了网格、condvis、visreg、石灰、部分依赖和冰图等技术的可视化效果。我们的示例使用随机森林拟合，但所提供的所有技术都与模型无关。

引用次数: 1

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Wiley Interdisciplinary Reviews-Computational Statistics

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀