Wiley Interdisciplinary Reviews-Computational Statistics最新文献

英文中文

A review study of functional autoregressive models with application to energy forecasting 功能自回归模型在能源预测中的应用综述

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-07-28 DOI: 10.1002/wics.1525

Ying Chen, T. Koch, K. Lim, Xiaofei Xu, Nazgul Zakiyeva

In this data‐rich era, it is essential to develop advanced techniques to analyze and understand large amounts of data and extract the underlying information in a flexible way. We provide a review study on the state‐of‐the‐art statistical time series models for univariate and multivariate functional data with serial dependence. In particular, we review functional autoregressive (FAR) models and their variations under different scenarios. The models include the classic FAR model under stationarity; the FARX and pFAR model dealing with multiple exogenous functional variables and large‐scale mixed‐type exogenous variables; the vector FAR model and common functional principal component technique to handle multiple dimensional functional time series; and the warping FAR, varying coefficient‐FAR and adaptive FAR models to handle seasonal variations, slow varying effects and the more challenging cases of structural changes or breaks respectively. We present the models’ setup and detail the estimation procedure. We discuss the models’ applicability and illustrate the numerical performance using real‐world data of high‐resolution natural gas flows in the high‐pressure gas pipeline network of Germany. We conduct 1‐day and 14‐days‐ahead out‐of‐sample forecasts of the daily gas flow curves. We observe that the functional time series models generally produce stable out‐of‐sample forecast accuracy.

在这个数据丰富的时代，必须开发先进的技术来分析和理解大量数据，并以灵活的方式提取底层信息。我们对具有序列依赖性的单变量和多变量函数数据的最新统计时间序列模型进行了综述研究。我们特别回顾了功能自回归(FAR)模型及其在不同场景下的变化。模型包括平稳条件下的经典FAR模型;处理多个外源性功能变量和大规模混合型外源性变量的FARX和pFAR模型;利用向量FAR模型和常用的功能主成分技术处理多维功能时间序列;以及分别用于处理季节变化、缓慢变化效应和更具挑战性的结构变化或断裂的扭曲FAR、变系数FAR和自适应FAR模型。我们介绍了模型的建立和详细的估计过程。我们讨论了模型的适用性，并利用德国高压天然气管网中高分辨率天然气流动的实际数据说明了模型的数值性能。我们提前1天和14天对每日气体流量曲线进行样本外预测。我们观察到，函数时间序列模型通常产生稳定的样本外预测精度。

{"title":"A review study of functional autoregressive models with application to energy forecasting","authors":"Ying Chen, T. Koch, K. Lim, Xiaofei Xu, Nazgul Zakiyeva","doi":"10.1002/wics.1525","DOIUrl":"https://doi.org/10.1002/wics.1525","url":null,"abstract":"In this data‐rich era, it is essential to develop advanced techniques to analyze and understand large amounts of data and extract the underlying information in a flexible way. We provide a review study on the state‐of‐the‐art statistical time series models for univariate and multivariate functional data with serial dependence. In particular, we review functional autoregressive (FAR) models and their variations under different scenarios. The models include the classic FAR model under stationarity; the FARX and pFAR model dealing with multiple exogenous functional variables and large‐scale mixed‐type exogenous variables; the vector FAR model and common functional principal component technique to handle multiple dimensional functional time series; and the warping FAR, varying coefficient‐FAR and adaptive FAR models to handle seasonal variations, slow varying effects and the more challenging cases of structural changes or breaks respectively. We present the models’ setup and detail the estimation procedure. We discuss the models’ applicability and illustrate the numerical performance using real‐world data of high‐resolution natural gas flows in the high‐pressure gas pipeline network of Germany. We conduct 1‐day and 14‐days‐ahead out‐of‐sample forecasts of the daily gas flow curves. We observe that the functional time series models generally produce stable out‐of‐sample forecast accuracy.","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":" ","pages":""},"PeriodicalIF":3.2,"publicationDate":"2020-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/wics.1525","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44328711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Bayesian and frequentist testing for differences between two groups with parametric and nonparametric two‐sample tests 用参数和非参数两样本检验对两组之间差异的贝叶斯和频繁度检验

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-07-13 DOI: 10.1002/wics.1523

Riko Kelter

Testing for differences between two groups is one of the scenarios most often faced by scientists across all domains and is particularly important in the medical sciences and psychology. The traditional solution to this problem is rooted inside the Neyman–Pearson theory of null hypothesis significance testing and uniformly most powerful tests. In the last decade, a lot of progress has been made in developing Bayesian versions of the most common parametric and nonparametric two‐sample tests, including Student's t‐test and the Mann–Whitney U test. In this article, we review the underlying assumptions, models and implications for research practice of these Bayesian two‐sample tests and contrast them with the existing frequentist solutions. Also, we show that in general Bayesian and frequentist two‐sample tests have different behavior regarding the type I and II error control, which needs to be carefully balanced in practical research.

测试两组之间的差异是所有领域的科学家最常面临的场景之一，在医学和心理学中尤为重要。这个问题的传统解决方案植根于零假设显著性检验和一致最有力检验的奈曼-皮尔逊理论。在过去的十年里，在开发最常见的参数和非参数两样本检验的贝叶斯版本方面取得了很大进展，包括Student t检验和Mann–Whitney U检验。在这篇文章中，我们回顾了这些贝叶斯双样本测试的基本假设、模型和对研究实践的启示，并将其与现有的频率论解决方案进行了对比。此外，我们还表明，一般来说，贝叶斯和频率论两样本测试在I型和II型错误控制方面具有不同的行为，这需要在实际研究中仔细平衡。

引用次数: 15

Robust linear regression for high‐dimensional data: An overview 高维数据的稳健线性回归：综述

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-07-08 DOI: 10.1002/wics.1524

P. Filzmoser, K. Nordhausen

Digitization as the process of converting information into numbers leads to bigger and more complex data sets, bigger also with respect to the number of measured variables. This makes it harder or impossible for the practitioner to identify outliers or observations that are inconsistent with an underlying model. Classical least‐squares based procedures can be affected by those outliers. In the regression context, this means that the parameter estimates are biased, with consequences on the validity of the statistical inference, on regression diagnostics, and on the prediction accuracy. Robust regression methods aim at assigning appropriate weights to observations that deviate from the model. While robust regression techniques are widely known in the low‐dimensional case, researchers and practitioners might still not be very familiar with developments in this direction for high‐dimensional data. Recently, different strategies have been proposed for robust regression in the high‐dimensional case, typically based on dimension reduction, on shrinkage, including sparsity, and on combinations of such techniques. A very recent concept is downweighting single cells of the data matrix rather than complete observations, with the goal to make better use of the model‐consistent information, and thus to achieve higher efficiency of the parameter estimates.

数字化是将信息转换为数字的过程，它会产生更大、更复杂的数据集，测量变量的数量也会更大。这使得从业者更难或不可能识别出与基础模型不一致的异常值或观察结果。基于经典最小二乘法的程序可能会受到这些异常值的影响。在回归背景下，这意味着参数估计有偏差，对统计推断的有效性、回归诊断和预测准确性产生影响。稳健回归方法旨在为偏离模型的观测值分配适当的权重。尽管稳健回归技术在低维数据中广为人知，但研究人员和从业者可能仍然不太熟悉高维数据在这一方向上的发展。最近，在高维情况下，针对稳健回归提出了不同的策略，通常基于降维、收缩（包括稀疏性）以及这些技术的组合。最近的一个概念是降低数据矩阵的单个单元格的权重，而不是完整的观测值，目的是更好地利用模型一致性信息，从而实现更高的参数估计效率。

{"title":"Robust linear regression for high‐dimensional data: An overview","authors":"P. Filzmoser, K. Nordhausen","doi":"10.1002/wics.1524","DOIUrl":"https://doi.org/10.1002/wics.1524","url":null,"abstract":"Digitization as the process of converting information into numbers leads to bigger and more complex data sets, bigger also with respect to the number of measured variables. This makes it harder or impossible for the practitioner to identify outliers or observations that are inconsistent with an underlying model. Classical least‐squares based procedures can be affected by those outliers. In the regression context, this means that the parameter estimates are biased, with consequences on the validity of the statistical inference, on regression diagnostics, and on the prediction accuracy. Robust regression methods aim at assigning appropriate weights to observations that deviate from the model. While robust regression techniques are widely known in the low‐dimensional case, researchers and practitioners might still not be very familiar with developments in this direction for high‐dimensional data. Recently, different strategies have been proposed for robust regression in the high‐dimensional case, typically based on dimension reduction, on shrinkage, including sparsity, and on combinations of such techniques. A very recent concept is downweighting single cells of the data matrix rather than complete observations, with the goal to make better use of the model‐consistent information, and thus to achieve higher efficiency of the parameter estimates.","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":" ","pages":""},"PeriodicalIF":3.2,"publicationDate":"2020-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/wics.1524","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46937358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 24

Advances in statistical modeling of spatial extremes 空间极值统计模型研究进展

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-07-01 DOI: 10.1002/wics.1537

Raphael Huser, J. Wadsworth

The classical modeling of spatial extremes relies on asymptotic models (i.e., max‐stable or r‐Pareto processes) for block maxima or peaks over high thresholds, respectively. However, at finite levels, empirical evidence often suggests that such asymptotic models are too rigidly constrained, and that they do not adequately capture the frequent situation where more severe events tend to be spatially more localized. In other words, these asymptotic models have a strong tail dependence that persists at increasingly high levels, while data usually suggest that it should weaken instead. Another well‐known limitation of classical spatial extremes models is that they are either computationally prohibitive to fit in high dimensions, or they need to be fitted using less efficient techniques. In this review paper, we describe recent progress in the modeling and inference for spatial extremes, focusing on new models that have more flexible tail structures that can bridge asymptotic dependence classes, and that are more easily amenable to likelihood‐based inference for large datasets. In particular, we discuss various types of random scale constructions, as well as the conditional spatial extremes model, which have recently been getting increasing attention within the statistics of extremes community. We illustrate some of these new spatial models on two different environmental applications.

空间极值的经典建模分别依赖于块最大值或高阈值峰值的渐近模型(即最大稳定或r - Pareto过程)。然而，在有限的水平上，经验证据往往表明，这种渐近模型过于严格地受到约束，并且它们不能充分捕捉到更严重的事件往往在空间上更局部的频繁情况。换句话说，这些渐近模型具有很强的尾部依赖性，这种依赖性在越来越高的水平上持续存在，而数据通常表明它应该减弱。经典空间极值模型的另一个众所周知的局限性是，它们要么在计算上难以适应高维，要么需要使用效率较低的技术进行拟合。在这篇综述文章中，我们描述了空间极值建模和推理的最新进展，重点是具有更灵活的尾部结构的新模型，可以桥接渐近依赖类，并且更容易适用于基于似然的大型数据集推理。特别地，我们讨论了各种类型的随机尺度结构，以及最近在极端界统计中越来越受到关注的条件空间极端模型。我们在两种不同的环境应用中说明了这些新的空间模型。

{"title":"Advances in statistical modeling of spatial extremes","authors":"Raphael Huser, J. Wadsworth","doi":"10.1002/wics.1537","DOIUrl":"https://doi.org/10.1002/wics.1537","url":null,"abstract":"The classical modeling of spatial extremes relies on asymptotic models (i.e., max‐stable or r‐Pareto processes) for block maxima or peaks over high thresholds, respectively. However, at finite levels, empirical evidence often suggests that such asymptotic models are too rigidly constrained, and that they do not adequately capture the frequent situation where more severe events tend to be spatially more localized. In other words, these asymptotic models have a strong tail dependence that persists at increasingly high levels, while data usually suggest that it should weaken instead. Another well‐known limitation of classical spatial extremes models is that they are either computationally prohibitive to fit in high dimensions, or they need to be fitted using less efficient techniques. In this review paper, we describe recent progress in the modeling and inference for spatial extremes, focusing on new models that have more flexible tail structures that can bridge asymptotic dependence classes, and that are more easily amenable to likelihood‐based inference for large datasets. In particular, we discuss various types of random scale constructions, as well as the conditional spatial extremes model, which have recently been getting increasing attention within the statistics of extremes community. We illustrate some of these new spatial models on two different environmental applications.","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":" ","pages":""},"PeriodicalIF":3.2,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/wics.1537","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45974489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 64

A review of Bayesian group selection approaches for linear regression models 线性回归模型的贝叶斯群选择方法综述

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-06-29 DOI: 10.1002/wics.1513

Wei Lai, Ray‐Bing Chen

Grouping selection arises naturally in many statistical modeling problems. Several group selection methods have been proposed in the last two decades. In this paper, we review the Bayesian group selection approaches for linear regression models. We start from the Bayesian indicator approach and then move to the Bayesian group LASSO methods. In addition, we also consider the Bayesian methods for the sparse group selection that can be treated as an extension of the group selection. Finally, we mention some extensions of Bayesian group selection for the generalized linear models and the multiple response models.

分组选择在许多统计建模问题中自然产生。在过去的二十年里，已经提出了几种群体选择方法。在本文中，我们回顾了线性回归模型的贝叶斯群选择方法。我们从贝叶斯指标方法开始，然后转到贝叶斯组LASSO方法。此外，我们还考虑了稀疏群选择的贝叶斯方法，该方法可以被视为群选择的扩展。最后，我们提到了贝叶斯群选择对广义线性模型和多响应模型的一些扩展。

引用次数: 3

Issue Information 问题信息

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-06-07 DOI: 10.1002/wics.1474

引用次数: 0

Advance of the sufficient dimension reduction 充分降维的推进

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-06-03 DOI: 10.1002/wics.1516

Weiqiang Hang, Yingcun Xia

The sufficient dimension reduction of Li has been seen a steady development in the past 30 years in both methodology and application. The main approaches can be categorized into two groups: The inverse regression methods and forward regression methods. In this survey, we briefly discuss advances of methods and present problems that needs further investigation in the second group.

在过去的30年里，李的充分降维一直在稳步发展多年的方法论和应用。主要方法可分为两类：逆回归方法和正回归方法。在这项调查中，我们在第二组中简要讨论了方法的进展和需要进一步调查的问题。

引用次数: 0

30 Years of space–time covariance functions 30 年时空协方差函数

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-05-20 DOI: 10.1002/wics.1512

E. Porcu, R. Furrer, D. Nychka

In this article, we provide a comprehensive review of space–time covariance functions. As for the spatial domain, we focus on either the d‐dimensional Euclidean space or on the unit d‐dimensional sphere. We start by providing background information about (spatial) covariance functions and their properties along with different types of covariance functions. While we focus primarily on Gaussian processes, many of the results are independent of the underlying distribution, as the covariance only depends on second‐moment relationships. We discuss properties of space–time covariance functions along with the relevant results associated with spectral representations. Special attention is given to the Gneiting class of covariance functions, which has been especially popular in space–time geostatistical modeling. We then discuss some techniques that are useful for constructing new classes of space–time covariance functions. Separate treatment is reserved for spectral models, as well as to what are termed models with special features. We also discuss the problem of estimation of parametric classes of space–time covariance functions. An outlook concludes the paper.

在这篇文章中，我们对时空协方差函数进行了全面的回顾。至于空间域，我们关注的是d维欧几里得空间或单位d维球面。我们首先提供关于（空间）协方差函数及其性质的背景信息，以及不同类型的协方差函数。虽然我们主要关注高斯过程，但许多结果与潜在分布无关，因为协方差仅取决于二阶矩关系。我们讨论了时空协方差函数的性质以及与谱表示相关的相关结果。特别关注的是Gneiting类协方差函数，它在时空地质统计建模中特别流行。然后，我们讨论了一些对构造新的时空协方差函数类有用的技术。对光谱模型以及所谓的具有特殊特征的模型进行单独处理。我们还讨论了空间-时间协方差函数的参数类的估计问题。论文最后作了展望。

{"title":"30 Years of space–time covariance functions","authors":"E. Porcu, R. Furrer, D. Nychka","doi":"10.1002/wics.1512","DOIUrl":"https://doi.org/10.1002/wics.1512","url":null,"abstract":"In this article, we provide a comprehensive review of space–time covariance functions. As for the spatial domain, we focus on either the d‐dimensional Euclidean space or on the unit d‐dimensional sphere. We start by providing background information about (spatial) covariance functions and their properties along with different types of covariance functions. While we focus primarily on Gaussian processes, many of the results are independent of the underlying distribution, as the covariance only depends on second‐moment relationships. We discuss properties of space–time covariance functions along with the relevant results associated with spectral representations. Special attention is given to the Gneiting class of covariance functions, which has been especially popular in space–time geostatistical modeling. We then discuss some techniques that are useful for constructing new classes of space–time covariance functions. Separate treatment is reserved for spectral models, as well as to what are termed models with special features. We also discuss the problem of estimation of parametric classes of space–time covariance functions. An outlook concludes the paper.","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":" ","pages":""},"PeriodicalIF":3.2,"publicationDate":"2020-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/wics.1512","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45698321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 62

Analysis of shape data: From landmarks to elastic curves. 形状数据分析：从地标到弹性曲线

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-05-01 Epub Date: 2020-01-17 DOI: 10.1002/wics.1495

Karthik Bharath, Sebastian Kurtek

Proliferation of high-resolution imaging data in recent years has led to sub-stantial improvements in the two popular approaches for analyzing shapes of data objects based on landmarks and/or continuous curves. We provide an expository account of elastic shape analysis of parametric planar curves representing shapes of two-dimensional (2D) objects by discussing its differences, and its commonalities, to the landmark-based approach. Particular attention is accorded to the role of reparameterization of a curve, which in addition to rotation, scaling and translation, represents an important shape-preserving transformation of a curve. The transition to the curve-based approach moves the mathematical setting of shape analysis from finite-dimensional non-Euclidean spaces to infinite-dimensional ones. We discuss some of the challenges associated with the infinite-dimensionality of the shape space, and illustrate the use of geometry-based methods in the computation of intrinsic statistical summaries and in the definition of statistical models on a 2D imaging dataset consisting of mouse vertebrae. We conclude with an overview of the current state-of-the-art in the field.

近年来，高分辨率成像数据的激增使得基于地标和/或连续曲线分析数据对象形状的两种流行方法有了质的飞跃。我们对代表二维（2D）物体形状的参数平面曲线的弹性形状分析进行了阐述，讨论了它与基于地标的方法的区别和共同点。除了旋转、缩放和平移之外，曲线的重参数化也是曲线形状保持的重要变换。向基于曲线的方法过渡，使形状分析的数学环境从有限维非欧几里得空间转向无限维空间。我们讨论了与形状空间的无穷维性相关的一些挑战，并举例说明了在计算内在统计摘要和定义由小鼠椎骨组成的二维成像数据集的统计模型时如何使用基于几何的方法。最后，我们将概述该领域的最新进展。

{"title":"Analysis of shape data: From landmarks to elastic curves.","authors":"Karthik Bharath, Sebastian Kurtek","doi":"10.1002/wics.1495","DOIUrl":"10.1002/wics.1495","url":null,"abstract":"<p><p>Proliferation of high-resolution imaging data in recent years has led to sub-stantial improvements in the two popular approaches for analyzing shapes of data objects based on landmarks and/or continuous curves. We provide an expository account of elastic shape analysis of parametric planar curves representing shapes of two-dimensional (2D) objects by discussing its differences, and its commonalities, to the landmark-based approach. Particular attention is accorded to the role of reparameterization of a curve, which in addition to rotation, scaling and translation, represents an important shape-preserving transformation of a curve. The transition to the curve-based approach moves the mathematical setting of shape analysis from finite-dimensional non-Euclidean spaces to infinite-dimensional ones. We discuss some of the challenges associated with the infinite-dimensionality of the shape space, and illustrate the use of geometry-based methods in the computation of intrinsic statistical summaries and in the definition of statistical models on a 2D imaging dataset consisting of mouse vertebrae. We conclude with an overview of the current state-of-the-art in the field.</p>","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":"12 3","pages":""},"PeriodicalIF":3.2,"publicationDate":"2020-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8357314/pdf/nihms-1704274.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39316954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Adversarial machine learning for cybersecurity and computer vision: Current developments and challenges 网络安全和计算机视觉的对抗性机器学习:当前的发展和挑战

IF 3.2 2区数学 Q1 STATISTICS & PROBABILITY

Wiley Interdisciplinary Reviews-Computational Statistics

Pub Date : 2020-04-21 DOI: 10.1002/wics.1511

B. Xi

We provide a comprehensive overview of adversarial machine learning focusing on two application domains, that is, cybersecurity and computer vision. Research in adversarial machine learning addresses a significant threat to the wide application of machine learning techniques—they are vulnerable to carefully crafted attacks from malicious adversaries. For example, deep neural networks fail to correctly classify adversarial images, which are generated by adding imperceptible perturbations to clean images. We first discuss three main categories of attacks against machine learning techniques—poisoning attacks, evasion attacks, and privacy attacks. Then the corresponding defense approaches are introduced along with the weakness and limitations of the existing defense approaches. We notice adversarial samples in cybersecurity and computer vision are fundamentally different. While adversarial samples in cybersecurity often have different properties/distributions compared with training data, adversarial images in computer vision are created with minor input perturbations. This further complicates the development of robust learning techniques, because a robust learning technique must withstand different types of attacks.

我们全面概述了对抗性机器学习，重点关注两个应用领域，即网络安全和计算机视觉。对抗性机器学习的研究解决了对机器学习技术广泛应用的重大威胁——它们很容易受到恶意对手精心设计的攻击。例如，深度神经网络无法正确地对对抗性图像进行分类，而对抗性图像是通过向干净的图像添加难以察觉的扰动而生成的。我们首先讨论了针对机器学习技术的三类主要攻击——中毒攻击、逃避攻击和隐私攻击。然后介绍了相应的防御方法，以及现有防御方法的弱点和局限性。我们注意到网络安全和计算机视觉中的对抗性样本有着根本的不同。虽然与训练数据相比，网络安全中的对抗性样本通常具有不同的属性/分布，但计算机视觉中的对抗图像是在输入扰动较小的情况下创建的。这使得鲁棒学习技术的开发更加复杂，因为鲁棒学习技术必须能够承受不同类型的攻击。

{"title":"Adversarial machine learning for cybersecurity and computer vision: Current developments and challenges","authors":"B. Xi","doi":"10.1002/wics.1511","DOIUrl":"https://doi.org/10.1002/wics.1511","url":null,"abstract":"We provide a comprehensive overview of adversarial machine learning focusing on two application domains, that is, cybersecurity and computer vision. Research in adversarial machine learning addresses a significant threat to the wide application of machine learning techniques—they are vulnerable to carefully crafted attacks from malicious adversaries. For example, deep neural networks fail to correctly classify adversarial images, which are generated by adding imperceptible perturbations to clean images. We first discuss three main categories of attacks against machine learning techniques—poisoning attacks, evasion attacks, and privacy attacks. Then the corresponding defense approaches are introduced along with the weakness and limitations of the existing defense approaches. We notice adversarial samples in cybersecurity and computer vision are fundamentally different. While adversarial samples in cybersecurity often have different properties/distributions compared with training data, adversarial images in computer vision are created with minor input perturbations. This further complicates the development of robust learning techniques, because a robust learning technique must withstand different types of attacks.","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":" ","pages":""},"PeriodicalIF":3.2,"publicationDate":"2020-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/wics.1511","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43224376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Wiley Interdisciplinary Reviews-Computational Statistics

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀