首页 > 最新文献

Acta Numerica最新文献

英文 中文
Turnpike in optimal control of PDEs, ResNets, and beyond PDE、ResNets及其他优化控制中的收费公路
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2022-02-08 DOI: 10.1017/S0962492922000046
Borjan Geshkovski, E. Zuazua
The turnpike property in contemporary macroeconomics asserts that if an economic planner seeks to move an economy from one level of capital to another, then the most efficient path, as long as the planner has enough time, is to rapidly move stock to a level close to the optimal stationary or constant path, then allow for capital to develop along that path until the desired term is nearly reached, at which point the stock ought to be moved to the final target. Motivated in part by its nature as a resource allocation strategy, over the past decade, the turnpike property has also been shown to hold for several classes of partial differential equations arising in mechanics. When formalized mathematically, the turnpike theory corroborates insights from economics: for an optimal control problem set in a finite-time horizon, optimal controls and corresponding states are close (often exponentially) most of the time, except near the initial and final times, to the optimal control and the corresponding state for the associated stationary optimal control problem. In particular, the former are mostly constant over time. This fact provides a rigorous meaning to the asymptotic simplification that some optimal control problems appear to enjoy over long time intervals, allowing the consideration of the corresponding stationary problem for computing and applications. We review a slice of the theory developed over the past decade – the controllability of the underlying system is an important ingredient, and can even be used to devise simple turnpike-like strategies which are nearly optimal – and present several novel applications, including, among many others, the characterization of Hamilton–Jacobi–Bellman asymptotics, and stability estimates in deep learning via residual neural networks.
收费高速公路属性在当代宏观经济学断言,如果一个经济规划者试图将经济从一个水平的资本,那么最有效的路径,只要计划有足够的时间,迅速将股票水平接近最优固定或固定路径,然后允许资本发展沿着这条路直到所需的术语几乎达到了,此时的股票应该搬到最终的目标。在过去的十年中,由于其作为一种资源分配策略的性质,收费公路性质也被证明适用于力学中出现的几种偏微分方程。当以数学形式形式化时,收费公路理论证实了经济学的见解:对于有限时间范围内的最优控制问题集,除了接近初始和最终时间外,大多数时候最优控制和相应状态与相关平稳最优控制问题的最优控制和相应状态接近(通常是指数级)。特别是,前者随着时间的推移基本上是不变的。这一事实为一些最优控制问题在长时间间隔内似乎享有的渐近简化提供了严格的意义,允许考虑相应的计算和应用的平稳问题。我们回顾了过去十年来发展起来的理论的一部分-底层系统的可控制性是一个重要的组成部分,甚至可以用来设计简单的近似于最优的收费公路策略-并提出了几个新的应用,包括,在许多其他方面,汉密尔顿-雅可比-贝尔曼渐近的表征,以及通过残差神经网络进行深度学习的稳定性估计。
{"title":"Turnpike in optimal control of PDEs, ResNets, and beyond","authors":"Borjan Geshkovski, E. Zuazua","doi":"10.1017/S0962492922000046","DOIUrl":"https://doi.org/10.1017/S0962492922000046","url":null,"abstract":"The turnpike property in contemporary macroeconomics asserts that if an economic planner seeks to move an economy from one level of capital to another, then the most efficient path, as long as the planner has enough time, is to rapidly move stock to a level close to the optimal stationary or constant path, then allow for capital to develop along that path until the desired term is nearly reached, at which point the stock ought to be moved to the final target. Motivated in part by its nature as a resource allocation strategy, over the past decade, the turnpike property has also been shown to hold for several classes of partial differential equations arising in mechanics. When formalized mathematically, the turnpike theory corroborates insights from economics: for an optimal control problem set in a finite-time horizon, optimal controls and corresponding states are close (often exponentially) most of the time, except near the initial and final times, to the optimal control and the corresponding state for the associated stationary optimal control problem. In particular, the former are mostly constant over time. This fact provides a rigorous meaning to the asymptotic simplification that some optimal control problems appear to enjoy over long time intervals, allowing the consideration of the corresponding stationary problem for computing and applications. We review a slice of the theory developed over the past decade – the controllability of the underlying system is an important ingredient, and can even be used to devise simple turnpike-like strategies which are nearly optimal – and present several novel applications, including, among many others, the characterization of Hamilton–Jacobi–Bellman asymptotics, and stability estimates in deep learning via residual neural networks.","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"31 1","pages":"135 - 263"},"PeriodicalIF":14.2,"publicationDate":"2022-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46313032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Control of port-Hamiltonian differential-algebraic systems and applications 波特-哈密顿微分代数系统的控制及其应用
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2022-01-17 DOI: 10.1017/S0962492922000083
V. Mehrmann, B. Unger
We discuss the modelling framework of port-Hamiltonian descriptor systems and their use in numerical simulation and control. The structure is ideal for automated network-based modelling since it is invariant under power-conserving interconnection, congruence transformations and Galerkin projection. Moreover, stability and passivity properties are easily shown. Condensed forms under orthogonal transformations present easy analysis tools for existence, uniqueness, regularity and numerical methods to check these properties. After recalling the concepts for general linear and nonlinear descriptor systems, we demonstrate that many difficulties that arise in general descriptor systems can be easily overcome within the port-Hamiltonian framework. The properties of port-Hamiltonian descriptor systems are analysed, and time discretization and numerical linear algebra techniques are discussed. Structure-preserving regularization procedures for descriptor systems are presented to make them suitable for simulation and control. Model reduction techniques that preserve the structure and stabilization and optimal control techniques are discussed. The properties of port-Hamiltonian descriptor systems and their use in modelling simulation and control methods are illustrated with several examples from different physical domains. The survey concludes with open problems and research topics that deserve further attention.
讨论了端口-哈密顿广义系统的建模框架及其在数值仿真和控制中的应用。由于该结构在节能互连、同余变换和伽辽金投影下是不变的,因此它是基于自动网络建模的理想结构。此外,稳定性和无源性也很容易显示。正交变换下的凝聚形式提供了简便的存在性、唯一性、正则性分析工具和检验这些性质的数值方法。在回顾一般线性和非线性描述系统的概念后,我们证明了在一般描述系统中出现的许多困难可以在port- hamilton框架内轻松克服。分析了波特-哈密顿广义系统的性质,讨论了时间离散化和数值线性代数技术。为了使广义系统适合于仿真和控制,提出了保持结构的正则化方法。讨论了保持结构稳定的模型简化技术和最优控制技术。通过不同物理域的实例说明了端口-哈密顿广义系统的性质及其在建模、仿真和控制方法中的应用。调查总结了一些有待进一步关注的问题和研究课题。
{"title":"Control of port-Hamiltonian differential-algebraic systems and applications","authors":"V. Mehrmann, B. Unger","doi":"10.1017/S0962492922000083","DOIUrl":"https://doi.org/10.1017/S0962492922000083","url":null,"abstract":"We discuss the modelling framework of port-Hamiltonian descriptor systems and their use in numerical simulation and control. The structure is ideal for automated network-based modelling since it is invariant under power-conserving interconnection, congruence transformations and Galerkin projection. Moreover, stability and passivity properties are easily shown. Condensed forms under orthogonal transformations present easy analysis tools for existence, uniqueness, regularity and numerical methods to check these properties. After recalling the concepts for general linear and nonlinear descriptor systems, we demonstrate that many difficulties that arise in general descriptor systems can be easily overcome within the port-Hamiltonian framework. The properties of port-Hamiltonian descriptor systems are analysed, and time discretization and numerical linear algebra techniques are discussed. Structure-preserving regularization procedures for descriptor systems are presented to make them suitable for simulation and control. Model reduction techniques that preserve the structure and stabilization and optimal control techniques are discussed. The properties of port-Hamiltonian descriptor systems and their use in modelling simulation and control methods are illustrated with several examples from different physical domains. The survey concludes with open problems and research topics that deserve further attention.","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"32 1","pages":"395 - 515"},"PeriodicalIF":14.2,"publicationDate":"2022-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46524517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 36
Asymptotic-preserving schemes for multiscale physical problems 多尺度物理问题的渐近保持格式
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2021-12-11 DOI: 10.1017/S0962492922000010
Shi Jin
We present the asymptotic transitions from microscopic to macroscopic physics, their computational challenges and the asymptotic-preserving (AP) strategies to compute multiscale physical problems efficiently. Specifically, we will first study the asymptotic transition from quantum to classical mechanics, from classical mechanics to kinetic theory, and then from kinetic theory to hydrodynamics. We then review some representative AP schemes that mimic these asymptotic transitions at the discrete level, and hence can be used crossing scales and, in particular, capture the macroscopic behaviour without resolving the microscopic physical scale numerically.
我们提出了从微观到宏观物理的渐近转换,它们的计算挑战以及有效计算多尺度物理问题的渐近保持(AP)策略。具体来说,我们将首先研究从量子力学到经典力学,从经典力学到运动论,然后从运动论到流体力学的渐近跃迁。然后,我们回顾了一些具有代表性的AP方案,这些方案在离散水平上模拟这些渐近转换,因此可以用于跨尺度,特别是在不解决微观物理尺度的情况下捕获宏观行为。
{"title":"Asymptotic-preserving schemes for multiscale physical problems","authors":"Shi Jin","doi":"10.1017/S0962492922000010","DOIUrl":"https://doi.org/10.1017/S0962492922000010","url":null,"abstract":"We present the asymptotic transitions from microscopic to macroscopic physics, their computational challenges and the asymptotic-preserving (AP) strategies to compute multiscale physical problems efficiently. Specifically, we will first study the asymptotic transition from quantum to classical mechanics, from classical mechanics to kinetic theory, and then from kinetic theory to hydrodynamics. We then review some representative AP schemes that mimic these asymptotic transitions at the discrete level, and hence can be used crossing scales and, in particular, capture the macroscopic behaviour without resolving the microscopic physical scale numerically.","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"31 1","pages":"415 - 489"},"PeriodicalIF":14.2,"publicationDate":"2021-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43359804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
ANU volume 30 Cover and Front matter 澳大利亚国立大学第30卷封面和封面问题
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2021-05-01 DOI: 10.1017/s096249292100009x
R. Altmann, P. Henning, D. Peterseim, P. Bartlett, A. Montanari, A. Rakhlin, Ronald A. DeVore, B. Hanin
{"title":"ANU volume 30 Cover and Front matter","authors":"R. Altmann, P. Henning, D. Peterseim, P. Bartlett, A. Montanari, A. Rakhlin, Ronald A. DeVore, B. Hanin","doi":"10.1017/s096249292100009x","DOIUrl":"https://doi.org/10.1017/s096249292100009x","url":null,"abstract":"","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"30 1","pages":"f1 - f6"},"PeriodicalIF":14.2,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47797032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Numerical homogenization beyond scale separation 超越尺度分离的数值均质化
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2021-05-01 DOI: 10.1017/S0962492921000015
R. Altmann, P. Henning, D. Peterseim
Numerical homogenization is a methodology for the computational solution of multiscale partial differential equations. It aims at reducing complex large-scale problems to simplified numerical models valid on some target scale of interest, thereby accounting for the impact of features on smaller scales that are otherwise not resolved. While constructive approaches in the mathematical theory of homogenization are restricted to problems with a clear scale separation, modern numerical homogenization methods can accurately handle problems with a continuum of scales. This paper reviews such approaches embedded in a historical context and provides a unified variational framework for their design and numerical analysis. Apart from prototypical elliptic model problems, the class of partial differential equations covered here includes wave scattering in heterogeneous media and serves as a template for more general multi-physics problems.
数值均匀化是求解多尺度偏微分方程的一种方法。它旨在将复杂的大规模问题简化为在某些感兴趣的目标尺度上有效的简化数值模型,从而考虑到在较小尺度上无法解决的特征的影响。均匀化数学理论中的建设性方法仅限于具有明确尺度分离的问题,而现代数值均匀化方法可以准确地处理具有连续尺度的问题。本文回顾了嵌入在历史背景下的这些方法,并为其设计和数值分析提供了统一的变分框架。除了典型的椭圆模型问题外,这里所涵盖的一类偏微分方程还包括非均匀介质中的波散射,并作为更一般的多物理场问题的模板。
{"title":"Numerical homogenization beyond scale separation","authors":"R. Altmann, P. Henning, D. Peterseim","doi":"10.1017/S0962492921000015","DOIUrl":"https://doi.org/10.1017/S0962492921000015","url":null,"abstract":"Numerical homogenization is a methodology for the computational solution of multiscale partial differential equations. It aims at reducing complex large-scale problems to simplified numerical models valid on some target scale of interest, thereby accounting for the impact of features on smaller scales that are otherwise not resolved. While constructive approaches in the mathematical theory of homogenization are restricted to problems with a clear scale separation, modern numerical homogenization methods can accurately handle problems with a continuum of scales. This paper reviews such approaches embedded in a historical context and provides a unified variational framework for their design and numerical analysis. Apart from prototypical elliptic model problems, the class of partial differential equations covered here includes wave scattering in heterogeneous media and serves as a template for more general multi-physics problems.","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"30 1","pages":"1 - 86"},"PeriodicalIF":14.2,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48106578","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 44
Optimal transportation, modelling and numerical simulation 最优运输,建模和数值模拟
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2021-05-01 DOI: 10.1017/S0962492921000040
J. Benamou
We present an overviewof the basic theory, modern optimal transportation extensions and recent algorithmic advances. Selected modelling and numerical applications illustrate the impact of optimal transportation in numerical analysis.
本文综述了最优交通的基本理论、现代最优交通的扩展和最新的算法进展。所选的模型和数值应用说明了数值分析中最优运输的影响。
{"title":"Optimal transportation, modelling and numerical simulation","authors":"J. Benamou","doi":"10.1017/S0962492921000040","DOIUrl":"https://doi.org/10.1017/S0962492921000040","url":null,"abstract":"We present an overviewof the basic theory, modern optimal transportation extensions and recent algorithmic advances. Selected modelling and numerical applications illustrate the impact of optimal transportation in numerical analysis.","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"30 1","pages":"249 - 325"},"PeriodicalIF":14.2,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41848083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Tensors in computations 计算中的张量
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2021-05-01 DOI: 10.1017/S0962492921000076
Lek-Heng Lim
The notion of a tensor captures three great ideas: equivariance, multilinearity, separability. But trying to be three things at once makes the notion difficult to understand. We will explain tensors in an accessible and elementary way through the lens of linear algebra and numerical linear algebra, elucidated with examples from computational and applied mathematics.
张量的概念包含了三个伟大的概念:等变性、多线性和可分性。但是,试图同时成为三件事会使这个概念难以理解。我们将通过线性代数和数值线性代数的视角,通过计算数学和应用数学的例子,以一种简单易懂的方式解释张量。
{"title":"Tensors in computations","authors":"Lek-Heng Lim","doi":"10.1017/S0962492921000076","DOIUrl":"https://doi.org/10.1017/S0962492921000076","url":null,"abstract":"The notion of a tensor captures three great ideas: equivariance, multilinearity, separability. But trying to be three things at once makes the notion difficult to understand. We will explain tensors in an accessible and elementary way through the lens of linear algebra and numerical linear algebra, elucidated with examples from computational and applied mathematics.","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"30 1","pages":"555 - 764"},"PeriodicalIF":14.2,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44930203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Learning physics-based models from data: perspectives from inverse problems and model reduction 从数据中学习基于物理的模型:从反问题和模型简化的角度
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2021-05-01 DOI: 10.1017/S0962492921000064
O. Ghattas, K. Willcox
This article addresses the inference of physics models from data, from the perspectives of inverse problems and model reduction. These fields develop formulations that integrate data into physics-based models while exploiting the fact that many mathematical models of natural and engineered systems exhibit an intrinsically low-dimensional solution manifold. In inverse problems, we seek to infer uncertain components of the inputs from observations of the outputs, while in model reduction we seek low-dimensional models that explicitly capture the salient features of the input–output map through approximation in a low-dimensional subspace. In both cases, the result is a predictive model that reflects data-driven learning yet deeply embeds the underlying physics, and thus can be used for design, control and decision-making, often with quantified uncertainties. We highlight recent developments in scalable and efficient algorithms for inverse problems and model reduction governed by large-scale models in the form of partial differential equations. Several illustrative applications to large-scale complex problems across different domains of science and engineering are provided.
本文从反问题和模型约简的角度,讨论了物理模型的数据推理。这些领域开发了将数据集成到基于物理的模型中的公式,同时利用了许多自然和工程系统的数学模型表现出本质上低维解流形的事实。在反问题中,我们试图从输出的观察中推断输入的不确定成分,而在模型约简中,我们寻求通过在低维子空间中的近似来明确捕获输入-输出映射的显著特征的低维模型。在这两种情况下,结果都是一个预测模型,它反映了数据驱动的学习,但又深深嵌入了底层物理,因此可以用于设计、控制和决策,通常具有量化的不确定性。我们强调了最近在可扩展和高效算法的发展,用于逆问题和由偏微分方程形式的大规模模型控制的模型约简。提供了在不同科学和工程领域的大规模复杂问题的几个说明性应用。
{"title":"Learning physics-based models from data: perspectives from inverse problems and model reduction","authors":"O. Ghattas, K. Willcox","doi":"10.1017/S0962492921000064","DOIUrl":"https://doi.org/10.1017/S0962492921000064","url":null,"abstract":"This article addresses the inference of physics models from data, from the perspectives of inverse problems and model reduction. These fields develop formulations that integrate data into physics-based models while exploiting the fact that many mathematical models of natural and engineered systems exhibit an intrinsically low-dimensional solution manifold. In inverse problems, we seek to infer uncertain components of the inputs from observations of the outputs, while in model reduction we seek low-dimensional models that explicitly capture the salient features of the input–output map through approximation in a low-dimensional subspace. In both cases, the result is a predictive model that reflects data-driven learning yet deeply embeds the underlying physics, and thus can be used for design, control and decision-making, often with quantified uncertainties. We highlight recent developments in scalable and efficient algorithms for inverse problems and model reduction governed by large-scale models in the form of partial differential equations. Several illustrative applications to large-scale complex problems across different domains of science and engineering are provided.","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"30 1","pages":"445 - 554"},"PeriodicalIF":14.2,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47260397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 69
Fit without fear: remarkable mathematical phenomena of deep learning through the prism of interpolation 无所畏惧:通过插补棱镜的深度学习的非凡数学现象
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2021-05-01 DOI: 10.1017/S0962492921000039
M. Belkin
In the past decade the mathematical theory of machine learning has lagged far behind the triumphs of deep neural networks on practical challenges. However, the gap between theory and practice is gradually starting to close. In this paper I will attempt to assemble some pieces of the remarkable and still incomplete mathematical mosaic emerging from the efforts to understand the foundations of deep learning. The two key themes will be interpolation and its sibling over-parametrization. Interpolation corresponds to fitting data, even noisy data, exactly. Over-parametrization enables interpolation and provides flexibility to select a suitable interpolating model. As we will see, just as a physical prism separates colours mixed within a ray of light, the figurative prism of interpolation helps to disentangle generalization and optimization properties within the complex picture of modern machine learning. This article is written in the belief and hope that clearer understanding of these issues will bring us a step closer towards a general theory of deep learning and machine learning.
在过去的十年里,机器学习的数学理论在实践挑战方面远远落后于深度神经网络的成功。然而,理论与实践之间的差距正在逐渐缩小。在这篇论文中,我将尝试收集一些引人注目但仍然不完整的数学拼图,这些拼图是在理解深度学习的基础的努力中产生的。两个关键主题将是插值及其兄弟参数化。插值精确地对应于拟合数据,甚至是噪声数据。过参数化实现插值,并提供选择合适插值模型的灵活性。正如我们将看到的,正如物理棱镜分离光线中混合的颜色一样,插值的具象棱镜有助于在现代机器学习的复杂画面中理清泛化和优化特性。本文相信并希望对这些问题的更清晰理解将使我们离深度学习和机器学习的一般理论更近一步。
{"title":"Fit without fear: remarkable mathematical phenomena of deep learning through the prism of interpolation","authors":"M. Belkin","doi":"10.1017/S0962492921000039","DOIUrl":"https://doi.org/10.1017/S0962492921000039","url":null,"abstract":"In the past decade the mathematical theory of machine learning has lagged far behind the triumphs of deep neural networks on practical challenges. However, the gap between theory and practice is gradually starting to close. In this paper I will attempt to assemble some pieces of the remarkable and still incomplete mathematical mosaic emerging from the efforts to understand the foundations of deep learning. The two key themes will be interpolation and its sibling over-parametrization. Interpolation corresponds to fitting data, even noisy data, exactly. Over-parametrization enables interpolation and provides flexibility to select a suitable interpolating model. As we will see, just as a physical prism separates colours mixed within a ray of light, the figurative prism of interpolation helps to disentangle generalization and optimization properties within the complex picture of modern machine learning. This article is written in the belief and hope that clearer understanding of these issues will bring us a step closer towards a general theory of deep learning and machine learning.","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"30 1","pages":"203 - 248"},"PeriodicalIF":14.2,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46793783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 116
ANU volume 30 Cover and Back matter 澳大利亚国立大学第30卷封面和封底
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2021-05-01 DOI: 10.1017/s0962492921000106
{"title":"ANU volume 30 Cover and Back matter","authors":"","doi":"10.1017/s0962492921000106","DOIUrl":"https://doi.org/10.1017/s0962492921000106","url":null,"abstract":"","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"30 1","pages":"b1 - b1"},"PeriodicalIF":14.2,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46952396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Acta Numerica
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1