Ronald DeVore , Robert D. Nowak , Rahul Parhi , Jonathan W. Siegel
{"title":"Weighted variation spaces and approximation by shallow ReLU networks","authors":"Ronald DeVore , Robert D. Nowak , Rahul Parhi , Jonathan W. Siegel","doi":"10.1016/j.acha.2024.101713","DOIUrl":null,"url":null,"abstract":"<div><div>We investigate the approximation of functions <em>f</em> on a bounded domain <span><math><mi>Ω</mi><mo>⊂</mo><msup><mrow><mi>R</mi></mrow><mrow><mi>d</mi></mrow></msup></math></span> by the outputs of single-hidden-layer ReLU neural networks of width <em>n</em>. This form of nonlinear <em>n</em>-term dictionary approximation has been intensely studied since it is the simplest case of neural network approximation (NNA). There are several celebrated approximation results for this form of NNA that introduce novel model classes of functions on Ω whose approximation rates do not grow unbounded with the input dimension. These novel classes include Barron classes, and classes based on sparsity or variation such as the Radon-domain BV classes. The present paper is concerned with the definition of these novel model classes on domains Ω. The current definition of these model classes does not depend on the domain Ω. A new and more proper definition of model classes on domains is given by introducing the concept of weighted variation spaces. These new model classes are intrinsic to the domain itself. The importance of these new model classes is that they are strictly larger than the classical (domain-independent) classes. Yet, it is shown that they maintain the same NNA rates.</div></div>","PeriodicalId":55504,"journal":{"name":"Applied and Computational Harmonic Analysis","volume":"74 ","pages":"Article 101713"},"PeriodicalIF":2.6000,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied and Computational Harmonic Analysis","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1063520324000903","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
We investigate the approximation of functions f on a bounded domain by the outputs of single-hidden-layer ReLU neural networks of width n. This form of nonlinear n-term dictionary approximation has been intensely studied since it is the simplest case of neural network approximation (NNA). There are several celebrated approximation results for this form of NNA that introduce novel model classes of functions on Ω whose approximation rates do not grow unbounded with the input dimension. These novel classes include Barron classes, and classes based on sparsity or variation such as the Radon-domain BV classes. The present paper is concerned with the definition of these novel model classes on domains Ω. The current definition of these model classes does not depend on the domain Ω. A new and more proper definition of model classes on domains is given by introducing the concept of weighted variation spaces. These new model classes are intrinsic to the domain itself. The importance of these new model classes is that they are strictly larger than the classical (domain-independent) classes. Yet, it is shown that they maintain the same NNA rates.
我们研究了宽度为 n 的单隐层 ReLU 神经网络输出对有界域 Ω⊂Rd 上函数 f 的逼近。这种形式的 NNA 有几个著名的逼近结果,它们引入了 Ω 上函数的新模型类,其逼近率不会随着输入维度的增加而无限制地增长。这些新类包括巴伦类,以及基于稀疏性或变化的类,如拉顿域 BV 类。目前这些模型类的定义并不依赖于域 Ω。通过引入加权变异空间的概念,我们给出了关于域上模型类的更恰当的新定义。这些新的模型类是领域本身所固有的。这些新模型类的重要性在于,它们严格来说比经典(与域无关)类大。然而,研究表明它们保持了相同的 NNA 率。
期刊介绍:
Applied and Computational Harmonic Analysis (ACHA) is an interdisciplinary journal that publishes high-quality papers in all areas of mathematical sciences related to the applied and computational aspects of harmonic analysis, with special emphasis on innovative theoretical development, methods, and algorithms, for information processing, manipulation, understanding, and so forth. The objectives of the journal are to chronicle the important publications in the rapidly growing field of data representation and analysis, to stimulate research in relevant interdisciplinary areas, and to provide a common link among mathematical, physical, and life scientists, as well as engineers.