Approximation of Lipschitz Functions using Deep Spline Neural Networks

IF 1.9 Q1 MATHEMATICS, APPLIED SIAM journal on mathematics of data science Pub Date : 2022-04-13 DOI:10.48550/arXiv.2204.06233

Sebastian Neumayer, Alexis Goujon, Pakshal Bohra, M. Unser

引用次数: 14

Abstract

Lipschitz-constrained neural networks have many applications in machine learning. Since designing and training expressive Lipschitz-constrained networks is very challenging, there is a need for improved methods and a better theoretical understanding. Unfortunately, it turns out that ReLU networks have provable disadvantages in this setting. Hence, we propose to use learnable spline activation functions with at least 3 linear regions instead. We prove that this choice is optimal among all component-wise $1$-Lipschitz activation functions in the sense that no other weight constrained architecture can approximate a larger class of functions. Additionally, this choice is at least as expressive as the recently introduced non component-wise Groupsort activation function for spectral-norm-constrained weights. Previously published numerical results support our theoretical findings.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用深度样条神经网络逼近Lipschitz函数

lipschitz约束神经网络在机器学习中有很多应用。由于设计和训练富有表现力的lipschitz约束网络是非常具有挑战性的，因此需要改进方法和更好的理论理解。不幸的是，事实证明，ReLU网络在这种情况下存在可证明的缺点。因此，我们建议使用具有至少3个线性区域的可学习样条激活函数来代替。我们证明了这种选择在所有组件智能$1$-Lipschitz激活函数中是最优的，因为没有其他权重约束架构可以近似更大的函数类。此外，这种选择至少与最近引入的用于频谱范数约束权重的非组件分组排序激活函数一样具有表现力。先前发表的数值结果支持我们的理论发现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

SIAM journal on mathematics of data science

自引率

0.00%

发文量