Riesz Networks: Scale-Invariant Neural Networks in a Single Forward Pass

IF 1.3 4区 数学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Journal of Mathematical Imaging and Vision Pub Date : 2024-02-29 DOI:10.1007/s10851-024-01171-4
Tin Barisin, Katja Schladitz, Claudia Redenbach
{"title":"Riesz Networks: Scale-Invariant Neural Networks in a Single Forward Pass","authors":"Tin Barisin, Katja Schladitz, Claudia Redenbach","doi":"10.1007/s10851-024-01171-4","DOIUrl":null,"url":null,"abstract":"<p>Scale invariance of an algorithm refers to its ability to treat objects equally independently of their size. For neural networks, scale invariance is typically achieved by data augmentation. However, when presented with a scale far outside the range covered by the training set, neural networks may fail to generalize. Here, we introduce the Riesz network, a novel scale- invariant neural network. Instead of standard 2d or 3d convolutions for combining spatial information, the Riesz network is based on the Riesz transform which is a scale-equivariant operation. As a consequence, this network naturally generalizes to unseen or even arbitrary scales in a single forward pass. As an application example, we consider detecting and segmenting cracks in tomographic images of concrete. In this context, ‘scale’ refers to the crack thickness which may vary strongly even within the same sample. To prove its scale invariance, the Riesz network is trained on one fixed crack width. We then validate its performance in segmenting simulated and real tomographic images featuring a wide range of crack widths. An additional experiment is carried out on the MNIST Large Scale data set.</p>","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":null,"pages":null},"PeriodicalIF":1.3000,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Mathematical Imaging and Vision","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s10851-024-01171-4","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Scale invariance of an algorithm refers to its ability to treat objects equally independently of their size. For neural networks, scale invariance is typically achieved by data augmentation. However, when presented with a scale far outside the range covered by the training set, neural networks may fail to generalize. Here, we introduce the Riesz network, a novel scale- invariant neural network. Instead of standard 2d or 3d convolutions for combining spatial information, the Riesz network is based on the Riesz transform which is a scale-equivariant operation. As a consequence, this network naturally generalizes to unseen or even arbitrary scales in a single forward pass. As an application example, we consider detecting and segmenting cracks in tomographic images of concrete. In this context, ‘scale’ refers to the crack thickness which may vary strongly even within the same sample. To prove its scale invariance, the Riesz network is trained on one fixed crack width. We then validate its performance in segmenting simulated and real tomographic images featuring a wide range of crack widths. An additional experiment is carried out on the MNIST Large Scale data set.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
里兹网络一次前向传递中的规模不变神经网络
算法的尺度不变性是指算法能够平等对待不同大小的对象。对于神经网络来说,规模不变性通常是通过数据增强来实现的。然而,当遇到远远超出训练集覆盖范围的尺度时,神经网络可能无法泛化。在此,我们介绍一种新型尺度不变神经网络--Riesz 网络。Riesz 网络以 Riesz 变换为基础,而不是以标准的 2d 或 3d 卷积来组合空间信息,Riesz 变换是一种尺度不变运算。因此,该网络只需一次前向传递,就能自然地泛化到未见过的甚至任意的尺度。作为一个应用实例,我们考虑检测和分割混凝土断层图像中的裂缝。在这种情况下,"尺度 "指的是裂缝厚度,即使在同一个样本中,裂缝厚度也会有很大变化。为了证明其尺度不变性,我们在一个固定的裂缝宽度上对 Riesz 网络进行了训练。然后,我们验证了它在分割具有多种裂缝宽度的模拟和真实断层图像时的性能。我们还在 MNIST 大尺度数据集上进行了额外的实验。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Mathematical Imaging and Vision
Journal of Mathematical Imaging and Vision 工程技术-计算机:人工智能
CiteScore
4.30
自引率
5.00%
发文量
70
审稿时长
3.3 months
期刊介绍: The Journal of Mathematical Imaging and Vision is a technical journal publishing important new developments in mathematical imaging. The journal publishes research articles, invited papers, and expository articles. Current developments in new image processing hardware, the advent of multisensor data fusion, and rapid advances in vision research have led to an explosive growth in the interdisciplinary field of imaging science. This growth has resulted in the development of highly sophisticated mathematical models and theories. The journal emphasizes the role of mathematics as a rigorous basis for imaging science. This provides a sound alternative to present journals in this area. Contributions are judged on the basis of mathematical content. Articles may be physically speculative but need to be mathematically sound. Emphasis is placed on innovative or established mathematical techniques applied to vision and imaging problems in a novel way, as well as new developments and problems in mathematics arising from these applications. The scope of the journal includes: computational models of vision; imaging algebra and mathematical morphology mathematical methods in reconstruction, compactification, and coding filter theory probabilistic, statistical, geometric, topological, and fractal techniques and models in imaging science inverse optics wave theory. Specific application areas of interest include, but are not limited to: all aspects of image formation and representation medical, biological, industrial, geophysical, astronomical and military imaging image analysis and image understanding parallel and distributed computing computer vision architecture design.
期刊最新文献
Mathematical Morphology on Directional Data Mixing Support Detection-Based Alternating Direction Method of Multipliers for Sparse Hyperspectral Image Unmixing Inferring Object Boundaries and Their Roughness with Uncertainty Quantification A Graph Multi-separator Problem for Image Segmentation Parallelly Sliced Optimal Transport on Spheres and on the Rotation Group
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1