{"title":"Lie group convolution neural networks with scale-rotation equivariance.","authors":"Weidong Qiao, Yang Xu, Hui Li","doi":"10.1016/j.neunet.2024.106980","DOIUrl":null,"url":null,"abstract":"<p><p>The weight-sharing mechanism of convolutional kernels ensures the translation equivariance of convolutional neural networks (CNNs) but not scale and rotation equivariance. This study proposes a SIM(2) Lie group-CNN, which can simultaneously keep scale, rotation, and translation equivariance for image classification tasks. The SIM(2) Lie group-CNN includes a lifting module, a series of group convolution modules, a global pooling layer, and a classification layer. The lifting module transfers the input image from Euclidean space to Lie group space, and the group convolution is parameterized through a fully connected network using the Lie Algebra coefficients of Lie group elements as inputs to achieve scale and rotation equivariance. It is worth noting that the mapping relationship between SIM(2) and its Lie Algebra and the distance measure of SIM(2) are defined explicitly in this paper, thus solving the problem of the metric of features on the space of SIM(2) Lie group, which contrasts with other Lie groups characterized by a single element, such as SO(2). The scale-rotation equivariance of Lie group-CNN is verified, and the best recognition accuracy is achieved on three categories of image datasets. Consequently, the SIM(2) Lie group-CNN can successfully extract geometric features and perform equivariant recognition on images with rotation and scale transformations.</p>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"183 ","pages":"106980"},"PeriodicalIF":6.0000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1016/j.neunet.2024.106980","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/28 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The weight-sharing mechanism of convolutional kernels ensures the translation equivariance of convolutional neural networks (CNNs) but not scale and rotation equivariance. This study proposes a SIM(2) Lie group-CNN, which can simultaneously keep scale, rotation, and translation equivariance for image classification tasks. The SIM(2) Lie group-CNN includes a lifting module, a series of group convolution modules, a global pooling layer, and a classification layer. The lifting module transfers the input image from Euclidean space to Lie group space, and the group convolution is parameterized through a fully connected network using the Lie Algebra coefficients of Lie group elements as inputs to achieve scale and rotation equivariance. It is worth noting that the mapping relationship between SIM(2) and its Lie Algebra and the distance measure of SIM(2) are defined explicitly in this paper, thus solving the problem of the metric of features on the space of SIM(2) Lie group, which contrasts with other Lie groups characterized by a single element, such as SO(2). The scale-rotation equivariance of Lie group-CNN is verified, and the best recognition accuracy is achieved on three categories of image datasets. Consequently, the SIM(2) Lie group-CNN can successfully extract geometric features and perform equivariant recognition on images with rotation and scale transformations.
期刊介绍:
Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.