{"title":"Spatial Transformations in Deep Neural Networks","authors":"Michał Bednarek, K. Walas","doi":"10.23919/SPA.2018.8563429","DOIUrl":null,"url":null,"abstract":"Convolutional Neural Networks (CNNs) have brought us the exceptionally significant improvement in the performance of the variety of visual tasks, such as object classification, semantic segmentation or linear regression. However, these powerful neural models suffer from the lack of spatial invariance. In this paper, we introduce the end-to-end system that is able to learn such invariance including in-plane and out-of-plane rotations. We performed extensive experiments on variations of widely known MNIST dataset, which consist of images subjected to deformations. Our comparative results show that we can successfully improve the classification score by implementing so-called Spatial Transformer module.","PeriodicalId":265587,"journal":{"name":"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/SPA.2018.8563429","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Convolutional Neural Networks (CNNs) have brought us the exceptionally significant improvement in the performance of the variety of visual tasks, such as object classification, semantic segmentation or linear regression. However, these powerful neural models suffer from the lack of spatial invariance. In this paper, we introduce the end-to-end system that is able to learn such invariance including in-plane and out-of-plane rotations. We performed extensive experiments on variations of widely known MNIST dataset, which consist of images subjected to deformations. Our comparative results show that we can successfully improve the classification score by implementing so-called Spatial Transformer module.