深度神经网络中的空间变换

2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA) Pub Date : 2018-09-01 DOI:10.23919/SPA.2018.8563429

Michał Bednarek, K. Walas

{"title":"深度神经网络中的空间变换","authors":"Michał Bednarek, K. Walas","doi":"10.23919/SPA.2018.8563429","DOIUrl":null,"url":null,"abstract":"Convolutional Neural Networks (CNNs) have brought us the exceptionally significant improvement in the performance of the variety of visual tasks, such as object classification, semantic segmentation or linear regression. However, these powerful neural models suffer from the lack of spatial invariance. In this paper, we introduce the end-to-end system that is able to learn such invariance including in-plane and out-of-plane rotations. We performed extensive experiments on variations of widely known MNIST dataset, which consist of images subjected to deformations. Our comparative results show that we can successfully improve the classification score by implementing so-called Spatial Transformer module.","PeriodicalId":265587,"journal":{"name":"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Spatial Transformations in Deep Neural Networks\",\"authors\":\"Michał Bednarek, K. Walas\",\"doi\":\"10.23919/SPA.2018.8563429\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Convolutional Neural Networks (CNNs) have brought us the exceptionally significant improvement in the performance of the variety of visual tasks, such as object classification, semantic segmentation or linear regression. However, these powerful neural models suffer from the lack of spatial invariance. In this paper, we introduce the end-to-end system that is able to learn such invariance including in-plane and out-of-plane rotations. We performed extensive experiments on variations of widely known MNIST dataset, which consist of images subjected to deformations. Our comparative results show that we can successfully improve the classification score by implementing so-called Spatial Transformer module.\",\"PeriodicalId\":265587,\"journal\":{\"name\":\"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)\",\"volume\":\"100 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/SPA.2018.8563429\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/SPA.2018.8563429","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

卷积神经网络(cnn)为我们带来了各种视觉任务性能的显著改善，如对象分类、语义分割或线性回归。然而，这些强大的神经模型缺乏空间不变性。在本文中，我们引入了一个端到端系统，它能够学习平面内和平面外旋转的不变性。我们对广为人知的MNIST数据集进行了广泛的实验，该数据集由变形的图像组成。我们的比较结果表明，通过实现所谓的空间转换器模块，我们可以成功地提高分类分数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Spatial Transformations in Deep Neural Networks

Convolutional Neural Networks (CNNs) have brought us the exceptionally significant improvement in the performance of the variety of visual tasks, such as object classification, semantic segmentation or linear regression. However, these powerful neural models suffer from the lack of spatial invariance. In this paper, we introduce the end-to-end system that is able to learn such invariance including in-plane and out-of-plane rotations. We performed extensive experiments on variations of widely known MNIST dataset, which consist of images subjected to deformations. Our comparative results show that we can successfully improve the classification score by implementing so-called Spatial Transformer module.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

自引率

0.00%

发文量