Spatial Transformer Networks for Curriculum Learning

Fatemeh Azimi, J. Nies, Sebastián M. Palacio, Federico Raue, Jörn Hees, A. Dengel
{"title":"Spatial Transformer Networks for Curriculum Learning","authors":"Fatemeh Azimi, J. Nies, Sebastián M. Palacio, Federico Raue, Jörn Hees, A. Dengel","doi":"10.1109/DICTA56598.2022.10034595","DOIUrl":null,"url":null,"abstract":"Curriculum learning is a bio-inspired training technique that is widely adopted in machine learning for improved optimization and better training of neural networks regarding the convergence rate or obtained accuracy. The main concept in curriculum learning is to start the training with simpler tasks and gradually increase the level of difficulty. Therefore, a natural question is how to determine or generate these simpler tasks. In this work, we take inspiration from Spatial Transformer Networks (STNs) in order to form an easy-to-hard curriculum. As STNs have been proved capable of removing the clutter from the input images and obtaining higher accuracy in image classification tasks, we hypothesize that images processed by STNs can be seen as easier tasks and utilized in the interest of curriculum learning. To this end, we study multiple strategies developed for shaping the training curriculum, using the data generated by STNs. We perform various experiments on cluttered MNIST and Fashion-MNIST datasets, where on the former, we obtain an improvement of 3.8pp in classification accuracy compared to the baseline, indicating that STNs can be considered as a tool for generating the easy-to-hard training schedule required for curriculum learning.","PeriodicalId":159377,"journal":{"name":"2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA56598.2022.10034595","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Curriculum learning is a bio-inspired training technique that is widely adopted in machine learning for improved optimization and better training of neural networks regarding the convergence rate or obtained accuracy. The main concept in curriculum learning is to start the training with simpler tasks and gradually increase the level of difficulty. Therefore, a natural question is how to determine or generate these simpler tasks. In this work, we take inspiration from Spatial Transformer Networks (STNs) in order to form an easy-to-hard curriculum. As STNs have been proved capable of removing the clutter from the input images and obtaining higher accuracy in image classification tasks, we hypothesize that images processed by STNs can be seen as easier tasks and utilized in the interest of curriculum learning. To this end, we study multiple strategies developed for shaping the training curriculum, using the data generated by STNs. We perform various experiments on cluttered MNIST and Fashion-MNIST datasets, where on the former, we obtain an improvement of 3.8pp in classification accuracy compared to the baseline, indicating that STNs can be considered as a tool for generating the easy-to-hard training schedule required for curriculum learning.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
课程学习的空间转换网络
课程学习是一种生物启发的训练技术,被广泛应用于机器学习中,用于提高神经网络在收敛速度或获得精度方面的优化和更好的训练。课程学习的主要理念是从简单的任务开始训练,逐渐增加难度。因此,一个自然的问题是如何确定或生成这些更简单的任务。在这项工作中,我们从空间变压器网络(STNs)中获得灵感,以形成一个简单难的课程。由于STNs已被证明能够去除输入图像中的杂波,并在图像分类任务中获得更高的精度,我们假设经过STNs处理的图像可以被视为更容易的任务,并用于课程学习。为此,我们研究了利用STNs生成的数据制定培训课程的多种策略。我们在杂乱的MNIST和Fashion-MNIST数据集上进行了各种实验,前者的分类准确率比基线提高了3.8pp,这表明stn可以被认为是生成课程学习所需的易难训练计划的工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
FootSeg: Automatic Anatomical Segmentation of Foot Bones from Weight-Bearing Cone Beam CT Scans KENGIC: KEyword-driven and N-Gram Graph based Image Captioning Swin-ResUNet: A Swin-Topology Module for Road Extraction from Remote Sensing Images A Multi-Granularity Feature Fusion Model for Pedestrian Attribute Recognition Disentangling Convolutional Neural Network towards an explainable Vehicle Classifier
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1