基于改进AlexNet模型的场景分类

Lisha Xiao, Qin Yan, Shuyu Deng
{"title":"基于改进AlexNet模型的场景分类","authors":"Lisha Xiao, Qin Yan, Shuyu Deng","doi":"10.1109/ISKE.2017.8258820","DOIUrl":null,"url":null,"abstract":"Scene classification is an important research branch of image comprehension, which gains information from images and interprets them using computer system by imitating the biological systems of human beings. AlexNet model is limited in image classification because of the large convolution kernel and stride in the first convolutional layer leading to over rapid decline of feature maps resolution and excessive compression of spatial information. This paper proposed an improved AlexNet model according to the design principle of convolutional neural networks (CNNs). The large convolution kernel is decomposed into a structure cascaded by two small convolution kernels with reduced stride. Another convolutional layer is added after the first one to enhance the integration process of the low-level features or the spatial information. The asymmetric convolution kernel is applied in the last three convolutional layers. The experiments on two datasets show that the classification accuracy of the improved AlexNet model is higher than those of AlexNet model and ZFNet model for 23 categories of scene classification.","PeriodicalId":208009,"journal":{"name":"2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"45","resultStr":"{\"title\":\"Scene classification with improved AlexNet model\",\"authors\":\"Lisha Xiao, Qin Yan, Shuyu Deng\",\"doi\":\"10.1109/ISKE.2017.8258820\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Scene classification is an important research branch of image comprehension, which gains information from images and interprets them using computer system by imitating the biological systems of human beings. AlexNet model is limited in image classification because of the large convolution kernel and stride in the first convolutional layer leading to over rapid decline of feature maps resolution and excessive compression of spatial information. This paper proposed an improved AlexNet model according to the design principle of convolutional neural networks (CNNs). The large convolution kernel is decomposed into a structure cascaded by two small convolution kernels with reduced stride. Another convolutional layer is added after the first one to enhance the integration process of the low-level features or the spatial information. The asymmetric convolution kernel is applied in the last three convolutional layers. The experiments on two datasets show that the classification accuracy of the improved AlexNet model is higher than those of AlexNet model and ZFNet model for 23 categories of scene classification.\",\"PeriodicalId\":208009,\"journal\":{\"name\":\"2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE)\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"45\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISKE.2017.8258820\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISKE.2017.8258820","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 45

摘要

场景分类是图像理解的一个重要研究分支,它通过模拟人类的生物系统,从图像中获取信息,并利用计算机系统对其进行解释。AlexNet模型在图像分类方面受到限制,因为其卷积核较大,且第一卷积层的步幅过大,导致特征图分辨率下降过快,空间信息压缩过度。本文根据卷积神经网络(cnn)的设计原理,提出了一种改进的AlexNet模型。将大卷积核分解为由两个小卷积核级联而成的结构。在第一层卷积层之后再加一层卷积层,增强底层特征或空间信息的融合过程。在最后三个卷积层中应用非对称卷积核。在两个数据集上的实验表明,在23个类别的场景分类中,改进的AlexNet模型的分类准确率高于AlexNet模型和ZFNet模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Scene classification with improved AlexNet model
Scene classification is an important research branch of image comprehension, which gains information from images and interprets them using computer system by imitating the biological systems of human beings. AlexNet model is limited in image classification because of the large convolution kernel and stride in the first convolutional layer leading to over rapid decline of feature maps resolution and excessive compression of spatial information. This paper proposed an improved AlexNet model according to the design principle of convolutional neural networks (CNNs). The large convolution kernel is decomposed into a structure cascaded by two small convolution kernels with reduced stride. Another convolutional layer is added after the first one to enhance the integration process of the low-level features or the spatial information. The asymmetric convolution kernel is applied in the last three convolutional layers. The experiments on two datasets show that the classification accuracy of the improved AlexNet model is higher than those of AlexNet model and ZFNet model for 23 categories of scene classification.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
An interval-valued fuzzy soft set based triple I method Knowledge-based innovative methods for collaborative quality control in equipment outsourcing chain SimWalk: Learning network latent representations with social relation similarity An evaluation of sustainable development in less developed areas of Western China A data forwarding algorithm based on estimated Hungarian method for underwater sensor networks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1