{"title":"基于深度残差网络的卡通人物识别","authors":"Ziyi Guo","doi":"10.1109/CSAIEE54046.2021.9543197","DOIUrl":null,"url":null,"abstract":"Because of the wide application of deep learning, there are more neural network structures in image recognition technology nowadays, but there are various differences in the accuracy of image recognition because of the various differences in network structures. For this reason, it is especially important to use different neural network structures for different forms of image data. This paper focuses on exploring the differences between LSTM networks, residual networks, and CNN networks in terms of the accuracy of cartoon character recognition.[1]Firstly, the web crawler acquires 14 different cartoon character images and manually screens the original data to remove the duplicate images and obtain the preliminary data. Then data enhancement was performed on the preliminary data, and the form of rotating the images was selected to complete the pre-processing of the data, which solved the problem of using different code forms for different forms of data importing into the neural network; the LSTM network, CNN network and CNN network with added residual function were used to recognize the pre-processed data. The experiments show that the CNN network structure with residual function can achieve higher accuracy compared to LSTM, with the final result of 76.08%.","PeriodicalId":376014,"journal":{"name":"2021 IEEE International Conference on Computer Science, Artificial Intelligence and Electronic Engineering (CSAIEE)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Cartoon Figure Recognition with The Deep Residual Network\",\"authors\":\"Ziyi Guo\",\"doi\":\"10.1109/CSAIEE54046.2021.9543197\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Because of the wide application of deep learning, there are more neural network structures in image recognition technology nowadays, but there are various differences in the accuracy of image recognition because of the various differences in network structures. For this reason, it is especially important to use different neural network structures for different forms of image data. This paper focuses on exploring the differences between LSTM networks, residual networks, and CNN networks in terms of the accuracy of cartoon character recognition.[1]Firstly, the web crawler acquires 14 different cartoon character images and manually screens the original data to remove the duplicate images and obtain the preliminary data. Then data enhancement was performed on the preliminary data, and the form of rotating the images was selected to complete the pre-processing of the data, which solved the problem of using different code forms for different forms of data importing into the neural network; the LSTM network, CNN network and CNN network with added residual function were used to recognize the pre-processed data. The experiments show that the CNN network structure with residual function can achieve higher accuracy compared to LSTM, with the final result of 76.08%.\",\"PeriodicalId\":376014,\"journal\":{\"name\":\"2021 IEEE International Conference on Computer Science, Artificial Intelligence and Electronic Engineering (CSAIEE)\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Computer Science, Artificial Intelligence and Electronic Engineering (CSAIEE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSAIEE54046.2021.9543197\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Computer Science, Artificial Intelligence and Electronic Engineering (CSAIEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSAIEE54046.2021.9543197","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

由于深度学习的广泛应用,如今的图像识别技术中出现了更多的神经网络结构,但由于网络结构的各种差异,图像识别的准确性也存在着各种差异。因此,针对不同形式的图像数据使用不同的神经网络结构就显得尤为重要。本文主要探讨LSTM网络、残差网络和CNN网络在卡通人物识别准确率方面的差异。[1]首先,网络爬虫获取14张不同的卡通人物图像,对原始数据进行人工筛选,去除重复图像,获得初步数据。然后对初步数据进行数据增强,选择旋转图像的形式完成数据预处理,解决了不同形式的数据导入神经网络时使用不同的编码形式的问题;采用LSTM网络、CNN网络和添加残差函数的CNN网络对预处理后的数据进行识别。实验表明,与LSTM相比,带有残差函数的CNN网络结构可以达到更高的准确率,最终结果为76.08%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Cartoon Figure Recognition with The Deep Residual Network
Because of the wide application of deep learning, there are more neural network structures in image recognition technology nowadays, but there are various differences in the accuracy of image recognition because of the various differences in network structures. For this reason, it is especially important to use different neural network structures for different forms of image data. This paper focuses on exploring the differences between LSTM networks, residual networks, and CNN networks in terms of the accuracy of cartoon character recognition.[1]Firstly, the web crawler acquires 14 different cartoon character images and manually screens the original data to remove the duplicate images and obtain the preliminary data. Then data enhancement was performed on the preliminary data, and the form of rotating the images was selected to complete the pre-processing of the data, which solved the problem of using different code forms for different forms of data importing into the neural network; the LSTM network, CNN network and CNN network with added residual function were used to recognize the pre-processed data. The experiments show that the CNN network structure with residual function can achieve higher accuracy compared to LSTM, with the final result of 76.08%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Res-Attention Net: An Image Dehazing Network Teacher-Student Network for Low-quality Remote Sensing Ship Detection Optimization of GNSS Signals Acquisition Algorithm Complexity Using Comb Decimation Filter Basic Ensemble Learning of Encoder Representations from Transformer for Disaster-mentioning Tweets Classification Measuring Hilbert-Schmidt Independence Criterion with Different Kernels
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1