Cartoon Figure Recognition with The Deep Residual Network

2021 IEEE International Conference on Computer Science, Artificial Intelligence and Electronic Engineering (CSAIEE) Pub Date : 2021-08-20 DOI:10.1109/CSAIEE54046.2021.9543197

Ziyi Guo

{"title":"Cartoon Figure Recognition with The Deep Residual Network","authors":"Ziyi Guo","doi":"10.1109/CSAIEE54046.2021.9543197","DOIUrl":null,"url":null,"abstract":"Because of the wide application of deep learning, there are more neural network structures in image recognition technology nowadays, but there are various differences in the accuracy of image recognition because of the various differences in network structures. For this reason, it is especially important to use different neural network structures for different forms of image data. This paper focuses on exploring the differences between LSTM networks, residual networks, and CNN networks in terms of the accuracy of cartoon character recognition.[1]Firstly, the web crawler acquires 14 different cartoon character images and manually screens the original data to remove the duplicate images and obtain the preliminary data. Then data enhancement was performed on the preliminary data, and the form of rotating the images was selected to complete the pre-processing of the data, which solved the problem of using different code forms for different forms of data importing into the neural network; the LSTM network, CNN network and CNN network with added residual function were used to recognize the pre-processed data. The experiments show that the CNN network structure with residual function can achieve higher accuracy compared to LSTM, with the final result of 76.08%.","PeriodicalId":376014,"journal":{"name":"2021 IEEE International Conference on Computer Science, Artificial Intelligence and Electronic Engineering (CSAIEE)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Computer Science, Artificial Intelligence and Electronic Engineering (CSAIEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSAIEE54046.2021.9543197","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Because of the wide application of deep learning, there are more neural network structures in image recognition technology nowadays, but there are various differences in the accuracy of image recognition because of the various differences in network structures. For this reason, it is especially important to use different neural network structures for different forms of image data. This paper focuses on exploring the differences between LSTM networks, residual networks, and CNN networks in terms of the accuracy of cartoon character recognition.[1]Firstly, the web crawler acquires 14 different cartoon character images and manually screens the original data to remove the duplicate images and obtain the preliminary data. Then data enhancement was performed on the preliminary data, and the form of rotating the images was selected to complete the pre-processing of the data, which solved the problem of using different code forms for different forms of data importing into the neural network; the LSTM network, CNN network and CNN network with added residual function were used to recognize the pre-processed data. The experiments show that the CNN network structure with residual function can achieve higher accuracy compared to LSTM, with the final result of 76.08%.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于深度残差网络的卡通人物识别

由于深度学习的广泛应用，如今的图像识别技术中出现了更多的神经网络结构，但由于网络结构的各种差异，图像识别的准确性也存在着各种差异。因此，针对不同形式的图像数据使用不同的神经网络结构就显得尤为重要。本文主要探讨LSTM网络、残差网络和CNN网络在卡通人物识别准确率方面的差异。[1]首先，网络爬虫获取14张不同的卡通人物图像，对原始数据进行人工筛选，去除重复图像，获得初步数据。然后对初步数据进行数据增强，选择旋转图像的形式完成数据预处理，解决了不同形式的数据导入神经网络时使用不同的编码形式的问题;采用LSTM网络、CNN网络和添加残差函数的CNN网络对预处理后的数据进行识别。实验表明，与LSTM相比，带有残差函数的CNN网络结构可以达到更高的准确率，最终结果为76.08%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2021 IEEE International Conference on Computer Science, Artificial Intelligence and Electronic Engineering (CSAIEE)

自引率

0.00%

发文量