基于深度学习的光学和自然图像鲁棒字符识别

Al.maamoon Rasool Abdali, R. F. Ghani
{"title":"基于深度学习的光学和自然图像鲁棒字符识别","authors":"Al.maamoon Rasool Abdali, R. F. Ghani","doi":"10.1109/SCORED.2019.8896354","DOIUrl":null,"url":null,"abstract":"Character recognition is one of the most critical parts of many computer vision system. And many studies have explored character recognition as two subcategories: character recognition in optical images, character recognition natural images while this separation was to achieve high accuracy in each field separated but it needs more hardware resources to operate two models one for each area. In addition to that most researches divided each field into (digits recognition, character recognition) that add extra cost in both time and hardware resources also we found that both areas still have room for accuracy improvement. This paper tackles the problem of the two subclasses by building one robust, accurate classifier using a convolutional neural network that can recognize (characters and digits) accurately in both optical and natural scene images, the proposed model has been trained on a combination of EMNIST and Char74k data sets with a random data augmentation. The proposed model achieved 92% accuracy in EMINST compared to previous works shows that the proposed model has the highest accuracy among all the previous works based on EMNIST data set. We also tested the model on none-seen data sets (ICDAR203) and the obtained results indicate the high generality and the robustness of the classifier.","PeriodicalId":231004,"journal":{"name":"2019 IEEE Student Conference on Research and Development (SCOReD)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Robust Character Recognition For Optical And Natural Images Using Deep Learning\",\"authors\":\"Al.maamoon Rasool Abdali, R. F. Ghani\",\"doi\":\"10.1109/SCORED.2019.8896354\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Character recognition is one of the most critical parts of many computer vision system. And many studies have explored character recognition as two subcategories: character recognition in optical images, character recognition natural images while this separation was to achieve high accuracy in each field separated but it needs more hardware resources to operate two models one for each area. In addition to that most researches divided each field into (digits recognition, character recognition) that add extra cost in both time and hardware resources also we found that both areas still have room for accuracy improvement. This paper tackles the problem of the two subclasses by building one robust, accurate classifier using a convolutional neural network that can recognize (characters and digits) accurately in both optical and natural scene images, the proposed model has been trained on a combination of EMNIST and Char74k data sets with a random data augmentation. The proposed model achieved 92% accuracy in EMINST compared to previous works shows that the proposed model has the highest accuracy among all the previous works based on EMNIST data set. We also tested the model on none-seen data sets (ICDAR203) and the obtained results indicate the high generality and the robustness of the classifier.\",\"PeriodicalId\":231004,\"journal\":{\"name\":\"2019 IEEE Student Conference on Research and Development (SCOReD)\",\"volume\":\"87 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE Student Conference on Research and Development (SCOReD)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SCORED.2019.8896354\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Student Conference on Research and Development (SCOReD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCORED.2019.8896354","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

字符识别是许多计算机视觉系统中最关键的部分之一。许多研究将字符识别分为光学图像中的字符识别和自然图像中的字符识别两个子类,这种分离是为了在每个分离领域都达到较高的精度,但需要更多的硬件资源来运行两个模型,每个领域一个。此外,大多数研究将每个领域分为(数字识别,字符识别),这在时间和硬件资源上都增加了额外的成本,我们发现这两个领域的准确性仍然有提高的空间。本文通过使用卷积神经网络构建一个鲁棒、准确的分类器来解决这两个子类的问题,该分类器可以在光学和自然场景图像中准确识别(字符和数字),所提出的模型已经在EMNIST和Char74k数据集的组合上进行了随机数据增强训练。本文模型在EMNIST数据集上的准确率达到了92%,表明本文模型在以往基于EMNIST数据集的研究中准确率最高。我们还在未见过的数据集(ICDAR203)上测试了该模型,得到的结果表明该分类器具有很高的通用性和鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Robust Character Recognition For Optical And Natural Images Using Deep Learning
Character recognition is one of the most critical parts of many computer vision system. And many studies have explored character recognition as two subcategories: character recognition in optical images, character recognition natural images while this separation was to achieve high accuracy in each field separated but it needs more hardware resources to operate two models one for each area. In addition to that most researches divided each field into (digits recognition, character recognition) that add extra cost in both time and hardware resources also we found that both areas still have room for accuracy improvement. This paper tackles the problem of the two subclasses by building one robust, accurate classifier using a convolutional neural network that can recognize (characters and digits) accurately in both optical and natural scene images, the proposed model has been trained on a combination of EMNIST and Char74k data sets with a random data augmentation. The proposed model achieved 92% accuracy in EMINST compared to previous works shows that the proposed model has the highest accuracy among all the previous works based on EMNIST data set. We also tested the model on none-seen data sets (ICDAR203) and the obtained results indicate the high generality and the robustness of the classifier.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Preliminary Implementation of the Next Generation Cannulation Simulator Multi-Input Power Converter for Renewable Energy Sources using Active Current Sharing Schemes Understanding the differences in students' attitudes towards social media use: A case study from Oman Effect Of Neurofeedback 2D and 3D Stimulus Content On Stress Mitigation Investigation of Segmented Rotor FEFSSM with Non-Overlap Windings in Various Slot-Pole Configurations
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1