使用各种深度学习模型对手写城市名称进行分类

Nurseitov Daniyar, B. Kairat, Kanatov Maksat, Alimova Anel
{"title":"使用各种深度学习模型对手写城市名称进行分类","authors":"Nurseitov Daniyar, B. Kairat, Kanatov Maksat, Alimova Anel","doi":"10.1109/ICECCO48375.2019.9043266","DOIUrl":null,"url":null,"abstract":"The digitized text of handwriting would conduce to automate the business processes of many companies, simplifying the work of human being. For example, our state postal service does not have an automated mail processing system that recognizes handwritten addresses on an envelope. Each incoming correspondence is registered in the system by the operator. Automation of this business process on registering post mailing will significantly reduce expenses of postal service on mail delivery.There are two main approaches to handwriting recognition, namely hidden Markov models (HMM) and artificial neural networks (ANN). The methods proposed in this article are based on ANN. The first model is based on deep convolutional neural networks (DCNN) [1] for feature extraction and a fully connected multilayer perceptron (fully connected MLP) for word classification. The next model under consideration, called SimpleHTR, proposed by Harald Scheidl [2], has layers of a convolutional neural network (CNN) and layers of a recurrent neural network (RNN) for disseminating information through an image. Finally, the Connectionist Temporal Classification (CTC) decoding algorithm is executed, which adduces the text to the final version.Models were learned on the dataset of handwritten city names from Cyrillic words. 21,000 images were collected (42 classes of 500 handwriting samples). To increase the data set for training, 207,438 images from available samples were generated.As a result, two approaches for handwriting recognition were analyzed and the SimpleHTR model showed the best results over all.","PeriodicalId":166322,"journal":{"name":"2019 15th International Conference on Electronics, Computer and Computation (ICECCO)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Classification of handwritten names of cities using various deep learning models\",\"authors\":\"Nurseitov Daniyar, B. Kairat, Kanatov Maksat, Alimova Anel\",\"doi\":\"10.1109/ICECCO48375.2019.9043266\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The digitized text of handwriting would conduce to automate the business processes of many companies, simplifying the work of human being. For example, our state postal service does not have an automated mail processing system that recognizes handwritten addresses on an envelope. Each incoming correspondence is registered in the system by the operator. Automation of this business process on registering post mailing will significantly reduce expenses of postal service on mail delivery.There are two main approaches to handwriting recognition, namely hidden Markov models (HMM) and artificial neural networks (ANN). The methods proposed in this article are based on ANN. The first model is based on deep convolutional neural networks (DCNN) [1] for feature extraction and a fully connected multilayer perceptron (fully connected MLP) for word classification. The next model under consideration, called SimpleHTR, proposed by Harald Scheidl [2], has layers of a convolutional neural network (CNN) and layers of a recurrent neural network (RNN) for disseminating information through an image. Finally, the Connectionist Temporal Classification (CTC) decoding algorithm is executed, which adduces the text to the final version.Models were learned on the dataset of handwritten city names from Cyrillic words. 21,000 images were collected (42 classes of 500 handwriting samples). To increase the data set for training, 207,438 images from available samples were generated.As a result, two approaches for handwriting recognition were analyzed and the SimpleHTR model showed the best results over all.\",\"PeriodicalId\":166322,\"journal\":{\"name\":\"2019 15th International Conference on Electronics, Computer and Computation (ICECCO)\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 15th International Conference on Electronics, Computer and Computation (ICECCO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECCO48375.2019.9043266\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 15th International Conference on Electronics, Computer and Computation (ICECCO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECCO48375.2019.9043266","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

数字化的手写文本将有助于许多公司的业务流程自动化,简化人类的工作。例如,我们国家的邮政服务没有一个自动邮件处理系统来识别信封上手写的地址。每个传入的通信都由接线员在系统中登记。自动登记邮寄业务,可大大减少邮政服务的邮件派递费用。手写识别主要有两种方法,隐马尔可夫模型(HMM)和人工神经网络(ANN)。本文提出的方法是基于人工神经网络的。第一个模型基于深度卷积神经网络(DCNN)[1]进行特征提取,并基于全连接多层感知器(fully connected MLP)进行词分类。下一个正在考虑的模型,称为SimpleHTR,由Harald Scheidl b[2]提出,具有卷积神经网络(CNN)层和递归神经网络(RNN)层,用于通过图像传播信息。最后,执行连接时态分类(CTC)解码算法,将文本引到最终版本。模型是在来自西里尔字母的手写城市名称数据集上学习的。收集了21,000张图像(42类500个笔迹样本)。为了增加训练数据集,从可用样本中生成了207,438张图像。结果,分析了两种手写识别方法,SimpleHTR模型总体上显示出最好的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Classification of handwritten names of cities using various deep learning models
The digitized text of handwriting would conduce to automate the business processes of many companies, simplifying the work of human being. For example, our state postal service does not have an automated mail processing system that recognizes handwritten addresses on an envelope. Each incoming correspondence is registered in the system by the operator. Automation of this business process on registering post mailing will significantly reduce expenses of postal service on mail delivery.There are two main approaches to handwriting recognition, namely hidden Markov models (HMM) and artificial neural networks (ANN). The methods proposed in this article are based on ANN. The first model is based on deep convolutional neural networks (DCNN) [1] for feature extraction and a fully connected multilayer perceptron (fully connected MLP) for word classification. The next model under consideration, called SimpleHTR, proposed by Harald Scheidl [2], has layers of a convolutional neural network (CNN) and layers of a recurrent neural network (RNN) for disseminating information through an image. Finally, the Connectionist Temporal Classification (CTC) decoding algorithm is executed, which adduces the text to the final version.Models were learned on the dataset of handwritten city names from Cyrillic words. 21,000 images were collected (42 classes of 500 handwriting samples). To increase the data set for training, 207,438 images from available samples were generated.As a result, two approaches for handwriting recognition were analyzed and the SimpleHTR model showed the best results over all.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
An Investigation into Peer-to-Peer Network Security Using Wireshark Greenhouse Monitoring and Control System with an Arduino System Transfer Learning Based Histopathologic Image Classification for Burns Recognition Modelling And Realization of a Compact CPW Transmission Lines Using 3D Mmics Technology in ADS Momentum Review of Advances in Machine Learning Based Protein Secondary Structure Prediction
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1