Akm Ashiquzzaman, A. Tushar, S. Dutta, Farzana Mohsin
{"title":"An efficient method for improving classification accuracy of handwritten Bangla compound characters using DCNN with dropout and ELU","authors":"Akm Ashiquzzaman, A. Tushar, S. Dutta, Farzana Mohsin","doi":"10.1109/ICRCICN.2017.8234497","DOIUrl":null,"url":null,"abstract":"Handwritten character recognition is an essential part of optical character recognition domain. Bangla handwritten compound character recognition is a complex task that is challenging due to extensive size of and sheer diversity within the alphabet. The current work proposes a novel method of recognition of compound characters in Bangla language using deep convolutional neural networks (DCNN) and efficient greedy layer-wise training approach. Introduction of dropout technology mitigates data overfitting and Exponential Linear Unit (ELU) is introduced to tackle the vanishing gradient problem during training. ELU is a special rectified linear unit which provides sustainability against the vanishing as well as exploding gradients. Furthermore, dropout influences network elements to learn diverse representation of data, which contributes to generalization of model. The model is tested on CMATERdb 3.1.3.3 data set of compound characters, and the performance is found to outperform existing state-of-the-art methods of Bangla handwritten complex character recognition.","PeriodicalId":166298,"journal":{"name":"2017 Third International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Third International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRCICN.2017.8234497","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 21
Abstract
Handwritten character recognition is an essential part of optical character recognition domain. Bangla handwritten compound character recognition is a complex task that is challenging due to extensive size of and sheer diversity within the alphabet. The current work proposes a novel method of recognition of compound characters in Bangla language using deep convolutional neural networks (DCNN) and efficient greedy layer-wise training approach. Introduction of dropout technology mitigates data overfitting and Exponential Linear Unit (ELU) is introduced to tackle the vanishing gradient problem during training. ELU is a special rectified linear unit which provides sustainability against the vanishing as well as exploding gradients. Furthermore, dropout influences network elements to learn diverse representation of data, which contributes to generalization of model. The model is tested on CMATERdb 3.1.3.3 data set of compound characters, and the performance is found to outperform existing state-of-the-art methods of Bangla handwritten complex character recognition.
手写体字符识别是光学字符识别领域的重要组成部分。孟加拉语手写复合字识别是一项复杂的任务,由于其庞大的规模和绝对的多样性,具有挑战性。本文提出了一种基于深度卷积神经网络(DCNN)和高效贪婪分层训练方法的孟加拉语复合字识别新方法。引入dropout技术减轻了数据的过拟合,并引入指数线性单元(Exponential Linear Unit, ELU)来解决训练过程中梯度消失的问题。ELU是一种特殊的整流线性单元,它提供了对消失和爆炸梯度的可持续性。此外,辍学影响网络元素学习数据的多样化表示,有助于模型的泛化。在CMATERdb 3.1.3.3复合字数据集上对该模型进行了测试,结果表明该模型的性能优于现有的孟加拉文手写复合字识别方法。