Bangla Handwritten Digit Recognition using RNN-CNN Hybrid Approach

A. Hasib Uddin, Joygun Khatun, Mehera Afroz Meghna, Prince Mahmud
{"title":"Bangla Handwritten Digit Recognition using RNN-CNN Hybrid Approach","authors":"A. Hasib Uddin, Joygun Khatun, Mehera Afroz Meghna, Prince Mahmud","doi":"10.1109/ICCIT57492.2022.10055089","DOIUrl":null,"url":null,"abstract":"The automatic recognition of handwritten English material has seen a lot of progress. However, research on automatic Bangla handwriting numerals recognition is far behind. Even the most effective recognizers now in use do not produce an adequate performance for real-world applications. This paper suggested a strategy based on deep neural networks. In this paper, we have used the BanglaLekha-Isolated handwriting dataset along with ResNet50 and DensNet201 models for benchmarking process. Then we proposed two new models one is a Gated Recurrent Unit (GRU) based and another one is a Hybrid of Convolutional Neural Network (CNN) and Convolutional Long Short-term Memory (ConvLSTM). As for our proposed GRU model it performs closely to the DensNet201 and REsNet50 models while requiring very few parameters compared to these two models. On the other hand, our proposed Hybrid ConvLSTM model outperforms both of the aforementioned benchmarking models. Finally, we have developed a new Bangla Handwriting Numerical dataset containing a total of seven thousand training, one thousand validation, and two thousand test images. Our proposed best-performing model (Hybrid ConvLSTM) achieves 98.84% accuracy in the test data of our dataset while the GRU model gained 91.33% test accuracy without any help of image preprocessing steps.","PeriodicalId":255498,"journal":{"name":"2022 25th International Conference on Computer and Information Technology (ICCIT)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 25th International Conference on Computer and Information Technology (ICCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIT57492.2022.10055089","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The automatic recognition of handwritten English material has seen a lot of progress. However, research on automatic Bangla handwriting numerals recognition is far behind. Even the most effective recognizers now in use do not produce an adequate performance for real-world applications. This paper suggested a strategy based on deep neural networks. In this paper, we have used the BanglaLekha-Isolated handwriting dataset along with ResNet50 and DensNet201 models for benchmarking process. Then we proposed two new models one is a Gated Recurrent Unit (GRU) based and another one is a Hybrid of Convolutional Neural Network (CNN) and Convolutional Long Short-term Memory (ConvLSTM). As for our proposed GRU model it performs closely to the DensNet201 and REsNet50 models while requiring very few parameters compared to these two models. On the other hand, our proposed Hybrid ConvLSTM model outperforms both of the aforementioned benchmarking models. Finally, we have developed a new Bangla Handwriting Numerical dataset containing a total of seven thousand training, one thousand validation, and two thousand test images. Our proposed best-performing model (Hybrid ConvLSTM) achieves 98.84% accuracy in the test data of our dataset while the GRU model gained 91.33% test accuracy without any help of image preprocessing steps.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用RNN-CNN混合方法的孟加拉语手写数字识别
手写英语材料的自动识别已经取得了很大的进展。然而,对孟加拉文手写数字自动识别的研究还远远落后。即使是目前使用的最有效的识别器也不能在实际应用中产生足够的性能。本文提出了一种基于深度神经网络的策略。在本文中,我们使用BanglaLekha-Isolated手写数据集以及ResNet50和DensNet201模型进行基准测试过程。然后我们提出了两个新的模型,一个是基于门控循环单元(GRU)的模型,另一个是卷积神经网络(CNN)和卷积长短期记忆(ConvLSTM)的混合模型。对于我们提出的GRU模型,它的性能与DensNet201和REsNet50模型非常接近,而与这两个模型相比,它需要的参数很少。另一方面,我们提出的混合ConvLSTM模型优于上述两种基准测试模型。最后,我们开发了一个新的孟加拉语手写数字数据集,其中总共包含7000个训练图像、1000个验证图像和2000个测试图像。我们提出的最佳模型(Hybrid ConvLSTM)在我们数据集的测试数据中达到了98.84%的准确率,而GRU模型在没有任何图像预处理步骤的情况下获得了91.33%的测试准确率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
SlotFinder: A Spatio-temporal based Car Parking System Land Cover and Land Use Detection using Semi-Supervised Learning Comparative Analysis of Process Scheduling Algorithm using AI models Throughput Optimization of IEEE 802.15.4e TSCH-Based Scheduling: A Deep Neural Network (DNN) Scheme Towards Developing a Voice-Over-Guided System for Visually Impaired People to Learn Writing the Alphabets
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1