Application of image retrieval for aesthetic evaluation and improvement suggestion of isolated Bangla handwritten characters

Mithun Biswas, Rafiqul Islam, Gautam Kumar Shom, Nabeel Mohammed, S. Momen, N. Mansoor, Md. Anowarul Abedin
{"title":"Application of image retrieval for aesthetic evaluation and improvement suggestion of isolated Bangla handwritten characters","authors":"Mithun Biswas, Rafiqul Islam, Gautam Kumar Shom, Nabeel Mohammed, S. Momen, N. Mansoor, Md. Anowarul Abedin","doi":"10.1109/ICSIPA.2017.8120596","DOIUrl":null,"url":null,"abstract":"Bangla is one of the most widely used languages worldwide. This paper presents an application of image retrieval techniques to automatically judge the aesthetic quality of handwritten Bangla isolated characters. Retrieval techniques are also adapted to give improvement suggestions, with a plan to incorporate the methods in applications which can assist in learning/teaching handwriting. The proposed method borrows key concepts from content-based image retrieval. Our method was tested on the BanglaLekha-Isolated data set, which contains images of 84 Bangla characters, with nearly 2000 samples per character. The data set contains evaluation of the aesthetic quality of the handwriting judged on a scale of 1–5. For this work, the dataset was partitioned into a test set of 400 images and a database-set of ≈ 1600 images, per Bangla character. Assuming that a scoring difference of 1 is acceptable, the proposed method achieves an accuracy of 77.39% when using features extracted by a convolutional neural network based autoencoder. Experiments were also done with the popular HOG feature. However, the autoencoder-based results showed clear superiority compared the HOG-based results. Our proposed method for improvement suggestions also shows that it is possible to shows samples from the dataset which will help users improve their handwriting while requiring small changes to their own handwriting.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSIPA.2017.8120596","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Bangla is one of the most widely used languages worldwide. This paper presents an application of image retrieval techniques to automatically judge the aesthetic quality of handwritten Bangla isolated characters. Retrieval techniques are also adapted to give improvement suggestions, with a plan to incorporate the methods in applications which can assist in learning/teaching handwriting. The proposed method borrows key concepts from content-based image retrieval. Our method was tested on the BanglaLekha-Isolated data set, which contains images of 84 Bangla characters, with nearly 2000 samples per character. The data set contains evaluation of the aesthetic quality of the handwriting judged on a scale of 1–5. For this work, the dataset was partitioned into a test set of 400 images and a database-set of ≈ 1600 images, per Bangla character. Assuming that a scoring difference of 1 is acceptable, the proposed method achieves an accuracy of 77.39% when using features extracted by a convolutional neural network based autoencoder. Experiments were also done with the popular HOG feature. However, the autoencoder-based results showed clear superiority compared the HOG-based results. Our proposed method for improvement suggestions also shows that it is possible to shows samples from the dataset which will help users improve their handwriting while requiring small changes to their own handwriting.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
图像检索在孟加拉孤立手写体汉字美学评价中的应用及改进建议
孟加拉语是世界上使用最广泛的语言之一。本文提出了一种基于图像检索技术的手写体孟加拉语孤立字美学质量自动评判方法。检索技术也进行了调整,以提供改进建议,并计划将这些方法纳入应用程序,以帮助学习/教学手写。该方法借鉴了基于内容的图像检索的关键概念。我们的方法在BanglaLekha-Isolated数据集上进行了测试,该数据集包含84个孟加拉语字符的图像,每个字符有近2000个样本。该数据集包含对笔迹美学质量的评估,评分范围为1-5。在这项工作中,每个孟加拉语字符将数据集划分为包含400张图像的测试集和包含约1600张图像的数据库集。假设评分差为1是可以接受的,当使用基于卷积神经网络的自编码器提取特征时,本文方法的准确率达到77.39%。我们还对流行的HOG特征进行了实验。然而,与基于hog的结果相比,基于自编码器的结果显示出明显的优势。我们提出的改进建议方法也表明,可以从数据集中显示样本,这将帮助用户在需要对自己的笔迹进行微小更改的情况下改善他们的笔迹。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Enhanced forensic speaker verification using multi-run ICA in the presence of environmental noise and reverberation conditions A real-time multi-class multi-object tracker using YOLOv2 Hybrid neural network and regression tree ensemble pruned by simulated annealing for virtual flow metering application Hybrid DWT and MFCC feature warping for noisy forensic speaker verification in room reverberation A deep architecture for face recognition based on multiple feature extraction techniques
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1