Region-Based Convolutional Neural Network for Segmenting Text in Epigraphical Images

Padmaprabha Preethi, Hosahalli Ramappa Mamatha
{"title":"Region-Based Convolutional Neural Network for Segmenting Text in Epigraphical Images","authors":"Padmaprabha Preethi, Hosahalli Ramappa Mamatha","doi":"10.47852/bonviewaia2202293","DOIUrl":null,"url":null,"abstract":"Indian history is derived from ancient writings on the inscriptions, palm leaves, copper plates, coins, and many more mediums. Epigraphers read these inscriptions and produce meaningful interpretations. Automating the process of reading is the interest of our study, and in this paper, segmentation to detect text on digitized inscriptional images is dealt in detail. Character segmentation from epigraphical images helps in optical character recognizer in training and recognition of old regional scripts. Epigraphical images are drawn from estampages containing scripts from various periods starting from Brahmi in the 3rd century BC to the medieval period of the 15th century AD. The scripts or characters present in digitized epigraphical images are illegible and have complex noisy background textures. To achieve script/text segmentation, region-based convolutional neural network (CNN) is employed to detect characters in the images. Proposed method uses selective search to identify text regions and forwards them to trained CNN models for drawing feature vectors. These feature vectors are fed to support vector machine classifiers for classification and recognize text by drawing a bounding box based on confidence score. Alexnet, VGG16, Resnet50, and InceptionV3 are used as CNN models for experimentation, and InceptionV3 performed well with good results. A total of 197 images are used for experimentation, out of which 70 samples are of printed denoised epigraphical images, 40 denoised estampage images, and 87 noisy estampage images. The segmentation result of 74.79% for printed denoised epigraphical images, 71.53 % for denoised estampage epigraphical images, and 18.11% for noisy estampage images are recorded by InceptionV3. The segmented characters are used for epigraphical applications like period/era prediction and recognition of characters. FAST and FASTER region-based design approach was also tested and illustrated in this paper.","PeriodicalId":91205,"journal":{"name":"Artificial intelligence and applications (Commerce, Calif.)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence and applications (Commerce, Calif.)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47852/bonviewaia2202293","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Indian history is derived from ancient writings on the inscriptions, palm leaves, copper plates, coins, and many more mediums. Epigraphers read these inscriptions and produce meaningful interpretations. Automating the process of reading is the interest of our study, and in this paper, segmentation to detect text on digitized inscriptional images is dealt in detail. Character segmentation from epigraphical images helps in optical character recognizer in training and recognition of old regional scripts. Epigraphical images are drawn from estampages containing scripts from various periods starting from Brahmi in the 3rd century BC to the medieval period of the 15th century AD. The scripts or characters present in digitized epigraphical images are illegible and have complex noisy background textures. To achieve script/text segmentation, region-based convolutional neural network (CNN) is employed to detect characters in the images. Proposed method uses selective search to identify text regions and forwards them to trained CNN models for drawing feature vectors. These feature vectors are fed to support vector machine classifiers for classification and recognize text by drawing a bounding box based on confidence score. Alexnet, VGG16, Resnet50, and InceptionV3 are used as CNN models for experimentation, and InceptionV3 performed well with good results. A total of 197 images are used for experimentation, out of which 70 samples are of printed denoised epigraphical images, 40 denoised estampage images, and 87 noisy estampage images. The segmentation result of 74.79% for printed denoised epigraphical images, 71.53 % for denoised estampage epigraphical images, and 18.11% for noisy estampage images are recorded by InceptionV3. The segmented characters are used for epigraphical applications like period/era prediction and recognition of characters. FAST and FASTER region-based design approach was also tested and illustrated in this paper.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于区域卷积神经网络的铭文图像文本分割
印度的历史来源于古代铭文、棕榈叶、铜板、硬币和许多其他媒介上的文字。铭文工作者阅读这些铭文,并作出有意义的解释。阅读过程的自动化是我们研究的方向,本文详细讨论了在数字化铭文图像上进行文本分割检测的方法。对铭文图像进行字符分割,有助于光学字符识别器对旧区域文字的训练和识别。铭文图像是从包含从公元前3世纪的婆罗门到公元15世纪的中世纪时期的不同时期的文字的邮票上绘制的。数字化铭文图像中的文字或文字难以辨认,并且具有复杂的噪声背景纹理。为了实现脚本/文本分割,采用基于区域的卷积神经网络(CNN)对图像中的字符进行检测。该方法通过选择性搜索识别文本区域,并将其转发给训练好的CNN模型绘制特征向量。将这些特征向量馈送给支持向量机分类器进行分类,并根据置信度绘制边界框进行文本识别。使用Alexnet、VGG16、Resnet50和InceptionV3作为CNN模型进行实验,InceptionV3表现良好,效果良好。实验共使用了197张图像,其中印刷去噪墓志文图像70张,去噪墓志文图像40张,去噪墓志文图像87张。InceptionV3的分割结果分别为印刷品去噪后的74.79%、去噪后的71.53%和带噪后的18.11%。分割的字符用于铭文应用,如时期/时代预测和字符识别。本文还对FAST和FASTER基于区域的设计方法进行了测试和说明。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Methodology of Measurement Intellectualization based on Regularized Bayesian Approach in Uncertain Conditions Stochastic Dual Coordinate Ascent for Learning Sign Constrained Linear Predictors Data Smoothing Filling Method based on ScRNA-Seq Data Zero-Value Identification Batch-Stochastic Sub-Gradient Method for Solving Non-Smooth Convex Loss Function Problems Teaching Reading Skills More Effectively
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1