Proposed Deep Learning System for Arabic Text Detection and Recognition

Ghufran Jafar Salman, M. S. M. Altaei
{"title":"Proposed Deep Learning System for Arabic Text Detection and Recognition","authors":"Ghufran Jafar Salman, M. S. M. Altaei","doi":"10.1109/DeSE58274.2023.10100235","DOIUrl":null,"url":null,"abstract":"Building a system to recognize Arabic words or texts has been challenging. It's harder when the text is in various sizes and fonts, regardless of font complexities. This work built a smart system to recognize Arabic words and texts by creating a dataset and training it by using deep learning techniques. This system can scan text into a computer texts. Each of the 1,000 words in the dataset was written out 24 different ways, using 24 different Arabic fonts. Words in images were identified and deduced with the use of image processing methods. Finally, the deep learning (Convolution Neural Network CNN) algorithm takes over, extracting features from the truncated word and retrieving text words that are visually similar to the ones that were cut. In experiments, the system achieved 99% accuracy in words detection and 96% accuracy in recognition.","PeriodicalId":346847,"journal":{"name":"2023 15th International Conference on Developments in eSystems Engineering (DeSE)","volume":"43 21","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 15th International Conference on Developments in eSystems Engineering (DeSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DeSE58274.2023.10100235","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Building a system to recognize Arabic words or texts has been challenging. It's harder when the text is in various sizes and fonts, regardless of font complexities. This work built a smart system to recognize Arabic words and texts by creating a dataset and training it by using deep learning techniques. This system can scan text into a computer texts. Each of the 1,000 words in the dataset was written out 24 different ways, using 24 different Arabic fonts. Words in images were identified and deduced with the use of image processing methods. Finally, the deep learning (Convolution Neural Network CNN) algorithm takes over, extracting features from the truncated word and retrieving text words that are visually similar to the ones that were cut. In experiments, the system achieved 99% accuracy in words detection and 96% accuracy in recognition.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
阿拉伯语文本检测与识别的深度学习系统
建立一个识别阿拉伯语单词或文本的系统一直是一个挑战。当文本是各种大小和字体时,不管字体的复杂性如何,这就更难了。这项工作建立了一个智能系统,通过创建一个数据集并使用深度学习技术进行训练,来识别阿拉伯语单词和文本。该系统可以将文本扫描成计算机文本。数据集中的1000个单词中的每一个都有24种不同的写法,使用24种不同的阿拉伯字体。利用图像处理方法对图像中的单词进行识别和推理。最后,深度学习(卷积神经网络CNN)算法接管,从截断的单词中提取特征,并检索视觉上与被截断的单词相似的文本单词。在实验中,该系统的单词检测准确率达到99%,识别准确率达到96%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Using Simulation for Investigating Emergency Traffic Situations Real- Time Healthcare Monitoring and Treatment System Based Microcontroller with IoT Automated Face Mask Detection using Artificial Intelligence and Video Surveillance Management Improvement of the Personnel Delivery System in the Mining Complex using Simulation Models An Exploratory Study on the Impact of Hosting Blockchain Applications in Cloud Infrastructures
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1