A Parallel Text Recognition in Electrical Equipment Nameplate Images Based on Apache Flink

Z. Liu, Lin Li, Da Zhang, Liangshuai Liu, Ze Deng
{"title":"A Parallel Text Recognition in Electrical Equipment Nameplate Images Based on Apache Flink","authors":"Z. Liu, Lin Li, Da Zhang, Liangshuai Liu, Ze Deng","doi":"10.1142/s0218126623501098","DOIUrl":null,"url":null,"abstract":"Information on the equipment nameplate is important for the storage, transportation, verification and maintenance of electrical equipment. However, because a natural image of the device on the text nameplate may be multidirectional, curved, noisy or blurry, automatically recognizing the image from the device nameplate can be difficult. Meanwhile, image preprocessing methods are carried out in a serial manner, so the processing speed with regard to the above problems is slower and takes a longer time. Accordingly, this study proposes a parallel and deep-learning-based text automatic recognition method. In the proposed method, a pretreatment method comprising edge detection, morphological manipulation and projection transformation is used to obtain the corrected nameplate region. The connectionist text proposal network (CTPN) is then activated to detect text lines on the corrected nameplate area. Next, a deep-learning method is proposed to study the classification methods of convolutional recurrent neural networks and connectionist time classification for identifying text in each line of text detected by CTPN. Finally, we use Apache Flink to parallelize the above processes, including parallelization preprocessing and bidirectional long short-term memory parallelization in the process of text line detection and text recognition. Experimental results on the collected nameplate show that the proposed imaging processing method has a good recognition performance and that the parallelization method significantly reduces the data processing time cost.","PeriodicalId":14696,"journal":{"name":"J. Circuits Syst. Comput.","volume":"43 1","pages":"2350109:1-2350109:19"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Circuits Syst. Comput.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s0218126623501098","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Information on the equipment nameplate is important for the storage, transportation, verification and maintenance of electrical equipment. However, because a natural image of the device on the text nameplate may be multidirectional, curved, noisy or blurry, automatically recognizing the image from the device nameplate can be difficult. Meanwhile, image preprocessing methods are carried out in a serial manner, so the processing speed with regard to the above problems is slower and takes a longer time. Accordingly, this study proposes a parallel and deep-learning-based text automatic recognition method. In the proposed method, a pretreatment method comprising edge detection, morphological manipulation and projection transformation is used to obtain the corrected nameplate region. The connectionist text proposal network (CTPN) is then activated to detect text lines on the corrected nameplate area. Next, a deep-learning method is proposed to study the classification methods of convolutional recurrent neural networks and connectionist time classification for identifying text in each line of text detected by CTPN. Finally, we use Apache Flink to parallelize the above processes, including parallelization preprocessing and bidirectional long short-term memory parallelization in the process of text line detection and text recognition. Experimental results on the collected nameplate show that the proposed imaging processing method has a good recognition performance and that the parallelization method significantly reduces the data processing time cost.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于Apache Flink的电气设备铭牌图像并行文本识别
设备铭牌上的信息对于电气设备的储存、运输、验证和维护非常重要。但是,由于文本铭牌上设备的自然图像可能是多向的、弯曲的、嘈杂的或模糊的,因此从设备铭牌上自动识别图像可能很困难。同时,由于图像预处理方法采用串行方式进行,因此对上述问题的处理速度较慢,耗时较长。据此,本研究提出了一种基于并行深度学习的文本自动识别方法。该方法采用边缘检测、形态学处理和投影变换等预处理方法,得到校正后的铭牌区域。然后激活连接文本建议网络(CTPN)以检测正确铭牌区域上的文本行。接下来,提出一种深度学习方法,研究卷积递归神经网络和连接时间分类的分类方法,用于在CTPN检测到的每一行文本中识别文本。最后,我们利用Apache Flink对上述过程进行并行化处理,包括文本行检测和文本识别过程中的并行化预处理和双向长短期记忆并行化。在采集到的铭牌上的实验结果表明,所提出的图像处理方法具有良好的识别性能,并行化方法显著降低了数据处理的时间成本。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Erratum: Diode Connected Transistor-Based Low PDP Adiabatic Full Adder in 7nm FINFET Technology for MIMO Applications An Efficient Control Strategy for an Extended Switched Coupled Inductor Quasi-Z-Source Inverter for 3Φ Grid Connected System An Optimal Partitioning and Floor Planning for VLSI Circuit Design Based on a Hybrid Bio-Inspired Whale Optimization and Adaptive Bird Swarm Optimization (WO-ABSO) Algorithm Mode Switching Technique for High Efficiency Buck Converter Cloud-Edge Computing-Based ICICOS Framework for Industrial Automation and Artificial Intelligence: A Survey
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1