Recognition of multifont English electronic prescribing based on convolution neural network algorithm

IF 1.2 Q3 Computer Science Bio-Algorithms and Med-Systems Pub Date : 2020-09-01 DOI:10.1515/BAMS-2020-0021
M. Mohammed, E. Mohammed, Mohammed S. Jarjees
{"title":"Recognition of multifont English electronic prescribing based on convolution neural network algorithm","authors":"M. Mohammed, E. Mohammed, Mohammed S. Jarjees","doi":"10.1515/BAMS-2020-0021","DOIUrl":null,"url":null,"abstract":"Abstract The printed character recognition is an efficient and automatic method for inputting information to a computer nowadays that is used to translate the printed or handwritten images into an editable and readable text file. This paper aims to recognize a multifont and multisize of the English language printed word for a smart pharmacy purpose. The recognition system has been based on a convolution neural network (CNN) approach where line, word, and character are separately corrected, and then each of the separated characters is fed into the CNN algorithm for recognition purposes. The OpenCV open-source library has been used for preprocessing, which can segment English characters accurately and efficiently, and for recognition, the Keras library with the backend of TensorFlow has been used. The training and testing data sets have been designed to include 23 different fonts with six different sizes. The CNN algorithm achieves the highest accuracy of 96.6% comparing to the other state-of-the-art machine learning methods. The higher classification accuracy of the CNN approach shows that this type of algorithm is ideal for the English language printed word recognition. The highest error rate after testing the system using English electronic prescribing written with all proposed font-types is 0.23% in Georgia font.","PeriodicalId":42620,"journal":{"name":"Bio-Algorithms and Med-Systems","volume":" ","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/BAMS-2020-0021","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bio-Algorithms and Med-Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/BAMS-2020-0021","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 8

Abstract

Abstract The printed character recognition is an efficient and automatic method for inputting information to a computer nowadays that is used to translate the printed or handwritten images into an editable and readable text file. This paper aims to recognize a multifont and multisize of the English language printed word for a smart pharmacy purpose. The recognition system has been based on a convolution neural network (CNN) approach where line, word, and character are separately corrected, and then each of the separated characters is fed into the CNN algorithm for recognition purposes. The OpenCV open-source library has been used for preprocessing, which can segment English characters accurately and efficiently, and for recognition, the Keras library with the backend of TensorFlow has been used. The training and testing data sets have been designed to include 23 different fonts with six different sizes. The CNN algorithm achieves the highest accuracy of 96.6% comparing to the other state-of-the-art machine learning methods. The higher classification accuracy of the CNN approach shows that this type of algorithm is ideal for the English language printed word recognition. The highest error rate after testing the system using English electronic prescribing written with all proposed font-types is 0.23% in Georgia font.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于卷积神经网络算法的多字体英文电子处方识别
摘要印刷字符识别是一种高效、自动的计算机信息输入方法,用于将印刷或手写图像转换为可编辑、可读的文本文件。本文旨在识别多字体、多尺寸的英语印刷词,以实现智能药房的目的。识别系统基于卷积神经网络(CNN)方法,其中行、单词和字符被分别校正,然后每个分离的字符被输入到CNN算法中用于识别目的。OpenCV开源库已用于预处理,可以准确高效地分割英文字符;识别方面,使用了带有TensorFlow后端的Keras库。训练和测试数据集被设计为包括23种不同字体和6种不同大小。与其他最先进的机器学习方法相比,CNN算法实现了96.6%的最高精度。CNN方法具有较高的分类精度,表明该算法是英语印刷字识别的理想算法。在使用所有拟议字体类型编写的英语电子处方测试系统后,乔治亚字体的最高错误率为0.23%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Bio-Algorithms and Med-Systems
Bio-Algorithms and Med-Systems MATHEMATICAL & COMPUTATIONAL BIOLOGY-
CiteScore
3.80
自引率
0.00%
发文量
3
期刊介绍: The journal Bio-Algorithms and Med-Systems (BAMS), edited by the Jagiellonian University Medical College, provides a forum for the exchange of information in the interdisciplinary fields of computational methods applied in medicine, presenting new algorithms and databases that allows the progress in collaborations between medicine, informatics, physics, and biochemistry. Projects linking specialists representing these disciplines are welcome to be published in this Journal. Articles in BAMS are published in English. Topics Bioinformatics Systems biology Telemedicine E-Learning in Medicine Patient''s electronic record Image processing Medical databases.
期刊最新文献
Propagation of electrical signals by fungi Light induced spiking of proteinoids Transfer Functions of Proteinoid Microspheres Application of quantum entanglement induced polarization for dual-positron and prompt gamma imaging. Transcriptomic data analysis of melanocytes and melanoma cell lines of LAT transporter genes for precise medicine
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1