Novel Deep Learning Application: Recognizing Inconsistent Characters on Pharmaceutical Packaging

Jarmo Koponen, Keijo Haataja, Pekka J. Toivanen
{"title":"Novel Deep Learning Application: Recognizing Inconsistent Characters on Pharmaceutical Packaging","authors":"Jarmo Koponen, Keijo Haataja, Pekka J. Toivanen","doi":"10.12688/f1000research.131775.2","DOIUrl":null,"url":null,"abstract":"Background Machine vision faces significant challenges when applied to text recognition on cardboard packaging particularly due to multiple printing methods, irregular character shapes, and curved packaging surfaces. Methods This research introduces a novel deep learning application for recognizing binarized expiration date and batch code characters printed using multiple printing methods. The method, based on Region-based Convolutional Neural Networks (R-CNN), enables character recognition directly from in the images without the need for extracting handcrafted features. In detail, this approach performs character recognition by using the whole image as input, extracting and learning salient character features directly from the packaging surface images. Results The R-CNN model, with a precision of 91.1% and an F1 score of 80.9%, effectively recognizes manufacturing markings on pharmaceutical packages, with inconsistencies in the characters’ shapes. In a comparative experiment using the same dataset of images, the R-CNN model significantly outperformed Tesseract OCR, achieving much higher precision, recall, and F1 scores. Conclusions The results of this study reveal that the deep learning method outperforms the well-established optical character recognition method in recognizing text characters printed with different printing methods. Presented in this study, the deep learning method recognizes text characters with high precision. It is also suitable for recognizing text printed on curved surfaces, provided proper preprocessing is applied. The problem investigated in the study differs from previous research in the field, focusing on the recognition of texts printed with different printing methods. The research thus fills a gap in text recognition that existed in the research of the field. Furthermore, the study presents new ideas that will be utilized in our future research.","PeriodicalId":504605,"journal":{"name":"F1000Research","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"F1000Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12688/f1000research.131775.2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background Machine vision faces significant challenges when applied to text recognition on cardboard packaging particularly due to multiple printing methods, irregular character shapes, and curved packaging surfaces. Methods This research introduces a novel deep learning application for recognizing binarized expiration date and batch code characters printed using multiple printing methods. The method, based on Region-based Convolutional Neural Networks (R-CNN), enables character recognition directly from in the images without the need for extracting handcrafted features. In detail, this approach performs character recognition by using the whole image as input, extracting and learning salient character features directly from the packaging surface images. Results The R-CNN model, with a precision of 91.1% and an F1 score of 80.9%, effectively recognizes manufacturing markings on pharmaceutical packages, with inconsistencies in the characters’ shapes. In a comparative experiment using the same dataset of images, the R-CNN model significantly outperformed Tesseract OCR, achieving much higher precision, recall, and F1 scores. Conclusions The results of this study reveal that the deep learning method outperforms the well-established optical character recognition method in recognizing text characters printed with different printing methods. Presented in this study, the deep learning method recognizes text characters with high precision. It is also suitable for recognizing text printed on curved surfaces, provided proper preprocessing is applied. The problem investigated in the study differs from previous research in the field, focusing on the recognition of texts printed with different printing methods. The research thus fills a gap in text recognition that existed in the research of the field. Furthermore, the study presents new ideas that will be utilized in our future research.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
新颖的深度学习应用:识别药品包装上不一致的字符
背景 机器视觉在应用于纸板包装上的文本识别时面临着巨大的挑战,特别是由于多种印刷方法、不规则的字符形状和弯曲的包装表面。方法 本研究介绍了一种新颖的深度学习应用,用于识别使用多种印刷方法印刷的二进制到期日期和批次代码字符。该方法基于基于区域的卷积神经网络(R-CNN),可直接从图像中识别字符,而无需提取手工制作的特征。具体来说,这种方法将整个图像作为输入,直接从包装表面图像中提取和学习突出的字符特征,从而进行字符识别。结果 R-CNN 模型的精确度为 91.1%,F1 得分为 80.9%,能有效识别药品包装上的生产标记,但字符形状不一致。在使用相同图像数据集进行的对比实验中,R-CNN 模型的表现明显优于 Tesseract OCR,获得了更高的精确度、召回率和 F1 分数。结论 本研究结果表明,深度学习方法在识别用不同印刷方法印刷的文本字符方面优于成熟的光学字符识别方法。本研究提出的深度学习方法能识别出高精度的文本字符。只要进行适当的预处理,该方法还适用于识别印刷在曲面上的文字。本研究调查的问题与该领域以往的研究不同,侧重于识别以不同印刷方法印刷的文本。因此,该研究填补了文本识别领域研究的空白。此外,这项研究还提出了一些新的想法,我们将在今后的研究中加以利用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Publisher preferences for a journal transparency tool: A modified three-round Delphi study The Nexus of Climate Change, Food Insecurity, and Conflict in Somalia: A Comprehensive Analysis of Multifaceted Challenges and Resilience Strategies StackbarExtended: a user-friendly stacked bar-plot representation incorporating phylogenetic information and microbiota differential abundance analysis Trends of machine learning for dental caries research in Southeast Asia: insights from a bibliometric analysis Case Report: Localized retinal ischemia revealing an antiphospholipids syndrome: A case report and review of the literature
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1