A custom-built deep learning approach for text extraction from identity card images

Geerish Suddul, Jean Fabrice Laurent Seguin
{"title":"A custom-built deep learning approach for text extraction from identity card images","authors":"Geerish Suddul, Jean Fabrice Laurent Seguin","doi":"10.11591/ijict.v13i1.pp34-41","DOIUrl":null,"url":null,"abstract":"Information found on an identity card is needed for different essential tasks and manually extracting this information is time consuming, resource exhaustive and may be prone to human error. In this study, an optical character recognition (OCR) approach using deep learning techniques is proposed to automatically extract text related information from the image of an identity card in view of developing an automated client onboarding system. The OCR problem is divided into two main parts. Firstly, a custom-built image segmentation model, based on the U-net architecture, is used to detect the location of the text to be extracted. Secondly, using the location of the identified text fields, a (CRNN) based on long short-term memory (LSTM) cells is trained to recognise the characters and build words. Experimental results, based on the national identity card of the Republic of Mauritius, demonstrate that our approach achieves higher accuracy compared to other studies. Our text detection module has an intersection over union (IOU) measure of 0.70 with a pixel accuracy of 98% for text detection and the text recognition module achieved a mean word recognition accuracy of around 97% on main fields of the identity card.","PeriodicalId":245958,"journal":{"name":"International Journal of Informatics and Communication Technology (IJ-ICT)","volume":"40 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Informatics and Communication Technology (IJ-ICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11591/ijict.v13i1.pp34-41","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Information found on an identity card is needed for different essential tasks and manually extracting this information is time consuming, resource exhaustive and may be prone to human error. In this study, an optical character recognition (OCR) approach using deep learning techniques is proposed to automatically extract text related information from the image of an identity card in view of developing an automated client onboarding system. The OCR problem is divided into two main parts. Firstly, a custom-built image segmentation model, based on the U-net architecture, is used to detect the location of the text to be extracted. Secondly, using the location of the identified text fields, a (CRNN) based on long short-term memory (LSTM) cells is trained to recognise the characters and build words. Experimental results, based on the national identity card of the Republic of Mauritius, demonstrate that our approach achieves higher accuracy compared to other studies. Our text detection module has an intersection over union (IOU) measure of 0.70 with a pixel accuracy of 98% for text detection and the text recognition module achieved a mean word recognition accuracy of around 97% on main fields of the identity card.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
从身份证图像中提取文本的定制深度学习方法
不同的基本任务都需要身份证上的信息,而手动提取这些信息既费时又耗费资源,还容易出现人为错误。本研究提出了一种使用深度学习技术的光学字符识别(OCR)方法,用于从身份证图像中自动提取与文本相关的信息,以开发自动客户登录系统。光学字符识别问题分为两个主要部分。首先,使用基于 U-net 架构的定制图像分割模型来检测待提取文本的位置。其次,利用识别出的文本字段的位置,训练一个基于长短期记忆(LSTM)单元的(CRNN)来识别字符和造词。基于毛里求斯共和国国民身份证的实验结果表明,与其他研究相比,我们的方法实现了更高的准确性。我们的文本检测模块的 "交集大于联合"(IOU)测量值为 0.70,文本检测的像素准确率为 98%,文本识别模块对身份证主要字段的平均单词识别准确率约为 97%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Review-based analysis of clustering approaches in a recommendation system Adaptive resource allocation in NOMA-enabled backscatter communications systems Navigating the cyber forensics landscape a review of recent innovations ChatGPT's effect on the job market: how automation affects employment in sectors using ChatGPT for customer service Predicting anomalies in computer networks using autoencoder-based representation learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1