SmartIdOCR: Automatic Detection and Recognition of Identity card number using Deep Networks

M. Gupta, Ronak Shah, Jitesh Rathod, Ajai Kumar
{"title":"SmartIdOCR: Automatic Detection and Recognition of Identity card number using Deep Networks","authors":"M. Gupta, Ronak Shah, Jitesh Rathod, Ajai Kumar","doi":"10.1109/ICIIP53038.2021.9702703","DOIUrl":null,"url":null,"abstract":"Identity authentication is much needed and required in this digital age where the information can be utilized in many areas like banking, finance, insurance, education, etc. The long time in the manual authentication process is tiresome for both sides due to the exchange of data. The challenge lies in verification and information extraction from the ID card during the authentication process. There is an AI-based solution needed to reduce the authentication time. This paper aims to solve this problem by doing real-time authentication of identity cards like PAN and UIDAI using AI techniques with good accuracy. Real-time authentication is done by text detection and text recognition. The text detection is done using a differentiable binarization algorithm. We do not have a real annotated dataset for an ID number. We generated approximately 90000 identity number images synthetically with noise and blur using two fonts. This dataset is divided into training, validation, and testing sets. We present a neural encoder-decoder model with attention for converting ID number line images into editable text. Our method is evaluated based on the text output of the line image. An attention-based approach can tackle this problem in a better way in comparison to other neural techniques using CTC-based models. This paper describes the usage of OpenNMT architecture for recognition due to the flexibility of hyperparameter tuning. We evaluated the text recognition performance on scanned as well as the camera-captured identity card number images. We also compared the current recognition performance of OpenNMT with Tesseract (LSTM) on the same testbed containing 36000 images containing ID numbers only. The proposed approach outperformed the Tesseract in ID number recognition.","PeriodicalId":431272,"journal":{"name":"2021 Sixth International Conference on Image Information Processing (ICIIP)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Sixth International Conference on Image Information Processing (ICIIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIIP53038.2021.9702703","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Identity authentication is much needed and required in this digital age where the information can be utilized in many areas like banking, finance, insurance, education, etc. The long time in the manual authentication process is tiresome for both sides due to the exchange of data. The challenge lies in verification and information extraction from the ID card during the authentication process. There is an AI-based solution needed to reduce the authentication time. This paper aims to solve this problem by doing real-time authentication of identity cards like PAN and UIDAI using AI techniques with good accuracy. Real-time authentication is done by text detection and text recognition. The text detection is done using a differentiable binarization algorithm. We do not have a real annotated dataset for an ID number. We generated approximately 90000 identity number images synthetically with noise and blur using two fonts. This dataset is divided into training, validation, and testing sets. We present a neural encoder-decoder model with attention for converting ID number line images into editable text. Our method is evaluated based on the text output of the line image. An attention-based approach can tackle this problem in a better way in comparison to other neural techniques using CTC-based models. This paper describes the usage of OpenNMT architecture for recognition due to the flexibility of hyperparameter tuning. We evaluated the text recognition performance on scanned as well as the camera-captured identity card number images. We also compared the current recognition performance of OpenNMT with Tesseract (LSTM) on the same testbed containing 36000 images containing ID numbers only. The proposed approach outperformed the Tesseract in ID number recognition.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
SmartIdOCR:使用深度网络自动检测和识别身份证号码
在这个数字时代,身份认证是非常必要的,信息可以在银行、金融、保险、教育等许多领域得到利用。由于数据的交换,手动认证过程的时间长,对双方来说都是令人厌烦的。其难点在于身份验证过程中的身份验证和信息提取。需要一种基于ai的解决方案来减少身份验证时间。本文旨在解决这一问题,利用人工智能技术对PAN、UIDAI等身份证进行实时认证,具有较好的准确性。实时认证是通过文本检测和文本识别来实现的。文本检测使用可微分二值化算法完成。我们没有一个真正带注释的ID号数据集。我们使用两种字体合成了大约90000张带有噪点和模糊的身份证号码图像。该数据集分为训练集、验证集和测试集。我们提出了一种神经编码器-解码器模型,用于将ID数轴图像转换为可编辑的文本。我们的方法是基于线条图像的文本输出来评估的。与使用基于ctc模型的其他神经技术相比,基于注意力的方法可以更好地解决这个问题。由于超参数调优的灵活性,本文描述了OpenNMT体系结构在识别中的应用。我们对扫描的文本识别性能以及相机捕获的身份证号码图像进行了评估。我们还比较了OpenNMT和Tesseract (LSTM)在同一试验台上的当前识别性能,该试验台包含36000张仅包含ID号的图像。该方法在识别身份证号码方面优于Tesseract。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Weather Station Using Raspberry Pi Prediction of Heart Disease using Machine Learning Techniques A Brief Review on Existing Techniques for Detecting Digital Image Forgery A Novel Approach for Excavating Communication Using Taxonomy and Outline Mechanisms A Novel Rough Set Based Image Denoising Algorithm
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1