ImageVeriBypasser: An image verification code recognition approach based on Convolutional Neural Network

IF 3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Expert Systems Pub Date : 2024-06-25 DOI:10.1111/exsy.13658

Tong Ji, Yuxin Luo, Yifeng Lin, Yuer Yang, Qian Zheng, Siwei Lian, Junjie Li

{"title":"ImageVeriBypasser: An image verification code recognition approach based on Convolutional Neural Network","authors":"Tong Ji, Yuxin Luo, Yifeng Lin, Yuer Yang, Qian Zheng, Siwei Lian, Junjie Li","doi":"10.1111/exsy.13658","DOIUrl":null,"url":null,"abstract":"<p>The recent period has witnessed automated crawlers designed to automatically crack passwords, which greatly risks various aspects of our lives. To prevent passwords from being cracked, image verification codes have been implemented to accomplish the human–machine verification. It is important to note, however, that the most widely-used image verification codes, especially the visual reasoning Completely Automated Public Turing tests to tell Computers and Humans Apart (CAPTCHAs), are still susceptible to attacks by artificial intelligence. Taking the visual reasoning CAPTCHAs representing the image verification codes, this study introduces an enhanced approach for generating image verification codes and proposes an improved Convolutional Neural Network (CNN)-based recognition system. After we add a fully connected layer and briefly solve the edge of stability issue, the accuracy of the improved CNN model can smoothly approach 98.40% within 50 epochs on the image verification codes with four digits using a large initial learning rate of 0.01. Compared with the baseline model, it is approximately 37.82% better in accuracy without obvious curve oscillation. The improved CNN model can also smoothly reach the accuracy of 99.00% within 7500 epochs on the image verification codes with six characters, including digits, upper-case alphabets, lower-case alphabets, and symbols. A detailed comparison between our proposed approach and the baseline one is presented. The relationship between the time consumption and the length of the seeds is compared theoretically. Subsequently, we figure out the threat assignments on the visual reasoning CAPTCHAs with different lengths based on four machine learning models. Based on the threat assignments, the Kaplan-Meier (KM) curves are computed.</p>","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"41 10","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/exsy.13658","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/exsy.13658","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

The recent period has witnessed automated crawlers designed to automatically crack passwords, which greatly risks various aspects of our lives. To prevent passwords from being cracked, image verification codes have been implemented to accomplish the human–machine verification. It is important to note, however, that the most widely-used image verification codes, especially the visual reasoning Completely Automated Public Turing tests to tell Computers and Humans Apart (CAPTCHAs), are still susceptible to attacks by artificial intelligence. Taking the visual reasoning CAPTCHAs representing the image verification codes, this study introduces an enhanced approach for generating image verification codes and proposes an improved Convolutional Neural Network (CNN)-based recognition system. After we add a fully connected layer and briefly solve the edge of stability issue, the accuracy of the improved CNN model can smoothly approach 98.40% within 50 epochs on the image verification codes with four digits using a large initial learning rate of 0.01. Compared with the baseline model, it is approximately 37.82% better in accuracy without obvious curve oscillation. The improved CNN model can also smoothly reach the accuracy of 99.00% within 7500 epochs on the image verification codes with six characters, including digits, upper-case alphabets, lower-case alphabets, and symbols. A detailed comparison between our proposed approach and the baseline one is presented. The relationship between the time consumption and the length of the seeds is compared theoretically. Subsequently, we figure out the threat assignments on the visual reasoning CAPTCHAs with different lengths based on four machine learning models. Based on the threat assignments, the Kaplan-Meier (KM) curves are computed.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

ImageVeriBypasser：基于卷积神经网络的图像验证码识别方法

最近一段时期，自动爬网程序被设计用来自动破解密码，这给我们生活的各个方面带来了极大的风险。为了防止密码被破解，人们开始使用图像验证码来完成人机对话。但值得注意的是，目前使用最广泛的图像验证码，尤其是视觉推理的 "完全自动区分计算机和人类的公共图灵测试（CAPTCHAs）"，仍然容易受到人工智能的攻击。本研究以视觉推理验证码为代表，介绍了一种生成图像验证码的增强方法，并提出了一种基于卷积神经网络（CNN）的改进型识别系统。在增加了一个全连接层并简单解决了稳定性边缘问题后，改进的 CNN 模型在使用 0.01 的大初始学习率时，在 50 个历元内对四位数字的图像验证码的准确率可顺利接近 98.40%。与基线模型相比，其准确率提高了约 37.82%，且没有明显的曲线振荡。改进后的 CNN 模型还能在 7500 个 epochs 内对包含数字、大写字母、小写字母和符号在内的六个字符的图像验证码顺利达到 99.00% 的准确率。报告详细比较了我们提出的方法和基线方法。我们从理论上比较了耗时与种子长度之间的关系。随后，我们根据四种机器学习模型计算出了不同长度的视觉推理验证码的威胁分配。根据威胁分配，计算出 Kaplan-Meier (KM) 曲线。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Expert Systems 工程技术-计算机：理论方法

CiteScore

7.40

自引率

6.10%

发文量

266

审稿时长

24 months

期刊介绍： Expert Systems: The Journal of Knowledge Engineering publishes papers dealing with all aspects of knowledge engineering, including individual methods and techniques in knowledge acquisition and representation, and their application in the construction of systems – including expert systems – based thereon. Detailed scientific evaluation is an essential part of any paper. As well as traditional application areas, such as Software and Requirements Engineering, Human-Computer Interaction, and Artificial Intelligence, we are aiming at the new and growing markets for these technologies, such as Business, Economy, Market Research, and Medical and Health Care. The shift towards this new focus will be marked by a series of special issues covering hot and emergent topics.

期刊最新文献

Advancing Disability Healthcare Solutions Through Privacy-Preserving Federated Learning With Theme Framework Predictive Analysis of Global Terrorist Attacks Using Lexical Patterns Across Multiple Datasets Issue Information Issue Information RETRACTION: Optimization Using Internet of Agent Based Stacked Sparse Autoencoder Model for Heart Disease Prediction