Methodology for eliminating plain regions from captured images

IAES International Journal of Artificial Intelligence (IJ-AI) Pub Date : 2024-06-01 DOI:10.11591/ijai.v13.i2.pp1358-1370

Shiva Shankar Reddy, Vuddagiri MNSSVKR. Gupta, Lokavarapu V. Srinivas, Chigurupati Ravi Swaroop

{"title":"Methodology for eliminating plain regions from captured images","authors":"Shiva Shankar Reddy, Vuddagiri MNSSVKR. Gupta, Lokavarapu V. Srinivas, Chigurupati Ravi Swaroop","doi":"10.11591/ijai.v13.i2.pp1358-1370","DOIUrl":null,"url":null,"abstract":"Finding relevant content and extracting information from images is highly significant. Still, it may be challenging to do so because of changes within the textual contents, such as typefaces, size, line orientation, sophisticated backgrounds in images, and non-uniform illuminations. Despite these challenges, extracting content from captured images is still very important. Proficient textual content image recognition abilities extract text from the images to get over these issues. Despite the availability of several optical character recognition (OCR) techniques, this issue has yet to be resolved. Captured images with text are a rich source of information that should be presented so that viewers may make informed decisions. Because of this, it has become a complicated process to extract the text from an image because the text might be of poor quality, has a variety of fonts and styles, and occasionally have a complicated backdrop, among other things. Several approaches have been tried. However, finding a solution remains challenging. The maximally stable external regions (MSER) approach is developed to identify the text region in a picture. MSER is utilized to elevate the plain regions outside the text and non-text areas using geometric features and stroke width variation qualities.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":"4 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IAES International Journal of Artificial Intelligence (IJ-AI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11591/ijai.v13.i2.pp1358-1370","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Finding relevant content and extracting information from images is highly significant. Still, it may be challenging to do so because of changes within the textual contents, such as typefaces, size, line orientation, sophisticated backgrounds in images, and non-uniform illuminations. Despite these challenges, extracting content from captured images is still very important. Proficient textual content image recognition abilities extract text from the images to get over these issues. Despite the availability of several optical character recognition (OCR) techniques, this issue has yet to be resolved. Captured images with text are a rich source of information that should be presented so that viewers may make informed decisions. Because of this, it has become a complicated process to extract the text from an image because the text might be of poor quality, has a variety of fonts and styles, and occasionally have a complicated backdrop, among other things. Several approaches have been tried. However, finding a solution remains challenging. The maximally stable external regions (MSER) approach is developed to identify the text region in a picture. MSER is utilized to elevate the plain regions outside the text and non-text areas using geometric features and stroke width variation qualities.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

从拍摄的图像中消除平原区域的方法

从图像中查找相关内容和提取信息意义重大。然而，由于文本内容的变化，如字体、大小、行方向、图像中复杂的背景和不均匀的照明，要做到这一点可能具有挑战性。尽管存在这些挑战，从拍摄的图像中提取内容仍然非常重要。熟练的文本内容图像识别能力可以从图像中提取文本，从而解决这些问题。尽管有多种光学字符识别（OCR）技术，但这一问题仍有待解决。带有文本的捕获图像是一个丰富的信息源，应将其呈现出来，以便观众做出明智的决定。正因为如此，从图像中提取文字就成了一个复杂的过程，因为文字的质量可能很差，字体和样式可能多种多样，有时还可能有复杂的背景等等。人们已经尝试了多种方法。然而，找到一种解决方案仍然具有挑战性。最大稳定外部区域（MSER）方法就是用来识别图片中的文字区域。MSER 利用几何特征和笔画宽度变化质量来提升文本和非文本区域之外的普通区域。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IAES International Journal of Artificial Intelligence (IJ-AI)

自引率

0.00%

发文量