基于k近邻的光学字符识别

ComTech Pub Date : 2016-03-01 DOI:10.21512/COMTECH.V7I1.2223

V. Ong, Derwin Suhartono

{"title":"基于k近邻的光学字符识别","authors":"V. Ong, Derwin Suhartono","doi":"10.21512/COMTECH.V7I1.2223","DOIUrl":null,"url":null,"abstract":"The growth in computer vision technology has aided society with various kinds of tasks. One of these tasks is the ability of recognizing text contained in an image, or usually referred to as Optical Character Recognition (OCR). There are many kinds of algorithms that can be implemented into an OCR. The K-Nearest Neighbor is one such algorithm. This research aims to find out the process behind the OCR mechanism by using K-Nearest Neighbor algorithm; one of the most influential machine learning algorithms. It also aims to find out how precise the algorithm is in an OCR program. To do that, a simple OCR program to classify alphabets of capital letters is made to produce and compare real results. The result of this research yielded a maximum of 76.9% accuracy with 200 training samples per alphabet. A set of reasons are also given as to why the program is able to reach said level of accuracy.","PeriodicalId":31095,"journal":{"name":"ComTech","volume":"7 1","pages":"53-65"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Using K-Nearest Neighbor in Optical Character Recognition\",\"authors\":\"V. Ong, Derwin Suhartono\",\"doi\":\"10.21512/COMTECH.V7I1.2223\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The growth in computer vision technology has aided society with various kinds of tasks. One of these tasks is the ability of recognizing text contained in an image, or usually referred to as Optical Character Recognition (OCR). There are many kinds of algorithms that can be implemented into an OCR. The K-Nearest Neighbor is one such algorithm. This research aims to find out the process behind the OCR mechanism by using K-Nearest Neighbor algorithm; one of the most influential machine learning algorithms. It also aims to find out how precise the algorithm is in an OCR program. To do that, a simple OCR program to classify alphabets of capital letters is made to produce and compare real results. The result of this research yielded a maximum of 76.9% accuracy with 200 training samples per alphabet. A set of reasons are also given as to why the program is able to reach said level of accuracy.\",\"PeriodicalId\":31095,\"journal\":{\"name\":\"ComTech\",\"volume\":\"7 1\",\"pages\":\"53-65\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ComTech\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21512/COMTECH.V7I1.2223\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ComTech","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21512/COMTECH.V7I1.2223","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

摘要

计算机视觉技术的发展帮助社会完成了各种各样的任务。其中一项任务是识别图像中包含的文本的能力，或通常称为光学字符识别(OCR)。有许多种算法可以实现到OCR中。k近邻算法就是这样一种算法。本研究旨在通过k -最近邻算法找出OCR机制背后的过程;最具影响力的机器学习算法之一。它还旨在找出该算法在OCR程序中的精确度。为此，编写了一个简单的OCR程序来对大写字母的字母进行分类，以产生和比较实际结果。本研究的结果是，在每个字母表200个训练样本的情况下，准确率最高为76.9%。一组原因也给出了为什么该程序能够达到上述水平的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Using K-Nearest Neighbor in Optical Character Recognition

The growth in computer vision technology has aided society with various kinds of tasks. One of these tasks is the ability of recognizing text contained in an image, or usually referred to as Optical Character Recognition (OCR). There are many kinds of algorithms that can be implemented into an OCR. The K-Nearest Neighbor is one such algorithm. This research aims to find out the process behind the OCR mechanism by using K-Nearest Neighbor algorithm; one of the most influential machine learning algorithms. It also aims to find out how precise the algorithm is in an OCR program. To do that, a simple OCR program to classify alphabets of capital letters is made to produce and compare real results. The result of this research yielded a maximum of 76.9% accuracy with 200 training samples per alphabet. A set of reasons are also given as to why the program is able to reach said level of accuracy.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ComTech

自引率

0.00%

发文量

审稿时长

16 weeks