Selly Oktaviani, C. A. Sari, Eko Hari Rachmawanto, De Rosal Ignatius Moses Setiadi
{"title":"Optical Character Recognition for Hangul Character using Artificial Neural Network","authors":"Selly Oktaviani, C. A. Sari, Eko Hari Rachmawanto, De Rosal Ignatius Moses Setiadi","doi":"10.1109/iSemantic50169.2020.9234215","DOIUrl":null,"url":null,"abstract":"Korean language is one of the languages that are becoming widely known in the world, along with the occupation of Korean music (K-POP). Hangul is a character used to write Korean, which is not like Latin, which is relatively more easily understood by the majority of people in this world. This research aims to analyze the performance of an Artificial Neural Network (ANN) in recognizing Hangul characters with a simplified optical character recognition (OCR) method. The OCR process is carried out by entering characters in a 15x13 tile area, then the characters that enter fully on the tile will be changed to value 1 while others become 0 values so that a binary image is generated. The next step is the character pattern crafting process towards the field. The results were recognized by ANN, in an experiment using four types of training data composition: testing, namely 50%: 50%, 60%: 40%, 70%: 30%, and 80%: 20%. The dataset used is 40 Hangul characters in which there are 10 sample data each, so in total there are 400 data. Based on testing, the highest accuracy is produced with a composition of 50%: 50% where the accuracy is 97%.","PeriodicalId":345558,"journal":{"name":"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iSemantic50169.2020.9234215","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Korean language is one of the languages that are becoming widely known in the world, along with the occupation of Korean music (K-POP). Hangul is a character used to write Korean, which is not like Latin, which is relatively more easily understood by the majority of people in this world. This research aims to analyze the performance of an Artificial Neural Network (ANN) in recognizing Hangul characters with a simplified optical character recognition (OCR) method. The OCR process is carried out by entering characters in a 15x13 tile area, then the characters that enter fully on the tile will be changed to value 1 while others become 0 values so that a binary image is generated. The next step is the character pattern crafting process towards the field. The results were recognized by ANN, in an experiment using four types of training data composition: testing, namely 50%: 50%, 60%: 40%, 70%: 30%, and 80%: 20%. The dataset used is 40 Hangul characters in which there are 10 sample data each, so in total there are 400 data. Based on testing, the highest accuracy is produced with a composition of 50%: 50% where the accuracy is 97%.