Puja Romulus, Yan Maraden, Prima Dewi Purnamasari, A. A. P. Ratna
{"title":"基于k近邻原理的古代巴塔克字符光学识别实现分析","authors":"Puja Romulus, Yan Maraden, Prima Dewi Purnamasari, A. A. P. Ratna","doi":"10.1109/QIR.2015.7374893","DOIUrl":null,"url":null,"abstract":"This paper is intended to support the preservation of national cultural asset, particularly for ancient symbols. By using image processing principle, an automatic system that can be designed and implemented to translate ancient manuscript documents. The system is composed of several phases, from scanning, preprocessing, segmentation, feature extraction and classification. Sample images of the document are not scanned automatically, but manually produced as monochrome, black for the text and white for the background. These sample images are varied based on font size, rotation, and image size. The system is intended to be adaptable for various condition except for the color variation. The system is implemented as a MATLAB application program to convert an image that contains random Batak symbols into a series of Latin character representation of each word. The experiment results show that the system accuracy is ranged between 42% - 96% and the processing time is ranged from 1.9 - 34 seconds.","PeriodicalId":127270,"journal":{"name":"2015 International Conference on Quality in Research (QiR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"An analysis of optical character recognition implementation for ancient Batak characters using K-nearest neighbors principle\",\"authors\":\"Puja Romulus, Yan Maraden, Prima Dewi Purnamasari, A. A. P. Ratna\",\"doi\":\"10.1109/QIR.2015.7374893\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper is intended to support the preservation of national cultural asset, particularly for ancient symbols. By using image processing principle, an automatic system that can be designed and implemented to translate ancient manuscript documents. The system is composed of several phases, from scanning, preprocessing, segmentation, feature extraction and classification. Sample images of the document are not scanned automatically, but manually produced as monochrome, black for the text and white for the background. These sample images are varied based on font size, rotation, and image size. The system is intended to be adaptable for various condition except for the color variation. The system is implemented as a MATLAB application program to convert an image that contains random Batak symbols into a series of Latin character representation of each word. The experiment results show that the system accuracy is ranged between 42% - 96% and the processing time is ranged from 1.9 - 34 seconds.\",\"PeriodicalId\":127270,\"journal\":{\"name\":\"2015 International Conference on Quality in Research (QiR)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference on Quality in Research (QiR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/QIR.2015.7374893\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Quality in Research (QiR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/QIR.2015.7374893","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An analysis of optical character recognition implementation for ancient Batak characters using K-nearest neighbors principle
This paper is intended to support the preservation of national cultural asset, particularly for ancient symbols. By using image processing principle, an automatic system that can be designed and implemented to translate ancient manuscript documents. The system is composed of several phases, from scanning, preprocessing, segmentation, feature extraction and classification. Sample images of the document are not scanned automatically, but manually produced as monochrome, black for the text and white for the background. These sample images are varied based on font size, rotation, and image size. The system is intended to be adaptable for various condition except for the color variation. The system is implemented as a MATLAB application program to convert an image that contains random Batak symbols into a series of Latin character representation of each word. The experiment results show that the system accuracy is ranged between 42% - 96% and the processing time is ranged from 1.9 - 34 seconds.