Vernon Estrada Bugayong, J. Villaverde, N. Linsangan
{"title":"Google Tesseract:使用机器视觉的HDD / SSD标签上的光学字符识别(OCR","authors":"Vernon Estrada Bugayong, J. Villaverde, N. Linsangan","doi":"10.1109/ICCAE55086.2022.9762440","DOIUrl":null,"url":null,"abstract":"This paper is designed to have an optical character recognition system capable of interpreting captured images of hard disk drive and solid-state drive labels with high accuracy. Manual checking of the disk capacity size and part number found on the labels is time consuming, more prone to errors and utilizes more manpower. Automating the inspection through optical character recognition using image pre-processing and machine vision contributes to an easier inspection process, better management of records and faster cycle time. The images captured using a vision camera went through different stages of image pre-processing via OpenCV-Python and recognition through Google Tesseract. Different categorical variables including exposure time and location of texts in a captured image were used to determine and improve the overall recognition accuracy. By improving the lighting condition through the addition of light sources, the developed OCR system was able to achieve a character recognition accuracy of 99.375%.","PeriodicalId":294641,"journal":{"name":"2022 14th International Conference on Computer and Automation Engineering (ICCAE)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Google Tesseract: Optical Character Recognition (OCR) on HDD / SSD Labels Using Machine Vision\",\"authors\":\"Vernon Estrada Bugayong, J. Villaverde, N. Linsangan\",\"doi\":\"10.1109/ICCAE55086.2022.9762440\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper is designed to have an optical character recognition system capable of interpreting captured images of hard disk drive and solid-state drive labels with high accuracy. Manual checking of the disk capacity size and part number found on the labels is time consuming, more prone to errors and utilizes more manpower. Automating the inspection through optical character recognition using image pre-processing and machine vision contributes to an easier inspection process, better management of records and faster cycle time. The images captured using a vision camera went through different stages of image pre-processing via OpenCV-Python and recognition through Google Tesseract. Different categorical variables including exposure time and location of texts in a captured image were used to determine and improve the overall recognition accuracy. By improving the lighting condition through the addition of light sources, the developed OCR system was able to achieve a character recognition accuracy of 99.375%.\",\"PeriodicalId\":294641,\"journal\":{\"name\":\"2022 14th International Conference on Computer and Automation Engineering (ICCAE)\",\"volume\":\"86 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 14th International Conference on Computer and Automation Engineering (ICCAE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCAE55086.2022.9762440\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Computer and Automation Engineering (ICCAE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCAE55086.2022.9762440","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Google Tesseract: Optical Character Recognition (OCR) on HDD / SSD Labels Using Machine Vision
This paper is designed to have an optical character recognition system capable of interpreting captured images of hard disk drive and solid-state drive labels with high accuracy. Manual checking of the disk capacity size and part number found on the labels is time consuming, more prone to errors and utilizes more manpower. Automating the inspection through optical character recognition using image pre-processing and machine vision contributes to an easier inspection process, better management of records and faster cycle time. The images captured using a vision camera went through different stages of image pre-processing via OpenCV-Python and recognition through Google Tesseract. Different categorical variables including exposure time and location of texts in a captured image were used to determine and improve the overall recognition accuracy. By improving the lighting condition through the addition of light sources, the developed OCR system was able to achieve a character recognition accuracy of 99.375%.