Carlos Cuenca-Enrique, Laura Del-Río-Carazo, Emiliano Acquila-Natale, Camilo Macarron
{"title":"Semi-automatic document processing and text-recognition for digital transformation of low-resources organizations","authors":"Carlos Cuenca-Enrique, Laura Del-Río-Carazo, Emiliano Acquila-Natale, Camilo Macarron","doi":"10.23919/cisti54924.2022.9820135","DOIUrl":null,"url":null,"abstract":"The digital transformation of organizations has become critical for their survival. Smaller organizations, and especially those in developing countries, are facing bigger challenges in this process due to limited resources and lack of digital skills. One of the keys of this transformation is the digitalization of information, where optical character recognition (OCR) plays an essential role. This research analyzes current OCR tools and systems for document digitalization in low-resources organizations and presents a semi-automatic document processing and text-recognition solution that addresses their needs. The study then presents an experimental design to assess the efficiency of the tool. The results support the adequacy of using these tools, compared to current manual digitalization processes.","PeriodicalId":187896,"journal":{"name":"2022 17th Iberian Conference on Information Systems and Technologies (CISTI)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 17th Iberian Conference on Information Systems and Technologies (CISTI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/cisti54924.2022.9820135","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The digital transformation of organizations has become critical for their survival. Smaller organizations, and especially those in developing countries, are facing bigger challenges in this process due to limited resources and lack of digital skills. One of the keys of this transformation is the digitalization of information, where optical character recognition (OCR) plays an essential role. This research analyzes current OCR tools and systems for document digitalization in low-resources organizations and presents a semi-automatic document processing and text-recognition solution that addresses their needs. The study then presents an experimental design to assess the efficiency of the tool. The results support the adequacy of using these tools, compared to current manual digitalization processes.