{"title":"基于随机森林的视频文本自动识别系统","authors":"Y. Rachidi","doi":"10.33847/2686-8296.4.2_3","DOIUrl":null,"url":null,"abstract":"In this paper; we introduce a system of automatic recognition of Video Text Amazigh based on the Random Forest. After doing some pretreatments on the video and picture, the text is segmented into lines and then into characters. In the stage of characteristics extraction, we are representing the input data into the vector of primitives. These characteristics are linked to pixels’ densities and they are extracted on binary pictures. In the classification stage, we examine four classification methods with two different classifiers types namely the convolutional neural network (CNN) and the Random Forest method. We carried out the experiments with a database containing 3300 samples collected from different writers. The experimental results show that our proposed OCR system is very efficient and provides good recognition accuracy rate of handwriting characters images acquired via Video camera phone.","PeriodicalId":235278,"journal":{"name":"Journal of Digital Science","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"System of Automatic Recognition of Video Text Amazigh based on the Random Forest\",\"authors\":\"Y. Rachidi\",\"doi\":\"10.33847/2686-8296.4.2_3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper; we introduce a system of automatic recognition of Video Text Amazigh based on the Random Forest. After doing some pretreatments on the video and picture, the text is segmented into lines and then into characters. In the stage of characteristics extraction, we are representing the input data into the vector of primitives. These characteristics are linked to pixels’ densities and they are extracted on binary pictures. In the classification stage, we examine four classification methods with two different classifiers types namely the convolutional neural network (CNN) and the Random Forest method. We carried out the experiments with a database containing 3300 samples collected from different writers. The experimental results show that our proposed OCR system is very efficient and provides good recognition accuracy rate of handwriting characters images acquired via Video camera phone.\",\"PeriodicalId\":235278,\"journal\":{\"name\":\"Journal of Digital Science\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Digital Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.33847/2686-8296.4.2_3\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Digital Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33847/2686-8296.4.2_3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
System of Automatic Recognition of Video Text Amazigh based on the Random Forest
In this paper; we introduce a system of automatic recognition of Video Text Amazigh based on the Random Forest. After doing some pretreatments on the video and picture, the text is segmented into lines and then into characters. In the stage of characteristics extraction, we are representing the input data into the vector of primitives. These characteristics are linked to pixels’ densities and they are extracted on binary pictures. In the classification stage, we examine four classification methods with two different classifiers types namely the convolutional neural network (CNN) and the Random Forest method. We carried out the experiments with a database containing 3300 samples collected from different writers. The experimental results show that our proposed OCR system is very efficient and provides good recognition accuracy rate of handwriting characters images acquired via Video camera phone.