{"title":"基于OPENCV的电子教科书数字化","authors":"Zhi-Ming Deng, Minyong Shi, Chunfang Li","doi":"10.1109/ICMLC51923.2020.9469536","DOIUrl":null,"url":null,"abstract":"The traditional digitization method of electronic textbooks is limited by text data and illustration layout, and the data processing effect is poor. In order to adapt to the complex and changeable data formats, this paper proposes an adaptive data partitioning technique. We divide all the texts and illustrations in the textbooks into independent data blocks, locate and cut them, and use OCR technology to identify the information of each area to make the processing goals more clear. Experiments were conducted on the junior middle school history textbooks in terms of data recognition rate. The experimental results show that the method proposed in this paper has a good effect on the digitalization of electronic textbooks.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"233 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Digitalization of Electronic Textbook Based on OPENCV\",\"authors\":\"Zhi-Ming Deng, Minyong Shi, Chunfang Li\",\"doi\":\"10.1109/ICMLC51923.2020.9469536\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The traditional digitization method of electronic textbooks is limited by text data and illustration layout, and the data processing effect is poor. In order to adapt to the complex and changeable data formats, this paper proposes an adaptive data partitioning technique. We divide all the texts and illustrations in the textbooks into independent data blocks, locate and cut them, and use OCR technology to identify the information of each area to make the processing goals more clear. Experiments were conducted on the junior middle school history textbooks in terms of data recognition rate. The experimental results show that the method proposed in this paper has a good effect on the digitalization of electronic textbooks.\",\"PeriodicalId\":170815,\"journal\":{\"name\":\"2020 International Conference on Machine Learning and Cybernetics (ICMLC)\",\"volume\":\"233 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Machine Learning and Cybernetics (ICMLC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLC51923.2020.9469536\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC51923.2020.9469536","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Digitalization of Electronic Textbook Based on OPENCV
The traditional digitization method of electronic textbooks is limited by text data and illustration layout, and the data processing effect is poor. In order to adapt to the complex and changeable data formats, this paper proposes an adaptive data partitioning technique. We divide all the texts and illustrations in the textbooks into independent data blocks, locate and cut them, and use OCR technology to identify the information of each area to make the processing goals more clear. Experiments were conducted on the junior middle school history textbooks in terms of data recognition rate. The experimental results show that the method proposed in this paper has a good effect on the digitalization of electronic textbooks.