{"title":"Shape analysis of Pashto script and creation of image database for OCR","authors":"Mehreen Wahab, Hassan Amin, F. Ahmed","doi":"10.1109/ICET.2009.5353160","DOIUrl":null,"url":null,"abstract":"Development of optical character recognition for the cursive script such as Pashto requires detailed knowledge of shape variation within Pashto script. The development of image dataset is essential for training/testing of various OCR approaches. This paper outlines various features of Pashto script, and describes the development of an image dataset for an optical character recognition system.","PeriodicalId":307661,"journal":{"name":"2009 International Conference on Emerging Technologies","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Emerging Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICET.2009.5353160","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19
Abstract
Development of optical character recognition for the cursive script such as Pashto requires detailed knowledge of shape variation within Pashto script. The development of image dataset is essential for training/testing of various OCR approaches. This paper outlines various features of Pashto script, and describes the development of an image dataset for an optical character recognition system.