{"title":"脚本独立文本预处理和分割的OCR","authors":"Archana S. Sawant, D. Chougule","doi":"10.1109/EESCO.2015.7253643","DOIUrl":null,"url":null,"abstract":"Optical Character Recognition (OCR) systems have been numerously developed for the recognition of printed script in many languages. Multiple approaches to pre-processing and segmentation exist for various scripts where OCR accuracy mainly depends on the text pre-processing and segmentation algorithm being used for the document. When the document is scanned it can be put in any arbitrary angle which would appear in the image as skew angle. Our experimental results proposed in the paper assures the superior algorithm for correction of skew angle of the text document. Projection Profile based methods used makes segmentation easy to separate the text in document image into lines, words and characters independent of the Language in the Text.","PeriodicalId":305584,"journal":{"name":"2015 International Conference on Electrical, Electronics, Signals, Communication and Optimization (EESCO)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Script independent text pre-processing and segmentation for OCR\",\"authors\":\"Archana S. Sawant, D. Chougule\",\"doi\":\"10.1109/EESCO.2015.7253643\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Optical Character Recognition (OCR) systems have been numerously developed for the recognition of printed script in many languages. Multiple approaches to pre-processing and segmentation exist for various scripts where OCR accuracy mainly depends on the text pre-processing and segmentation algorithm being used for the document. When the document is scanned it can be put in any arbitrary angle which would appear in the image as skew angle. Our experimental results proposed in the paper assures the superior algorithm for correction of skew angle of the text document. Projection Profile based methods used makes segmentation easy to separate the text in document image into lines, words and characters independent of the Language in the Text.\",\"PeriodicalId\":305584,\"journal\":{\"name\":\"2015 International Conference on Electrical, Electronics, Signals, Communication and Optimization (EESCO)\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference on Electrical, Electronics, Signals, Communication and Optimization (EESCO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EESCO.2015.7253643\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Electrical, Electronics, Signals, Communication and Optimization (EESCO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EESCO.2015.7253643","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Script independent text pre-processing and segmentation for OCR
Optical Character Recognition (OCR) systems have been numerously developed for the recognition of printed script in many languages. Multiple approaches to pre-processing and segmentation exist for various scripts where OCR accuracy mainly depends on the text pre-processing and segmentation algorithm being used for the document. When the document is scanned it can be put in any arbitrary angle which would appear in the image as skew angle. Our experimental results proposed in the paper assures the superior algorithm for correction of skew angle of the text document. Projection Profile based methods used makes segmentation easy to separate the text in document image into lines, words and characters independent of the Language in the Text.