{"title":"基于分词驱动和两级DTW的离线手写维吾尔语词识别","authors":"Xu Yamei, Xu Jili","doi":"10.1109/CCET48361.2019.8989253","DOIUrl":null,"url":null,"abstract":"Offline handwritten Uighur scripts is cursive and have a large vocabulary, which makes the word recognition more complicated. In this paper, we propose a segmentation-driven recognition algorithm for offline handwritten Uighur word based on grapheme analysis and two-level DTW (dynamic time wrapping). Firstly, a MSAC (main segmentation and additional clustering) algorithm is adopted to over-segment a handwritten Uighur word into two grapheme sequences. After then, a hierarchical hybrid Uighur character classifier is designed to enhance the character recognition accuracy. Finally, a novel maximum likelihood algorithm with two-level DTW is presented to select the best hypothesis of character sequence from grapheme merging and decide the word class. Experiment results show that the proposed algorithm can achieve high character segmentation accuracy and word recognition rate simultaneously.","PeriodicalId":231425,"journal":{"name":"2019 IEEE 2nd International Conference on Computer and Communication Engineering Technology (CCET)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Offline Handwritten Uighur Word Recognition Based on Segmentation-driven and Two-level DTW\",\"authors\":\"Xu Yamei, Xu Jili\",\"doi\":\"10.1109/CCET48361.2019.8989253\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Offline handwritten Uighur scripts is cursive and have a large vocabulary, which makes the word recognition more complicated. In this paper, we propose a segmentation-driven recognition algorithm for offline handwritten Uighur word based on grapheme analysis and two-level DTW (dynamic time wrapping). Firstly, a MSAC (main segmentation and additional clustering) algorithm is adopted to over-segment a handwritten Uighur word into two grapheme sequences. After then, a hierarchical hybrid Uighur character classifier is designed to enhance the character recognition accuracy. Finally, a novel maximum likelihood algorithm with two-level DTW is presented to select the best hypothesis of character sequence from grapheme merging and decide the word class. Experiment results show that the proposed algorithm can achieve high character segmentation accuracy and word recognition rate simultaneously.\",\"PeriodicalId\":231425,\"journal\":{\"name\":\"2019 IEEE 2nd International Conference on Computer and Communication Engineering Technology (CCET)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 2nd International Conference on Computer and Communication Engineering Technology (CCET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCET48361.2019.8989253\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 2nd International Conference on Computer and Communication Engineering Technology (CCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCET48361.2019.8989253","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Offline Handwritten Uighur Word Recognition Based on Segmentation-driven and Two-level DTW
Offline handwritten Uighur scripts is cursive and have a large vocabulary, which makes the word recognition more complicated. In this paper, we propose a segmentation-driven recognition algorithm for offline handwritten Uighur word based on grapheme analysis and two-level DTW (dynamic time wrapping). Firstly, a MSAC (main segmentation and additional clustering) algorithm is adopted to over-segment a handwritten Uighur word into two grapheme sequences. After then, a hierarchical hybrid Uighur character classifier is designed to enhance the character recognition accuracy. Finally, a novel maximum likelihood algorithm with two-level DTW is presented to select the best hypothesis of character sequence from grapheme merging and decide the word class. Experiment results show that the proposed algorithm can achieve high character segmentation accuracy and word recognition rate simultaneously.