{"title":"基于自然卷积和多尺度特征解码器的历史文献文本二值化","authors":"Hanif Rasyidi, S. Khan","doi":"10.1109/DICTA47822.2019.8946108","DOIUrl":null,"url":null,"abstract":"This paper presents a segmentation-based binarization model to extract text information from the historical document using convolutional neural networks. The proposed method uses atrous convolution feature extraction to learn useful text pattern from the document without making a significant reduction on the spatial size of the image. The model then combines the extracted feature using a multi-scale decoder to construct a binary image that contains only text information from the document. We train our model using a series of DIBCO competition datasets and compare the results with the existing text binarization methods as well as a state-of-the-art object segmentation model. The experiment results on the H-DIBCO 2016 dataset show that our method has an excellent performance on the pseudo F-Score metric that surpasses the result of various existing methods.","PeriodicalId":6696,"journal":{"name":"2019 Digital Image Computing: Techniques and Applications (DICTA)","volume":"163 1","pages":"1-8"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Historical Document Text Binarization using Atrous Convolution and Multi-Scale Feature Decoder\",\"authors\":\"Hanif Rasyidi, S. Khan\",\"doi\":\"10.1109/DICTA47822.2019.8946108\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a segmentation-based binarization model to extract text information from the historical document using convolutional neural networks. The proposed method uses atrous convolution feature extraction to learn useful text pattern from the document without making a significant reduction on the spatial size of the image. The model then combines the extracted feature using a multi-scale decoder to construct a binary image that contains only text information from the document. We train our model using a series of DIBCO competition datasets and compare the results with the existing text binarization methods as well as a state-of-the-art object segmentation model. The experiment results on the H-DIBCO 2016 dataset show that our method has an excellent performance on the pseudo F-Score metric that surpasses the result of various existing methods.\",\"PeriodicalId\":6696,\"journal\":{\"name\":\"2019 Digital Image Computing: Techniques and Applications (DICTA)\",\"volume\":\"163 1\",\"pages\":\"1-8\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 Digital Image Computing: Techniques and Applications (DICTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DICTA47822.2019.8946108\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA47822.2019.8946108","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Historical Document Text Binarization using Atrous Convolution and Multi-Scale Feature Decoder
This paper presents a segmentation-based binarization model to extract text information from the historical document using convolutional neural networks. The proposed method uses atrous convolution feature extraction to learn useful text pattern from the document without making a significant reduction on the spatial size of the image. The model then combines the extracted feature using a multi-scale decoder to construct a binary image that contains only text information from the document. We train our model using a series of DIBCO competition datasets and compare the results with the existing text binarization methods as well as a state-of-the-art object segmentation model. The experiment results on the H-DIBCO 2016 dataset show that our method has an excellent performance on the pseudo F-Score metric that surpasses the result of various existing methods.