{"title":"Nom Document Background Removal Using Generative Adversarial Network","authors":"Loc Ho, S. Tran, Dinh Dien","doi":"10.1109/ICSIPA52582.2021.9576764","DOIUrl":null,"url":null,"abstract":"In this research, we present a new technique to improve the performance of a Nom-character recognition system. Nom-character recognition is a challenging problem in pattern recognition. Especially these characters are not only blurred or distorted in a paper of a historical document containing ink strokes and symbols created by readers. Generative Adversarial Network (GAN) is one of the advanced versions of deep neural networks applied to generate artificial photos of objects [28]. Many versions of GAN have been malfunctioned recently to help the learning process be more stable and realistic to maximize features extracted from the data. We have been using a recent version of GAN to extract characters from images with complex backgrounds and brightness. This task is to retrieve clean text images from complex and noisy background sources. To the best of our knowledge, we perform the test on the Nom Dataset, which characterizes by multiple noise forms. The results demonstrate that this approach can help to improve any Nom-character recognition system.","PeriodicalId":326688,"journal":{"name":"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSIPA52582.2021.9576764","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this research, we present a new technique to improve the performance of a Nom-character recognition system. Nom-character recognition is a challenging problem in pattern recognition. Especially these characters are not only blurred or distorted in a paper of a historical document containing ink strokes and symbols created by readers. Generative Adversarial Network (GAN) is one of the advanced versions of deep neural networks applied to generate artificial photos of objects [28]. Many versions of GAN have been malfunctioned recently to help the learning process be more stable and realistic to maximize features extracted from the data. We have been using a recent version of GAN to extract characters from images with complex backgrounds and brightness. This task is to retrieve clean text images from complex and noisy background sources. To the best of our knowledge, we perform the test on the Nom Dataset, which characterizes by multiple noise forms. The results demonstrate that this approach can help to improve any Nom-character recognition system.