Hoai Viet Nguyen, Linh Doan Bao, Hoang Viet Trinh, Hoang Huy Phan, Ta Minh Thanh
{"title":"MC-OCR挑战2021:面向无约束移动捕获越南收据的文档理解","authors":"Hoai Viet Nguyen, Linh Doan Bao, Hoang Viet Trinh, Hoang Huy Phan, Ta Minh Thanh","doi":"10.1109/RIVF51545.2021.9642126","DOIUrl":null,"url":null,"abstract":"The Mobile capture receipts Optical Character Recognition (MC-OCR) [14] challenge deliver two tasks: Receipt Image Quality Evaluation and Key Information Extraction. In the first task, we introduce a regression model to map various inputs, for instance the probability of the output OCR, cropped text boxes, images to actual label. In the second task, we propose a stacked multi-model as a solution to solve this problem. The robust models are incorporated by image segmentation, image classification, text detection, text recognition, and text classification. Follow this solution, we can get vital tackle various noise receipt types: horizontal, skew, and blur receipt.","PeriodicalId":6860,"journal":{"name":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","volume":"105 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"MC-OCR Challenge 2021: Towards Document Understanding for Unconstrained Mobile-Captured Vietnamese Receipts\",\"authors\":\"Hoai Viet Nguyen, Linh Doan Bao, Hoang Viet Trinh, Hoang Huy Phan, Ta Minh Thanh\",\"doi\":\"10.1109/RIVF51545.2021.9642126\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Mobile capture receipts Optical Character Recognition (MC-OCR) [14] challenge deliver two tasks: Receipt Image Quality Evaluation and Key Information Extraction. In the first task, we introduce a regression model to map various inputs, for instance the probability of the output OCR, cropped text boxes, images to actual label. In the second task, we propose a stacked multi-model as a solution to solve this problem. The robust models are incorporated by image segmentation, image classification, text detection, text recognition, and text classification. Follow this solution, we can get vital tackle various noise receipt types: horizontal, skew, and blur receipt.\",\"PeriodicalId\":6860,\"journal\":{\"name\":\"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)\",\"volume\":\"105 1\",\"pages\":\"1-5\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RIVF51545.2021.9642126\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RIVF51545.2021.9642126","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
MC-OCR Challenge 2021: Towards Document Understanding for Unconstrained Mobile-Captured Vietnamese Receipts
The Mobile capture receipts Optical Character Recognition (MC-OCR) [14] challenge deliver two tasks: Receipt Image Quality Evaluation and Key Information Extraction. In the first task, we introduce a regression model to map various inputs, for instance the probability of the output OCR, cropped text boxes, images to actual label. In the second task, we propose a stacked multi-model as a solution to solve this problem. The robust models are incorporated by image segmentation, image classification, text detection, text recognition, and text classification. Follow this solution, we can get vital tackle various noise receipt types: horizontal, skew, and blur receipt.