基于深度学习和图像处理的数据增强表信息提取

Izuardo Zulkarnain, Rin Rin Nurmalasari, F. N. Azizah
{"title":"基于深度学习和图像处理的数据增强表信息提取","authors":"Izuardo Zulkarnain, Rin Rin Nurmalasari, F. N. Azizah","doi":"10.1109/TSSA56819.2022.10063909","DOIUrl":null,"url":null,"abstract":"Generally, the extraction of information in the table is done quickly if the table is within a document with a tabular structure. However, in the case of tables presented in the image document, steps are needed first to detect the table. Seeing a table in image documents becomes more complicated if the table to be seen does not have clear boundaries. This research focuses on extracting information from borderless tables in image documents. The study applies the Mask RCNN-FPN deep learning model to detect borderless tables using augmentation data. The use of data augmentation is expected to increase the accuracy of deep learning models even though there is only a small amount of training data available. The data augmentation technique proposed in this study is a fine-tuning method with CutMask augmentation data. For model formation and testing, this study uses the UNLV data set. This data set consists of scanned images of documents from various sources, including financial reports, journals, and different tabular research papers. The total amount of data used is 427 samples. After data augmentation, the amount of data used is 854 samples. The table detection model is based on the Mask RCNN created using Python. The testing parameters used for table detection quality are precise detection, partial detection, false detection, Precision, Recall, and F-Measure. The table's structure recognition rate is measured from the detection intersection value, rows, columns, and cells compared to ground truth. The test results show that using data augmentation with the CutMask technique can improve the performance of deep learning models to detect borderless tables. The use of image processing is shown to enhance table segmentation. However, the table structure recognition result does not offer a good result compared to the effects of other research.","PeriodicalId":164665,"journal":{"name":"2022 16th International Conference on Telecommunication Systems, Services, and Applications (TSSA)","volume":"77 7","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Table Information Extraction Using Data Augmentation on Deep Learning and Image Processing\",\"authors\":\"Izuardo Zulkarnain, Rin Rin Nurmalasari, F. N. Azizah\",\"doi\":\"10.1109/TSSA56819.2022.10063909\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Generally, the extraction of information in the table is done quickly if the table is within a document with a tabular structure. However, in the case of tables presented in the image document, steps are needed first to detect the table. Seeing a table in image documents becomes more complicated if the table to be seen does not have clear boundaries. This research focuses on extracting information from borderless tables in image documents. The study applies the Mask RCNN-FPN deep learning model to detect borderless tables using augmentation data. The use of data augmentation is expected to increase the accuracy of deep learning models even though there is only a small amount of training data available. The data augmentation technique proposed in this study is a fine-tuning method with CutMask augmentation data. For model formation and testing, this study uses the UNLV data set. This data set consists of scanned images of documents from various sources, including financial reports, journals, and different tabular research papers. The total amount of data used is 427 samples. After data augmentation, the amount of data used is 854 samples. The table detection model is based on the Mask RCNN created using Python. The testing parameters used for table detection quality are precise detection, partial detection, false detection, Precision, Recall, and F-Measure. The table's structure recognition rate is measured from the detection intersection value, rows, columns, and cells compared to ground truth. The test results show that using data augmentation with the CutMask technique can improve the performance of deep learning models to detect borderless tables. The use of image processing is shown to enhance table segmentation. However, the table structure recognition result does not offer a good result compared to the effects of other research.\",\"PeriodicalId\":164665,\"journal\":{\"name\":\"2022 16th International Conference on Telecommunication Systems, Services, and Applications (TSSA)\",\"volume\":\"77 7\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 16th International Conference on Telecommunication Systems, Services, and Applications (TSSA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TSSA56819.2022.10063909\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 16th International Conference on Telecommunication Systems, Services, and Applications (TSSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TSSA56819.2022.10063909","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

通常,如果表位于具有表格结构的文档中,则可以快速提取表中的信息。但是,对于图像文档中显示的表,首先需要执行检测表的步骤。如果要查看的表没有明确的边界,那么在图像文档中查看表将变得更加复杂。本研究的重点是从图像文档中的无边界表中提取信息。该研究应用Mask RCNN-FPN深度学习模型使用增强数据检测无边界表。数据增强的使用有望提高深度学习模型的准确性,即使只有少量的训练数据可用。本研究提出的数据增强技术是一种基于CutMask增强数据的微调方法。对于模型的形成和测试,本研究使用UNLV数据集。该数据集由来自各种来源的文件的扫描图像组成,包括财务报告、期刊和不同的表格研究论文。使用的数据总量为427个样本。数据扩充后,使用的数据量为854个样本。表检测模型基于使用Python创建的掩码RCNN。表检测质量的检测参数有:精确检测、部分检测、误检、Precision、Recall、F-Measure。表的结构识别率是从检测的交叉值、行、列和单元与地面真值的比较中测量的。测试结果表明,使用CutMask技术的数据增强可以提高深度学习模型检测无边界表的性能。使用图像处理显示增强表分割。然而,与其他研究结果相比,表结构识别结果并不理想。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Table Information Extraction Using Data Augmentation on Deep Learning and Image Processing
Generally, the extraction of information in the table is done quickly if the table is within a document with a tabular structure. However, in the case of tables presented in the image document, steps are needed first to detect the table. Seeing a table in image documents becomes more complicated if the table to be seen does not have clear boundaries. This research focuses on extracting information from borderless tables in image documents. The study applies the Mask RCNN-FPN deep learning model to detect borderless tables using augmentation data. The use of data augmentation is expected to increase the accuracy of deep learning models even though there is only a small amount of training data available. The data augmentation technique proposed in this study is a fine-tuning method with CutMask augmentation data. For model formation and testing, this study uses the UNLV data set. This data set consists of scanned images of documents from various sources, including financial reports, journals, and different tabular research papers. The total amount of data used is 427 samples. After data augmentation, the amount of data used is 854 samples. The table detection model is based on the Mask RCNN created using Python. The testing parameters used for table detection quality are precise detection, partial detection, false detection, Precision, Recall, and F-Measure. The table's structure recognition rate is measured from the detection intersection value, rows, columns, and cells compared to ground truth. The test results show that using data augmentation with the CutMask technique can improve the performance of deep learning models to detect borderless tables. The use of image processing is shown to enhance table segmentation. However, the table structure recognition result does not offer a good result compared to the effects of other research.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Design of a compact antenna and rectifier for a dual band rectenna operating at 2.4 GHz and 5.8 GHz Road Segmentation with U-Net Architecture Using Jetson AGX Xavier For Autonomous Vehicle Speed Control System of BLDC Motor Based on DSP TMS320F28027F Design and Control of Swerve Drive Mechanism for Autonomous Mobile Robot Application of Certainty Factor Method to Diagnose Venereal Diseases Using Confusion Matrix for Multi-Class Classification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1