从收据中提取信息的图卷积神经网络过滤器大小的效率评估

International Journal of Information Technology Pub Date : 2024-08-30 DOI:10.1007/s41870-024-02089-1

An C. Tran, Bao Thai Le, Hai Thanh Nguyen

{"title":"从收据中提取信息的图卷积神经网络过滤器大小的效率评估","authors":"An C. Tran, Bao Thai Le, Hai Thanh Nguyen","doi":"10.1007/s41870-024-02089-1","DOIUrl":null,"url":null,"abstract":"<p>Graph Neural Networks (GNNs) have attracted considerable attention due to their ability to analyze structured data represented as graphs. In invoice information extraction, GNNs have proven to be a powerful tool for automatically extracting relevant information from invoices, streamlining data entry processes, and improving efficiency. By modeling the invoice layout as a graph and exploiting the inherent structural dependencies, GNNs enable end-to-end extraction by encoding the graph structure and using deep learning techniques. This work proposes a Graph Convolution Network to extract information from invoices. Furthermore, an evaluation of the effect of filter sizes on the model’s accuracy was performed. We built an extraction model based on the filter size selected by the evaluation. We achieved the accuracy of the test set of 96.4% and the training set of 98.5% on the dataset of about 1.500 invoice images we collected.</p>","PeriodicalId":14138,"journal":{"name":"International Journal of Information Technology","volume":"62 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficiency evaluation of filter sizes on graph convolutional neural networks for information extraction from receipts\",\"authors\":\"An C. Tran, Bao Thai Le, Hai Thanh Nguyen\",\"doi\":\"10.1007/s41870-024-02089-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Graph Neural Networks (GNNs) have attracted considerable attention due to their ability to analyze structured data represented as graphs. In invoice information extraction, GNNs have proven to be a powerful tool for automatically extracting relevant information from invoices, streamlining data entry processes, and improving efficiency. By modeling the invoice layout as a graph and exploiting the inherent structural dependencies, GNNs enable end-to-end extraction by encoding the graph structure and using deep learning techniques. This work proposes a Graph Convolution Network to extract information from invoices. Furthermore, an evaluation of the effect of filter sizes on the model’s accuracy was performed. We built an extraction model based on the filter size selected by the evaluation. We achieved the accuracy of the test set of 96.4% and the training set of 98.5% on the dataset of about 1.500 invoice images we collected.</p>\",\"PeriodicalId\":14138,\"journal\":{\"name\":\"International Journal of Information Technology\",\"volume\":\"62 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s41870-024-02089-1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s41870-024-02089-1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

图形神经网络（GNN）因其分析以图形表示的结构化数据的能力而备受关注。在发票信息提取方面，图形神经网络已被证明是自动提取发票相关信息、简化数据录入流程和提高效率的有力工具。通过将发票布局建模为图，并利用固有的结构依赖性，GNN 可通过编码图结构和使用深度学习技术实现端到端提取。这项工作提出了一种图卷积网络，用于从发票中提取信息。此外，还评估了过滤器大小对模型准确性的影响。我们根据评估所选择的过滤器大小建立了一个提取模型。在我们收集的约 1,500 张发票图像的数据集上，测试集的准确率达到 96.4%，训练集的准确率达到 98.5%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Efficiency evaluation of filter sizes on graph convolutional neural networks for information extraction from receipts

Graph Neural Networks (GNNs) have attracted considerable attention due to their ability to analyze structured data represented as graphs. In invoice information extraction, GNNs have proven to be a powerful tool for automatically extracting relevant information from invoices, streamlining data entry processes, and improving efficiency. By modeling the invoice layout as a graph and exploiting the inherent structural dependencies, GNNs enable end-to-end extraction by encoding the graph structure and using deep learning techniques. This work proposes a Graph Convolution Network to extract information from invoices. Furthermore, an evaluation of the effect of filter sizes on the model’s accuracy was performed. We built an extraction model based on the filter size selected by the evaluation. We achieved the accuracy of the test set of 96.4% and the training set of 98.5% on the dataset of about 1.500 invoice images we collected.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Information Technology

自引率

0.00%

发文量