从收据中提取信息的图卷积神经网络过滤器大小的效率评估

An C. Tran, Bao Thai Le, Hai Thanh Nguyen
{"title":"从收据中提取信息的图卷积神经网络过滤器大小的效率评估","authors":"An C. Tran, Bao Thai Le, Hai Thanh Nguyen","doi":"10.1007/s41870-024-02089-1","DOIUrl":null,"url":null,"abstract":"<p>Graph Neural Networks (GNNs) have attracted considerable attention due to their ability to analyze structured data represented as graphs. In invoice information extraction, GNNs have proven to be a powerful tool for automatically extracting relevant information from invoices, streamlining data entry processes, and improving efficiency. By modeling the invoice layout as a graph and exploiting the inherent structural dependencies, GNNs enable end-to-end extraction by encoding the graph structure and using deep learning techniques. This work proposes a Graph Convolution Network to extract information from invoices. Furthermore, an evaluation of the effect of filter sizes on the model’s accuracy was performed. We built an extraction model based on the filter size selected by the evaluation. We achieved the accuracy of the test set of 96.4% and the training set of 98.5% on the dataset of about 1.500 invoice images we collected.</p>","PeriodicalId":14138,"journal":{"name":"International Journal of Information Technology","volume":"62 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficiency evaluation of filter sizes on graph convolutional neural networks for information extraction from receipts\",\"authors\":\"An C. Tran, Bao Thai Le, Hai Thanh Nguyen\",\"doi\":\"10.1007/s41870-024-02089-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Graph Neural Networks (GNNs) have attracted considerable attention due to their ability to analyze structured data represented as graphs. In invoice information extraction, GNNs have proven to be a powerful tool for automatically extracting relevant information from invoices, streamlining data entry processes, and improving efficiency. By modeling the invoice layout as a graph and exploiting the inherent structural dependencies, GNNs enable end-to-end extraction by encoding the graph structure and using deep learning techniques. This work proposes a Graph Convolution Network to extract information from invoices. Furthermore, an evaluation of the effect of filter sizes on the model’s accuracy was performed. We built an extraction model based on the filter size selected by the evaluation. We achieved the accuracy of the test set of 96.4% and the training set of 98.5% on the dataset of about 1.500 invoice images we collected.</p>\",\"PeriodicalId\":14138,\"journal\":{\"name\":\"International Journal of Information Technology\",\"volume\":\"62 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s41870-024-02089-1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s41870-024-02089-1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

图形神经网络(GNN)因其分析以图形表示的结构化数据的能力而备受关注。在发票信息提取方面,图形神经网络已被证明是自动提取发票相关信息、简化数据录入流程和提高效率的有力工具。通过将发票布局建模为图,并利用固有的结构依赖性,GNN 可通过编码图结构和使用深度学习技术实现端到端提取。这项工作提出了一种图卷积网络,用于从发票中提取信息。此外,还评估了过滤器大小对模型准确性的影响。我们根据评估所选择的过滤器大小建立了一个提取模型。在我们收集的约 1,500 张发票图像的数据集上,测试集的准确率达到 96.4%,训练集的准确率达到 98.5%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Efficiency evaluation of filter sizes on graph convolutional neural networks for information extraction from receipts

Graph Neural Networks (GNNs) have attracted considerable attention due to their ability to analyze structured data represented as graphs. In invoice information extraction, GNNs have proven to be a powerful tool for automatically extracting relevant information from invoices, streamlining data entry processes, and improving efficiency. By modeling the invoice layout as a graph and exploiting the inherent structural dependencies, GNNs enable end-to-end extraction by encoding the graph structure and using deep learning techniques. This work proposes a Graph Convolution Network to extract information from invoices. Furthermore, an evaluation of the effect of filter sizes on the model’s accuracy was performed. We built an extraction model based on the filter size selected by the evaluation. We achieved the accuracy of the test set of 96.4% and the training set of 98.5% on the dataset of about 1.500 invoice images we collected.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Statistical cryptanalysis of seven classical lightweight ciphers CNN-BO-LSTM: an ensemble framework for prognosis of liver cancer Architecting lymphoma fusion: PROMETHEE-II guided optimization of combination therapeutic synergy RBCA-ETS: enhancing extractive text summarization with contextual embedding and word-level attention RAMD and transient analysis of a juice clarification unit in sugar plants
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1