Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning最新文献

英文中文

TIRec: Transformer-based Invoice Text Recognition 基于变压器的发票文本识别

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

Pub Date : 2023-03-17 DOI: 10.1145/3590003.3590034

Yanlan Chen

A novel invoice text recognition model is proposed. In the past few years, researchers have explored text recognition methods with RNN-like structures to model semantic information. However, RNN-based approaches have some obvious drawbacks, such as the level-by-level decoding approach and the one-way serial transmission of semantic information, which greatly limit semantic information's effectiveness and computational efficiency. In contrast, invoice text has obvious contextual relationships due to its fixed text pattern, the text font in the invoice is more fixed and the complexity of the background is much lower than that of natural scenes. To further exploit these contextual relationships and adapt to the characteristics of invoice text, we propose a new text recognition framework inspired by Transformer [1]. Self-attention-based architectures, in particular Transformer, have been successful in natural language processing (NLP). It has demonstrated powerful semantic information modeling capabilities in NLP. Inspired by its success, we try to apply Transformer to invoice text recognition. Unlike the RNN-based approach, we reduce the parameters of the vision network used to extract image features, use the Convolutional Vision Transformer Attention module to capture the semantic information, and use the Transformer decoding module to decode all characters in parallel. We hope that this Transformer-based architecture can better model the semantic information in invoices while remaining lightweight. Meanwhile, we collected text images of more than 40,000 train invoices, VAT invoices, rolled invoices, and cab invoices. Experiments on the collected invoice text recognition dataset show that our approach outperforms previous methods in terms of accuracy and speed.

提出了一种新的发票文本识别模型。在过去的几年里，研究人员探索了使用类rnn结构来建模语义信息的文本识别方法。然而，基于rnn的方法存在一些明显的缺陷，如逐级解码方式和语义信息的单向串行传输，这极大地限制了语义信息的有效性和计算效率。相比之下，发票文本由于其固定的文本模式，具有明显的上下文关系，发票中的文本字体更加固定，背景的复杂性远低于自然场景。为了进一步利用这些上下文关系并适应发票文本的特征，我们提出了一个受Transformer[1]启发的新的文本识别框架。基于自关注的体系结构，特别是Transformer，在自然语言处理(NLP)中已经取得了成功。它展示了在自然语言处理中强大的语义信息建模能力。受其成功的启发，我们尝试将Transformer应用于发票文本识别。与基于rnn的方法不同，我们减少了用于提取图像特征的视觉网络的参数，使用卷积视觉变压器注意力模块捕获语义信息，并使用变压器解码模块并行解码所有字符。我们希望这种基于transformer的体系结构能够更好地为发票中的语义信息建模，同时保持轻量级。同时，我们收集了4万多张火车发票、增值税发票、滚动发票和出租车发票的文字图像。在已收集的发票文本识别数据集上的实验表明，我们的方法在准确率和速度上都优于以前的方法。

{"title":"TIRec: Transformer-based Invoice Text Recognition","authors":"Yanlan Chen","doi":"10.1145/3590003.3590034","DOIUrl":"https://doi.org/10.1145/3590003.3590034","url":null,"abstract":"A novel invoice text recognition model is proposed. In the past few years, researchers have explored text recognition methods with RNN-like structures to model semantic information. However, RNN-based approaches have some obvious drawbacks, such as the level-by-level decoding approach and the one-way serial transmission of semantic information, which greatly limit semantic information's effectiveness and computational efficiency. In contrast, invoice text has obvious contextual relationships due to its fixed text pattern, the text font in the invoice is more fixed and the complexity of the background is much lower than that of natural scenes. To further exploit these contextual relationships and adapt to the characteristics of invoice text, we propose a new text recognition framework inspired by Transformer [1]. Self-attention-based architectures, in particular Transformer, have been successful in natural language processing (NLP). It has demonstrated powerful semantic information modeling capabilities in NLP. Inspired by its success, we try to apply Transformer to invoice text recognition. Unlike the RNN-based approach, we reduce the parameters of the vision network used to extract image features, use the Convolutional Vision Transformer Attention module to capture the semantic information, and use the Transformer decoding module to decode all characters in parallel. We hope that this Transformer-based architecture can better model the semantic information in invoices while remaining lightweight. Meanwhile, we collected text images of more than 40,000 train invoices, VAT invoices, rolled invoices, and cab invoices. Experiments on the collected invoice text recognition dataset show that our approach outperforms previous methods in terms of accuracy and speed.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125562497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Foreign object recognition method of transmission line based on improved outlier rate method 基于改进离群率法的传输线异物识别方法

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

Pub Date : 2023-03-17 DOI: 10.1145/3590003.3590073

Dongmei Liu, Zhongwang Zhu, Bo Chen

Foreign matters hanging on the transmission line can be regarded as a potential risk of the transmission system, which will not only affect the normal power supply of the transmission line, but also pose a greater threat to pedestrians and vehicles under the line. Aiming at the low efficiency and high false detection rate of traditional foreign object recognition methods for hanging foreign objects, this paper proposes a foreign object recognition method for transmission lines based on improved outlier rate method. It proposes to use Hough line transformation to extract the transmission line, and then conduct convolution operation on the area where the transmission line is located and the non-transmission line area, and set the corresponding outlier rate in combination with the actual error to identify the foreign matters in the transmission line.

悬挂在输电线路上的异物可视为输电系统的潜在风险，不仅会影响输电线路的正常供电，还会对线路下的行人和车辆构成较大的威胁。针对传统的悬挂异物识别方法效率低、误检率高的问题，提出了一种基于改进离群率法的传输线异物识别方法。提出利用霍夫线变换对传输线进行提取，然后对传输线所在区域和非传输线区域进行卷积运算，并结合实际误差设置相应的离群率，识别传输线内异物。

引用次数: 0

PhyGNNet: Solving spatiotemporal PDEs with Physics-informed Graph Neural Network PhyGNNet:利用物理信息图神经网络求解时空偏微分方程

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

Pub Date : 2022-08-07 DOI: 10.1145/3590003.3590029

Longxiang Jiang, Liyuan Wang, Xinkun Chu, Yonghao Xiao, Hao Zhang

Partial differential equations (PDEs) are a common means of describing physical processes. Solving PDEs can obtain simulated results of physical evolution. Currently, the mainstream neural network method is to minimize the loss of PDEs thus constraining neural networks to fit the solution mappings. By the implementation of differentiation, the methods can be divided into PINN methods based on automatic differentiation and other methods based on discrete differentiation. PINN methods rely on automatic backpropagation, and the computation step is time-consuming, for iterative training, the complexity of the neural network and the number of collocation points are limited to a small condition, thus abating accuracy. The discrete differentiation is more efficient in computation, following the regular computational domain assumption. However, in practice, the assumption does not necessarily hold. In this paper, we propose a PhyGNNet method to solve PDEs based on graph neural network and discrete differentiation on irregular domain. Meanwhile, to verify the validity of the method, we solve Burgers equation and conduct a numerical comparison with PINN. The results show that the proposed method performs better both in fit ability and time extrapolation than PINN. Code is available at https://github.com/echowve/phygnnet.

偏微分方程(PDEs)是描述物理过程的常用方法。求解偏微分方程可以得到物理演化的模拟结果。目前，主流的神经网络方法是最小化偏微分方程的损失，从而约束神经网络拟合解映射。通过微分的实现，可以将方法分为基于自动微分的PINN方法和基于离散微分的其他方法。PINN方法依赖于自动反向传播，计算步骤耗时，对于迭代训练，神经网络的复杂性和配点数被限制在很小的条件下，从而降低了精度。离散微分遵循规则的计算域假设，计算效率更高。然而，在实践中，这种假设并不一定成立。本文提出了一种基于图神经网络和不规则域上离散微分的PDEs求解方法。同时，为了验证该方法的有效性，我们求解Burgers方程，并与PINN进行数值比较。结果表明，该方法在拟合能力和时间外推方面都优于PINN算法。代码可从https://github.com/echowve/phygnnet获得。

{"title":"PhyGNNet: Solving spatiotemporal PDEs with Physics-informed Graph Neural Network","authors":"Longxiang Jiang, Liyuan Wang, Xinkun Chu, Yonghao Xiao, Hao Zhang","doi":"10.1145/3590003.3590029","DOIUrl":"https://doi.org/10.1145/3590003.3590029","url":null,"abstract":"Partial differential equations (PDEs) are a common means of describing physical processes. Solving PDEs can obtain simulated results of physical evolution. Currently, the mainstream neural network method is to minimize the loss of PDEs thus constraining neural networks to fit the solution mappings. By the implementation of differentiation, the methods can be divided into PINN methods based on automatic differentiation and other methods based on discrete differentiation. PINN methods rely on automatic backpropagation, and the computation step is time-consuming, for iterative training, the complexity of the neural network and the number of collocation points are limited to a small condition, thus abating accuracy. The discrete differentiation is more efficient in computation, following the regular computational domain assumption. However, in practice, the assumption does not necessarily hold. In this paper, we propose a PhyGNNet method to solve PDEs based on graph neural network and discrete differentiation on irregular domain. Meanwhile, to verify the validity of the method, we solve Burgers equation and conduct a numerical comparison with PINN. The results show that the proposed method performs better both in fit ability and time extrapolation than PINN. Code is available at https://github.com/echowve/phygnnet.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127997209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning 2023第二届亚洲算法、计算与机器学习会议论文集

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

Pub Date : 1900-01-01 DOI: 10.1145/3590003

引用次数: 0

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀