Neural Network Based Transaction Classification System for Chinese Transaction Behavior Analysis

Jianyang Yu, Yuanyuan Qiao, Nanfei Shu, Kewu Sun, Shenshen Zhou, Jie Yang
{"title":"Neural Network Based Transaction Classification System for Chinese Transaction Behavior Analysis","authors":"Jianyang Yu, Yuanyuan Qiao, Nanfei Shu, Kewu Sun, Shenshen Zhou, Jie Yang","doi":"10.1109/BigDataCongress.2019.00021","DOIUrl":null,"url":null,"abstract":"With the rapid development of Chinese economy, it is significant to examine the economic activities in China. Each transaction behavior is recorded by the invoice. The invoice contains the transaction content, the classification of the transaction behavior (in accordance with the Tax Classification and Coding for Commodities and Services issued by the state) and transaction price, etc. Our work uses real mass invoice data collected from Zhejiang Province and conducts a multi-dimensional analysis of Chinese transaction behavior based on transaction behavior classification model. Firstly, we propose a compositional CNN-RNN model with attention mechanism to recommend the corresponding categories of transaction behavior collected from tax invoices. It maps the transaction behavior recorded in the invoice to transaction code in the Tax Classification and Coding for Commodities and Services issued by the state. Preliminary experiments show that the top-one accuracy of classifying transaction behavior achieves 75%. Then, we focus on the quantity distribution of invoice data and draw a conclusion that the major category with larger number of invoice records is more diversified in subdivided categories. After that, we studied the price distribution of various transaction behaviors to discover the difference in price distribution between different industries. Prices in the major categories of goods are more concentrated in the middle or lower prices. We can analyze the regional industrial structure through the price distribution of the industry which makes sense to study the economy of the region from the perspective of industry.","PeriodicalId":335850,"journal":{"name":"2019 IEEE International Congress on Big Data (BigDataCongress)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Congress on Big Data (BigDataCongress)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BigDataCongress.2019.00021","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

With the rapid development of Chinese economy, it is significant to examine the economic activities in China. Each transaction behavior is recorded by the invoice. The invoice contains the transaction content, the classification of the transaction behavior (in accordance with the Tax Classification and Coding for Commodities and Services issued by the state) and transaction price, etc. Our work uses real mass invoice data collected from Zhejiang Province and conducts a multi-dimensional analysis of Chinese transaction behavior based on transaction behavior classification model. Firstly, we propose a compositional CNN-RNN model with attention mechanism to recommend the corresponding categories of transaction behavior collected from tax invoices. It maps the transaction behavior recorded in the invoice to transaction code in the Tax Classification and Coding for Commodities and Services issued by the state. Preliminary experiments show that the top-one accuracy of classifying transaction behavior achieves 75%. Then, we focus on the quantity distribution of invoice data and draw a conclusion that the major category with larger number of invoice records is more diversified in subdivided categories. After that, we studied the price distribution of various transaction behaviors to discover the difference in price distribution between different industries. Prices in the major categories of goods are more concentrated in the middle or lower prices. We can analyze the regional industrial structure through the price distribution of the industry which makes sense to study the economy of the region from the perspective of industry.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于神经网络的中国交易行为分析交易分类系统
随着中国经济的快速发展,研究中国的经济活动意义重大。每一笔交易行为都由发票记录。发票包含交易内容、交易行为分类(根据国家颁布的《商品和服务税收分类与编码》)和交易价格等。我们的研究利用从浙江省采集的真实海量发票数据,基于交易行为分类模型对中国人的交易行为进行了多维度分析。首先,我们提出了一个具有关注机制的 CNN-RNN 组成模型,以推荐从税务发票中收集到的交易行为的相应类别。它将发票中记录的交易行为与国家颁布的《商品和服务税收分类与编码》中的交易代码进行映射。初步实验表明,交易行为分类的最高准确率达到 75%。然后,我们重点研究了发票数据的数量分布,得出了发票记录数量较多的大类在细分类别中更加多样化的结论。之后,我们研究了各种交易行为的价格分布,发现了不同行业之间价格分布的差异。大类商品的价格更多集中在中低价位。我们可以通过产业的价格分布来分析区域产业结构,这对于从产业角度研究区域经济是很有意义的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
PREMISES, a Scalable Data-Driven Service to Predict Alarms in Slowly-Degrading Multi-Cycle Industrial Processes Context-Aware Enforcement of Privacy Policies in Edge Computing Efficient Re-Computation of Big Data Analytics Processes in the Presence of Changes: Computational Framework, Reference Architecture, and Applications Reducing Feature Embedding Data for Discovering Relations in Big Text Data Distributed, Numerically Stable Distance and Covariance Computation with MPI for Extremely Large Datasets
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1