{"title":"Sequential classification of customer behavior based on sequence-to-sequence learning with gated-attention neural networks","authors":"Licheng Zhao, Yi Zuo, Katsutoshi Yada","doi":"10.1007/s11634-022-00517-3","DOIUrl":null,"url":null,"abstract":"<div><p>During the last decade, an increasing number of supermarkets have begun to use RFID technology to track consumers' in-store movements to collect data on their shopping behavioral. Marketers hope that such new types of RFID data will improve the accuracy of the existing customer segmentation, and provide effective marketing positioning from the customer’s perspective. Therefore, this paper presents an integrated work on combining RFID data with traditional point of sales (POS) data, and proposes a sequential classification-based model to classify and identify consumers’ purchasing behavior. We chose an island area of the supermarket to perform the tracking experiment and collected customer behavioral data for two months. RFID data are used to extract behavior explanatory variables, such as residence time and wandering direction. For these customers, we extracted their purchasing historical data for the past three months from the POS system to define customer background and segmentation. Finally, this paper proposes a novel classification model based on sequence-to-sequence (Seq2seq) learning architecture. The encoder–decoder of Seq2seq uses an attention mechanism to pursue sequential inputs, with gating units in the encoder and decoder adjusting the output weights based on the input variables. The experimental results showed that the proposed model has a higher accuracy and area under curve value for customer classification and recognition compared with other benchmark models. Furthermore, the validity of behavioral description variables among heterogeneous customers was verified by adjusting the attention mechanism.</p></div>","PeriodicalId":49270,"journal":{"name":"Advances in Data Analysis and Classification","volume":"17 3","pages":"549 - 581"},"PeriodicalIF":1.4000,"publicationDate":"2022-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Data Analysis and Classification","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s11634-022-00517-3","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
During the last decade, an increasing number of supermarkets have begun to use RFID technology to track consumers' in-store movements to collect data on their shopping behavioral. Marketers hope that such new types of RFID data will improve the accuracy of the existing customer segmentation, and provide effective marketing positioning from the customer’s perspective. Therefore, this paper presents an integrated work on combining RFID data with traditional point of sales (POS) data, and proposes a sequential classification-based model to classify and identify consumers’ purchasing behavior. We chose an island area of the supermarket to perform the tracking experiment and collected customer behavioral data for two months. RFID data are used to extract behavior explanatory variables, such as residence time and wandering direction. For these customers, we extracted their purchasing historical data for the past three months from the POS system to define customer background and segmentation. Finally, this paper proposes a novel classification model based on sequence-to-sequence (Seq2seq) learning architecture. The encoder–decoder of Seq2seq uses an attention mechanism to pursue sequential inputs, with gating units in the encoder and decoder adjusting the output weights based on the input variables. The experimental results showed that the proposed model has a higher accuracy and area under curve value for customer classification and recognition compared with other benchmark models. Furthermore, the validity of behavioral description variables among heterogeneous customers was verified by adjusting the attention mechanism.
期刊介绍:
The international journal Advances in Data Analysis and Classification (ADAC) is designed as a forum for high standard publications on research and applications concerning the extraction of knowable aspects from many types of data. It publishes articles on such topics as structural, quantitative, or statistical approaches for the analysis of data; advances in classification, clustering, and pattern recognition methods; strategies for modeling complex data and mining large data sets; methods for the extraction of knowledge from data, and applications of advanced methods in specific domains of practice. Articles illustrate how new domain-specific knowledge can be made available from data by skillful use of data analysis methods. The journal also publishes survey papers that outline, and illuminate the basic ideas and techniques of special approaches.