Data Technologies and Applications最新文献_第5页

Channel attention-based spatial-temporal graph neural networks for traffic prediction 基于通道注意力的时空图神经网络交通预测

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2023-04-29 DOI: 10.1108/dta-09-2022-0378

Bin Wang, Fan Gao, Le Tong, Qian Zhang, Sulei Zhu

PurposeTraffic flow prediction has always been a top priority of intelligent transportation systems. There are many mature methods for short-term traffic flow prediction. However, the existing methods are often insufficient in capturing long-term spatial-temporal dependencies. To predict long-term dependencies more accurately, in this paper, a new and more effective traffic flow prediction model is proposed.Design/methodology/approachThis paper proposes a new and more effective traffic flow prediction model, named channel attention-based spatial-temporal graph neural networks. A graph convolutional network is used to extract local spatial-temporal correlations, a channel attention mechanism is used to enhance the influence of nearby spatial-temporal dependencies on decision-making and a transformer mechanism is used to capture long-term dependencies.FindingsThe proposed model is applied to two common highway datasets: METR-LA collected in Los Angeles and PEMS-BAY collected in the California Bay Area. This model outperforms the other five in terms of performance on three performance metrics a popular model.Originality/value(1) Based on the spatial-temporal synchronization graph convolution module, a spatial-temporal channel attention module is designed to increase the influence of proximity dependence on decision-making by enhancing or suppressing different channels. (2) To better capture long-term dependencies, the transformer module is introduced.

目的交通流量预测一直是智能交通系统的首要任务。短期交通流量预测有许多成熟的方法。然而，现有的方法往往不足以捕捉长期的时空依赖关系。为了更准确地预测长期相关性，本文提出了一种新的、更有效的交通流预测模型。设计/方法论/方法本文提出了一种新的、更有效的交通流预测模型，称为基于通道注意力的时空图神经网络。图卷积网络用于提取局部时空相关性，通道注意力机制用于增强附近时空相关性对决策的影响，变换器机制用于捕获长期相关性。发现所提出的模型应用于两个常见的公路数据集：洛杉矶收集的METR-LA和加利福尼亚湾区收集的PEMS-BAY。该模型在三个性能指标上的性能优于其他五个，这是一个流行的模型。独创性/价值（1）基于时空同步图卷积模块，设计了一个时空通道注意力模块，通过增强或抑制不同通道来增加邻近度依赖对决策的影响。（2）为了更好地捕捉长期依赖关系，引入了transformer模块。

{"title":"Channel attention-based spatial-temporal graph neural networks for traffic prediction","authors":"Bin Wang, Fan Gao, Le Tong, Qian Zhang, Sulei Zhu","doi":"10.1108/dta-09-2022-0378","DOIUrl":"https://doi.org/10.1108/dta-09-2022-0378","url":null,"abstract":"PurposeTraffic flow prediction has always been a top priority of intelligent transportation systems. There are many mature methods for short-term traffic flow prediction. However, the existing methods are often insufficient in capturing long-term spatial-temporal dependencies. To predict long-term dependencies more accurately, in this paper, a new and more effective traffic flow prediction model is proposed.Design/methodology/approachThis paper proposes a new and more effective traffic flow prediction model, named channel attention-based spatial-temporal graph neural networks. A graph convolutional network is used to extract local spatial-temporal correlations, a channel attention mechanism is used to enhance the influence of nearby spatial-temporal dependencies on decision-making and a transformer mechanism is used to capture long-term dependencies.FindingsThe proposed model is applied to two common highway datasets: METR-LA collected in Los Angeles and PEMS-BAY collected in the California Bay Area. This model outperforms the other five in terms of performance on three performance metrics a popular model.Originality/value(1) Based on the spatial-temporal synchronization graph convolution module, a spatial-temporal channel attention module is designed to increase the influence of proximity dependence on decision-making by enhancing or suppressing different channels. (2) To better capture long-term dependencies, the transformer module is introduced.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49082434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Machine learning approaches for prediction of ovarian cancer driver genes from mutational and network analysis 从突变和网络分析中预测卵巢癌驱动基因的机器学习方法

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2023-04-28 DOI: 10.1108/dta-03-2022-0096

Rucha Wadapurkar, S. Bapat, Rupali A. Mahajan, R. Vyas

PurposeOvarian cancer (OC) is the most common type of gynecologic cancer in the world with a high rate of mortality. Due to manifestation of generic symptoms and absence of specific biomarkers, OC is usually diagnosed at a late stage. Machine learning models can be employed to predict driver genes implicated in causative mutations.Design/methodology/approachIn the present study, a comprehensive next generation sequencing (NGS) analysis of whole exome sequences of 47 OC patients was carried out to identify clinically significant mutations. Nine functional features of 708 mutations identified were input into a machine learning classification model by employing the eXtreme Gradient Boosting (XGBoost) classifier method for prediction of OC driver genes.FindingsThe XGBoost classifier model yielded a classification accuracy of 0.946, which was superior to that obtained by other classifiers such as decision tree, Naive Bayes, random forest and support vector machine. Further, an interaction network was generated to identify and establish correlations with cancer-associated pathways and gene ontology data.Originality/valueThe final results revealed 12 putative candidate cancer driver genes, namely LAMA3, LAMC3, COL6A1, COL5A1, COL2A1, UGT1A1, BDNF, ANK1, WNT10A, FZD4, PLEKHG5 and CYP2C9, that may have implications in clinical diagnosis.

目的癌症是世界上最常见的妇科癌症，死亡率高。由于一般症状的表现和缺乏特定的生物标志物，OC通常在晚期被诊断出来。机器学习模型可以用来预测与致病突变有关的驱动基因。设计/方法/方法在本研究中，对47名OC患者的全外显子组序列进行了全面的下一代测序（NGS）分析，以确定具有临床意义的突变。通过使用用于预测OC驱动基因的极限梯度增强（XGBoost）分类器方法，将所识别的708个突变的9个功能特征输入到机器学习分类模型中。结果XGBoost分类器模型的分类精度为0.946，优于决策树、朴素贝叶斯、随机森林和支持向量机等其他分类器。此外，生成了一个交互网络，以识别和建立与癌症相关途径和基因本体数据的相关性。原创性/价值最终结果揭示了12个推定的癌症驱动基因候选，即LAMA3、LAMC3、COL6A1、COL5A1、COL2A1、UGT1A1、BDNF、ANK1、WNT10A、FZD4、PLEKHG5和CYP2C9，这些基因可能对临床诊断有影响。

{"title":"Machine learning approaches for prediction of ovarian cancer driver genes from mutational and network analysis","authors":"Rucha Wadapurkar, S. Bapat, Rupali A. Mahajan, R. Vyas","doi":"10.1108/dta-03-2022-0096","DOIUrl":"https://doi.org/10.1108/dta-03-2022-0096","url":null,"abstract":"PurposeOvarian cancer (OC) is the most common type of gynecologic cancer in the world with a high rate of mortality. Due to manifestation of generic symptoms and absence of specific biomarkers, OC is usually diagnosed at a late stage. Machine learning models can be employed to predict driver genes implicated in causative mutations.Design/methodology/approachIn the present study, a comprehensive next generation sequencing (NGS) analysis of whole exome sequences of 47 OC patients was carried out to identify clinically significant mutations. Nine functional features of 708 mutations identified were input into a machine learning classification model by employing the eXtreme Gradient Boosting (XGBoost) classifier method for prediction of OC driver genes.FindingsThe XGBoost classifier model yielded a classification accuracy of 0.946, which was superior to that obtained by other classifiers such as decision tree, Naive Bayes, random forest and support vector machine. Further, an interaction network was generated to identify and establish correlations with cancer-associated pathways and gene ontology data.Originality/valueThe final results revealed 12 putative candidate cancer driver genes, namely LAMA3, LAMC3, COL6A1, COL5A1, COL2A1, UGT1A1, BDNF, ANK1, WNT10A, FZD4, PLEKHG5 and CYP2C9, that may have implications in clinical diagnosis.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48683941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

News recommendations based on collaborative topic modeling and collaborative filtering with generative adversarial networks 基于协同主题建模和生成对抗网络协同过滤的新闻推荐

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2023-03-30 DOI: 10.1108/dta-08-2022-0315

Duen-Ren Liu, Yang Huang, Jhen-Jie Jhao, Shin-Jye Lee

PurposeOnline news websites provide huge amounts of timely news, bringing the challenge of recommending personalized news articles. Generative adversarial networks (GAN) based on collaborative filtering (CFGAN) can achieve effective recommendation quality. However, CFGAN ignores item contents, which contain more latent preference features than just user ratings. It is important to consider both ratings and item contents in making preference predictions. This study aims to improve news recommendation by proposing a GAN-based news recommendation model considering both ratings (implicit feedback) and the latent features of news content.Design/methodology/approachThe collaborative topic modeling (CTM) can improve user preference prediction by combining matrix factorization (MF) with latent topics of item content derived from latent topic modeling. This study proposes a novel hybrid news recommendation model, Hybrid-CFGAN, which modifies the architecture of the CFGAN model with enhanced preference learning from the CTM. The proposed Hybrid-CFGAN model contains parallel neural networks – original rating-based preference learning and CTM-based preference learning, which consider both ratings and news content with user preferences derived from the CTM model. A tunable parameter is used to adjust the weights of the two preference learnings, while concatenating the preference outputs of the two parallel neural networks.FindingsThis study uses the dataset collected from an online news website, NiusNews, to conduct an experimental evaluation. The results show that the proposed Hybrid-CFGAN model can achieve better performance than the state-of-the-art GAN-based recommendation methods. The proposed novel Hybrid-CFGAN model can enhance existing GAN-based recommendation and increase the performance of preference predictions on textual content such as news articles.Originality/valueAs the existing CFGAN model does not consider content information and solely relies on history logs, it may not be effective in recommending news articles. Our proposed Hybrid-CFGAN model modified the architecture of the CFGAN generator by adding a parallel neural network to gain the relevant information from news content and user preferences derived from the CTM model. The novel idea of adjusting the preference learning from two parallel neural networks – original rating-based preference learning and CTM-based preference learning – contributes to improve the recommendation quality of the proposed model by considering both ratings and latent preferences derived from item contents. The proposed novel recommendation model can improve news recommendation, thereby increasing the commercial value of news media platforms.

目的在线新闻网站提供了大量及时的新闻，带来了个性化新闻文章推荐的挑战。基于协同过滤(CFGAN)的生成式对抗网络(GAN)可以达到有效的推荐质量。然而，CFGAN忽略了项目内容，其中包含比用户评分更多的潜在偏好特征。在进行偏好预测时，同时考虑评分和项目内容是很重要的。本研究旨在通过提出一种基于gan的新闻推荐模型，同时考虑新闻内容的评级(隐式反馈)和潜在特征，从而改进新闻推荐。设计/方法/方法协同主题建模(CTM)将矩阵分解(MF)与潜在主题建模衍生的项目内容潜在主题相结合，提高了用户偏好预测的精度。本文提出了一种新的混合新闻推荐模型hybrid -CFGAN，该模型改进了CFGAN模型的结构，增强了对CTM的偏好学习。提出的Hybrid-CFGAN模型包含并行神经网络-基于原始评级的偏好学习和基于CTM的偏好学习，它们同时考虑评级和新闻内容以及从CTM模型派生的用户偏好。在连接两个并行神经网络的偏好输出时，使用一个可调参数来调整两个偏好学习的权重。本研究使用在线新闻网站NiusNews收集的数据集进行实验评估。结果表明，所提出的Hybrid-CFGAN模型比目前基于gan的推荐方法具有更好的性能。提出的新型Hybrid-CFGAN模型可以增强现有的基于gan的推荐，并提高对文本内容(如新闻文章)的偏好预测性能。原创性/价值由于现有的CFGAN模型不考虑内容信息，仅依赖历史日志，因此可能无法有效推荐新闻文章。我们提出的Hybrid-CFGAN模型修改了CFGAN生成器的架构，通过添加一个并行神经网络从CTM模型衍生的新闻内容和用户偏好中获取相关信息。利用基于原始评分的偏好学习和基于ctm的偏好学习这两个并行神经网络调整偏好学习的新思想，通过同时考虑来自项目内容的评分和潜在偏好，有助于提高所提出模型的推荐质量。本文提出的新推荐模型可以改进新闻推荐，从而提高新闻媒体平台的商业价值。

{"title":"News recommendations based on collaborative topic modeling and collaborative filtering with generative adversarial networks","authors":"Duen-Ren Liu, Yang Huang, Jhen-Jie Jhao, Shin-Jye Lee","doi":"10.1108/dta-08-2022-0315","DOIUrl":"https://doi.org/10.1108/dta-08-2022-0315","url":null,"abstract":"PurposeOnline news websites provide huge amounts of timely news, bringing the challenge of recommending personalized news articles. Generative adversarial networks (GAN) based on collaborative filtering (CFGAN) can achieve effective recommendation quality. However, CFGAN ignores item contents, which contain more latent preference features than just user ratings. It is important to consider both ratings and item contents in making preference predictions. This study aims to improve news recommendation by proposing a GAN-based news recommendation model considering both ratings (implicit feedback) and the latent features of news content.Design/methodology/approachThe collaborative topic modeling (CTM) can improve user preference prediction by combining matrix factorization (MF) with latent topics of item content derived from latent topic modeling. This study proposes a novel hybrid news recommendation model, Hybrid-CFGAN, which modifies the architecture of the CFGAN model with enhanced preference learning from the CTM. The proposed Hybrid-CFGAN model contains parallel neural networks – original rating-based preference learning and CTM-based preference learning, which consider both ratings and news content with user preferences derived from the CTM model. A tunable parameter is used to adjust the weights of the two preference learnings, while concatenating the preference outputs of the two parallel neural networks.FindingsThis study uses the dataset collected from an online news website, NiusNews, to conduct an experimental evaluation. The results show that the proposed Hybrid-CFGAN model can achieve better performance than the state-of-the-art GAN-based recommendation methods. The proposed novel Hybrid-CFGAN model can enhance existing GAN-based recommendation and increase the performance of preference predictions on textual content such as news articles.Originality/valueAs the existing CFGAN model does not consider content information and solely relies on history logs, it may not be effective in recommending news articles. Our proposed Hybrid-CFGAN model modified the architecture of the CFGAN generator by adding a parallel neural network to gain the relevant information from news content and user preferences derived from the CTM model. The novel idea of adjusting the preference learning from two parallel neural networks – original rating-based preference learning and CTM-based preference learning – contributes to improve the recommendation quality of the proposed model by considering both ratings and latent preferences derived from item contents. The proposed novel recommendation model can improve news recommendation, thereby increasing the commercial value of news media platforms.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41499535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Data mining–based stock price prediction using hybridization of technical and fundamental analysis 基于数据挖掘的技术分析和基本面分析相结合的股价预测

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2023-03-20 DOI: 10.1108/dta-04-2022-0142

Jasleen Kaur, Khushdeep Dharni

PurposeThe stock market generates massive databases of various financial companies that are highly volatile and complex. To forecast daily stock values of these companies, investors frequently use technical analysis or fundamental analysis. Data mining techniques coupled with fundamental and technical analysis types have the potential to give satisfactory results for stock market prediction. In the current paper, an effort is made to investigate the accuracy of stock market predictions by using the combined approach of variables from technical and fundamental analysis for the creation of a data mining predictive model.Design/methodology/approachWe chose 381 companies from the National Stock Exchange of India's CNX 500 index and conducted a two-stage data analysis. The first stage is identifying key fundamental variables and constructing a portfolio based on that study. Artificial neural network (ANN), support vector machines (SVM) and decision tree J48 were used to build the models. The second stage entails applying technical analysis to forecast price movements in the companies included in the portfolios. ANN and SVM techniques were used to create predictive models for all companies in the portfolios. We also estimated returns using trading decisions based on the model's output and then compared them to buy-and-hold returns and the return of the NIFTY 50 index, which served as a benchmark.FindingsThe results show that the returns of both the portfolios are higher than the benchmark buy-and-hold strategy return. It can be concluded that data mining techniques give better results, irrespective of the type of stock, and have the ability to make up for poor stocks. The comparison of returns of portfolios with the return of NIFTY as a benchmark also indicates that both the portfolios are generating higher returns as compared to the return generated by NIFTY.Originality/valueAs stock prices are influenced by both technical and fundamental indicators, the current paper explored the combined effect of technical analysis and fundamental analysis variables for Indian stock market prediction. Further, the results obtained by individual analysis have also been compared. The proposed method under study can also be utilized to determine whether to hold stocks for the long or short term using trend-based research.

股票市场产生了大量的各种金融公司数据库，这些数据库高度波动和复杂。为了预测这些公司的每日股价，投资者经常使用技术分析或基本面分析。数据挖掘技术与基础和技术分析相结合，有可能为股票市场预测提供令人满意的结果。在当前的论文中，通过使用技术和基本面分析变量的组合方法来创建数据挖掘预测模型，努力研究股票市场预测的准确性。设计/方法/方法我们从印度国家证券交易所的CNX 500指数中选择了381家公司，并进行了两阶段的数据分析。第一阶段是确定关键的基本变量，并在此基础上构建投资组合。采用人工神经网络(ANN)、支持向量机(SVM)和决策树J48建立模型。第二阶段需要应用技术分析来预测投资组合中公司的价格走势。使用人工神经网络和支持向量机技术为投资组合中的所有公司创建预测模型。我们还根据模型的输出使用交易决策来估计回报，然后将其与买入并持有的回报和作为基准的NIFTY 50指数的回报进行比较。结果表明，两种投资组合的收益均高于基准买入并持有策略的收益。可以得出结论，无论股票类型如何，数据挖掘技术都能提供更好的结果，并且有能力弥补糟糕的股票。将投资组合的回报与NIFTY的回报作为基准进行比较也表明，与NIFTY产生的回报相比，这两种投资组合产生的回报都更高。由于股票价格同时受到技术指标和基本面指标的影响，本文探讨了技术分析和基本面分析变量对印度股市预测的联合作用。此外，还对个别分析所得的结果进行了比较。研究中提出的方法也可以利用基于趋势的研究来确定是长期持有股票还是短期持有股票。

{"title":"Data mining–based stock price prediction using hybridization of technical and fundamental analysis","authors":"Jasleen Kaur, Khushdeep Dharni","doi":"10.1108/dta-04-2022-0142","DOIUrl":"https://doi.org/10.1108/dta-04-2022-0142","url":null,"abstract":"PurposeThe stock market generates massive databases of various financial companies that are highly volatile and complex. To forecast daily stock values of these companies, investors frequently use technical analysis or fundamental analysis. Data mining techniques coupled with fundamental and technical analysis types have the potential to give satisfactory results for stock market prediction. In the current paper, an effort is made to investigate the accuracy of stock market predictions by using the combined approach of variables from technical and fundamental analysis for the creation of a data mining predictive model.Design/methodology/approachWe chose 381 companies from the National Stock Exchange of India's CNX 500 index and conducted a two-stage data analysis. The first stage is identifying key fundamental variables and constructing a portfolio based on that study. Artificial neural network (ANN), support vector machines (SVM) and decision tree J48 were used to build the models. The second stage entails applying technical analysis to forecast price movements in the companies included in the portfolios. ANN and SVM techniques were used to create predictive models for all companies in the portfolios. We also estimated returns using trading decisions based on the model's output and then compared them to buy-and-hold returns and the return of the NIFTY 50 index, which served as a benchmark.FindingsThe results show that the returns of both the portfolios are higher than the benchmark buy-and-hold strategy return. It can be concluded that data mining techniques give better results, irrespective of the type of stock, and have the ability to make up for poor stocks. The comparison of returns of portfolios with the return of NIFTY as a benchmark also indicates that both the portfolios are generating higher returns as compared to the return generated by NIFTY.Originality/valueAs stock prices are influenced by both technical and fundamental indicators, the current paper explored the combined effect of technical analysis and fundamental analysis variables for Indian stock market prediction. Further, the results obtained by individual analysis have also been compared. The proposed method under study can also be utilized to determine whether to hold stocks for the long or short term using trend-based research.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44417754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Music sentiment classification based on an optimized CNN-RF-QPSO model 基于优化CNN-RF-QPSO模型的音乐情感分类

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2023-03-17 DOI: 10.1108/dta-07-2022-0267

Rui Tian, Ruheng Yin, Feng Gan

PurposeMusic sentiment analysis helps to promote the diversification of music information retrieval methods. Traditional music emotion classification tasks suffer from high manual workload and low classification accuracy caused by difficulty in feature extraction and inaccurate manual determination of hyperparameter. In this paper, the authors propose an optimized convolution neural network-random forest (CNN-RF) model for music sentiment classification which is capable of optimizing the manually selected hyperparameters to improve the accuracy of music sentiment classification and reduce labor costs and human classification errors.Design/methodology/approachA CNN-RF music sentiment classification model is designed based on quantum particle swarm optimization (QPSO). First, the audio data are transformed into a Mel spectrogram, and feature extraction is conducted by a CNN. Second, the music features extracted are processed by RF algorithm to complete a preliminary emotion classification. Finally, to select the suitable hyperparameters for a CNN, the QPSO algorithm is adopted to extract the best hyperparameters and obtain the final classification results.FindingsThe model has gone through experimental validations and achieved a classification accuracy of 97 per cent for different sentiment categories with shortened training time. The proposed method with QPSO achieved 1.2 and 1.6 per cent higher accuracy than that with particle swarm optimization and genetic algorithm, respectively. The proposed model had great potential for music sentiment classification.Originality/valueThe dual contribution of this work comprises the proposed model which integrated two deep learning models and the introduction of a QPSO into model optimization. With these two innovations, the efficiency and accuracy of music emotion recognition and classification have been significantly improved.

目的音乐情感分析有助于促进音乐信息检索方法的多样化。传统的音乐情感分类任务由于特征提取困难和超参数的手工确定不准确，导致手工工作量大，分类精度低。在本文中，作者提出了一种用于音乐情感分类的优化卷积神经网络随机森林（CNN-RF）模型，该模型能够优化手动选择的超参数，以提高音乐情感分类精度，降低人工成本和人工分类误差。设计/方法论/方法基于量子粒子群优化（QPSO）设计了一个CNN-RF音乐情感分类模型。首先，将音频数据转换为梅尔谱图，并通过CNN进行特征提取。其次，通过RF算法对提取的音乐特征进行处理，完成初步的情绪分类。最后，为了为CNN选择合适的超参数，采用QPSO算法提取最佳超参数并获得最终的分类结果。发现该模型经过了实验验证，在缩短训练时间的情况下，对不同情绪类别的分类准确率达到97%。与粒子群优化和遗传算法相比，QPSO方法的精度分别提高了1.2%和1.6%。所提出的模型在音乐情感分类方面具有很大的潜力。原创性/价值这项工作的双重贡献包括所提出的模型，该模型集成了两个深度学习模型，并在模型优化中引入了QPSO。通过这两项创新，音乐情感识别和分类的效率和准确性得到了显著提高。

{"title":"Music sentiment classification based on an optimized CNN-RF-QPSO model","authors":"Rui Tian, Ruheng Yin, Feng Gan","doi":"10.1108/dta-07-2022-0267","DOIUrl":"https://doi.org/10.1108/dta-07-2022-0267","url":null,"abstract":"PurposeMusic sentiment analysis helps to promote the diversification of music information retrieval methods. Traditional music emotion classification tasks suffer from high manual workload and low classification accuracy caused by difficulty in feature extraction and inaccurate manual determination of hyperparameter. In this paper, the authors propose an optimized convolution neural network-random forest (CNN-RF) model for music sentiment classification which is capable of optimizing the manually selected hyperparameters to improve the accuracy of music sentiment classification and reduce labor costs and human classification errors.Design/methodology/approachA CNN-RF music sentiment classification model is designed based on quantum particle swarm optimization (QPSO). First, the audio data are transformed into a Mel spectrogram, and feature extraction is conducted by a CNN. Second, the music features extracted are processed by RF algorithm to complete a preliminary emotion classification. Finally, to select the suitable hyperparameters for a CNN, the QPSO algorithm is adopted to extract the best hyperparameters and obtain the final classification results.FindingsThe model has gone through experimental validations and achieved a classification accuracy of 97 per cent for different sentiment categories with shortened training time. The proposed method with QPSO achieved 1.2 and 1.6 per cent higher accuracy than that with particle swarm optimization and genetic algorithm, respectively. The proposed model had great potential for music sentiment classification.Originality/valueThe dual contribution of this work comprises the proposed model which integrated two deep learning models and the introduction of a QPSO into model optimization. With these two innovations, the efficiency and accuracy of music emotion recognition and classification have been significantly improved.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47188804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-relation global context learning for session-based recommendation 基于会话推荐的多关系全局上下文学习

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2023-03-16 DOI: 10.1108/dta-07-2022-0290

Yishan Liu, Wenming Cao, Guitao Cao

PurposeSession-based recommendation aims to predict the user's next preference based on the user's recent activities. Although most existing studies consider the global characteristics of items, they only learn the global characteristics of items based on a single connection relationship, which cannot fully capture the complex transformation relationship between items. We believe that multiple relationships between items in learning sessions can improve the performance of session recommendation tasks and the scalability of recommendation models. At the same time, high-quality global features of the item help to explore the potential common preferences of users.Design/methodology/approachThis work proposes a session-based recommendation method with a multi-relation global context–enhanced network to capture this global transition relationship. Specifically, we construct a multi-relation global item graph based on a group of sessions, use a graded attention mechanism to learn different types of connection relations independently and obtain the global feature of the item according to the multi-relation weight.FindingsWe did related experiments on three benchmark datasets. The experimental results show that our proposed model is superior to the existing state-of-the-art methods, which verifies the effectiveness of our model.Originality/valueFirst, we construct a multi-relation global item graph to learn the complex transition relations of the global context of the item and effectively mine the potential association of items between different sessions. Second, our model effectively improves the scalability of the model by obtaining high-quality item global features and enables some previously unconsidered items to make it onto the candidate list.

基于会话的推荐旨在根据用户最近的活动来预测用户的下一个偏好。虽然现有的研究大多考虑了项目的全局特征，但它们只是基于单一的连接关系来学习项目的全局特征，无法充分捕捉到项目之间复杂的转换关系。我们认为，学习会话中项目之间的多重关系可以提高会话推荐任务的性能和推荐模型的可扩展性。同时，产品的高质量全球特征有助于探索用户潜在的共同偏好。设计/方法/方法本工作提出了一种基于会话的推荐方法，该方法采用多关系全局上下文增强网络来捕获这种全局转换关系。具体而言，我们基于一组会话构建了一个多关系全局项目图，使用分级注意机制独立学习不同类型的连接关系，并根据多关系权重获得项目的全局特征。我们在三个基准数据集上做了相关的实验。实验结果表明，该模型优于现有的最先进的方法，验证了该模型的有效性。首先，我们构建了一个多关系的全局项目图，学习了项目全局上下文的复杂过渡关系，有效挖掘了不同会话之间项目的潜在关联。其次，我们的模型通过获得高质量的物品全局特征，有效地提高了模型的可扩展性，并使一些之前未考虑的物品进入候选列表。

{"title":"Multi-relation global context learning for session-based recommendation","authors":"Yishan Liu, Wenming Cao, Guitao Cao","doi":"10.1108/dta-07-2022-0290","DOIUrl":"https://doi.org/10.1108/dta-07-2022-0290","url":null,"abstract":"PurposeSession-based recommendation aims to predict the user's next preference based on the user's recent activities. Although most existing studies consider the global characteristics of items, they only learn the global characteristics of items based on a single connection relationship, which cannot fully capture the complex transformation relationship between items. We believe that multiple relationships between items in learning sessions can improve the performance of session recommendation tasks and the scalability of recommendation models. At the same time, high-quality global features of the item help to explore the potential common preferences of users.Design/methodology/approachThis work proposes a session-based recommendation method with a multi-relation global context–enhanced network to capture this global transition relationship. Specifically, we construct a multi-relation global item graph based on a group of sessions, use a graded attention mechanism to learn different types of connection relations independently and obtain the global feature of the item according to the multi-relation weight.FindingsWe did related experiments on three benchmark datasets. The experimental results show that our proposed model is superior to the existing state-of-the-art methods, which verifies the effectiveness of our model.Originality/valueFirst, we construct a multi-relation global item graph to learn the complex transition relations of the global context of the item and effectively mine the potential association of items between different sessions. Second, our model effectively improves the scalability of the model by obtaining high-quality item global features and enables some previously unconsidered items to make it onto the candidate list.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48165487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A new method based on ensemble time series for fast and accurate clustering 一种基于集成时间序列的快速准确聚类方法

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2023-03-15 DOI: 10.1108/dta-08-2022-0300

A. Ghorbanian, H. Razavi

PurposeThe common methods for clustering time series are the use of specific distance criteria or the use of standard clustering algorithms. Ensemble clustering is one of the common techniques used in data mining to increase the accuracy of clustering. In this study, based on segmentation, selecting the best segments, and using ensemble clustering for selected segments, a multistep approach has been developed for the whole clustering of time series data.Design/methodology/approachFirst, this approach divides the time series dataset into equal segments. In the next step, using one or more internal clustering criteria, the best segments are selected, and then the selected segments are combined for final clustering. By using a loop and how to select the best segments for the final clustering (using one criterion or several criteria simultaneously), two algorithms have been developed in different settings. A logarithmic relationship limits the number of segments created in the loop.FindingAccording to Rand's external criteria and statistical tests, at first, the best setting of the two developed algorithms has been selected. Then this setting has been compared to different algorithms in the literature on clustering accuracy and execution time. The obtained results indicate more accuracy and less execution time for the proposed approach.Originality/valueThis paper proposed a fast and accurate approach for time series clustering in three main steps. This is the first work that uses a combination of segmentation and ensemble clustering. More accuracy and less execution time are the remarkable achievements of this study.

常用的时间序列聚类方法是使用特定的距离标准或使用标准聚类算法。集成聚类是数据挖掘中用于提高聚类精度的常用技术之一。本研究提出了一种多步骤的时间序列数据整体聚类方法，该方法基于分割、选择最佳片段，并对所选片段使用集成聚类。设计/方法/方法首先，该方法将时间序列数据集分成相等的部分。下一步，使用一个或多个内部聚类标准，选择最佳段，然后将选择的段组合进行最终聚类。通过使用循环以及如何为最终聚类选择最佳片段(同时使用一个标准或几个标准)，在不同的设置下开发了两种算法。对数关系限制了在循环中创建的段的数量。根据Rand的外部标准和统计检验，首先选择了两种开发算法的最佳设置。然后将此设置与文献中不同的算法在聚类精度和执行时间上进行了比较。结果表明，该方法具有较高的精度和较短的执行时间。本文提出了一种快速准确的时间序列聚类方法，分为三个主要步骤。这是第一次使用分割和集成聚类相结合的工作。准确性提高，执行时间缩短是本研究的显著成果。

{"title":"A new method based on ensemble time series for fast and accurate clustering","authors":"A. Ghorbanian, H. Razavi","doi":"10.1108/dta-08-2022-0300","DOIUrl":"https://doi.org/10.1108/dta-08-2022-0300","url":null,"abstract":"PurposeThe common methods for clustering time series are the use of specific distance criteria or the use of standard clustering algorithms. Ensemble clustering is one of the common techniques used in data mining to increase the accuracy of clustering. In this study, based on segmentation, selecting the best segments, and using ensemble clustering for selected segments, a multistep approach has been developed for the whole clustering of time series data.Design/methodology/approachFirst, this approach divides the time series dataset into equal segments. In the next step, using one or more internal clustering criteria, the best segments are selected, and then the selected segments are combined for final clustering. By using a loop and how to select the best segments for the final clustering (using one criterion or several criteria simultaneously), two algorithms have been developed in different settings. A logarithmic relationship limits the number of segments created in the loop.FindingAccording to Rand's external criteria and statistical tests, at first, the best setting of the two developed algorithms has been selected. Then this setting has been compared to different algorithms in the literature on clustering accuracy and execution time. The obtained results indicate more accuracy and less execution time for the proposed approach.Originality/valueThis paper proposed a fast and accurate approach for time series clustering in three main steps. This is the first work that uses a combination of segmentation and ensemble clustering. More accuracy and less execution time are the remarkable achievements of this study.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42318784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A novel twin-support vector machine for binary classification to imbalanced data 一种新的双支持向量机对不平衡数据进行二值分类

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2023-03-09 DOI: 10.1108/dta-08-2022-0302

Jingyi Li, S. Chao

PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classifier degradation. To address this, this paper proposes a twin-support vector machines for binary classification on imbalanced data.Design/methodology/approachIn the proposed method, the authors construct two support vector machines to focus on majority classes and minority classes, respectively. In order to promote the learning ability of the two support vector machines, a new kernel is derived for them.Findings(1) A novel twin-support vector machine is proposed for binary classification on imbalanced data, and new kernels are derived. (2) For imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned by using optimizing kernels. (3) Classifiers based on twin architectures have more advantages than those based on single architecture for binary classification on imbalanced data.Originality/valueFor imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned through using optimizing kernels.

目的对不平衡数据进行二值分类是一个挑战;由于阶级的不平衡，少数阶级很容易被多数阶级所掩盖。然而，现有的大多数分类器更擅长识别多数类，从而忽略了少数类，从而导致分类器退化。为了解决这个问题，本文提出了一种双支持向量机对不平衡数据进行二值分类。在提出的方法中，作者构建了两个支持向量机，分别关注多数类和少数类。(1)提出了一种新的双支持向量机对不平衡数据进行二值分类，并推导了新的核。(2)对于不平衡数据，数据分布的复杂性对分类结果有负面影响;然而，通过使用优化核可以获得高级分类结果和期望的边界。(3)对于非平衡数据的二值分类，基于双体系结构的分类器比基于单体系结构的分类器更有优势。独创性/价值对于不平衡数据，数据分布的复杂性对分类结果有负面影响;然而，通过使用优化核可以获得高级分类结果并学习到期望的边界。

{"title":"A novel twin-support vector machine for binary classification to imbalanced data","authors":"Jingyi Li, S. Chao","doi":"10.1108/dta-08-2022-0302","DOIUrl":"https://doi.org/10.1108/dta-08-2022-0302","url":null,"abstract":"PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classifier degradation. To address this, this paper proposes a twin-support vector machines for binary classification on imbalanced data.Design/methodology/approachIn the proposed method, the authors construct two support vector machines to focus on majority classes and minority classes, respectively. In order to promote the learning ability of the two support vector machines, a new kernel is derived for them.Findings(1) A novel twin-support vector machine is proposed for binary classification on imbalanced data, and new kernels are derived. (2) For imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned by using optimizing kernels. (3) Classifiers based on twin architectures have more advantages than those based on single architecture for binary classification on imbalanced data.Originality/valueFor imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned through using optimizing kernels.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"49 1","pages":"385-396"},"PeriodicalIF":1.6,"publicationDate":"2023-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73540195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Machine learning methods for results merging in patent retrieval 专利检索中结果合并的机器学习方法

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2023-02-27 DOI: 10.1108/dta-06-2021-0156

Vasileios Stamatis, M. Salampasis, K. Diamantaras

PurposeIn federated search, a query is sent simultaneously to multiple resources and each one of them returns a list of results. These lists are merged into a single list using the results merging process. In this work, the authors apply machine learning methods for results merging in federated patent search. Even though several methods for results merging have been developed, none of them were tested on patent data nor considered several machine learning models. Thus, the authors experiment with state-of-the-art methods using patent data and they propose two new methods for results merging that use machine learning models.Design/methodology/approachThe methods are based on a centralized index containing samples of documents from all the remote resources, and they implement machine learning models to estimate comparable scores for the documents retrieved by different resources. The authors examine the new methods in cooperative and uncooperative settings where document scores from the remote search engines are available and not, respectively. In uncooperative environments, they propose two methods for assigning document scores.FindingsThe effectiveness of the new results merging methods was measured against state-of-the-art models and found to be superior to them in many cases with significant improvements. The random forest model achieves the best results in comparison to all other models and presents new insights for the results merging problem.Originality/valueIn this article the authors prove that machine learning models can substitute other standard methods and models that used for results merging for many years. Our methods outperformed state-of-the-art estimation methods for results merging, and they proved that they are more effective for federated patent search.

目的在联合搜索中，一个查询被同时发送到多个资源，每个资源都返回一个结果列表。使用结果合并过程将这些列表合并为单个列表。在这项工作中，作者将机器学习方法应用于联邦专利搜索中的结果合并。尽管已经开发了几种结果合并方法，但没有一种方法在专利数据上进行测试，也没有考虑几种机器学习模型。因此，作者使用专利数据对最先进的方法进行了实验，并提出了两种使用机器学习模型的结果合并新方法。设计/方法论/方法论这些方法基于一个集中索引，该索引包含来自所有远程资源的文档样本，它们实现了机器学习模型，以估计不同资源检索到的文档的可比分数。作者研究了在合作和不合作环境中的新方法，其中远程搜索引擎的文档分数分别可用和不可用。在不合作的环境中，他们提出了两种分配文档分数的方法。发现新的结果合并方法的有效性是根据最先进的模型进行测量的，发现在许多情况下都优于它们，并有显著的改进。与所有其他模型相比，随机森林模型获得了最好的结果，并为结果合并问题提供了新的见解。独创性/价值在这篇文章中，作者证明了机器学习模型可以替代多年来用于结果合并的其他标准方法和模型。我们的方法在结果合并方面优于最先进的估计方法，并证明它们在联合专利搜索中更有效。

{"title":"Machine learning methods for results merging in patent retrieval","authors":"Vasileios Stamatis, M. Salampasis, K. Diamantaras","doi":"10.1108/dta-06-2021-0156","DOIUrl":"https://doi.org/10.1108/dta-06-2021-0156","url":null,"abstract":"PurposeIn federated search, a query is sent simultaneously to multiple resources and each one of them returns a list of results. These lists are merged into a single list using the results merging process. In this work, the authors apply machine learning methods for results merging in federated patent search. Even though several methods for results merging have been developed, none of them were tested on patent data nor considered several machine learning models. Thus, the authors experiment with state-of-the-art methods using patent data and they propose two new methods for results merging that use machine learning models.Design/methodology/approachThe methods are based on a centralized index containing samples of documents from all the remote resources, and they implement machine learning models to estimate comparable scores for the documents retrieved by different resources. The authors examine the new methods in cooperative and uncooperative settings where document scores from the remote search engines are available and not, respectively. In uncooperative environments, they propose two methods for assigning document scores.FindingsThe effectiveness of the new results merging methods was measured against state-of-the-art models and found to be superior to them in many cases with significant improvements. The random forest model achieves the best results in comparison to all other models and presents new insights for the results merging problem.Originality/valueIn this article the authors prove that machine learning models can substitute other standard methods and models that used for results merging for many years. Our methods outperformed state-of-the-art estimation methods for results merging, and they proved that they are more effective for federated patent search.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44771332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

SanMove: next location recommendation via self-attention network SanMove:通过自关注网络推荐下一个位置

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2023-02-25 DOI: 10.1108/dta-03-2022-0093

Bin Wang, Huifeng Li, Le Tong, Qian Zhang, Sulei Zhu, Tao Yang

PurposeThis paper aims to address the following issues: (1) most existing methods are based on recurrent network, which is time-consuming to train long sequences due to not allowing for full parallelism; (2) personalized preference generally are not considered reasonably; (3) existing methods rarely systematically studied how to efficiently utilize various auxiliary information (e.g. user ID and time stamp) in trajectory data and the spatiotemporal relations among nonconsecutive locations.Design/methodology/approachThe authors propose a novel self-attention network–based model named SanMove to predict the next location via capturing the long- and short-term mobility patterns of users. Specifically, SanMove uses a self-attention module to capture each user's long-term preference, which can represent her personalized location preference. Meanwhile, the authors use a spatial-temporal guided noninvasive self-attention (STNOVA) module to exploit auxiliary information in the trajectory data to learn the user's short-term preference.FindingsThe authors evaluate SanMove on two real-world datasets. The experimental results demonstrate that SanMove is not only faster than the state-of-the-art recurrent neural network (RNN) based predict model but also outperforms the baselines for next location prediction.Originality/valueThe authors propose a self-attention-based sequential model named SanMove to predict the user's trajectory, which comprised long-term and short-term preference learning modules. SanMove allows full parallel processing of trajectories to improve processing efficiency. They propose an STNOVA module to capture the sequential transitions of current trajectories. Moreover, the self-attention module is used to process historical trajectory sequences in order to capture the personalized location preference of each user. The authors conduct extensive experiments on two check-in datasets. The experimental results demonstrate that the model has a fast training speed and excellent performance compared with the existing RNN-based methods for next location prediction.

本文旨在解决以下问题:(1)现有方法大多基于循环网络，由于不允许完全并行，导致长序列训练时间长;(2)个性化偏好普遍没有得到合理考虑;(3)现有方法很少系统地研究如何有效利用轨迹数据中的各种辅助信息(如用户ID和时间戳)以及非连续位置之间的时空关系。设计/方法/方法作者提出了一种新的基于自关注网络的模型SanMove，该模型通过捕捉用户的长期和短期移动模式来预测下一个位置。具体来说，SanMove使用一个自我关注模块来捕捉每个用户的长期偏好，这可以代表她个性化的位置偏好。同时，利用时空导向非侵入性自注意(STNOVA)模块，利用轨迹数据中的辅助信息来学习用户的短期偏好。研究结果作者在两个真实世界的数据集上评估了SanMove。实验结果表明，SanMove不仅比最先进的基于递归神经网络(RNN)的预测模型更快，而且在下一个位置预测方面也优于基线。原创性/价值作者提出了一个基于自我注意的序列模型SanMove来预测用户的轨迹，该模型由长期偏好学习和短期偏好学习模块组成。SanMove允许轨迹的完全并行处理，以提高处理效率。他们提出了一个STNOVA模块来捕捉当前轨迹的顺序转换。此外，利用自关注模块对历史轨迹序列进行处理，以捕获每个用户的个性化位置偏好。作者在两个检入数据集上进行了广泛的实验。实验结果表明，与现有的基于rnn的下一个位置预测方法相比，该模型具有较快的训练速度和优异的性能。

{"title":"SanMove: next location recommendation via self-attention network","authors":"Bin Wang, Huifeng Li, Le Tong, Qian Zhang, Sulei Zhu, Tao Yang","doi":"10.1108/dta-03-2022-0093","DOIUrl":"https://doi.org/10.1108/dta-03-2022-0093","url":null,"abstract":"PurposeThis paper aims to address the following issues: (1) most existing methods are based on recurrent network, which is time-consuming to train long sequences due to not allowing for full parallelism; (2) personalized preference generally are not considered reasonably; (3) existing methods rarely systematically studied how to efficiently utilize various auxiliary information (e.g. user ID and time stamp) in trajectory data and the spatiotemporal relations among nonconsecutive locations.Design/methodology/approachThe authors propose a novel self-attention network–based model named SanMove to predict the next location via capturing the long- and short-term mobility patterns of users. Specifically, SanMove uses a self-attention module to capture each user's long-term preference, which can represent her personalized location preference. Meanwhile, the authors use a spatial-temporal guided noninvasive self-attention (STNOVA) module to exploit auxiliary information in the trajectory data to learn the user's short-term preference.FindingsThe authors evaluate SanMove on two real-world datasets. The experimental results demonstrate that SanMove is not only faster than the state-of-the-art recurrent neural network (RNN) based predict model but also outperforms the baselines for next location prediction.Originality/valueThe authors propose a self-attention-based sequential model named SanMove to predict the user's trajectory, which comprised long-term and short-term preference learning modules. SanMove allows full parallel processing of trajectories to improve processing efficiency. They propose an STNOVA module to capture the sequential transitions of current trajectories. Moreover, the self-attention module is used to process historical trajectory sequences in order to capture the personalized location preference of each user. The authors conduct extensive experiments on two check-in datasets. The experimental results demonstrate that the model has a fast training speed and excellent performance compared with the existing RNN-based methods for next location prediction.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"49 1","pages":"330-343"},"PeriodicalIF":1.6,"publicationDate":"2023-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76459007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0