首页 > 最新文献

Applied Intelligence最新文献

英文 中文
A BERT-based review helpfulness prediction model utilizing consistency of ratings and texts
IF 3.4 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-15 DOI: 10.1007/s10489-024-06100-x
Xinzhe Li, Qinglong Li, Dongyeop Ryu, Jaekyeong Kim

Predicting review helpfulness (RH) to ensure that consumers make effective purchasing decisions is a significant area of study. Many scholars have attempted to develop accurate review helpfulness prediction (RHP) methodologies. However, most previous studies have mainly focused on predictions using product review texts, and few studies have used product satisfaction as indicated by star ratings, particularly the consistency between review texts and star ratings. This study proposes a novel model called BHelP-CoRT (Bidirectional Encoder Representations from Transformers based RHP model utilizing consistency of ratings and texts) to predict RH. The proposed model consists of a review text encoder, star rating encoder, and text-rating interaction. The review text encoder was developed by applying the BERT model to extract contextual semantic features embedded in review texts. The star rating encoder was designed to embed star ratings into feature vectors. The text-rating interaction was constructed by applying an attention mechanism to extract the text-rating interaction and introduce consistency into the RHP tasks. This study conducted extensive experiments to demonstrate the effectiveness of the proposed model from multiple perspectives using real-world online reviews collected from Amazon. The experimental results show that the proposed model outperforms the state-of-the-art models, indicating that it can improve the RHP performance. Specifically, this effectiveness is reflected in the processing of reviews containing inconsistent information. This study supports the marketing efforts of the e-commerce industry by providing an RHP service to address consumer information overload.

Graphical abstract

{"title":"A BERT-based review helpfulness prediction model utilizing consistency of ratings and texts","authors":"Xinzhe Li,&nbsp;Qinglong Li,&nbsp;Dongyeop Ryu,&nbsp;Jaekyeong Kim","doi":"10.1007/s10489-024-06100-x","DOIUrl":"10.1007/s10489-024-06100-x","url":null,"abstract":"<div><p>Predicting review helpfulness (RH) to ensure that consumers make effective purchasing decisions is a significant area of study. Many scholars have attempted to develop accurate review helpfulness prediction (RHP) methodologies. However, most previous studies have mainly focused on predictions using product review texts, and few studies have used product satisfaction as indicated by star ratings, particularly the consistency between review texts and star ratings. This study proposes a novel model called BHelP-CoRT (Bidirectional Encoder Representations from Transformers based RHP model utilizing consistency of ratings and texts) to predict RH. The proposed model consists of a review text encoder, star rating encoder, and text-rating interaction. The review text encoder was developed by applying the BERT model to extract contextual semantic features embedded in review texts. The star rating encoder was designed to embed star ratings into feature vectors. The text-rating interaction was constructed by applying an attention mechanism to extract the text-rating interaction and introduce consistency into the RHP tasks. This study conducted extensive experiments to demonstrate the effectiveness of the proposed model from multiple perspectives using real-world online reviews collected from Amazon. The experimental results show that the proposed model outperforms the state-of-the-art models, indicating that it can improve the RHP performance. Specifically, this effectiveness is reflected in the processing of reviews containing inconsistent information. This study supports the marketing efforts of the e-commerce industry by providing an RHP service to address consumer information overload.</p><h3>Graphical abstract</h3><div><figure><div><div><picture><source><img></source></picture></div></div></figure></div></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143423296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GANet: geometry-aware network for RGB-D semantic segmentation
IF 3.4 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-15 DOI: 10.1007/s10489-025-06337-0
Chunqi Tian, Weirong Xu, Lizhi Bai, Jun Yang, Yanjun Xu

The field of RGB-D semantic segmentation has attracted considerable interest in recent times. The challenge is to develop an effective method for combining RGB images, which capture colour variations, with depth images, which provide robust information about object geometry regardless of lighting conditions. Treating both image types equally through the same convolution operator fails to take into account their inherent differences. Thus, in this paper, we propose a novel approach that combines a geometry-aware convolution (GAConv) module and a multiscale fusion module (MFM) with the aim of enhancing the performance of RGB-D image segmentation. The GAConv module effectively captures fine-grained geometric details from depth images, while the MFM module enables efficient integration of multi-scale features, allowing the network to utilise both spatial and semantic information. Extensive experimentation was conducted on the NYUv2 and SUN RGB-D datasets, wherein our model demonstrated consistent superiority over existing state-of-the-art methods in terms of pixel accuracy and mean intersection over union (mIoU).

近来,RGB-D 语义分割领域引起了广泛关注。RGB 图像能捕捉色彩的变化,而深度图像则能提供有关物体几何形状的可靠信息,因此,如何开发一种有效的方法将 RGB 图像与深度图像结合起来是一项挑战。通过相同的卷积算子对这两种图像进行平等处理,无法考虑到它们之间的内在差异。因此,在本文中,我们提出了一种结合几何感知卷积(GAConv)模块和多尺度融合模块(MFM)的新方法,旨在提高 RGB-D 图像分割的性能。GAConv 模块能有效捕捉深度图像中细粒度的几何细节,而 MFM 模块则能有效整合多尺度特征,使网络同时利用空间和语义信息。我们在 NYUv2 和 SUN RGB-D 数据集上进行了广泛的实验,结果表明我们的模型在像素精确度和平均交集大于联合(mIoU)方面始终优于现有的先进方法。
{"title":"GANet: geometry-aware network for RGB-D semantic segmentation","authors":"Chunqi Tian,&nbsp;Weirong Xu,&nbsp;Lizhi Bai,&nbsp;Jun Yang,&nbsp;Yanjun Xu","doi":"10.1007/s10489-025-06337-0","DOIUrl":"10.1007/s10489-025-06337-0","url":null,"abstract":"<div><p>The field of RGB-D semantic segmentation has attracted considerable interest in recent times. The challenge is to develop an effective method for combining RGB images, which capture colour variations, with depth images, which provide robust information about object geometry regardless of lighting conditions. Treating both image types equally through the same convolution operator fails to take into account their inherent differences. Thus, in this paper, we propose a novel approach that combines a geometry-aware convolution (<i>GAConv</i>) module and a multiscale fusion module (MFM) with the aim of enhancing the performance of RGB-D image segmentation. The <i>GAConv</i> module effectively captures fine-grained geometric details from depth images, while the MFM module enables efficient integration of multi-scale features, allowing the network to utilise both spatial and semantic information. Extensive experimentation was conducted on the NYUv2 and SUN RGB-D datasets, wherein our model demonstrated consistent superiority over existing state-of-the-art methods in terms of pixel accuracy and mean intersection over union (mIoU).</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143423167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scnet: spectral convolutional networks for multivariate time series classification
IF 3.4 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-15 DOI: 10.1007/s10489-025-06352-1
Xing Wu, Xinyu Xing, Junfeng Yao, Quan Qian, Jun Song

With the widespread application of time series data, the study of classification techniques has become an important topic. Although existing multivariate time series classification (MTSC) methods have made progress, they often rely on one-dimensional (1D) time series, which limits their ability to capture complex temporal dynamics and multiscale features. To address these challenges, a Spectral Convolutional Network (SCNet) is introduced in this work. SCNet effectively transforms 1D time series data into the frequency domain using an enhanced Discrete Fourier Transform (enhanced_DFT), revealing periodicity and key frequency components while reshaping the data into a two-dimensional (2D) time series for better representation. Furthermore, it uses a Spectral Energy Prioritization method to optimize frequency domain energy distribution and a multiscale convolutional module to capture features at different scales, improving the model’s ability to analyze short-term and long-term trends. To validate the effectiveness and superiority, we conducted extensive experiments on 10 sub-datasets from the well-known UEA dataset. The results show that our proposed SCNet achieved the highest average accuracy of 74.3%, which is 2.2% higher than the current state-of-the-art models, demonstrating its potential for practical application and efficiency in MTSC task.

{"title":"Scnet: spectral convolutional networks for multivariate time series classification","authors":"Xing Wu,&nbsp;Xinyu Xing,&nbsp;Junfeng Yao,&nbsp;Quan Qian,&nbsp;Jun Song","doi":"10.1007/s10489-025-06352-1","DOIUrl":"10.1007/s10489-025-06352-1","url":null,"abstract":"<div><p>With the widespread application of time series data, the study of classification techniques has become an important topic. Although existing multivariate time series classification (MTSC) methods have made progress, they often rely on one-dimensional (1D) time series, which limits their ability to capture complex temporal dynamics and multiscale features. To address these challenges, a Spectral Convolutional Network (SCNet) is introduced in this work. SCNet effectively transforms 1D time series data into the frequency domain using an enhanced Discrete Fourier Transform (enhanced_DFT), revealing periodicity and key frequency components while reshaping the data into a two-dimensional (2D) time series for better representation. Furthermore, it uses a Spectral Energy Prioritization method to optimize frequency domain energy distribution and a multiscale convolutional module to capture features at different scales, improving the model’s ability to analyze short-term and long-term trends. To validate the effectiveness and superiority, we conducted extensive experiments on 10 sub-datasets from the well-known UEA dataset. The results show that our proposed SCNet achieved the highest average accuracy of 74.3%, which is 2.2% higher than the current state-of-the-art models, demonstrating its potential for practical application and efficiency in MTSC task.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10489-025-06352-1.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143423299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NoRD: A framework for noise-resilient self-distillation through relative supervision
IF 3.4 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-15 DOI: 10.1007/s10489-025-06355-y
Saurabh Sharma, Shikhar Singh Lodhi, Vanshika Srivastava, Joydeep Chandra

Knowledge distillation (KD) has become a pivotal technique in deep learning, facilitating model compression and regularization by transferring knowledge from one neural network to another, enhancing its capabilities for downstream tasks such as classification. However, real-world datasets often suffer from noisy label problems, significantly hindering neural network learning in supervised tasks. Recent advancements in KD aim to improve noise-robustness and regularization in deep neural networks through different learning paradigms. Yet, prevalent approaches often exhibit noise-prone behaviors as the student network heavily relies on the teacher’s learning. To address this challenge, we propose a robust knowledge transfer method, NoRD: a Noise-Resilient Self-Distillation framework. This approach leverages relative self-supervision combined with decision matching to minimize noise susceptibility during the knowledge transfer process. Our study evaluates this technique on CIFAR-10, CIFAR-100, and MNIST datasets with synthetic label noise. Results showcase that our method achieves 8-10% higher test accuracy compared to state-of-the-art noise-robust loss functions at noise rates exceeding 50%, surpassing well-known KD methods by 4-5% in top-1 test accuracy. The code is available at https://github.com/philsaurabh/NoRD_Applied-Intelligence.

知识蒸馏(KD)已成为深度学习中的一项关键技术,它通过将知识从一个神经网络转移到另一个神经网络,促进了模型压缩和正则化,增强了神经网络在分类等下游任务中的能力。然而,现实世界的数据集往往存在噪声标签问题,严重阻碍了神经网络在监督任务中的学习。最近,KD 的研究进展旨在通过不同的学习范式,提高深度神经网络的抗噪性和正则化。然而,由于学生网络在很大程度上依赖于教师的学习,因此流行的方法往往表现出易受噪声影响的行为。为了应对这一挑战,我们提出了一种稳健的知识转移方法--NoRD:一种抗噪声自蒸馏框架。这种方法利用相对自监督与决策匹配相结合的方式,最大限度地降低知识转移过程中的噪声敏感性。我们的研究在具有合成标签噪声的 CIFAR-10、CIFAR-100 和 MNIST 数据集上对该技术进行了评估。结果表明,在噪声率超过 50%的情况下,我们的方法与最先进的抗噪损失函数相比,测试准确率提高了 8-10%,在前 1 位测试准确率方面比著名的 KD 方法高出 4-5%。代码见 https://github.com/philsaurabh/NoRD_Applied-Intelligence。
{"title":"NoRD: A framework for noise-resilient self-distillation through relative supervision","authors":"Saurabh Sharma,&nbsp;Shikhar Singh Lodhi,&nbsp;Vanshika Srivastava,&nbsp;Joydeep Chandra","doi":"10.1007/s10489-025-06355-y","DOIUrl":"10.1007/s10489-025-06355-y","url":null,"abstract":"<div><p>Knowledge distillation (KD) has become a pivotal technique in deep learning, facilitating model compression and regularization by transferring knowledge from one neural network to another, enhancing its capabilities for downstream tasks such as classification. However, real-world datasets often suffer from noisy label problems, significantly hindering neural network learning in supervised tasks. Recent advancements in KD aim to improve noise-robustness and regularization in deep neural networks through different learning paradigms. Yet, prevalent approaches often exhibit noise-prone behaviors as the student network heavily relies on the teacher’s learning. To address this challenge, we propose a robust knowledge transfer method, <b>NoRD</b>: a <b>N</b>oise-<b>R</b>esilient Self-<b>D</b>istillation framework. This approach leverages relative self-supervision combined with decision matching to minimize noise susceptibility during the knowledge transfer process. Our study evaluates this technique on CIFAR-10, CIFAR-100, and MNIST datasets with synthetic label noise. Results showcase that our method achieves 8-10% higher test accuracy compared to state-of-the-art noise-robust loss functions at noise rates exceeding 50%, surpassing well-known KD methods by 4-5% in top-1 test accuracy. The code is available at https://github.com/philsaurabh/NoRD_Applied-Intelligence.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143423166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge-guided classification and regression surrogates co-assisted multi-objective soft subspace clustering algorithm
IF 3.4 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-15 DOI: 10.1007/s10489-025-06266-y
Feng Zhao, Lu Li, Hanqiang Liu

The efficiency of multi-objective soft subspace clustering algorithms (MSSCAs) can be low when applied to large-scale datasets. This inefficiency arises because the multi-objective evolutionary algorithms (MOEAs) utilized in MSSCAs often require a large number of soft subspace clustering objective function evaluations due to their population-based nature. Moreover, relying solely on negative Shannon entropy to constrain feature weights is inadequate for soft subspace clustering algorithms. To address these issues, a knowledge-guided classification and regression surrogates co-assisted multi-objective soft subspace clustering (KCRS-MOSSC) algorithm is presented. First, an inter-cluster feature weight dissimilarity function is designed to further constrain the feature weights. Furthermore, a novel surrogate-based optimization framework called the knowledge-guided classification and regression surrogates co-assisted multi-objective evolutionary framework (KCRS-MOEF) is proposed to efficiently optimize the proposed inter-cluster feature weight dissimilarity function, intra-cluster compactness function, inter-cluster separation function, and negative Shannon entropy function. In KCRS-MOEF, a classification decision tree is utilized as the classification surrogate model to help generate a set of promising offspring, while a radial basis function (RBF) model is employed as the regression surrogate model to assist in the infill criterion by predicting the objective function values of the offspring. Furthermore, to fully leverage the knowledge of the evolutionary process, an infill criterion guided by dynamic process knowledge of elite individuals is designed to enhance the convergence and diversity of the population. Finally, a clustering ensemble strategy based on knee point guidance is proposed to generate a final solution from a set of non-dominated individuals. KCRS-MOEF outperforms state-of-the-art counterparts in terms of convergence, diversity, and time efficiency, as demonstrated in four experiments conducted on the DTLZ benchmark. Furthermore, experiments on various datasets show that the clustering performance and time efficiency of KCRS-MOSSC exceed those of comparison algorithms.

{"title":"Knowledge-guided classification and regression surrogates co-assisted multi-objective soft subspace clustering algorithm","authors":"Feng Zhao,&nbsp;Lu Li,&nbsp;Hanqiang Liu","doi":"10.1007/s10489-025-06266-y","DOIUrl":"10.1007/s10489-025-06266-y","url":null,"abstract":"<div><p>The efficiency of multi-objective soft subspace clustering algorithms (MSSCAs) can be low when applied to large-scale datasets. This inefficiency arises because the multi-objective evolutionary algorithms (MOEAs) utilized in MSSCAs often require a large number of soft subspace clustering objective function evaluations due to their population-based nature. Moreover, relying solely on negative Shannon entropy to constrain feature weights is inadequate for soft subspace clustering algorithms. To address these issues, a knowledge-guided classification and regression surrogates co-assisted multi-objective soft subspace clustering (KCRS-MOSSC) algorithm is presented. First, an inter-cluster feature weight dissimilarity function is designed to further constrain the feature weights. Furthermore, a novel surrogate-based optimization framework called the knowledge-guided classification and regression surrogates co-assisted multi-objective evolutionary framework (KCRS-MOEF) is proposed to efficiently optimize the proposed inter-cluster feature weight dissimilarity function, intra-cluster compactness function, inter-cluster separation function, and negative Shannon entropy function. In KCRS-MOEF, a classification decision tree is utilized as the classification surrogate model to help generate a set of promising offspring, while a radial basis function (RBF) model is employed as the regression surrogate model to assist in the infill criterion by predicting the objective function values of the offspring. Furthermore, to fully leverage the knowledge of the evolutionary process, an infill criterion guided by dynamic process knowledge of elite individuals is designed to enhance the convergence and diversity of the population. Finally, a clustering ensemble strategy based on knee point guidance is proposed to generate a final solution from a set of non-dominated individuals. KCRS-MOEF outperforms state-of-the-art counterparts in terms of convergence, diversity, and time efficiency, as demonstrated in four experiments conducted on the DTLZ benchmark. Furthermore, experiments on various datasets show that the clustering performance and time efficiency of KCRS-MOSSC exceed those of comparison algorithms.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143423298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced decision framework for two-player zero-sum Markov games with diverse opponent policies
IF 3.4 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-14 DOI: 10.1007/s10489-025-06344-1
Jin Zhu, Xuan Wang, Dullerud Geir E.

This paper takes into account a general two-player zero-sum Markov game scenario in which our agent faces multi-type opponents with multiple policies. To enhance our agent’s return against opponent’s diverse policies, a novel Decision-making Framework based on Opponent Distinguishing and Policy Judgment (DF-ODPJ) is proposed. On the basis of the pre-trained Nash equilibrium strategies, DF-ODPJ can distinguish the opponent’s type by sampling from the interaction trajectory. Then a fast criterion is proposed to judge the opponent’s policy which is proven to minimize the misjudgment probability with optimal threshold calculated. According to the identification results, appropriate policies are generated to enhance the return. The proposed DF-ODPJ is more flexible since it is orthogonal to existing Nash equilibrium algorithms and single-agent reinforcement learning algorithms. The experimental results on grid world, video games, and UAV aerial combat environments illustrate the effectiveness of DF-ODPJ. The code is available at https://github.com/ChenXJ295/DF-ODPJ.

{"title":"Enhanced decision framework for two-player zero-sum Markov games with diverse opponent policies","authors":"Jin Zhu,&nbsp;Xuan Wang,&nbsp;Dullerud Geir E.","doi":"10.1007/s10489-025-06344-1","DOIUrl":"10.1007/s10489-025-06344-1","url":null,"abstract":"<div><p>This paper takes into account a general two-player zero-sum Markov game scenario in which our agent faces multi-type opponents with multiple policies. To enhance our agent’s return against opponent’s diverse policies, a novel Decision-making Framework based on Opponent Distinguishing and Policy Judgment (DF-ODPJ) is proposed. On the basis of the pre-trained Nash equilibrium strategies, DF-ODPJ can distinguish the opponent’s type by sampling from the interaction trajectory. Then a fast criterion is proposed to judge the opponent’s policy which is proven to minimize the misjudgment probability with optimal threshold calculated. According to the identification results, appropriate policies are generated to enhance the return. The proposed DF-ODPJ is more flexible since it is orthogonal to existing Nash equilibrium algorithms and single-agent reinforcement learning algorithms. The experimental results on grid world, video games, and UAV aerial combat environments illustrate the effectiveness of DF-ODPJ. The code is available at https://github.com/ChenXJ295/DF-ODPJ.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143404187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic fusion of multi-source heterogeneous data using MOE mechanism for stock prediction
IF 3.4 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-14 DOI: 10.1007/s10489-025-06330-7
Yuxin Dong, Zirui Wu, Yongtao Hao

Stock prices are influenced by numerous factors, including social media, news, and financial reports, serving as indicators of financial market dynamics. However, harnessing diverse information from different sources and structures to predict price trends remains challenging. In this paper, we propose a dual-stage deep learning model based on the Mixture-of-Expert (MoE) mechanism. In stage one, three distinct expert networks encode information about price movements, financial news, and investor sentiments through multi-source interaction attention. In stage two, a gated network dynamically fuses outputs, capturing temporal relationships in windowed data. Experimental results on the Chinese stock market demonstrate our model outperforms existing ones in forecasting tasks.

{"title":"Dynamic fusion of multi-source heterogeneous data using MOE mechanism for stock prediction","authors":"Yuxin Dong,&nbsp;Zirui Wu,&nbsp;Yongtao Hao","doi":"10.1007/s10489-025-06330-7","DOIUrl":"10.1007/s10489-025-06330-7","url":null,"abstract":"<div><p>Stock prices are influenced by numerous factors, including social media, news, and financial reports, serving as indicators of financial market dynamics. However, harnessing diverse information from different sources and structures to predict price trends remains challenging. In this paper, we propose a dual-stage deep learning model based on the Mixture-of-Expert (MoE) mechanism. In stage one, three distinct expert networks encode information about price movements, financial news, and investor sentiments through multi-source interaction attention. In stage two, a gated network dynamically fuses outputs, capturing temporal relationships in windowed data. Experimental results on the Chinese stock market demonstrate our model outperforms existing ones in forecasting tasks.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143404189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A code completion approach combining pointer network and Transformer-XL network
IF 3.4 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-14 DOI: 10.1007/s10489-025-06315-6
Xiangping Zhang, Jianxun Liu, Teng Long, Haize Hu

Code completion is a crucial aspect of contemporary integrated development environments (IDEs), as it not only streamlines the software development process but also bolsters the quality of software products. By leveraging large-scale codes to learn the probability distribution among code token units, deep learning methods have demonstrated significant improvements in the accuracy of token unit recommendations. However, the efficacy of code completion employing deep learning is often compromised by information loss. To mitigate this issue, we introduce a novel code language model that incorporates both the pointer network and the Transformer-XL architecture to surpass the constraints of current approaches in code completion. Our proposed model accepts as input the original code snippet and its corresponding abstract syntax tree (AST), utilizing the Transformer-XL model as the foundational architecture for capturing long-term dependencies. Additionally, we incorporate a pointer network as an adjunct component to forecast Out-of-Vocabulary (OoV) words. Our approach has been rigorously evaluated on the authentic PY150 and JS150 datasets. The comparative experimental results demonstrate the effectiveness of our model in improving the accuracy of the code completion task at the token unit level.

{"title":"A code completion approach combining pointer network and Transformer-XL network","authors":"Xiangping Zhang,&nbsp;Jianxun Liu,&nbsp;Teng Long,&nbsp;Haize Hu","doi":"10.1007/s10489-025-06315-6","DOIUrl":"10.1007/s10489-025-06315-6","url":null,"abstract":"<div><p>Code completion is a crucial aspect of contemporary integrated development environments (IDEs), as it not only streamlines the software development process but also bolsters the quality of software products. By leveraging large-scale codes to learn the probability distribution among code token units, deep learning methods have demonstrated significant improvements in the accuracy of token unit recommendations. However, the efficacy of code completion employing deep learning is often compromised by information loss. To mitigate this issue, we introduce a novel code language model that incorporates both the pointer network and the Transformer-XL architecture to surpass the constraints of current approaches in code completion. Our proposed model accepts as input the original code snippet and its corresponding abstract syntax tree (AST), utilizing the Transformer-XL model as the foundational architecture for capturing long-term dependencies. Additionally, we incorporate a pointer network as an adjunct component to forecast Out-of-Vocabulary (OoV) words. Our approach has been rigorously evaluated on the authentic PY150 and JS150 datasets. The comparative experimental results demonstrate the effectiveness of our model in improving the accuracy of the code completion task at the token unit level.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143423220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Loop closure detection based on image feature matching and motion trajectory similarity for mobile robot 基于图像特征匹配和运动轨迹相似性的移动机器人环路闭合检测
IF 3.4 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-13 DOI: 10.1007/s10489-024-05874-4
Weilong Hao, Peng Wang, Cui Ni, Wenjun Huangfu, Zhu Liu, Kaiyuan Qi

In visual simultaneous localization and mapping (SLAM), loop closure detection plays an irreplaceable role in eliminating cumulative errors, optimizing robot poses, and ensuring map consistency. Most loop closure detection algorithms adopt single feature or feature fusion to detect loop closures, which makes it difficult to ensure accuracy in environments with changing lighting or high-similarity scenes. In this work, image features and motion trajectories are combined to improve the effectiveness of loop closure detection via a staged detection method. First, histogram equalization is used to reduce the algorithm’s sensitivity to lighting. Then, LBP features are used to divide keyframes into multiple sequences, and the sequence where the loop closure candidate frame is located is determined according to the image feature matching results. Then, the most matched keyframe is searched in the sequence as a candidate loop closure. Finally, the true loop closure is confirmed by comparing the motion trajectory similarity to improve the algorithm’s adaptability to high-similarity scenes. The experimental results show that in different application scenarios, the proposed method can achieve good results in terms of precision, recall, area under the curve (AUC), and recall when the precision is 100%.

{"title":"Loop closure detection based on image feature matching and motion trajectory similarity for mobile robot","authors":"Weilong Hao,&nbsp;Peng Wang,&nbsp;Cui Ni,&nbsp;Wenjun Huangfu,&nbsp;Zhu Liu,&nbsp;Kaiyuan Qi","doi":"10.1007/s10489-024-05874-4","DOIUrl":"10.1007/s10489-024-05874-4","url":null,"abstract":"<div><p>In visual simultaneous localization and mapping (SLAM), loop closure detection plays an irreplaceable role in eliminating cumulative errors, optimizing robot poses, and ensuring map consistency. Most loop closure detection algorithms adopt single feature or feature fusion to detect loop closures, which makes it difficult to ensure accuracy in environments with changing lighting or high-similarity scenes. In this work, image features and motion trajectories are combined to improve the effectiveness of loop closure detection via a staged detection method. First, histogram equalization is used to reduce the algorithm’s sensitivity to lighting. Then, LBP features are used to divide keyframes into multiple sequences, and the sequence where the loop closure candidate frame is located is determined according to the image feature matching results. Then, the most matched keyframe is searched in the sequence as a candidate loop closure. Finally, the true loop closure is confirmed by comparing the motion trajectory similarity to improve the algorithm’s adaptability to high-similarity scenes. The experimental results show that in different application scenarios, the proposed method can achieve good results in terms of precision, recall, area under the curve (AUC), and recall when the precision is 100%.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143396744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatial-temporal clustering enhanced multi-graph convolutional network for traffic flow prediction 用于交通流量预测的时空聚类增强型多图卷积网络
IF 3.4 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-13 DOI: 10.1007/s10489-025-06329-0
Yinxin Bao, Qinqin Shen, Yang Cao, Quan Shi

Dynamics and uncertainty are the fundamental reasons for the difficulty in accurately predicting traffic flow. In recent years, graph convolutional networks have been widely used in traffic flow prediction because of their excellent dynamic feature mapping ability. However, the existing models usually overlook the correlations among the nodes and the complex impact of external factors on traffic flow, which make it challenging to explore the complex spatial-temporal features. To overcome these shortcomings, we propose a novel Spatial-temporal Clustering enhanced Multi-Graph Convolutional Network (SCM-GCN) for traffic flow prediction. First, a Spatial-Temporal Clustering (STS) module based on the improved adjacency matrix DBSCAN clustering algorithm is constructed, this module divides traffic nodes into multiple highly correlated clusters, each of which consists of multi-graph features and time-varying features. Then, a Multi-Graph Spatial Feature Extraction (MGSFE) module that integrates the graph convolution operation and attention mechanism is designed to extract dynamic spatial features of multi-graph and time-varying features. Next, the Time-Varying Feature Extraction (TVFE) module based on the dilated convolution and gated attention mechanism is constructed. It integrates the output of the MGSFE module to extract dynamic temporal features of time-varying features and output the predicted values. Finally, the comparison and ablation experiments on four datasets show that the proposed model performs better than state-of-the-art models. The key source code and data are available at https://github.com/Bounger2/SCMGCN.

{"title":"Spatial-temporal clustering enhanced multi-graph convolutional network for traffic flow prediction","authors":"Yinxin Bao,&nbsp;Qinqin Shen,&nbsp;Yang Cao,&nbsp;Quan Shi","doi":"10.1007/s10489-025-06329-0","DOIUrl":"10.1007/s10489-025-06329-0","url":null,"abstract":"<div><p>Dynamics and uncertainty are the fundamental reasons for the difficulty in accurately predicting traffic flow. In recent years, graph convolutional networks have been widely used in traffic flow prediction because of their excellent dynamic feature mapping ability. However, the existing models usually overlook the correlations among the nodes and the complex impact of external factors on traffic flow, which make it challenging to explore the complex spatial-temporal features. To overcome these shortcomings, we propose a novel Spatial-temporal Clustering enhanced Multi-Graph Convolutional Network (SCM-GCN) for traffic flow prediction. First, a Spatial-Temporal Clustering (STS) module based on the improved adjacency matrix DBSCAN clustering algorithm is constructed, this module divides traffic nodes into multiple highly correlated clusters, each of which consists of multi-graph features and time-varying features. Then, a Multi-Graph Spatial Feature Extraction (MGSFE) module that integrates the graph convolution operation and attention mechanism is designed to extract dynamic spatial features of multi-graph and time-varying features. Next, the Time-Varying Feature Extraction (TVFE) module based on the dilated convolution and gated attention mechanism is constructed. It integrates the output of the MGSFE module to extract dynamic temporal features of time-varying features and output the predicted values. Finally, the comparison and ablation experiments on four datasets show that the proposed model performs better than state-of-the-art models. The key source code and data are available at https://github.com/Bounger2/SCMGCN.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143396746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Applied Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1