Complex & Intelligent Systems最新文献_第4页

Adaptive temporal-difference learning via deep neural network function approximation: a non-asymptotic analysis 基于深度神经网络函数逼近的自适应时间差学习：非渐近分析

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Complex & Intelligent Systems

Pub Date : 2025-01-15 DOI: 10.1007/s40747-024-01757-w

Guoyong Wang, Tiange Fu, Ruijuan Zheng, Xuhui Zhao, Junlong Zhu, Mingchuan Zhang

Although deep reinforcement learning has achieved notable practical achievements, its theoretical foundations have been scarcely explored until recent times. Nonetheless, the rate of convergence for current neural temporal-difference (TD) learning algorithms is constrained, largely due to their high sensitivity to stepsize choices. In order to mitigate this issue, we propose an adaptive neural TD algorithm (AdaBNTD) inspired by the superior performance of adaptive gradient techniques in training deep neural networks. Simultaneously, we derive non-asymptotic bounds for AdaBNTD within the Markovian observation framework. In particular, AdaBNTD is capable of converging to the global optimum of the mean square projection Bellman error (MSPBE) with a convergence rate of ({{mathcal {O}}}(1/sqrt{K})), where K denotes the iteration count. Besides, the effectiveness AdaBNTD is also verified through several reinforcement learning benchmark domains.

尽管深度强化学习已经取得了显著的实践成就，但直到最近才对其理论基础进行了探索。尽管如此，当前的神经时间差（TD）学习算法的收敛速度受到限制，主要是由于它们对步长选择的高度敏感性。为了缓解这一问题，受自适应梯度技术在训练深度神经网络中的优越性能的启发，我们提出了一种自适应神经TD算法（AdaBNTD）。同时，我们导出了AdaBNTD在马尔可夫观测框架下的非渐近界。特别是，AdaBNTD能够收敛到均方投影Bellman误差（MSPBE）的全局最优，收敛速率为({{mathcal {O}}}(1/sqrt{K}))，其中K为迭代次数。此外，还通过多个强化学习基准域验证了AdaBNTD的有效性。

引用次数: 0

A traffic prediction method for missing data scenarios: graph convolutional recurrent ordinary differential equation network 缺失数据场景下的交通预测方法：图卷积递归常微分方程网络

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Complex & Intelligent Systems

Pub Date : 2025-01-15 DOI: 10.1007/s40747-024-01768-7

Ming Jiang, Zhiwei Liu, Yan Xu

Traffic prediction plays an increasingly important role in intelligent transportation systems and smart cities. Both travelers and urban managers rely on accurate traffic information to make decisions about route selection and traffic management. Due to various factors, both human and natural, traffic data often contains missing values. Addressing the impact of missing data on traffic flow prediction has become a widely discussed topic in the academic community and holds significant practical importance. Existing spatiotemporal graph models typically rely on complete data, and the presence of missing values can significantly degrade prediction performance and disrupt the construction of dynamic graph structures. To address this challenge, this paper proposes a neural network architecture designed specifically for missing data scenarios—graph convolutional recurrent ordinary differential equation network (GCRNODE). GCRNODE combines recurrent networks based on ordinary differential equation (ODE) with spatiotemporal memory graph convolutional networks, enabling accurate traffic prediction and effective modeling of dynamic graph structures even in the presence of missing data. GCRNODE uses ODE to model the evolution of traffic flow and updates the hidden states of the ODE through observed data. Additionally, GCRNODE employs a data-independent spatiotemporal memory graph convolutional network to capture the dynamic spatial dependencies in missing data scenarios. The experimental results on three real-world traffic datasets demonstrate that GCRNODE outperforms baseline models in prediction performance under various missing data rates and scenarios. This indicates that the proposed method has stronger adaptability and robustness in handling missing data and modeling spatiotemporal dependencies.

交通预测在智能交通系统和智慧城市中发挥着越来越重要的作用。旅行者和城市管理者都依赖于准确的交通信息来做出路线选择和交通管理决策。由于人为和自然等各种因素，交通数据经常包含缺失值。解决缺失数据对交通流量预测的影响已成为学术界广泛讨论的话题，并具有重要的现实意义。现有的时空图模型通常依赖于完整的数据，而缺失值的存在会大大降低预测性能，并破坏动态图结构的构建。为了应对这一挑战，本文提出了一种专为缺失数据场景设计的神经网络架构--图卷积递归常微分方程网络（GCRNODE）。GCRNODE 将基于常微分方程 (ODE) 的递归网络与时空记忆图卷积网络相结合，即使在数据缺失的情况下也能实现精确的流量预测和动态图结构的有效建模。GCRNODE 使用 ODE 对交通流的演变进行建模，并通过观测数据更新 ODE 的隐藏状态。此外，GCRNODE 还采用了与数据无关的时空记忆图卷积网络，以捕捉数据缺失情况下的动态空间依赖关系。在三个真实交通数据集上的实验结果表明，GCRNODE 在各种缺失数据率和场景下的预测性能均优于基线模型。这表明所提出的方法在处理缺失数据和时空依赖性建模方面具有更强的适应性和鲁棒性。

{"title":"A traffic prediction method for missing data scenarios: graph convolutional recurrent ordinary differential equation network","authors":"Ming Jiang, Zhiwei Liu, Yan Xu","doi":"10.1007/s40747-024-01768-7","DOIUrl":"https://doi.org/10.1007/s40747-024-01768-7","url":null,"abstract":"Traffic prediction plays an increasingly important role in intelligent transportation systems and smart cities. Both travelers and urban managers rely on accurate traffic information to make decisions about route selection and traffic management. Due to various factors, both human and natural, traffic data often contains missing values. Addressing the impact of missing data on traffic flow prediction has become a widely discussed topic in the academic community and holds significant practical importance. Existing spatiotemporal graph models typically rely on complete data, and the presence of missing values can significantly degrade prediction performance and disrupt the construction of dynamic graph structures. To address this challenge, this paper proposes a neural network architecture designed specifically for missing data scenarios—graph convolutional recurrent ordinary differential equation network (GCRNODE). GCRNODE combines recurrent networks based on ordinary differential equation (ODE) with spatiotemporal memory graph convolutional networks, enabling accurate traffic prediction and effective modeling of dynamic graph structures even in the presence of missing data. GCRNODE uses ODE to model the evolution of traffic flow and updates the hidden states of the ODE through observed data. Additionally, GCRNODE employs a data-independent spatiotemporal memory graph convolutional network to capture the dynamic spatial dependencies in missing data scenarios. The experimental results on three real-world traffic datasets demonstrate that GCRNODE outperforms baseline models in prediction performance under various missing data rates and scenarios. This indicates that the proposed method has stronger adaptability and robustness in handling missing data and modeling spatiotemporal dependencies.","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"52 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142981773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A generalized diffusion model for remaining useful life prediction with uncertainty 不确定剩余使用寿命预测的广义扩散模型

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Complex & Intelligent Systems

Pub Date : 2025-01-15 DOI: 10.1007/s40747-024-01773-w

Bincheng Wen, Xin Zhao, Xilang Tang, Mingqing Xiao, Haizhen Zhu, Jianfeng Li

Forecasting the remaining useful life (RUL) is a crucial aspect of prognostics and health management (PHM), which has garnered significant attention in academic and industrial domains in recent decades. The accurate prediction of RUL relies on the creation of an appropriate degradation model for the system. In this paper, a general representation of diffusion process models with three sources of uncertainty for RUL estimation is constructed. According to time-space transformation, the analytic equations that approximate the RUL probability distribution function (PDF) are inferred. The results demonstrate that the proposed model is more general, covering several existing simplified cases. The parameters of the model are then calculated utilizing an adaptive technique based on the Kalman filter and expectation maximization with Rauch-Tung-Striebel (KF-EM-RTS). KF-EM-RTS can adaptively estimate and update unknown parameters, overcoming the limits of strong Markovian nature of diffusion model. Linear and nonlinear degradation datasets from real working environments are used to validate the proposed model. The experiments indicate that the proposed model can achieve accurate RUL estimation results.

预测剩余使用寿命（RUL）是预测和健康管理（PHM）的一个重要方面，近几十年来在学术界和工业界引起了极大的关注。RUL的准确预测依赖于为系统创建适当的退化模型。本文构造了具有三种不确定源的扩散过程模型的一般表示，用于RUL估计。根据时空变换，推导出近似RUL概率分布函数（PDF）的解析方程。结果表明，该模型具有较好的通用性，涵盖了几种已有的简化情况。然后利用基于卡尔曼滤波和Rauch-Tung-Striebel （KF-EM-RTS）期望最大化的自适应技术计算模型的参数。KF-EM-RTS能够自适应估计和更新未知参数，克服了扩散模型强马尔可夫性的限制。利用实际工作环境的线性和非线性退化数据集验证了所提出的模型。实验结果表明，该模型能够获得准确的RUL估计结果。

{"title":"A generalized diffusion model for remaining useful life prediction with uncertainty","authors":"Bincheng Wen, Xin Zhao, Xilang Tang, Mingqing Xiao, Haizhen Zhu, Jianfeng Li","doi":"10.1007/s40747-024-01773-w","DOIUrl":"https://doi.org/10.1007/s40747-024-01773-w","url":null,"abstract":"Forecasting the remaining useful life (RUL) is a crucial aspect of prognostics and health management (PHM), which has garnered significant attention in academic and industrial domains in recent decades. The accurate prediction of RUL relies on the creation of an appropriate degradation model for the system. In this paper, a general representation of diffusion process models with three sources of uncertainty for RUL estimation is constructed. According to time-space transformation, the analytic equations that approximate the RUL probability distribution function (PDF) are inferred. The results demonstrate that the proposed model is more general, covering several existing simplified cases. The parameters of the model are then calculated utilizing an adaptive technique based on the Kalman filter and expectation maximization with Rauch-Tung-Striebel (KF-EM-RTS). KF-EM-RTS can adaptively estimate and update unknown parameters, overcoming the limits of strong Markovian nature of diffusion model. Linear and nonlinear degradation datasets from real working environments are used to validate the proposed model. The experiments indicate that the proposed model can achieve accurate RUL estimation results.","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"7 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142981761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Microscale search-based algorithm based on time-space transfer for automated test case generation 基于时空转移的微尺度搜索算法自动生成测试用例

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Complex & Intelligent Systems

Pub Date : 2025-01-15 DOI: 10.1007/s40747-024-01706-7

Yinghan Hong, Fangqing Liu, Han Huang, Yi Xiang, Xueming Yan, Guizhen Mai

Automated test case generation for path coverage (ATCG-PC) is a major challenge in search-based software engineering due to its complexity as a large-scale black-box optimization problem. However, existing search-based approaches often fail to achieve high path coverage in large-scale unit programs. This is due to their expansive decision space and the presence of hundreds of feasible paths. In this paper, we present a microscale (small-size subsets of the decomposed decision set) search-based algorithm with time-space transfer (MISA-TST). This algorithm aims to identify more accurate subspaces consisting of optimal solutions based on two strategies. The dimension partition strategy employs a relationship matrix to track subspaces corresponding to the target paths. Additionally, the specific value strategy allows MISA-TST to focus the search on the neighborhood of specific dimension values rather than the entire dimension space. Experiments conducted on nine normal-scale and six large-scale benchmarks demonstrate the effectiveness of MISA-TST. The large-scale unit programs encompass hundreds of feasible paths or more than 1.00E+50 test cases. The results show that MISA-TST achieves significantly higher path coverage than other state-of-the-art algorithms in most benchmarks. Furthermore, the combination of the two time-space transfer strategies significantly enhances the performance of search-based algorithms like MISA, especially in large-scale unit programs.

路径覆盖的自动测试用例生成（ATCG-PC）是基于搜索的软件工程中的一个主要挑战，因为它是一个大规模黑盒优化问题的复杂性。然而，现有的基于搜索的方法往往无法在大规模单元程序中实现高路径覆盖率。这是因为它们具有广阔的决策空间和数百条可行路径。本文提出了一种基于微尺度（分解决策集的小尺寸子集）搜索的时空转移算法（MISA-TST）。该算法旨在基于两种策略识别更精确的由最优解组成的子空间。维度划分策略采用关系矩阵来跟踪与目标路径对应的子空间。此外，特定值策略允许MISA-TST将搜索重点放在特定维度值的邻域上，而不是整个维度空间。在9个标准尺度和6个大规模基准上进行的实验证明了MISA-TST的有效性。大型单元程序包含数百个可行路径或超过1.00E+50个测试用例。结果表明，在大多数基准测试中，MISA-TST比其他最先进的算法实现了更高的路径覆盖率。此外，两种时空转移策略的结合显著提高了MISA等基于搜索的算法的性能，特别是在大规模单元程序中。

{"title":"Microscale search-based algorithm based on time-space transfer for automated test case generation","authors":"Yinghan Hong, Fangqing Liu, Han Huang, Yi Xiang, Xueming Yan, Guizhen Mai","doi":"10.1007/s40747-024-01706-7","DOIUrl":"https://doi.org/10.1007/s40747-024-01706-7","url":null,"abstract":"Automated test case generation for path coverage (ATCG-PC) is a major challenge in search-based software engineering due to its complexity as a large-scale black-box optimization problem. However, existing search-based approaches often fail to achieve high path coverage in large-scale unit programs. This is due to their expansive decision space and the presence of hundreds of feasible paths. In this paper, we present a microscale (small-size subsets of the decomposed decision set) search-based algorithm with time-space transfer (MISA-TST). This algorithm aims to identify more accurate subspaces consisting of optimal solutions based on two strategies. The dimension partition strategy employs a relationship matrix to track subspaces corresponding to the target paths. Additionally, the specific value strategy allows MISA-TST to focus the search on the neighborhood of specific dimension values rather than the entire dimension space. Experiments conducted on nine normal-scale and six large-scale benchmarks demonstrate the effectiveness of MISA-TST. The large-scale unit programs encompass hundreds of feasible paths or more than 1.00E+50 test cases. The results show that MISA-TST achieves significantly higher path coverage than other state-of-the-art algorithms in most benchmarks. Furthermore, the combination of the two time-space transfer strategies significantly enhances the performance of search-based algorithms like MISA, especially in large-scale unit programs.","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"12 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142981764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

View adaptive multi-object tracking method based on depth relationship cues 基于深度关系线索的视图自适应多目标跟踪方法

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Complex & Intelligent Systems

Pub Date : 2025-01-15 DOI: 10.1007/s40747-024-01776-7

Haoran Sun, Yang Li, Guanci Yang, Zhidong Su, Kexin Luo

Multi-object tracking (MOT) tasks face challenges from multiple perception views due to the diversity of application scenarios. Different views (front-view and top-view) have different imaging and data distribution characteristics, but the current MOT methods do not consider these differences and only adopt a unified association strategy to deal with various occlusion situations. This paper proposed View Adaptive Multi-Object Tracking Method Based on Depth Relationship Cues (ViewTrack) to enable MOT to adapt to the scene's dynamic changes. Firstly, based on exploiting the depth relationships between objects by using the position information of the bounding box, a view-type recognition method based on depth relationship cues (VTRM) is proposed to perceive the changes of depth and view within the dynamic scene. Secondly, by adjusting the interval partitioning strategy to adapt to the changes in view characteristics, a view adaptive partitioning method for tracklet sets and detection sets (VAPM) is proposed to achieve sparse decomposition in occluded scenes. Then, combining pedestrian displacement with Intersection over Union (IoU), a displacement modulated Intersection over Union method (DMIoU) is proposed to improve the association accuracy between detection and tracklet boxes. Finally, the comparison results with 12 representative methods demonstrate that ViewTrack outperforms multiple metrics on the benchmark datasets. The code is available at https://github.com/Hamor404/ViewTrack.

由于应用场景的多样性，多目标跟踪任务面临着来自多个感知视角的挑战。不同的视图（前视图和俯视图）具有不同的成像和数据分布特征，但目前的MOT方法没有考虑这些差异，只是采用统一的关联策略来处理各种遮挡情况。为了使MOT能够适应场景的动态变化，提出了基于深度关系线索的视图自适应多目标跟踪方法（ViewTrack）。首先，在利用边界框位置信息挖掘物体间深度关系的基础上，提出了一种基于深度关系线索（VTRM）的视觉类型识别方法来感知动态场景中景深和视角的变化；其次，通过调整间隔划分策略以适应视图特征的变化，提出了一种轨道集和检测集的视图自适应划分方法（VAPM），实现了遮挡场景下的稀疏分解；然后，将行人位移与交叉口联合（Intersection over Union, IoU）相结合，提出了一种位移调制交叉口联合方法（Intersection over Union, DMIoU），以提高检测与轨迹盒之间的关联精度。最后，与12种代表性方法的比较结果表明，ViewTrack在基准数据集上优于多个指标。代码可在https://github.com/Hamor404/ViewTrack上获得。

{"title":"View adaptive multi-object tracking method based on depth relationship cues","authors":"Haoran Sun, Yang Li, Guanci Yang, Zhidong Su, Kexin Luo","doi":"10.1007/s40747-024-01776-7","DOIUrl":"https://doi.org/10.1007/s40747-024-01776-7","url":null,"abstract":"Multi-object tracking (MOT) tasks face challenges from multiple perception views due to the diversity of application scenarios. Different views (front-view and top-view) have different imaging and data distribution characteristics, but the current MOT methods do not consider these differences and only adopt a unified association strategy to deal with various occlusion situations. This paper proposed View Adaptive Multi-Object Tracking Method Based on Depth Relationship Cues (ViewTrack) to enable MOT to adapt to the scene's dynamic changes. Firstly, based on exploiting the depth relationships between objects by using the position information of the bounding box, a view-type recognition method based on depth relationship cues (VTRM) is proposed to perceive the changes of depth and view within the dynamic scene. Secondly, by adjusting the interval partitioning strategy to adapt to the changes in view characteristics, a view adaptive partitioning method for tracklet sets and detection sets (VAPM) is proposed to achieve sparse decomposition in occluded scenes. Then, combining pedestrian displacement with Intersection over Union (IoU), a displacement modulated Intersection over Union method (DMIoU) is proposed to improve the association accuracy between detection and tracklet boxes. Finally, the comparison results with 12 representative methods demonstrate that ViewTrack outperforms multiple metrics on the benchmark datasets. The code is available at https://github.com/Hamor404/ViewTrack.","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"7 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142981766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A novel group-based framework for nature-inspired optimization algorithms with adaptive movement behavior 基于群体的自然启发优化算法新框架，具有自适应运动行为

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Complex & Intelligent Systems

Pub Date : 2025-01-15 DOI: 10.1007/s40747-024-01763-y

Adam Robson, Kamlesh Mistry, Wai-Lok Woo

This paper proposes two novel group-based frameworks that can be implemented into almost any nature-inspired optimization algorithm. The proposed Group-Based (GB) and Cross Group-Based (XGB) framework implements a strategy which modifies the attraction and movement behaviors of base nature-inspired optimization algorithms and a mechanism that creates a continuing variance within population groupings, while attempting to maintain levels of computational simplicity that have helped nature-inspired optimization algorithms gain notoriety within the field of feature selection. Through this functionality, the proposed framework seeks to increase search diversity within the population swarm to address issues such as premature convergence, and oscillations within the swarm. The proposed frameworks have shown promising results when implemented into the Bat algorithm (BA), Firefly algorithm (FA), and Particle Swarm Optimization algorithm (PSO), all of which are popular when applied to the field of feature selection, and have been shown to perform well in a variety of domains, gaining notoriety due to their powerful search capabilities.

本文提出了两个新的基于组的框架，这两个框架几乎可以实现到任何受自然启发的优化算法中。所提出的基于群的（GB）和基于交叉群的（XGB）框架实现了一种策略，该策略修改了基于自然的优化算法的吸引和运动行为，以及一种在种群分组中产生持续方差的机制，同时试图保持计算简单性的水平，这有助于基于自然的优化算法在特征选择领域获得声誉。通过这一功能，提出的框架旨在增加种群群内的搜索多样性，以解决诸如过早收敛和群体内振荡等问题。所提出的框架在Bat算法（BA）、Firefly算法（FA）和Particle Swarm Optimization算法（PSO）中得到了很好的应用效果，这些算法在特征选择领域都很流行，并且在许多领域都表现良好，由于其强大的搜索能力而获得了声誉。

{"title":"A novel group-based framework for nature-inspired optimization algorithms with adaptive movement behavior","authors":"Adam Robson, Kamlesh Mistry, Wai-Lok Woo","doi":"10.1007/s40747-024-01763-y","DOIUrl":"https://doi.org/10.1007/s40747-024-01763-y","url":null,"abstract":"This paper proposes two novel group-based frameworks that can be implemented into almost any nature-inspired optimization algorithm. The proposed Group-Based (GB) and Cross Group-Based (XGB) framework implements a strategy which modifies the attraction and movement behaviors of base nature-inspired optimization algorithms and a mechanism that creates a continuing variance within population groupings, while attempting to maintain levels of computational simplicity that have helped nature-inspired optimization algorithms gain notoriety within the field of feature selection. Through this functionality, the proposed framework seeks to increase search diversity within the population swarm to address issues such as premature convergence, and oscillations within the swarm. The proposed frameworks have shown promising results when implemented into the Bat algorithm (BA), Firefly algorithm (FA), and Particle Swarm Optimization algorithm (PSO), all of which are popular when applied to the field of feature selection, and have been shown to perform well in a variety of domains, gaining notoriety due to their powerful search capabilities.","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"67 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142981762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Preference learning based deep reinforcement learning for flexible job shop scheduling problem 基于偏好学习的柔性作业车间调度问题深度强化学习

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Complex & Intelligent Systems

Pub Date : 2025-01-15 DOI: 10.1007/s40747-024-01772-x

Xinning Liu, Li Han, Ling Kang, Jiannan Liu, Huadong Miao

The flexible job shop scheduling problem (FJSP) holds significant importance in both theoretical research and practical applications. Given the complexity and diversity of FJSP, improving the generalization and quality of scheduling methods has become a hot topic of interest in both industry and academia. To address this, this paper proposes a Preference-Based Mask-PPO (PBMP) algorithm, which leverages the strengths of preference learning and invalid action masking to optimize FJSP solutions. First, a reward predictor based on preference learning is designed to model reward prediction by comparing random fragments, eliminating the need for complex reward function design. Second, a novel intelligent switching mechanism is introduced, where proximal policy optimization (PPO) is employed to enhance exploration during sampling, and masked proximal policy optimization (Mask-PPO) refines the action space during training, significantly improving efficiency and solution quality. Furthermore, the Pearson correlation coefficient (PCC) is used to evaluate the performance of the preference model. Finally, comparative experiments on FJSP benchmark instances of varying sizes demonstrate that PBMP outperforms traditional scheduling strategies such as dispatching rules, OR-Tools, and other deep reinforcement learning (DRL) algorithms, achieving superior scheduling policies and faster convergence. Even with increasing instance sizes, preference learning proves to be an effective reward mechanism in reinforcement learning for FJSP. The ablation study further highlights the advantages of each key component in the PBMP algorithm across performance metrics.

柔性作业车间调度问题在理论研究和实际应用中都具有重要意义。由于FJSP的复杂性和多样性，提高调度方法的通用性和质量已成为业界和学术界关注的热点问题。为了解决这个问题，本文提出了一种基于偏好的掩码ppo （PBMP）算法，该算法利用偏好学习和无效动作掩蔽的优势来优化FJSP解决方案。首先，设计了一个基于偏好学习的奖励预测器，通过比较随机片段来建模奖励预测，从而消除了复杂的奖励函数设计的需要。其次，引入了一种新颖的智能切换机制，在采样过程中利用近端策略优化（PPO）增强探索能力，在训练过程中利用掩模近端策略优化（Mask-PPO）细化动作空间，显著提高了效率和解的质量。此外，使用Pearson相关系数（PCC）来评估偏好模型的性能。最后，在不同规模的FJSP基准实例上的对比实验表明，PBMP优于传统的调度策略，如调度规则、OR-Tools和其他深度强化学习（DRL）算法，实现了更优的调度策略和更快的收敛速度。即使实例数量不断增加，偏好学习仍然是FJSP强化学习中一种有效的奖励机制。消融研究进一步强调了PBMP算法中每个关键组件跨性能指标的优势。

{"title":"Preference learning based deep reinforcement learning for flexible job shop scheduling problem","authors":"Xinning Liu, Li Han, Ling Kang, Jiannan Liu, Huadong Miao","doi":"10.1007/s40747-024-01772-x","DOIUrl":"https://doi.org/10.1007/s40747-024-01772-x","url":null,"abstract":"The flexible job shop scheduling problem (FJSP) holds significant importance in both theoretical research and practical applications. Given the complexity and diversity of FJSP, improving the generalization and quality of scheduling methods has become a hot topic of interest in both industry and academia. To address this, this paper proposes a Preference-Based Mask-PPO (PBMP) algorithm, which leverages the strengths of preference learning and invalid action masking to optimize FJSP solutions. First, a reward predictor based on preference learning is designed to model reward prediction by comparing random fragments, eliminating the need for complex reward function design. Second, a novel intelligent switching mechanism is introduced, where proximal policy optimization (PPO) is employed to enhance exploration during sampling, and masked proximal policy optimization (Mask-PPO) refines the action space during training, significantly improving efficiency and solution quality. Furthermore, the Pearson correlation coefficient (PCC) is used to evaluate the performance of the preference model. Finally, comparative experiments on FJSP benchmark instances of varying sizes demonstrate that PBMP outperforms traditional scheduling strategies such as dispatching rules, OR-Tools, and other deep reinforcement learning (DRL) algorithms, achieving superior scheduling policies and faster convergence. Even with increasing instance sizes, preference learning proves to be an effective reward mechanism in reinforcement learning for FJSP. The ablation study further highlights the advantages of each key component in the PBMP algorithm across performance metrics.","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"74 2 Pt 2 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142981765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing zero-shot stance detection via multi-task fine-tuning with debate data and knowledge augmentation 通过多任务微调与辩论数据和知识增强增强零射击姿态检测

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Complex & Intelligent Systems

Pub Date : 2025-01-15 DOI: 10.1007/s40747-024-01767-8

Qinlong Fan, Jicang Lu, Yepeng Sun, Qiankun Pi, Shouxin Shang

In the real world, stance detection tasks often involve assessing the stance or attitude of a given text toward new, unseen targets, a task known as zero-shot stance detection. However, zero-shot stance detection often suffers from issues such as sparse data annotation and inherent task complexity, which can lead to lower performance. To address these challenges, we propose combining fine-tuning of Large Language Models (LLMs) with knowledge augmentation for zero-shot stance detection. Specifically, we leverage stance detection and related tasks from debate corpora to perform multi-task fine-tuning of LLMs. This approach aims to learn and transfer the capability of zero-shot stance detection and reasoning analysis from relevant data. Additionally, we enhance the model’s semantic understanding of the given text and targets by retrieving relevant knowledge from external knowledge bases as context, alleviating the lack of relevant contextual knowledge. Compared to ChatGPT, our model achieves a significant improvement in the average F1 score, with an increase of 15.74% on the SemEval 2016 Task 6 A and 3.55% on the P-Stance dataset. Our model outperforms current state-of-the-art models on these two datasets, demonstrating the superiority of multi-task fine-tuning with debate data and knowledge augmentation.

在现实世界中，姿态检测任务通常涉及评估给定文本对新的、未见目标的姿态或态度，这种任务被称为零镜头姿态检测。然而，零镜头姿态检测通常存在数据注释稀疏和任务固有复杂性等问题，这可能会导致性能降低。为了应对这些挑战，我们建议将大语言模型（LLM）的微调与零拍姿态检测的知识增强相结合。具体来说，我们利用辩论语料库中的立场检测和相关任务对 LLM 进行多任务微调。这种方法旨在从相关数据中学习和转移零镜头立场检测和推理分析的能力。此外，我们还通过检索外部知识库中的相关知识作为上下文来增强模型对给定文本和目标的语义理解，从而缓解相关上下文知识的缺乏。与 ChatGPT 相比，我们的模型显著提高了平均 F1 分数，在 SemEval 2016 Task 6 A 上提高了 15.74%，在 P-Stance 数据集上提高了 3.55%。我们的模型在这两个数据集上的表现优于目前最先进的模型，证明了利用辩论数据和知识增强进行多任务微调的优越性。

{"title":"Enhancing zero-shot stance detection via multi-task fine-tuning with debate data and knowledge augmentation","authors":"Qinlong Fan, Jicang Lu, Yepeng Sun, Qiankun Pi, Shouxin Shang","doi":"10.1007/s40747-024-01767-8","DOIUrl":"https://doi.org/10.1007/s40747-024-01767-8","url":null,"abstract":"In the real world, stance detection tasks often involve assessing the stance or attitude of a given text toward new, unseen targets, a task known as zero-shot stance detection. However, zero-shot stance detection often suffers from issues such as sparse data annotation and inherent task complexity, which can lead to lower performance. To address these challenges, we propose combining fine-tuning of Large Language Models (LLMs) with knowledge augmentation for zero-shot stance detection. Specifically, we leverage stance detection and related tasks from debate corpora to perform multi-task fine-tuning of LLMs. This approach aims to learn and transfer the capability of zero-shot stance detection and reasoning analysis from relevant data. Additionally, we enhance the model’s semantic understanding of the given text and targets by retrieving relevant knowledge from external knowledge bases as context, alleviating the lack of relevant contextual knowledge. Compared to ChatGPT, our model achieves a significant improvement in the average F1 score, with an increase of 15.74% on the SemEval 2016 Task 6 A and 3.55% on the P-Stance dataset. Our model outperforms current state-of-the-art models on these two datasets, demonstrating the superiority of multi-task fine-tuning with debate data and knowledge augmentation.","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"1 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142981772","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MKER: multi-modal knowledge extraction and reasoning for future event prediction MKER：用于未来事件预测的多模态知识提取和推理

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Complex & Intelligent Systems

Pub Date : 2025-01-09 DOI: 10.1007/s40747-024-01741-4

Chenghang Lai, Shoumeng Qiu

Humans can predict what will happen shortly, which is essential for survival, but machines cannot. To equip machines with the ability, we introduce the innovative multi-modal knowledge extraction and reasoning (MKER) framework. This framework combines external commonsense knowledge, internal visual relation knowledge, and basic information to make inference. This framework is built on an encoder-decoder structure with three essential components: a visual language reasoning module, an adaptive cross-modality feature fusion module, and a future event description generation module. The visual language reasoning module extracts the object relationships among the most informative objects and the dynamic evolution of the relationship, which comes from the sequence scene graphs and commonsense graphs. The long short-term memory model is employed to explore changes in the object relationships at different times to form a dynamic object relationship. Furthermore, the adaptive cross-modality feature fusion module aligns video and language information by using object relationship knowledge as guidance to learn vision-language representation. Finally, the future event description generation module decodes the fused information and generates the language description of the next event. Experimental results demonstrate that MKER outperforms existing methods. Ablation studies further illustrate the effectiveness of the designed module. This work advances the field by providing a way to predict future events, enhance machine understanding, and interact with dynamic environments.

人类可以预测即将发生的事情，这对生存至关重要，但机器不能。为了使机器具备这种能力，我们引入了创新的多模态知识提取和推理（MKER）框架。该框架结合外部常识性知识、内部视觉关系知识和基本信息进行推理。该框架建立在一个编码器-解码器结构上，包含三个基本组件：视觉语言推理模块、自适应跨模态特征融合模块和未来事件描述生成模块。视觉语言推理模块从序列场景图和常识图中提取信息量最大的对象之间的对象关系及其动态演化。利用长短期记忆模型探索对象关系在不同时间的变化，形成动态的对象关系。此外，自适应跨模态特征融合模块以对象关系知识为指导，对视频信息和语言信息进行对齐，学习视觉语言表示。最后，未来事件描述生成模块对融合后的信息进行解码，生成下一个事件的语言描述。实验结果表明，MKER算法优于现有算法。烧蚀实验进一步验证了所设计模块的有效性。这项工作通过提供一种预测未来事件、增强机器理解和与动态环境交互的方法，推动了该领域的发展。

{"title":"MKER: multi-modal knowledge extraction and reasoning for future event prediction","authors":"Chenghang Lai, Shoumeng Qiu","doi":"10.1007/s40747-024-01741-4","DOIUrl":"https://doi.org/10.1007/s40747-024-01741-4","url":null,"abstract":"Humans can predict what will happen shortly, which is essential for survival, but machines cannot. To equip machines with the ability, we introduce the innovative multi-modal knowledge extraction and reasoning (MKER) framework. This framework combines external commonsense knowledge, internal visual relation knowledge, and basic information to make inference. This framework is built on an encoder-decoder structure with three essential components: a visual language reasoning module, an adaptive cross-modality feature fusion module, and a future event description generation module. The visual language reasoning module extracts the object relationships among the most informative objects and the dynamic evolution of the relationship, which comes from the sequence scene graphs and commonsense graphs. The long short-term memory model is employed to explore changes in the object relationships at different times to form a dynamic object relationship. Furthermore, the adaptive cross-modality feature fusion module aligns video and language information by using object relationship knowledge as guidance to learn vision-language representation. Finally, the future event description generation module decodes the fused information and generates the language description of the next event. Experimental results demonstrate that MKER outperforms existing methods. Ablation studies further illustrate the effectiveness of the designed module. This work advances the field by providing a way to predict future events, enhance machine understanding, and interact with dynamic environments.","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"28 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142937248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

RL4CEP: reinforcement learning for updating CEP rules RL4CEP：用于更新CEP规则的强化学习

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Complex & Intelligent Systems

Pub Date : 2025-01-09 DOI: 10.1007/s40747-024-01742-3

Afef Mdhaffar, Ghassen Baklouti, Yassine Rebai, Mohamed Jmaiel, Bernd Freisleben

This paper presents RL4CEP, a reinforcement learning (RL) approach to dynamically update complex event processing (CEP) rules. RL4CEP uses Double Deep Q-Networks to update the threshold values used by CEP rules. It is implemented using Apache Flink as a CEP engine and Apache Kafka for message distribution. RL4CEP is a generic approach for scenarios in which CEP rules need to be updated dynamically. In this paper, we use RL4CEP in a financial trading use case. Our experimental results based on three financial trading rules and eight financial datasets demonstrate the merits of RL4CEP in improving the overall profit, when compared to baseline and state-of-the-art approaches, with a reasonable consumption of resources, i.e., RAM and CPU. Finally, our experiments indicate that RL4CEP is executed quite fast compared to traditional CEP engines processing static rules.

本文提出了一种动态更新复杂事件处理（CEP）规则的强化学习方法RL4CEP。RL4CEP使用Double Deep Q-Networks来更新CEP规则使用的阈值。它是使用Apache Flink作为CEP引擎和Apache Kafka作为消息分发来实现的。RL4CEP是一种通用方法，适用于需要动态更新CEP规则的场景。在本文中，我们在一个金融交易用例中使用RL4CEP。我们基于三个金融交易规则和八个金融数据集的实验结果表明，与基线和最先进的方法相比，RL4CEP在提高整体利润方面具有优势，并且合理消耗资源，即RAM和CPU。最后，我们的实验表明，与处理静态规则的传统CEP引擎相比，RL4CEP执行速度相当快。

引用次数: 0