Swarm and Evolutionary Computation最新文献_第6页

An improved variable neighborhood search algorithm embedded temporal and spatial synchronization for vehicle and drone cooperative routing problem with pre-reconnaissance 针对具有预侦察功能的车辆和无人机合作路由问题的嵌入时空同步的改进型可变邻域搜索算法

IF 8.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Swarm and Evolutionary Computation

Pub Date : 2024-08-13 DOI: 10.1016/j.swevo.2024.101699

Drones are increasingly utilized for transportation reconnaissance due to their expansive field of view, cost-effectiveness, and agility. They can pre-reconnaissance the traveling routes of the ground vehicles carrying valuable goods to ensure the safety of vehicles and their goods. This rises a novel routing problem for the Vehicles and their Pre-Reconnaissance Drones, which is an integrated point and arc routing problem with temporal and spatial synchronization. A mixed integer linear programming model is developed to formulate the problem with complex synchronization constraints. The Variable Neighborhood Search algorithm integrated with the Temporal & Spatial synchronization-based Greedy search and simulated annealing strategy is designed to solve the model. A practical case based on real urban data from Beijing, China, and random instances with different sizes are tested and compared with the proposed algorithm. Computational results indicated that the proposed algorithm can solve the problem efficiently and outperform the simulated annealing algorithm and the greedy algorithm.

无人机具有视野开阔、成本效益高和机动灵活等优点，越来越多地被用于运输侦察。它们可以预先侦察运载贵重物品的地面车辆的行驶路线，以确保车辆及其货物的安全。这就提出了一个新颖的车辆及其预侦察无人机的路由问题，这是一个时空同步的点弧综合路由问题。我们建立了一个混合整数线性规划模型，以制定具有复杂同步约束条件的问题。为解决该模型，设计了与基于时空同步的贪婪搜索和模拟退火策略相结合的可变邻域搜索算法。基于中国北京的真实城市数据和不同大小的随机实例对所提出的算法进行了实际测试和比较。计算结果表明，所提出的算法可以高效地解决问题，其性能优于模拟退火算法和贪婪算法。

{"title":"An improved variable neighborhood search algorithm embedded temporal and spatial synchronization for vehicle and drone cooperative routing problem with pre-reconnaissance","authors":"","doi":"10.1016/j.swevo.2024.101699","DOIUrl":"10.1016/j.swevo.2024.101699","url":null,"abstract":"<div><p>Drones are increasingly utilized for transportation reconnaissance due to their expansive field of view, cost-effectiveness, and agility. They can pre-reconnaissance the traveling routes of the ground vehicles carrying valuable goods to ensure the safety of vehicles and their goods. This rises a novel routing problem for the Vehicles and their Pre-Reconnaissance Drones, which is an integrated point and arc routing problem with temporal and spatial synchronization. A mixed integer linear programming model is developed to formulate the problem with complex synchronization constraints. The Variable Neighborhood Search algorithm integrated with the Temporal & Spatial synchronization-based Greedy search and simulated annealing strategy is designed to solve the model. A practical case based on real urban data from Beijing, China, and random instances with different sizes are tested and compared with the proposed algorithm. Computational results indicated that the proposed algorithm can solve the problem efficiently and outperform the simulated annealing algorithm and the greedy algorithm.</p></div>","PeriodicalId":48682,"journal":{"name":"Swarm and Evolutionary Computation","volume":null,"pages":null},"PeriodicalIF":8.2,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141978247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A learning-based memetic algorithm for a cooperative task allocation problem of multiple unmanned aerial vehicles in smart agriculture 智能农业中多个无人飞行器合作任务分配问题的基于学习的记忆算法

IF 8.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Swarm and Evolutionary Computation

Pub Date : 2024-08-11 DOI: 10.1016/j.swevo.2024.101694

Smart agriculture aligns with the principles of sustainable development, making it a crucial direction for the future agriculture. This study focuses on a cooperative plant protection task allocation problem (CPPTAP) of multiple unmanned aerial vehicles (UAVs) with a common deadline in smart agriculture. CPPTAP permits multiple UAVs to conduct pesticide spraying on the same field. The completion time for each task fluctuates due to the cooperation among UAVs. We present a mathematical model and learning-based memetic algorithm (L-MA) to maximize the total area of the fields to be sprayed. In the evolutionary stage, mutation and repair operators based on value information are applied to balance the exploration and exploitation, while a problem-specific local search strategy is designed to enhance exploitation capability. A knowledge-based UAV allocation method (KUAM) is employed to maximize UAV utilization efficiency and minimize conflicts. Throughout the search process, Q-learning is utilized to assist the aforementioned operators and make decisions on the number of cooperative UAVs on fields. The effectiveness of L-MA is validated by comparing it against other state-of-the-art algorithms. The results demonstrate that L-MA outperforms the compared algorithms at a considerable margin in a statistical sense.

智能农业符合可持续发展原则，是未来农业的重要发展方向。本研究的重点是智能农业中具有共同截止日期的多个无人飞行器（UAV）的合作植保任务分配问题（CPPTAP）。CPPTAP 允许多个无人飞行器在同一块田地上进行农药喷洒。由于无人飞行器之间的合作，每项任务的完成时间会发生波动。我们提出了一个数学模型和基于学习的记忆算法（L-MA），以最大化待喷洒田地的总面积。在进化阶段，应用基于价值信息的突变和修复算子来平衡探索和开发，同时设计了针对特定问题的局部搜索策略来增强开发能力。基于知识的无人飞行器分配方法（KUAM）可最大限度地提高无人飞行器的利用效率并减少冲突。在整个搜索过程中，利用 Q-learning 来辅助上述操作员，并就场上合作无人机的数量做出决策。通过与其他最先进的算法进行比较，验证了 L-MA 的有效性。结果表明，从统计意义上讲，L-MA 在相当大的程度上优于所比较的算法。

{"title":"A learning-based memetic algorithm for a cooperative task allocation problem of multiple unmanned aerial vehicles in smart agriculture","authors":"","doi":"10.1016/j.swevo.2024.101694","DOIUrl":"10.1016/j.swevo.2024.101694","url":null,"abstract":"<div><p>Smart agriculture aligns with the principles of sustainable development, making it a crucial direction for the future agriculture. This study focuses on a cooperative plant protection task allocation problem (CPPTAP) of multiple unmanned aerial vehicles (UAVs) with a common deadline in smart agriculture. CPPTAP permits multiple UAVs to conduct pesticide spraying on the same field. The completion time for each task fluctuates due to the cooperation among UAVs. We present a mathematical model and learning-based memetic algorithm (L-MA) to maximize the total area of the fields to be sprayed. In the evolutionary stage, mutation and repair operators based on value information are applied to balance the exploration and exploitation, while a problem-specific local search strategy is designed to enhance exploitation capability. A knowledge-based UAV allocation method (KUAM) is employed to maximize UAV utilization efficiency and minimize conflicts. Throughout the search process, Q-learning is utilized to assist the aforementioned operators and make decisions on the number of cooperative UAVs on fields. The effectiveness of L-MA is validated by comparing it against other state-of-the-art algorithms. The results demonstrate that L-MA outperforms the compared algorithms at a considerable margin in a statistical sense.</p></div>","PeriodicalId":48682,"journal":{"name":"Swarm and Evolutionary Computation","volume":null,"pages":null},"PeriodicalIF":8.2,"publicationDate":"2024-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141992979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A novel evolutionary strategy optimization algorithm for reliability redundancy allocation problem with heterogeneous components 针对异构组件可靠性冗余分配问题的新型进化策略优化算法

IF 8.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Swarm and Evolutionary Computation

Pub Date : 2024-08-10 DOI: 10.1016/j.swevo.2024.101695

The reliability-redundancy allocation problem (RRAP) is an optimization problem that maximizes system reliability under some constraints. In most studies on the RRAP, either active redundant components or cold standby components are used in a subsystem. This paper presents a new model for the RRAP of a system with a mixed redundancy strategy, in which all components can be heterogeneous. This formulation leads to a more precise solution for the problem; however, RRAP is an np-hard problem, and the new mixed heterogeneous model will be more complicated to solve. After formulating the issue, a novel design of an evolutionary strategy optimization algorithm is proposed to solve that. The problem consists of discrete and continuous variables, and different mutation strategies are designed for each. The new formulation of the problem and the new method for solving it lead to better results than those reported in other recent papers. We implement the new suggested heterogeneous model with the PSO and SPSO algorithms to better compare the proposed algorithm. Results show improvement in both system reliability and fitness evaluation count.

可靠性-冗余分配问题（RRAP）是一个优化问题，可在某些约束条件下最大限度地提高系统可靠性。在有关 RRAP 的大多数研究中，子系统中要么使用主动冗余组件，要么使用冷备用组件。本文为混合冗余策略系统的 RRAP 提出了一个新模型，其中所有组件都可以是异构的。然而，RRAP 是一个 np-hard 问题，新的混合异构模型的求解将更加复杂。在对问题进行表述后，提出了一种新颖的进化策略优化算法设计来解决该问题。问题由离散变量和连续变量组成，并分别设计了不同的突变策略。与近期发表的其他论文相比，新的问题表述和新的求解方法带来了更好的结果。我们用 PSO 和 SPSO 算法实现了新建议的异构模型，以更好地比较建议的算法。结果表明，系统可靠性和适配性评估计数都有所提高。

{"title":"A novel evolutionary strategy optimization algorithm for reliability redundancy allocation problem with heterogeneous components","authors":"","doi":"10.1016/j.swevo.2024.101695","DOIUrl":"10.1016/j.swevo.2024.101695","url":null,"abstract":"<div><p>The reliability-redundancy allocation problem (RRAP) is an optimization problem that maximizes system reliability under some constraints. In most studies on the RRAP, either active redundant components or cold standby components are used in a subsystem. This paper presents a new model for the RRAP of a system with a mixed redundancy strategy, in which all components can be heterogeneous. This formulation leads to a more precise solution for the problem; however, RRAP is an np-hard problem, and the new mixed heterogeneous model will be more complicated to solve. After formulating the issue, a novel design of an evolutionary strategy optimization algorithm is proposed to solve that. The problem consists of discrete and continuous variables, and different mutation strategies are designed for each. The new formulation of the problem and the new method for solving it lead to better results than those reported in other recent papers. We implement the new suggested heterogeneous model with the PSO and SPSO algorithms to better compare the proposed algorithm. Results show improvement in both system reliability and fitness evaluation count.</p></div>","PeriodicalId":48682,"journal":{"name":"Swarm and Evolutionary Computation","volume":null,"pages":null},"PeriodicalIF":8.2,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141935234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Reinforced steering Evolutionary Markov Chain for high-dimensional feature selection 用于高维特征选择的强化转向进化马尔可夫链

IF 8.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Swarm and Evolutionary Computation

Pub Date : 2024-08-10 DOI: 10.1016/j.swevo.2024.101701

The increasing accessibility of extensive datasets has amplified the importance of extracting insights from high-dimensional data. However, the task of selecting relevant features in these high-dimensional spaces is made more difficult due to the curse of dimensionality. Although Evolutionary Algorithms (EAs) have shown promise in the literature for feature selection, creating EAs for high dimensions is still challenging. To address the problem of feature selection in high dimensions, a novel concept of Evolutionary Reinforced Markov Chain is proposed in this paper. The proposed work has the following contributions and merits: (i) The paradigms of evolutionary computation, reinforcement learning, and Markov chain are incorporated into an integrational framework for feature selection in high dimensional spaces in a recursive manner. (ii) To support the global convergence of the algorithm and manage its computational complexity, a restricted group of the most effective agents is maintained within the evolutionary population. (iii) The dynamic Markov chain process efficiently manages agent evolution and communication, ensuring effective navigation through the search space. (iv) Agents moving in the right way are rewarded with an increase in their associated transition probability, while the agents going in the wrong direction are discouraged with a decrease in their associated transition probabilities; this promotes the establishment of an equilibrium state and leads to convergence. (v) The effective size of successful agents is reduced recursively while progressing through different states to further facilitate the speed of convergence and decrease the number of features. (vi) The performance comparison with state-of-the-art feature selection methods shows a significant improvement and promise of the proposed method over the existing methods.

随着大量数据集的日益普及，从高维数据中提取洞察力的重要性日益凸显。然而，由于维度诅咒，在这些高维空间中选择相关特征的任务变得更加困难。虽然进化算法（EA）在特征选择方面已在文献中显示出前景，但创建适用于高维度的进化算法仍然具有挑战性。为了解决高维度特征选择问题，本文提出了一个新概念--进化强化马尔可夫链。本文提出的工作有以下贡献和优点：(i) 将进化计算、强化学习和马尔可夫链的范例以递归的方式纳入高维空间特征选择的集成框架。(ii) 为支持算法的全局收敛并管理其计算复杂性，在进化群体中保留了一组最有效的受限代理。(iii) 动态马尔可夫链过程可有效管理代理进化和通信，确保在搜索空间中有效导航。(iv) 朝正确方向前进的代理会得到奖励，其相关转换概率会增加；而朝错误方向前进的代理则会受到打击，其相关转换概率会降低；这会促进平衡状态的建立并导致收敛。(v) 在不同状态下，成功代理的有效规模会递减，以进一步加快收敛速度并减少特征数量。(vi) 与最先进的特征选择方法进行的性能比较表明，与现有方法相比，所提出的方法有显著的改进和前景。

{"title":"Reinforced steering Evolutionary Markov Chain for high-dimensional feature selection","authors":"","doi":"10.1016/j.swevo.2024.101701","DOIUrl":"10.1016/j.swevo.2024.101701","url":null,"abstract":"<div><p>The increasing accessibility of extensive datasets has amplified the importance of extracting insights from high-dimensional data. However, the task of selecting relevant features in these high-dimensional spaces is made more difficult due to the curse of dimensionality. Although Evolutionary Algorithms (EAs) have shown promise in the literature for feature selection, creating EAs for high dimensions is still challenging. To address the problem of feature selection in high dimensions, a novel concept of Evolutionary Reinforced Markov Chain is proposed in this paper. The proposed work has the following contributions and merits: (i) The paradigms of evolutionary computation, reinforcement learning, and Markov chain are incorporated into an integrational framework for feature selection in high dimensional spaces in a recursive manner. (ii) To support the global convergence of the algorithm and manage its computational complexity, a restricted group of the most effective agents is maintained within the evolutionary population. (iii) The dynamic Markov chain process efficiently manages agent evolution and communication, ensuring effective navigation through the search space. (iv) Agents moving in the right way are rewarded with an increase in their associated transition probability, while the agents going in the wrong direction are discouraged with a decrease in their associated transition probabilities; this promotes the establishment of an equilibrium state and leads to convergence. (v) The effective size of successful agents is reduced recursively while progressing through different states to further facilitate the speed of convergence and decrease the number of features. (vi) The performance comparison with state-of-the-art feature selection methods shows a significant improvement and promise of the proposed method over the existing methods.</p></div>","PeriodicalId":48682,"journal":{"name":"Swarm and Evolutionary Computation","volume":null,"pages":null},"PeriodicalIF":8.2,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141984832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Dynamic-multi-task-assisted evolutionary algorithm for constrained multi-objective optimization 约束多目标优化的动态多任务辅助进化算法

IF 8.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Swarm and Evolutionary Computation

Pub Date : 2024-08-10 DOI: 10.1016/j.swevo.2024.101683

Compared with common multi-objective optimization problems, constrained multi-objective optimization problems demand additional consideration of the treatment of constraints. Recently, many constrained multi-objective evolutionary algorithms have been presented to reconcile the relationship between constraint satisfaction and objective optimization. Notably, evolutionary multi-task mechanisms have also been exploited in solving constrained multi-objective problems frequently with remarkable outcomes. However, previous methods are not fully applicable to solving problems possessing all types of constraint landscapes and are only superior for a certain type of problem. Thus, in this paper, a novel dynamic-multi-task-assisted constrained multi-objective optimization algorithm, termed DTCMO, is proposed, and three dynamic tasks are involved. The main task approaches the constrained Pareto front by adding new constraints dynamically. Two auxiliary tasks are devoted to exploring the unconstrained Pareto front and the constrained Pareto front with dynamically changing constraint boundaries, respectively. In addition, the first auxiliary task stops the evolution automatically after reaching the unconstrained Pareto front, avoiding the waste of subsequent computational resources. A series of experiments are conducted with eight mainstream algorithms on five benchmark problems, and the results confirm the generality and superiority of DTCMO.

与普通的多目标优化问题相比，约束多目标优化问题需要额外考虑约束的处理。最近，人们提出了许多约束多目标进化算法，以协调约束满足与目标优化之间的关系。值得注意的是，多任务进化机制也被用于解决约束多目标问题，并经常取得显著成果。然而，以往的方法并不完全适用于解决具有所有类型约束景观的问题，而只是对某一类问题具有优势。因此，本文提出了一种新颖的动态多任务辅助约束多目标优化算法，称为 DTCMO，涉及三个动态任务。主要任务通过动态添加新的约束条件来接近约束帕累托前沿。两个辅助任务分别用于探索无约束帕累托前沿和约束边界动态变化的约束帕累托前沿。此外，第一个辅助任务在到达无约束帕累托前沿后自动停止演化，避免了后续计算资源的浪费。我们在五个基准问题上用八种主流算法进行了一系列实验，结果证实了 DTCMO 的通用性和优越性。

{"title":"Dynamic-multi-task-assisted evolutionary algorithm for constrained multi-objective optimization","authors":"","doi":"10.1016/j.swevo.2024.101683","DOIUrl":"10.1016/j.swevo.2024.101683","url":null,"abstract":"<div><p>Compared with common multi-objective optimization problems, constrained multi-objective optimization problems demand additional consideration of the treatment of constraints. Recently, many constrained multi-objective evolutionary algorithms have been presented to reconcile the relationship between constraint satisfaction and objective optimization. Notably, evolutionary multi-task mechanisms have also been exploited in solving constrained multi-objective problems frequently with remarkable outcomes. However, previous methods are not fully applicable to solving problems possessing all types of constraint landscapes and are only superior for a certain type of problem. Thus, in this paper, a novel dynamic-multi-task-assisted constrained multi-objective optimization algorithm, termed DTCMO, is proposed, and three dynamic tasks are involved. The main task approaches the constrained Pareto front by adding new constraints dynamically. Two auxiliary tasks are devoted to exploring the unconstrained Pareto front and the constrained Pareto front with dynamically changing constraint boundaries, respectively. In addition, the first auxiliary task stops the evolution automatically after reaching the unconstrained Pareto front, avoiding the waste of subsequent computational resources. A series of experiments are conducted with eight mainstream algorithms on five benchmark problems, and the results confirm the generality and superiority of DTCMO.</p></div>","PeriodicalId":48682,"journal":{"name":"Swarm and Evolutionary Computation","volume":null,"pages":null},"PeriodicalIF":8.2,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141935235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Q-learning based artificial bee colony algorithm for solving surgery scheduling problems with setup time 基于 Q 学习的人工蜂群算法，用于解决有设置时间的手术排期问题

IF 8.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Swarm and Evolutionary Computation

Pub Date : 2024-08-09 DOI: 10.1016/j.swevo.2024.101686

With the increasing demand for surgeries, surgery scheduling become an important problem in hospital management. Efficient surgery scheduling can enhance the optimal use of surgical resources, leading to high efficiency of surgery assignments. This work addresses surgery scheduling problems with surgical resources setup time. A mathematical model is established to describe the considered problems with the objective of minimizing the maximum completion time of the surgeries (makespan). Second, a modified artificial bee colony (ABC) algorithm is proposed, named QABC. Six local search operators are developed based on the characteristics of the problem, aiming to strengthen the local search capability of the algorithm. To further improve the performance of the algorithm, this study combines a Q-learning strategy with ABC algorithm. During each iteration of the algorithm, the Q-learning strategy is employed to guide the selection of search operators. Finally, the effectiveness of the local search operators and Q-learning based local search selection is verified by solving 20 cases with varying scales. And the results obtained by the Gurobi solver are compared with the proposed QABC. Furthermore, the proposed QABC is compared with the state-of-the-art algorithms. The experimental results and comparisons show that QABC is more effective than its peers for solving the surgery scheduling problems with setup time.

随着手术需求的不断增加，手术排期成为医院管理中的一个重要问题。高效的手术排期可以提高手术资源的优化利用，从而实现手术任务的高效率。本研究针对手术资源设置时间的手术调度问题。首先，建立了一个数学模型来描述所考虑的问题，其目标是最小化手术的最大完成时间（makespan）。其次，提出了一种改进的人工蜂群（ABC）算法，命名为 QABC。根据问题的特点开发了六个局部搜索算子，旨在加强算法的局部搜索能力。为进一步提高算法性能，本研究将 Q-learning 策略与 ABC 算法相结合。在算法的每次迭代中，都采用 Q-learning 策略来指导搜索算子的选择。最后，通过解决 20 个不同规模的案例，验证了局部搜索算子和基于 Q-learning 的局部搜索选择的有效性。并将 Gurobi 求解器获得的结果与所提出的 QABC 进行了比较。此外，还将所提出的 QABC 与最先进的算法进行了比较。实验结果和比较结果表明，在解决有设置时间的手术调度问题时，QABC 比同类算法更有效。

{"title":"A Q-learning based artificial bee colony algorithm for solving surgery scheduling problems with setup time","authors":"","doi":"10.1016/j.swevo.2024.101686","DOIUrl":"10.1016/j.swevo.2024.101686","url":null,"abstract":"<div><p>With the increasing demand for surgeries, surgery scheduling become an important problem in hospital management. Efficient surgery scheduling can enhance the optimal use of surgical resources, leading to high efficiency of surgery assignments. This work addresses surgery scheduling problems with surgical resources setup time. A mathematical model is established to describe the considered problems with the objective of minimizing the maximum completion time of the surgeries (makespan). Second, a modified artificial bee colony (ABC) algorithm is proposed, named QABC. Six local search operators are developed based on the characteristics of the problem, aiming to strengthen the local search capability of the algorithm. To further improve the performance of the algorithm, this study combines a Q-learning strategy with ABC algorithm. During each iteration of the algorithm, the Q-learning strategy is employed to guide the selection of search operators. Finally, the effectiveness of the local search operators and Q-learning based local search selection is verified by solving 20 cases with varying scales. And the results obtained by the Gurobi solver are compared with the proposed QABC. Furthermore, the proposed QABC is compared with the state-of-the-art algorithms. The experimental results and comparisons show that QABC is more effective than its peers for solving the surgery scheduling problems with setup time.</p></div>","PeriodicalId":48682,"journal":{"name":"Swarm and Evolutionary Computation","volume":null,"pages":null},"PeriodicalIF":8.2,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141935237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A self-adaptive memetic algorithm with Q-learning for solving the multi-AGVs dispatching problem 用 Q-learning 自适应记忆算法解决多AGV 调度问题

IF 8.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Swarm and Evolutionary Computation

Pub Date : 2024-08-09 DOI: 10.1016/j.swevo.2024.101697

In this paper, we address the problem of dispatching multiple automated guided vehicles (AGVs) in an actual production workshop, aiming to minimize the transportation cost. To solve this problem, a self-adaptive memetic algorithm with Q-learning (Q-SAMA) is proposed. An improved nearest-neighbor task division heuristic is used for generating premium solutions. Additionally, a Q-learning is integrated to select appropriate neighborhood operators, thereby enhancing the algorithm's exploration ability. To prevent the algorithm from falling into a local optimum, the restart strategy is offered. In order to adapt Q-SAMA to different stages in the search process, the traditional crossover and mutation probabilities are no longer used. Instead, a self-adaptive probability is obtained based on the population's degree of concentration, and the sparsity relationship among individuals' fitness. Finally, experimental results validate the effectiveness of the proposed method. It is able to yield better results compared with other five state-of-the-art algorithms.

在本文中，我们讨论了在实际生产车间中调度多辆自动导引车（AGV）的问题，目的是最大限度地降低运输成本。为了解决这个问题，我们提出了一种具有 Q-learning 功能的自适应记忆算法（Q-SAMA）。该算法采用改进的近邻任务划分启发式来生成优质解决方案。此外，还集成了 Q-learning 来选择合适的邻域算子，从而增强了算法的探索能力。为了防止算法陷入局部最优，还提供了重启策略。为了使 Q-SAMA 算法适应搜索过程中的不同阶段，不再使用传统的交叉和突变概率。取而代之的是，根据种群的集中程度和个体适合度之间的稀疏关系获得自适应概率。最后，实验结果验证了所提方法的有效性。与其他五种最先进的算法相比，它能产生更好的结果。

引用次数: 0

Constraint landscape knowledge assisted constrained multiobjective optimization 约束景观知识辅助约束多目标优化

IF 8.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Swarm and Evolutionary Computation

Pub Date : 2024-08-09 DOI: 10.1016/j.swevo.2024.101685

When evolutionary algorithms are employed to tackle constrained multiobjective optimization problems (CMOPs), constraint handling techniques (CHTs) play a pivotal role. To date, several CHTs have been designed, but they are only effective for certain types of constraint landscapes. For CMOPs with unknown properties, their optimization performance and efficiency remain uncertain. To tackle this issue, we attempt to mine and utilize the knowledge of constraint landscape to solve CMOPs. Specifically, the evolutionary process can be divided into three stages: learning stage, classification stage, and evolving stage. During the learning stage, the two populations, namely mainPop and auxPop, cooperatively evolve with and without considering constraints, respectively. The mainPop can locate the feasible regions, while the auxPop is employed to evaluate the size of the feasible regions. Subsequently, in the classification stage, based on the learned landscape knowledge, the category of problem can be determined: CMOP with small feasible regions or CMOP with large feasible regions. Then, in the evolving stage, for CMOPs with small feasible regions, CHTI, which includes a population exchange method and a feasible regions relaxation method, is proposed, while for CMOPs with large feasible regions, CHTII, which encompasses a dynamic resource allocation method and a coevolutionary method, is designed. The proposed framework is executed on extensive benchmark test suites. It has achieved superior or at least competitive performance compared with other state-of-the-art algorithms. Furthermore, the framework has been successfully implemented on the robotic manipulator path planning problem.

当采用进化算法处理约束多目标优化问题（CMOPs）时，约束处理技术（CHTs）起着举足轻重的作用。迄今为止，人们已经设计出了多种约束处理技术，但它们只对某些类型的约束景观有效。对于属性未知的 CMOP，其优化性能和效率仍不确定。为解决这一问题，我们尝试挖掘并利用约束景观知识来求解 CMOP。具体来说，进化过程可分为三个阶段：学习阶段、分类阶段和进化阶段。在学习阶段，两个种群，即和，分别在考虑约束和不考虑约束的情况下合作进化。可以定位可行区域，而则用于评估可行区域的大小。随后，在分类阶段，根据学习到的景观知识，可以确定问题的类别：可行区域小的 CMOP 或可行区域大的 CMOP。然后，在演化阶段，针对可行区域较小的 CMOP，提出了包含种群交换方法和可行区域松弛方法的 CHTI；针对可行区域较大的 CMOP，设计了包含动态资源分配方法和协同演化方法的 CHTII。所提出的框架在大量基准测试套件中得到了执行。与其他最先进的算法相比，它取得了优异的性能，至少是具有竞争力的性能。此外，该框架已成功应用于机器人机械手路径规划问题。

{"title":"Constraint landscape knowledge assisted constrained multiobjective optimization","authors":"","doi":"10.1016/j.swevo.2024.101685","DOIUrl":"10.1016/j.swevo.2024.101685","url":null,"abstract":"<div><p>When evolutionary algorithms are employed to tackle constrained multiobjective optimization problems (CMOPs), constraint handling techniques (CHTs) play a pivotal role. To date, several CHTs have been designed, but they are only effective for certain types of constraint landscapes. For CMOPs with unknown properties, their optimization performance and efficiency remain uncertain. To tackle this issue, we attempt to mine and utilize the knowledge of constraint landscape to solve CMOPs. Specifically, the evolutionary process can be divided into three stages: learning stage, classification stage, and evolving stage. During the learning stage, the two populations, namely <em>mainPop</em> and <em>auxPop</em>, cooperatively evolve with and without considering constraints, respectively. The <em>mainPop</em> can locate the feasible regions, while the <em>auxPop</em> is employed to evaluate the size of the feasible regions. Subsequently, in the classification stage, based on the learned landscape knowledge, the category of problem can be determined: CMOP with small feasible regions or CMOP with large feasible regions. Then, in the evolving stage, for CMOPs with small feasible regions, CHTI, which includes a population exchange method and a feasible regions relaxation method, is proposed, while for CMOPs with large feasible regions, CHTII, which encompasses a dynamic resource allocation method and a coevolutionary method, is designed. The proposed framework is executed on extensive benchmark test suites. It has achieved superior or at least competitive performance compared with other state-of-the-art algorithms. Furthermore, the framework has been successfully implemented on the robotic manipulator path planning problem.</p></div>","PeriodicalId":48682,"journal":{"name":"Swarm and Evolutionary Computation","volume":null,"pages":null},"PeriodicalIF":8.2,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141935097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A dynamic multi-objective optimization algorithm with a dual mechanism based on prediction and archive 基于预测和存档双重机制的动态多目标优化算法

IF 8.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Swarm and Evolutionary Computation

Pub Date : 2024-08-09 DOI: 10.1016/j.swevo.2024.101693

In the dynamic multi-objective optimization problems, if the environmental changes are detected, an appropriate response strategy be employed to respond quickly to the change. The predictive mechanism is effective in detecting the patterns of change in a problem and is often used to track the Pareto Frontier (PF) in a new environment. However, these methods often rely on the historical optimization results to approximate new environmental solutions, which can lead to back-predictions and mislead population convergence because of the low quality of historical solutions. This paper proposes a dual mechanism of prediction and archive (DMPA_DMOEA) to address the problem. The improvements include: (1) The well-distributed solutions from the previous environment be retained to ensure that reliable solutions exist in the new environment. (2) An LSTM neural network model is used to construct the predictor, which makes full use of the historical information and fits the nonlinear relationship between the pareto set (PS), thus improving the accuracy of the predicted solution. (3) These archived solutions and the predicted solutions collectively form the initial population for the new environment, which improves the quality of the initial population and maintains excellent tracking performance. Finally, Multiple benchmark problems and different variation types are tested to validate the effectiveness of the proposed algorithm. Experiment results show that the proposed algorithm can effectively handle DMOPs and has shown its remarkable superiority in comparison with state-of-the-art algorithms.

在动态多目标优化问题中，如果能检测到环境变化，就能采用适当的应对策略来快速应对变化。预测机制能有效检测问题的变化规律，通常用于跟踪新环境下的帕累托前沿（PF）。然而，这些方法往往依赖于历史优化结果来近似新的环境解，由于历史解的质量较低，可能会导致反向预测，误导群体收敛。本文提出了预测和归档双重机制（DMPA_DMOEA）来解决这一问题。其改进包括(1) 保留上一个环境中分布良好的解决方案，以确保新环境中存在可靠的解决方案。(2) 使用 LSTM 神经网络模型构建预测器，充分利用历史信息并拟合帕雷托集（PS）之间的非线性关系，从而提高预测解决方案的准确性。(3) 这些存档解和预测解共同构成新环境的初始种群，从而提高初始种群的质量，保持优异的跟踪性能。最后，测试了多个基准问题和不同的变化类型，以验证所提算法的有效性。实验结果表明，所提出的算法可以有效地处理 DMOPs，与最先进的算法相比具有显著的优越性。

{"title":"A dynamic multi-objective optimization algorithm with a dual mechanism based on prediction and archive","authors":"","doi":"10.1016/j.swevo.2024.101693","DOIUrl":"10.1016/j.swevo.2024.101693","url":null,"abstract":"<div><p>In the dynamic multi-objective optimization problems, if the environmental changes are detected, an appropriate response strategy be employed to respond quickly to the change. The predictive mechanism is effective in detecting the patterns of change in a problem and is often used to track the Pareto Frontier (PF) in a new environment. However, these methods often rely on the historical optimization results to approximate new environmental solutions, which can lead to back-predictions and mislead population convergence because of the low quality of historical solutions. This paper proposes a dual mechanism of prediction and archive (DMPA_DMOEA) to address the problem. The improvements include: (1) The well-distributed solutions from the previous environment be retained to ensure that reliable solutions exist in the new environment. (2) An LSTM neural network model is used to construct the predictor, which makes full use of the historical information and fits the nonlinear relationship between the pareto set (PS), thus improving the accuracy of the predicted solution. (3) These archived solutions and the predicted solutions collectively form the initial population for the new environment, which improves the quality of the initial population and maintains excellent tracking performance. Finally, Multiple benchmark problems and different variation types are tested to validate the effectiveness of the proposed algorithm. Experiment results show that the proposed algorithm can effectively handle DMOPs and has shown its remarkable superiority in comparison with state-of-the-art algorithms.</p></div>","PeriodicalId":48682,"journal":{"name":"Swarm and Evolutionary Computation","volume":null,"pages":null},"PeriodicalIF":8.2,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141935114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deep reinforcement learning as multiobjective optimization benchmarks: Problem formulation and performance assessment 作为多目标优化基准的深度强化学习：问题制定与性能评估

IF 8.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Swarm and Evolutionary Computation

Pub Date : 2024-08-08 DOI: 10.1016/j.swevo.2024.101692

The successful deployment of Deep learning in several challenging tasks has been translated into complex control problems from different domains through Deep Reinforcement Learning (DRL). Although DRL has been extensively formulated and solved as single-objective problems, nearly all real-world RL problems often feature two or more conflicting objectives, where the goal is to obtain a high-quality and diverse set of optimal policies for different objective preferences. Consequently, the development of Multi-Objective Deep Reinforcement Learning (MODRL) algorithms has gained a lot of traction in the literature. Generally, Evolutionary Algorithms (EAs) have been demonstrated to be scalable alternatives to the classical DRL paradigms when formulated as an optimization problem. Hence it is reasonable to employ Multi-objective Evolutionary Algorithms (MOEAs) to handle MODRL tasks. However, there are several factors constraining the progress of research along this line: first, there is a lack of a general problem formulation of MODRL tasks from an optimization perspective; second, there exist several challenges in performing benchmark assessments of MOEAs for MODRL problems. To overcome these limitations: (i) we present a formulation of MODRL tasks as general multi-objective optimization problems and analyze their complex characteristics from an optimization perspective; (ii) we present an end-to-end framework, termed DRLXBench, to generate MODRL benchmark test problems for seamless running of MOEAs (iii) we propose a test suite comprising of 12 MODRL problems with different characteristics such as many-objectives, degenerated Pareto fronts, concave and convex optimization problems, etc. (iv) Finally, we present and discuss baseline results on the proposed test problems using seven representative MOEAs.

通过深度强化学习（DRL），深度学习在一些具有挑战性的任务中的成功应用已转化为不同领域的复杂控制问题。尽管 DRL 已被广泛地表述为单目标问题并得到解决，但几乎所有现实世界中的 RL 问题通常都具有两个或更多相互冲突的目标，其目标是针对不同的目标偏好获得一组高质量、多样化的最优策略。因此，多目标深度强化学习（MODRL）算法的发展在文献中得到了广泛关注。一般来说，进化算法（EAs）在表述为优化问题时，已被证明是经典 DRL 模式的可扩展替代方案。因此，采用多目标进化算法（MOEAs）来处理 MODRL 任务是合理的。然而，有几个因素制约着这一研究方向的进展：首先，缺乏从优化角度对 MODRL 任务进行一般问题表述的方法；其次，在针对 MODRL 问题对 MOEAs 进行基准评估方面存在一些挑战。为了克服这些限制：(i) 我们将 MODRL 任务表述为一般多目标优化问题，并从优化角度分析其复杂特性；(ii) 我们提出了一个端到端框架，称为 DRLXBench，用于生成 MODRL 基准测试问题，以便无缝运行 MOEA；(iii) 我们提出了一个测试套件，包括 12 个具有不同特性的 MODRL 问题，如多目标、退化帕累托前沿、凹和凸优化问题等。(iv) 最后，我们介绍并讨论了使用七个具有代表性的 MOEAs 对所提出的测试问题进行处理的基线结果。

{"title":"Deep reinforcement learning as multiobjective optimization benchmarks: Problem formulation and performance assessment","authors":"","doi":"10.1016/j.swevo.2024.101692","DOIUrl":"10.1016/j.swevo.2024.101692","url":null,"abstract":"<div><p>The successful deployment of Deep learning in several challenging tasks has been translated into complex control problems from different domains through Deep Reinforcement Learning (DRL). Although DRL has been extensively formulated and solved as single-objective problems, nearly all real-world RL problems often feature two or more conflicting objectives, where the goal is to obtain a high-quality and diverse set of optimal policies for different objective preferences. Consequently, the development of Multi-Objective Deep Reinforcement Learning (MODRL) algorithms has gained a lot of traction in the literature. Generally, Evolutionary Algorithms (EAs) have been demonstrated to be scalable alternatives to the classical DRL paradigms when formulated as an optimization problem. Hence it is reasonable to employ Multi-objective Evolutionary Algorithms (MOEAs) to handle MODRL tasks. However, there are several factors constraining the progress of research along this line: first, there is a lack of a general problem formulation of MODRL tasks from an optimization perspective; second, there exist several challenges in performing benchmark assessments of MOEAs for MODRL problems. To overcome these limitations: (i) we present a formulation of MODRL tasks as general multi-objective optimization problems and analyze their complex characteristics from an optimization perspective; (ii) we present an end-to-end framework, termed DRLXBench, to generate MODRL benchmark test problems for seamless running of MOEAs (iii) we propose a test suite comprising of 12 MODRL problems with different characteristics such as many-objectives, degenerated Pareto fronts, concave and convex optimization problems, etc. (iv) Finally, we present and discuss baseline results on the proposed test problems using seven representative MOEAs.</p></div>","PeriodicalId":48682,"journal":{"name":"Swarm and Evolutionary Computation","volume":null,"pages":null},"PeriodicalIF":8.2,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141935098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0