The orienteering problem (OP) is widely applied in real life. However, as the scale of real-world problem scenarios grows quickly, traditional exact, heuristics, and learning-based methods have difficulty balancing optimization accuracy and efficiency. This study proposes a problem decomposition-based double-layer optimization framework named DEA-DYPN to solve OPs. Using a diversity evolutionary algorithm (DEA) as the external optimizer and a dynamic pointer network (DYPN) as the inner optimizer, we significantly reduce the difficulty of solving large-scale OPs. Several targeted optimization operators are innovatively designed for stronger search ability, including a greedy population initialization heuristic, an elite strategy, a population restart mechanism, and a fitness-sharing selection strategy. Moreover, a dynamic embedding mechanism is introduced to DYPN to improve its characteristic learning ability. Extensive comparative experiments on OP instances with sizes from 20 to 500 are conducted for algorithmic performance validation. More experiments and analyses, including the significance test, stability analysis, complexity analysis, sensitivity analysis, and ablation experiments, are also conducted for comprehensive algorithmic evaluation. Experimental results show that our proposed DEA-DYPN ranks first according to the Friedman test and outperforms the competitor algorithms by 69%.
定向行走问题(OP)在现实生活中应用广泛。然而,随着现实世界问题场景规模的快速增长,传统的精确、启发式和基于学习的方法难以兼顾优化精度和效率。本研究提出了一种基于问题分解的双层优化框架,名为 DEA-DYPN,用于解决 OPs。以多样性进化算法(DEA)为外部优化器,以动态指针网络(DYPN)为内部优化器,大大降低了大规模 OP 的求解难度。为了增强搜索能力,我们创新性地设计了几种有针对性的优化算子,包括贪婪种群初始化启发式、精英策略、种群重启机制和适配性共享选择策略。此外,DYPN 还引入了动态嵌入机制,以提高其特有的学习能力。为了验证算法的性能,我们在 20 到 500 个 OP 实例上进行了广泛的对比实验。此外,还进行了更多的实验和分析,包括显著性检验、稳定性分析、复杂性分析、灵敏度分析和消融实验,以对算法进行综合评估。实验结果表明,根据弗里德曼测试,我们提出的 DEA-DYPN 排在第一位,比竞争算法高出 69%。
{"title":"Solving Orienteering Problems by Hybridizing Evolutionary Algorithm and Deep Reinforcement Learning","authors":"Rui Wang;Wei Liu;Kaiwen Li;Tao Zhang;Ling Wang;Xin Xu","doi":"10.1109/TAI.2024.3409520","DOIUrl":"https://doi.org/10.1109/TAI.2024.3409520","url":null,"abstract":"The orienteering problem (OP) is widely applied in real life. However, as the scale of real-world problem scenarios grows quickly, traditional exact, heuristics, and learning-based methods have difficulty balancing optimization accuracy and efficiency. This study proposes a problem decomposition-based double-layer optimization framework named DEA-DYPN to solve OPs. Using a diversity evolutionary algorithm (DEA) as the external optimizer and a dynamic pointer network (DYPN) as the inner optimizer, we significantly reduce the difficulty of solving large-scale OPs. Several targeted optimization operators are innovatively designed for stronger search ability, including a greedy population initialization heuristic, an elite strategy, a population restart mechanism, and a fitness-sharing selection strategy. Moreover, a dynamic embedding mechanism is introduced to DYPN to improve its characteristic learning ability. Extensive comparative experiments on OP instances with sizes from 20 to 500 are conducted for algorithmic performance validation. More experiments and analyses, including the significance test, stability analysis, complexity analysis, sensitivity analysis, and ablation experiments, are also conducted for comprehensive algorithmic evaluation. Experimental results show that our proposed DEA-DYPN ranks first according to the Friedman test and outperforms the competitor algorithms by 69%.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5493-5508"},"PeriodicalIF":0.0,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-04DOI: 10.1109/TAI.2024.3408717
Kyle Worrall;Zongyu Yin;Tom Collins
There have been many attempts to model the ability of human musicians to take a score and perform or render it expressively, by adding tempo, timing, loudness, and articulation changes to nonexpressive music data. While expressive rendering models exist in academic research, most of these are not open source or accessible, meaning they are difficult to evaluate empirically and have not been widely adopted in professional music software. Systematic comparative evaluation of such algorithms stopped after the last performance rendering contest (RENCON) in 2013, making it difficult to compare newer models to existing work in a fair and valid way. In this article, we introduce the first transformer-based model for expressive rendering, cue-free express + pedal (CFE + P), which predicts expressive attributes such as notewise dynamics and micro-timing adjustments, and beatwise tempo and sustain pedal use based only on the start and end times and pitches of notes (e.g., inexpressive musical instrument digital interface (MIDI) input). We perform two comparative evaluations on our model against a nonmachine learning baseline taken from professional music software and two open-source algorithms—a feedforward neural network (FFNN) and hierarchical recurrent neural network (HRNN). The results of two listening studies indicate that our model renders passages that outperform what can be done in professional music software such as Logic Pro and Ableton Live. 1