Evaluating multiobjective evolutionary algorithms: A real-world benchmarking framework

MODSIM2023, 25th International Congress on Modelling and Simulation. Pub Date : 2023-08-01 DOI:10.36334/modsim.2023.salazar

J. Z. Salazar, D. Hadka, P. Reed, H. Seada, K. Deb

{"title":"Evaluating multiobjective evolutionary algorithms: A real-world benchmarking framework","authors":"J. Z. Salazar, D. Hadka, P. Reed, H. Seada, K. Deb","doi":"10.36334/modsim.2023.salazar","DOIUrl":null,"url":null,"abstract":": Multiobjective evolutionary algorithms (MOEAs) have shown significant progress in addressing well-defined test problems, but their effectiveness in real-world applications remains uncertain. To bridge this gap, we provide a comprehensive benchmarking framework designed to rigorously evaluate state-of-the-art MOEAs in real-world scenarios. Our framework comprises a suite of statistical evaluation metrics, for a collection of diverse real-world benchmark applications representing various mathematical problem difficulties. In this study, we carefully selected four benchmark applications with 3 to 10 objectives, capturing challenging characteristics such as stochastic objectives, severe constraints, strong nonlinearity, and complex Pareto frontiers. We evaluated the performance of five popular MOEAs, including NSGA-II, NSGA-III, RVEA, MOEA/D, and the Borg MOEA, using our benchmarking framework. Multiobjective evolutionary algorithms (MOEAs) have shown significant progress in addressing well-defined test problems, but their effectiveness in real-world applications remains uncertain. To bridge this gap, we provide a comprehensive benchmarking framework designed to rigorously evaluate state-of-the-art MOEAs in real-world scenarios. Our framework comprises a suite of statistical evaluation metrics, for a collection of diverse real-world benchmark applications representing various mathematical problem difficulties. The results revealed distinct differences in the performance of the evaluated MOEAs across the real-world applications. Surprisingly, MOEAs that excelled on standard test problems struggled when confronted with the complexities inherent in real-world applications. These findings underscore the need to enhance the adaptability and ease-of-use of MOEAs, considering the often ill-defined nature of real-world problem solving. Furthermore, our study provides insights into successful algorithmic design choices for MOEAs. Optimal selection strategies and archive mechanisms are crucial to prevent deterioration, maintain diversity, and provide adequate selection pressure throughout the optimization process. Additionally, the choice of stable and flexible operators plays a vital role in reliably driving the search towards the Pareto front. Recent advancements in hyper-heuristics and multi-operator MOEAs offer promising automated approaches for tackling these challenges. We found that epsilon non-dominated sorting effectively maintains diversity and selection pressure for problems with up to ten objectives when the entire Pareto front is desired. Moreover, auto-adaptive search operators demonstrate their efficacy in adapting to the search landscape of diverse real-world applications. However, the performance of reference point/vector methods deteriorated at higher dimensions, indicating the need for further investigation. Our study highlights the inadequacy of existing test benchmarks in differentiating MOEAs based on real-world performance. While considerable efforts have focused on optimizing algorithms for test problems, the subpar performance of MOEAs in real-world settings persists. The benchmarking framework and results presented here aim to foster collaborative efforts, encouraging the development of a diverse suite of real-world benchmarking problems. The flexibility of our framework also allows for exploration of new or hybrid algorithm architectures in benchmarking studies.","PeriodicalId":390064,"journal":{"name":"MODSIM2023, 25th International Congress on Modelling and Simulation.","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MODSIM2023, 25th International Congress on Modelling and Simulation.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36334/modsim.2023.salazar","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

: Multiobjective evolutionary algorithms (MOEAs) have shown significant progress in addressing well-defined test problems, but their effectiveness in real-world applications remains uncertain. To bridge this gap, we provide a comprehensive benchmarking framework designed to rigorously evaluate state-of-the-art MOEAs in real-world scenarios. Our framework comprises a suite of statistical evaluation metrics, for a collection of diverse real-world benchmark applications representing various mathematical problem difficulties. In this study, we carefully selected four benchmark applications with 3 to 10 objectives, capturing challenging characteristics such as stochastic objectives, severe constraints, strong nonlinearity, and complex Pareto frontiers. We evaluated the performance of five popular MOEAs, including NSGA-II, NSGA-III, RVEA, MOEA/D, and the Borg MOEA, using our benchmarking framework. Multiobjective evolutionary algorithms (MOEAs) have shown significant progress in addressing well-defined test problems, but their effectiveness in real-world applications remains uncertain. To bridge this gap, we provide a comprehensive benchmarking framework designed to rigorously evaluate state-of-the-art MOEAs in real-world scenarios. Our framework comprises a suite of statistical evaluation metrics, for a collection of diverse real-world benchmark applications representing various mathematical problem difficulties. The results revealed distinct differences in the performance of the evaluated MOEAs across the real-world applications. Surprisingly, MOEAs that excelled on standard test problems struggled when confronted with the complexities inherent in real-world applications. These findings underscore the need to enhance the adaptability and ease-of-use of MOEAs, considering the often ill-defined nature of real-world problem solving. Furthermore, our study provides insights into successful algorithmic design choices for MOEAs. Optimal selection strategies and archive mechanisms are crucial to prevent deterioration, maintain diversity, and provide adequate selection pressure throughout the optimization process. Additionally, the choice of stable and flexible operators plays a vital role in reliably driving the search towards the Pareto front. Recent advancements in hyper-heuristics and multi-operator MOEAs offer promising automated approaches for tackling these challenges. We found that epsilon non-dominated sorting effectively maintains diversity and selection pressure for problems with up to ten objectives when the entire Pareto front is desired. Moreover, auto-adaptive search operators demonstrate their efficacy in adapting to the search landscape of diverse real-world applications. However, the performance of reference point/vector methods deteriorated at higher dimensions, indicating the need for further investigation. Our study highlights the inadequacy of existing test benchmarks in differentiating MOEAs based on real-world performance. While considerable efforts have focused on optimizing algorithms for test problems, the subpar performance of MOEAs in real-world settings persists. The benchmarking framework and results presented here aim to foster collaborative efforts, encouraging the development of a diverse suite of real-world benchmarking problems. The flexibility of our framework also allows for exploration of new or hybrid algorithm architectures in benchmarking studies.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

评估多目标进化算法:一个真实世界的基准框架

多目标进化算法(moea)在解决定义良好的测试问题方面取得了重大进展，但其在实际应用中的有效性仍不确定。为了弥补这一差距，我们提供了一个全面的基准框架，旨在严格评估现实世界中最先进的moea。我们的框架包括一套统计评估指标，用于代表各种数学问题困难的各种现实世界基准应用程序的集合。在这项研究中，我们精心选择了4个具有3到10个目标的基准应用程序，捕捉了随机目标、严格约束、强非线性和复杂帕累托边界等具有挑战性的特征。我们使用我们的基准测试框架评估了五种流行的MOEA的性能，包括NSGA-II、NSGA-III、RVEA、MOEA/D和Borg MOEA。多目标进化算法(moea)在解决定义良好的测试问题方面取得了重大进展，但其在实际应用中的有效性仍不确定。为了弥补这一差距，我们提供了一个全面的基准框架，旨在严格评估现实世界中最先进的moea。我们的框架包括一套统计评估指标，用于代表各种数学问题困难的各种现实世界基准应用程序的集合。结果显示，经过评估的moea在实际应用中的性能存在明显差异。令人惊讶的是，在标准测试问题上表现出色的moea在面对现实世界应用程序中固有的复杂性时却举步维艰。考虑到现实世界问题解决的不明确性质，这些发现强调了增强moea的适应性和易用性的必要性。此外，我们的研究为moea的成功算法设计选择提供了见解。在整个优化过程中，优化选择策略和存档机制对于防止退化、保持多样性和提供足够的选择压力至关重要。此外，选择稳定和灵活的算子在可靠地推动搜索到帕累托前沿方面起着至关重要的作用。最近在超启发式和多操作员moea方面的进展为解决这些挑战提供了有前途的自动化方法。我们发现，当需要整个Pareto前线时，epsilon非支配排序有效地保持了多达十个目标问题的多样性和选择压力。此外，自适应搜索操作符在适应各种实际应用程序的搜索环境方面显示出其有效性。然而，参考点/矢量法在高维下的性能下降，表明需要进一步研究。我们的研究强调了现有测试基准在根据实际性能区分moea方面的不足。虽然在优化测试问题的算法方面已经付出了相当大的努力，但moea在现实环境中的表现仍然不理想。本文介绍的基准测试框架和结果旨在促进协作，鼓励开发各种现实世界的基准测试问题。我们框架的灵活性也允许在基准研究中探索新的或混合算法架构。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

MODSIM2023, 25th International Congress on Modelling and Simulation.

自引率

0.00%

发文量