Evaluating multiobjective evolutionary algorithms: A real-world benchmarking framework

J. Z. Salazar, D. Hadka, P. Reed, H. Seada, K. Deb
{"title":"Evaluating multiobjective evolutionary algorithms: A real-world benchmarking framework","authors":"J. Z. Salazar, D. Hadka, P. Reed, H. Seada, K. Deb","doi":"10.36334/modsim.2023.salazar","DOIUrl":null,"url":null,"abstract":": Multiobjective evolutionary algorithms (MOEAs) have shown significant progress in addressing well-defined test problems, but their effectiveness in real-world applications remains uncertain. To bridge this gap, we provide a comprehensive benchmarking framework designed to rigorously evaluate state-of-the-art MOEAs in real-world scenarios. Our framework comprises a suite of statistical evaluation metrics, for a collection of diverse real-world benchmark applications representing various mathematical problem difficulties. In this study, we carefully selected four benchmark applications with 3 to 10 objectives, capturing challenging characteristics such as stochastic objectives, severe constraints, strong nonlinearity, and complex Pareto frontiers. We evaluated the performance of five popular MOEAs, including NSGA-II, NSGA-III, RVEA, MOEA/D, and the Borg MOEA, using our benchmarking framework. Multiobjective evolutionary algorithms (MOEAs) have shown significant progress in addressing well-defined test problems, but their effectiveness in real-world applications remains uncertain. To bridge this gap, we provide a comprehensive benchmarking framework designed to rigorously evaluate state-of-the-art MOEAs in real-world scenarios. Our framework comprises a suite of statistical evaluation metrics, for a collection of diverse real-world benchmark applications representing various mathematical problem difficulties. The results revealed distinct differences in the performance of the evaluated MOEAs across the real-world applications. Surprisingly, MOEAs that excelled on standard test problems struggled when confronted with the complexities inherent in real-world applications. These findings underscore the need to enhance the adaptability and ease-of-use of MOEAs, considering the often ill-defined nature of real-world problem solving. Furthermore, our study provides insights into successful algorithmic design choices for MOEAs. Optimal selection strategies and archive mechanisms are crucial to prevent deterioration, maintain diversity, and provide adequate selection pressure throughout the optimization process. Additionally, the choice of stable and flexible operators plays a vital role in reliably driving the search towards the Pareto front. Recent advancements in hyper-heuristics and multi-operator MOEAs offer promising automated approaches for tackling these challenges. We found that epsilon non-dominated sorting effectively maintains diversity and selection pressure for problems with up to ten objectives when the entire Pareto front is desired. Moreover, auto-adaptive search operators demonstrate their efficacy in adapting to the search landscape of diverse real-world applications. However, the performance of reference point/vector methods deteriorated at higher dimensions, indicating the need for further investigation. Our study highlights the inadequacy of existing test benchmarks in differentiating MOEAs based on real-world performance. While considerable efforts have focused on optimizing algorithms for test problems, the subpar performance of MOEAs in real-world settings persists. The benchmarking framework and results presented here aim to foster collaborative efforts, encouraging the development of a diverse suite of real-world benchmarking problems. The flexibility of our framework also allows for exploration of new or hybrid algorithm architectures in benchmarking studies.","PeriodicalId":390064,"journal":{"name":"MODSIM2023, 25th International Congress on Modelling and Simulation.","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MODSIM2023, 25th International Congress on Modelling and Simulation.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36334/modsim.2023.salazar","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

: Multiobjective evolutionary algorithms (MOEAs) have shown significant progress in addressing well-defined test problems, but their effectiveness in real-world applications remains uncertain. To bridge this gap, we provide a comprehensive benchmarking framework designed to rigorously evaluate state-of-the-art MOEAs in real-world scenarios. Our framework comprises a suite of statistical evaluation metrics, for a collection of diverse real-world benchmark applications representing various mathematical problem difficulties. In this study, we carefully selected four benchmark applications with 3 to 10 objectives, capturing challenging characteristics such as stochastic objectives, severe constraints, strong nonlinearity, and complex Pareto frontiers. We evaluated the performance of five popular MOEAs, including NSGA-II, NSGA-III, RVEA, MOEA/D, and the Borg MOEA, using our benchmarking framework. Multiobjective evolutionary algorithms (MOEAs) have shown significant progress in addressing well-defined test problems, but their effectiveness in real-world applications remains uncertain. To bridge this gap, we provide a comprehensive benchmarking framework designed to rigorously evaluate state-of-the-art MOEAs in real-world scenarios. Our framework comprises a suite of statistical evaluation metrics, for a collection of diverse real-world benchmark applications representing various mathematical problem difficulties. The results revealed distinct differences in the performance of the evaluated MOEAs across the real-world applications. Surprisingly, MOEAs that excelled on standard test problems struggled when confronted with the complexities inherent in real-world applications. These findings underscore the need to enhance the adaptability and ease-of-use of MOEAs, considering the often ill-defined nature of real-world problem solving. Furthermore, our study provides insights into successful algorithmic design choices for MOEAs. Optimal selection strategies and archive mechanisms are crucial to prevent deterioration, maintain diversity, and provide adequate selection pressure throughout the optimization process. Additionally, the choice of stable and flexible operators plays a vital role in reliably driving the search towards the Pareto front. Recent advancements in hyper-heuristics and multi-operator MOEAs offer promising automated approaches for tackling these challenges. We found that epsilon non-dominated sorting effectively maintains diversity and selection pressure for problems with up to ten objectives when the entire Pareto front is desired. Moreover, auto-adaptive search operators demonstrate their efficacy in adapting to the search landscape of diverse real-world applications. However, the performance of reference point/vector methods deteriorated at higher dimensions, indicating the need for further investigation. Our study highlights the inadequacy of existing test benchmarks in differentiating MOEAs based on real-world performance. While considerable efforts have focused on optimizing algorithms for test problems, the subpar performance of MOEAs in real-world settings persists. The benchmarking framework and results presented here aim to foster collaborative efforts, encouraging the development of a diverse suite of real-world benchmarking problems. The flexibility of our framework also allows for exploration of new or hybrid algorithm architectures in benchmarking studies.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
评估多目标进化算法:一个真实世界的基准框架
多目标进化算法(moea)在解决定义良好的测试问题方面取得了重大进展,但其在实际应用中的有效性仍不确定。为了弥补这一差距,我们提供了一个全面的基准框架,旨在严格评估现实世界中最先进的moea。我们的框架包括一套统计评估指标,用于代表各种数学问题困难的各种现实世界基准应用程序的集合。在这项研究中,我们精心选择了4个具有3到10个目标的基准应用程序,捕捉了随机目标、严格约束、强非线性和复杂帕累托边界等具有挑战性的特征。我们使用我们的基准测试框架评估了五种流行的MOEA的性能,包括NSGA-II、NSGA-III、RVEA、MOEA/D和Borg MOEA。多目标进化算法(moea)在解决定义良好的测试问题方面取得了重大进展,但其在实际应用中的有效性仍不确定。为了弥补这一差距,我们提供了一个全面的基准框架,旨在严格评估现实世界中最先进的moea。我们的框架包括一套统计评估指标,用于代表各种数学问题困难的各种现实世界基准应用程序的集合。结果显示,经过评估的moea在实际应用中的性能存在明显差异。令人惊讶的是,在标准测试问题上表现出色的moea在面对现实世界应用程序中固有的复杂性时却举步维艰。考虑到现实世界问题解决的不明确性质,这些发现强调了增强moea的适应性和易用性的必要性。此外,我们的研究为moea的成功算法设计选择提供了见解。在整个优化过程中,优化选择策略和存档机制对于防止退化、保持多样性和提供足够的选择压力至关重要。此外,选择稳定和灵活的算子在可靠地推动搜索到帕累托前沿方面起着至关重要的作用。最近在超启发式和多操作员moea方面的进展为解决这些挑战提供了有前途的自动化方法。我们发现,当需要整个Pareto前线时,epsilon非支配排序有效地保持了多达十个目标问题的多样性和选择压力。此外,自适应搜索操作符在适应各种实际应用程序的搜索环境方面显示出其有效性。然而,参考点/矢量法在高维下的性能下降,表明需要进一步研究。我们的研究强调了现有测试基准在根据实际性能区分moea方面的不足。虽然在优化测试问题的算法方面已经付出了相当大的努力,但moea在现实环境中的表现仍然不理想。本文介绍的基准测试框架和结果旨在促进协作,鼓励开发各种现实世界的基准测试问题。我们框架的灵活性也允许在基准研究中探索新的或混合算法架构。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Modelling of the activated sludge process with a stratified settling unit Recent changes in the water and ecological condition at the arid Tarim River Basin A study on internal observation of vertical protective nets of temporary structures using image processing techniques Developing synthetic datasets for reef modelling Modelling hydrological impact of remotely sensed vegetation change
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1