Learning to Schedule Heuristics for the Simultaneous Stochastic Optimization of Mining Complexes

Yassine Yaakoubi, R. Dimitrakopoulos
{"title":"Learning to Schedule Heuristics for the Simultaneous Stochastic Optimization of Mining Complexes","authors":"Yassine Yaakoubi, R. Dimitrakopoulos","doi":"10.2139/ssrn.4229477","DOIUrl":null,"url":null,"abstract":"The simultaneous stochastic optimization of mining complexes (SSOMC) is a large-scale stochastic combinatorial optimization problem that simultaneously manages the extraction of materials from multiple mines and their processing using interconnected facilities to generate a set of final products, while taking into account material supply (geological) uncertainty to manage the associated risk. Although simulated annealing has been shown to outperform comparing methods for solving the SSOMC, early performance might dominate recent performance in that a combination of the heuristics' performance is used to determine which perturbations to apply. This work proposes a data-driven framework for heuristic scheduling in a fully self-managed hyper-heuristic to solve the SSOMC. The proposed learn-to-perturb (L2P) hyper-heuristic is a multi-neighborhood simulated annealing algorithm. The L2P selects the heuristic (perturbation) to be applied in a self-adaptive manner using reinforcement learning to efficiently explore which local search is best suited for a particular search point. Several state-of-the-art agents have been incorporated into L2P to better adapt the search and guide it towards better solutions. By learning from data describing the performance of the heuristics, a problem-specific ordering of heuristics that collectively finds better solutions faster is obtained. L2P is tested on several real-world mining complexes, with an emphasis on efficiency, robustness, and generalization capacity. Results show a reduction in the number of iterations by 30-50% and in the computational time by 30-45%.","PeriodicalId":10582,"journal":{"name":"Comput. Oper. Res.","volume":"54 1","pages":"106349"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Comput. Oper. Res.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.4229477","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The simultaneous stochastic optimization of mining complexes (SSOMC) is a large-scale stochastic combinatorial optimization problem that simultaneously manages the extraction of materials from multiple mines and their processing using interconnected facilities to generate a set of final products, while taking into account material supply (geological) uncertainty to manage the associated risk. Although simulated annealing has been shown to outperform comparing methods for solving the SSOMC, early performance might dominate recent performance in that a combination of the heuristics' performance is used to determine which perturbations to apply. This work proposes a data-driven framework for heuristic scheduling in a fully self-managed hyper-heuristic to solve the SSOMC. The proposed learn-to-perturb (L2P) hyper-heuristic is a multi-neighborhood simulated annealing algorithm. The L2P selects the heuristic (perturbation) to be applied in a self-adaptive manner using reinforcement learning to efficiently explore which local search is best suited for a particular search point. Several state-of-the-art agents have been incorporated into L2P to better adapt the search and guide it towards better solutions. By learning from data describing the performance of the heuristics, a problem-specific ordering of heuristics that collectively finds better solutions faster is obtained. L2P is tested on several real-world mining complexes, with an emphasis on efficiency, robustness, and generalization capacity. Results show a reduction in the number of iterations by 30-50% and in the computational time by 30-45%.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
采矿复合体同步随机优化的学习调度启发式
矿山复合体同步随机优化(SSOMC)是一个大规模随机组合优化问题,它同时管理从多个矿山中提取材料并使用相互关联的设施进行加工以产生一组最终产品,同时考虑材料供应(地质)的不确定性以管理相关风险。虽然模拟退火在解决SSOMC方面的表现优于比较方法,但早期的性能可能会主导最近的性能,因为启发式性能的组合用于确定应用哪种扰动。本文提出了一个数据驱动的启发式调度框架,在一个完全自我管理的超启发式中解决SSOMC问题。提出的L2P超启发式算法是一种多邻域模拟退火算法。L2P选择启发式(扰动)以自适应的方式应用,使用强化学习来有效地探索最适合特定搜索点的局部搜索。几个最先进的代理已被纳入L2P,以更好地适应搜索并引导其找到更好的解决方案。通过从描述启发式性能的数据中学习,可以获得特定于问题的启发式排序,从而更快地找到更好的解决方案。L2P在几个现实世界的采矿复合体上进行了测试,重点是效率、鲁棒性和泛化能力。结果表明,迭代次数减少了30-50%,计算时间减少了30-45%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A family of hybrid conjugate gradient method with restart procedure for unconstrained optimizations and image restorations Dual-neighborhood iterated local search for routing and wavelength assignment Using submodularity in solving the robust bandwidth packing problem with queuing delay guarantees Loads scheduling for demand response in energy communities Simulation-based inventory management of perishable products via linear discrete choice models
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1