Lorenzo Canonne , Bilel Derbel , Miwako Tsuji , Mitsuhisa Sato
{"title":"超级计算机 \"Fugaku \"上的大规模协同灰箱并行优化","authors":"Lorenzo Canonne , Bilel Derbel , Miwako Tsuji , Mitsuhisa Sato","doi":"10.1016/j.jpdc.2024.104921","DOIUrl":null,"url":null,"abstract":"<div><p>We design, develop and analyze parallel variants of a state-of-the-art graybox optimization algorithm, namely <span>Drils</span> (Deterministic Recombination and Iterated Local Search), for attacking large-scale pseudo-boolean optimization problems on top of the large-scale computing facilities offered by the supercomputer Fugaku. We first adopt a Master/Worker design coupled with a fully distributed Island-based model, ending up with a number of hybrid OpenMP/MPI implementations of high-level parallel <span>Drils</span> versions. We show that such a design, although effective, can be substantially improved by enabling a more focused iteration-level cooperation mechanism between the core graybox components of the original serial <span>Drils</span> algorithm. Extensive experiments are conducted in order to provide a systematic analysis of the impact of the designed parallel algorithms on search behavior, and their ability to compute high-quality solutions using increasing number of CPU-cores. Results using up to 1024×12-cores NUMA nodes, and NK-landscapes with up to <span><math><mn>10</mn><mo>,</mo><mn>000</mn></math></span> binary variables are reported, providing evidence on the relative strength of the designed hybrid cooperative graybox parallel search.</p></div>","PeriodicalId":54775,"journal":{"name":"Journal of Parallel and Distributed Computing","volume":"191 ","pages":"Article 104921"},"PeriodicalIF":3.4000,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Large-scale and cooperative graybox parallel optimization on the supercomputer Fugaku\",\"authors\":\"Lorenzo Canonne , Bilel Derbel , Miwako Tsuji , Mitsuhisa Sato\",\"doi\":\"10.1016/j.jpdc.2024.104921\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>We design, develop and analyze parallel variants of a state-of-the-art graybox optimization algorithm, namely <span>Drils</span> (Deterministic Recombination and Iterated Local Search), for attacking large-scale pseudo-boolean optimization problems on top of the large-scale computing facilities offered by the supercomputer Fugaku. We first adopt a Master/Worker design coupled with a fully distributed Island-based model, ending up with a number of hybrid OpenMP/MPI implementations of high-level parallel <span>Drils</span> versions. We show that such a design, although effective, can be substantially improved by enabling a more focused iteration-level cooperation mechanism between the core graybox components of the original serial <span>Drils</span> algorithm. Extensive experiments are conducted in order to provide a systematic analysis of the impact of the designed parallel algorithms on search behavior, and their ability to compute high-quality solutions using increasing number of CPU-cores. Results using up to 1024×12-cores NUMA nodes, and NK-landscapes with up to <span><math><mn>10</mn><mo>,</mo><mn>000</mn></math></span> binary variables are reported, providing evidence on the relative strength of the designed hybrid cooperative graybox parallel search.</p></div>\",\"PeriodicalId\":54775,\"journal\":{\"name\":\"Journal of Parallel and Distributed Computing\",\"volume\":\"191 \",\"pages\":\"Article 104921\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2024-05-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Parallel and Distributed Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0743731524000856\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Parallel and Distributed Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0743731524000856","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
摘要
我们设计、开发并分析了一种最先进的灰盒优化算法的并行变体,即 Drils(确定性重组和迭代局部搜索),用于在超级计算机富加库提供的大规模计算设施之上解决大规模伪布尔优化问题。我们首先采用了 Master/Worker 设计和基于岛的完全分布式模型,最终得到了一些高级并行 Drils 版本的 OpenMP/MPI 混合实现。我们的研究表明,这种设计虽然有效,但可以通过在原始串行 Drils 算法的核心灰盒组件之间建立更集中的迭代级合作机制来大幅改进。我们进行了广泛的实验,以便系统分析所设计的并行算法对搜索行为的影响,以及使用越来越多的 CPU 核心计算高质量解决方案的能力。报告了使用多达 1024×12 核 NUMA 节点和多达 10,000 个二进制变量的 NK-landscapes 的结果,为所设计的混合合作灰箱并行搜索的相对优势提供了证据。
Large-scale and cooperative graybox parallel optimization on the supercomputer Fugaku
We design, develop and analyze parallel variants of a state-of-the-art graybox optimization algorithm, namely Drils (Deterministic Recombination and Iterated Local Search), for attacking large-scale pseudo-boolean optimization problems on top of the large-scale computing facilities offered by the supercomputer Fugaku. We first adopt a Master/Worker design coupled with a fully distributed Island-based model, ending up with a number of hybrid OpenMP/MPI implementations of high-level parallel Drils versions. We show that such a design, although effective, can be substantially improved by enabling a more focused iteration-level cooperation mechanism between the core graybox components of the original serial Drils algorithm. Extensive experiments are conducted in order to provide a systematic analysis of the impact of the designed parallel algorithms on search behavior, and their ability to compute high-quality solutions using increasing number of CPU-cores. Results using up to 1024×12-cores NUMA nodes, and NK-landscapes with up to binary variables are reported, providing evidence on the relative strength of the designed hybrid cooperative graybox parallel search.
期刊介绍:
This international journal is directed to researchers, engineers, educators, managers, programmers, and users of computers who have particular interests in parallel processing and/or distributed computing.
The Journal of Parallel and Distributed Computing publishes original research papers and timely review articles on the theory, design, evaluation, and use of parallel and/or distributed computing systems. The journal also features special issues on these topics; again covering the full range from the design to the use of our targeted systems.