{"title":"在新的 Sunway 架构上重新设计弹性全波形反演","authors":"Mengyuan Hua, Wubing Wan, Zhaoqi Sun, Zekun Yin, Puyu Xiong, Xiaohui Liu, Haodong Tian, Ping Gao, Weiguo Liu, Hua Wang, Wenlai Zhao, Zhenchun Huang","doi":"10.1002/eng2.12819","DOIUrl":null,"url":null,"abstract":"IFOS3D is a three‐dimensional elastic full‐waveform inversion (EFWI) tool designed for high‐resolution estimation of the Earth's material properties within 3D subsurface structures. However, due to the significant computational costs associated with 3D EFWI, leveraging the computing power of a supercomputer for implementation is a logical choice. In this article, we introduce several innovative process‐level and thread‐level optimizations based on heterogeneous many‐core architectures in the new Sunway supercomputer, which is a powerful system globally. These optimizations encompass a process‐level communication overlapping strategy, thread‐level data partitioning and layout approaches, a remote memory access optimized master‐slave communication scheme, and a thread‐level data reuse and overlapping strategy. Through these optimizations, we achieve significant improvements in each iteration, with a kernel function speedup of approximately 59 and an overall program speedup of about 14. Our findings demonstrate the ability of our proposed optimization strategies to overcome the computational challenges associated with 3D EFWI, providing a promising framework for future advancements in the field of subsurface imaging.","PeriodicalId":502604,"journal":{"name":"Engineering Reports","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Redesigning elastic full‐waveform inversion on the new Sunway architecture\",\"authors\":\"Mengyuan Hua, Wubing Wan, Zhaoqi Sun, Zekun Yin, Puyu Xiong, Xiaohui Liu, Haodong Tian, Ping Gao, Weiguo Liu, Hua Wang, Wenlai Zhao, Zhenchun Huang\",\"doi\":\"10.1002/eng2.12819\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"IFOS3D is a three‐dimensional elastic full‐waveform inversion (EFWI) tool designed for high‐resolution estimation of the Earth's material properties within 3D subsurface structures. However, due to the significant computational costs associated with 3D EFWI, leveraging the computing power of a supercomputer for implementation is a logical choice. In this article, we introduce several innovative process‐level and thread‐level optimizations based on heterogeneous many‐core architectures in the new Sunway supercomputer, which is a powerful system globally. These optimizations encompass a process‐level communication overlapping strategy, thread‐level data partitioning and layout approaches, a remote memory access optimized master‐slave communication scheme, and a thread‐level data reuse and overlapping strategy. Through these optimizations, we achieve significant improvements in each iteration, with a kernel function speedup of approximately 59 and an overall program speedup of about 14. Our findings demonstrate the ability of our proposed optimization strategies to overcome the computational challenges associated with 3D EFWI, providing a promising framework for future advancements in the field of subsurface imaging.\",\"PeriodicalId\":502604,\"journal\":{\"name\":\"Engineering Reports\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-11-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Reports\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/eng2.12819\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Reports","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/eng2.12819","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Redesigning elastic full‐waveform inversion on the new Sunway architecture
IFOS3D is a three‐dimensional elastic full‐waveform inversion (EFWI) tool designed for high‐resolution estimation of the Earth's material properties within 3D subsurface structures. However, due to the significant computational costs associated with 3D EFWI, leveraging the computing power of a supercomputer for implementation is a logical choice. In this article, we introduce several innovative process‐level and thread‐level optimizations based on heterogeneous many‐core architectures in the new Sunway supercomputer, which is a powerful system globally. These optimizations encompass a process‐level communication overlapping strategy, thread‐level data partitioning and layout approaches, a remote memory access optimized master‐slave communication scheme, and a thread‐level data reuse and overlapping strategy. Through these optimizations, we achieve significant improvements in each iteration, with a kernel function speedup of approximately 59 and an overall program speedup of about 14. Our findings demonstrate the ability of our proposed optimization strategies to overcome the computational challenges associated with 3D EFWI, providing a promising framework for future advancements in the field of subsurface imaging.