在新的 Sunway 架构上重新设计弹性全波形反演

Mengyuan Hua, Wubing Wan, Zhaoqi Sun, Zekun Yin, Puyu Xiong, Xiaohui Liu, Haodong Tian, Ping Gao, Weiguo Liu, Hua Wang, Wenlai Zhao, Zhenchun Huang
{"title":"在新的 Sunway 架构上重新设计弹性全波形反演","authors":"Mengyuan Hua, Wubing Wan, Zhaoqi Sun, Zekun Yin, Puyu Xiong, Xiaohui Liu, Haodong Tian, Ping Gao, Weiguo Liu, Hua Wang, Wenlai Zhao, Zhenchun Huang","doi":"10.1002/eng2.12819","DOIUrl":null,"url":null,"abstract":"IFOS3D is a three‐dimensional elastic full‐waveform inversion (EFWI) tool designed for high‐resolution estimation of the Earth's material properties within 3D subsurface structures. However, due to the significant computational costs associated with 3D EFWI, leveraging the computing power of a supercomputer for implementation is a logical choice. In this article, we introduce several innovative process‐level and thread‐level optimizations based on heterogeneous many‐core architectures in the new Sunway supercomputer, which is a powerful system globally. These optimizations encompass a process‐level communication overlapping strategy, thread‐level data partitioning and layout approaches, a remote memory access optimized master‐slave communication scheme, and a thread‐level data reuse and overlapping strategy. Through these optimizations, we achieve significant improvements in each iteration, with a kernel function speedup of approximately 59 and an overall program speedup of about 14. Our findings demonstrate the ability of our proposed optimization strategies to overcome the computational challenges associated with 3D EFWI, providing a promising framework for future advancements in the field of subsurface imaging.","PeriodicalId":502604,"journal":{"name":"Engineering Reports","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Redesigning elastic full‐waveform inversion on the new Sunway architecture\",\"authors\":\"Mengyuan Hua, Wubing Wan, Zhaoqi Sun, Zekun Yin, Puyu Xiong, Xiaohui Liu, Haodong Tian, Ping Gao, Weiguo Liu, Hua Wang, Wenlai Zhao, Zhenchun Huang\",\"doi\":\"10.1002/eng2.12819\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"IFOS3D is a three‐dimensional elastic full‐waveform inversion (EFWI) tool designed for high‐resolution estimation of the Earth's material properties within 3D subsurface structures. However, due to the significant computational costs associated with 3D EFWI, leveraging the computing power of a supercomputer for implementation is a logical choice. In this article, we introduce several innovative process‐level and thread‐level optimizations based on heterogeneous many‐core architectures in the new Sunway supercomputer, which is a powerful system globally. These optimizations encompass a process‐level communication overlapping strategy, thread‐level data partitioning and layout approaches, a remote memory access optimized master‐slave communication scheme, and a thread‐level data reuse and overlapping strategy. Through these optimizations, we achieve significant improvements in each iteration, with a kernel function speedup of approximately 59 and an overall program speedup of about 14. Our findings demonstrate the ability of our proposed optimization strategies to overcome the computational challenges associated with 3D EFWI, providing a promising framework for future advancements in the field of subsurface imaging.\",\"PeriodicalId\":502604,\"journal\":{\"name\":\"Engineering Reports\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-11-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Reports\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/eng2.12819\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Reports","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/eng2.12819","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

IFOS3D 是一种三维弹性全波形反演(EFWI)工具,旨在对三维地下结构中的地球材料属性进行高分辨率估算。然而,由于三维弹性全波形反演需要大量计算成本,因此利用超级计算机的计算能力来实施是一个合理的选择。在本文中,我们介绍了基于新型 Sunway 超级计算机的异构多核架构的若干创新性进程级和线程级优化。这些优化包括进程级通信重叠策略、线程级数据分区和布局方法、远程内存访问优化的主从通信方案以及线程级数据重用和重叠策略。通过这些优化,我们在每次迭代中都取得了显著改进,内核函数速度提高了约 59 倍,程序整体速度提高了约 14 倍。我们的研究结果表明,我们提出的优化策略有能力克服与 3D EFWI 相关的计算挑战,为地下成像领域的未来发展提供了一个前景广阔的框架。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Redesigning elastic full‐waveform inversion on the new Sunway architecture
IFOS3D is a three‐dimensional elastic full‐waveform inversion (EFWI) tool designed for high‐resolution estimation of the Earth's material properties within 3D subsurface structures. However, due to the significant computational costs associated with 3D EFWI, leveraging the computing power of a supercomputer for implementation is a logical choice. In this article, we introduce several innovative process‐level and thread‐level optimizations based on heterogeneous many‐core architectures in the new Sunway supercomputer, which is a powerful system globally. These optimizations encompass a process‐level communication overlapping strategy, thread‐level data partitioning and layout approaches, a remote memory access optimized master‐slave communication scheme, and a thread‐level data reuse and overlapping strategy. Through these optimizations, we achieve significant improvements in each iteration, with a kernel function speedup of approximately 59 and an overall program speedup of about 14. Our findings demonstrate the ability of our proposed optimization strategies to overcome the computational challenges associated with 3D EFWI, providing a promising framework for future advancements in the field of subsurface imaging.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Conventional and artificial intelligence based maximum power point tracking techniques for efficient solar power generation Optimal path calculation method of optical network under complex constraints A method for detecting navigable areas in narrow rivers under complex reflection conditions A task‐centric knowledge graph construction method based on multi‐modal representation learning for industrial maintenance automation Multi‐objective assessment of the water‐energy‐environment‐food nexus involving a life cycle assessment approach
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1