Pub Date : 2024-08-14DOI: 10.1016/j.cpc.2024.109346
I.S. Gordeev , A.N. Bugay
A new type of a galactic cosmic rays (GCR) simulator, provided at the JINR Laboratory of Radiation Biology, is potentially capable of generating a complex radiation field with inclusions of a variety of ions with a wide energy range and with required abundance at the charged particle accelerators. This complex multicomponent radiation field simulates radiation environment inside a spacecraft during an interplanetary flight, for example, to Mars. The article provides an analytical description of the GCR simulator as well as a description of a specially developed software that enables selection of necessary parameters of a simulator model for creating relevant mixed radiation conditions. The software implements processing of data obtained with Monte Carlo-based FLUKA and PHITS programs, fitting and optimization of model parameters as well as data visualization tools.
{"title":"Computer modeling of a new type galactic cosmic rays simulator","authors":"I.S. Gordeev , A.N. Bugay","doi":"10.1016/j.cpc.2024.109346","DOIUrl":"10.1016/j.cpc.2024.109346","url":null,"abstract":"<div><p>A new type of a galactic cosmic rays (GCR) simulator, provided at the JINR Laboratory of Radiation Biology, is potentially capable of generating a complex radiation field with inclusions of a variety of ions with a wide energy range and with required abundance at the charged particle accelerators. This complex multicomponent radiation field simulates radiation environment inside a spacecraft during an interplanetary flight, for example, to Mars. The article provides an analytical description of the GCR simulator as well as a description of a specially developed software that enables selection of necessary parameters of a simulator model for creating relevant mixed radiation conditions. The software implements processing of data obtained with Monte Carlo-based FLUKA and PHITS programs, fitting and optimization of model parameters as well as data visualization tools.</p></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"305 ","pages":"Article 109346"},"PeriodicalIF":7.2,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142006817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-14DOI: 10.1016/j.cpc.2024.109349
Min Choi , Mahmut Sait Okyay , Adrian Perez Dieguez , Mauro Del Ben , Khaled Z. Ibrahim , Bryan M. Wong
We present a new software module, QRCODE (Quantum Research for Calculating Optically Driven Excitations), for massively parallelized real-time time-dependent density functional theory (RT-TDDFT) calculations of periodic systems in the open-source Qbox software package. Our approach utilizes a custom implementation of a fast Fourier transformation scheme that significantly reduces inter-node message passing interface (MPI) communication of the major computational kernel and shows impressive scaling up to 16,344 CPU cores. In addition to improving computational performance, QRCODE contains a suite of various time propagators for accurate RT-TDDFT calculations. As benchmark applications of QRCODE, we calculate the current density and optical absorption spectra of hexagonal boron nitride (h-BN) and photo-driven reaction dynamics of the ozone-oxygen reaction. We also calculate the second and higher harmonic generation of monolayer and multi-layer boron nitride structures as examples of large material systems. Our optimized implementation of RT-TDDFT in QRCODE enables large-scale calculations of real-time electron dynamics of chemical and material systems with enhanced computational performance and impressive scaling across several thousand CPU cores.
我们介绍了一种新的软件模块 QRCODE(计算光驱动激发的量子研究),用于在开源 Qbox 软件包中对周期系统进行大规模并行化实时时变密度泛函理论(RT-TDDFT)计算。我们的方法利用了快速傅立叶变换方案的定制实现,大大减少了主要计算内核的节点间消息传递接口(MPI)通信,并显示了高达 16,344 个 CPU 内核的惊人扩展能力。除了提高计算性能,QRCODE 还包含一套用于精确 RT-TDDFT 计算的各种时间传播器。作为 QRCODE 的基准应用,我们计算了六方氮化硼(h-BN)的电流密度和光学吸收光谱,以及臭氧-氧气反应的光驱动反应动力学。我们还以大型材料系统为例,计算了单层和多层氮化硼结构的二次谐波和高次谐波生成。我们在 QRCODE 中优化了 RT-TDDFT 的实现,从而能够大规模计算化学和材料系统的实时电子动力学,提高了计算性能,并在数千个 CPU 内核上实现了令人印象深刻的扩展。
{"title":"QRCODE: Massively parallelized real-time time-dependent density functional theory for periodic systems","authors":"Min Choi , Mahmut Sait Okyay , Adrian Perez Dieguez , Mauro Del Ben , Khaled Z. Ibrahim , Bryan M. Wong","doi":"10.1016/j.cpc.2024.109349","DOIUrl":"10.1016/j.cpc.2024.109349","url":null,"abstract":"<div><p>We present a new software module, QRCODE (Quantum Research for Calculating Optically Driven Excitations), for massively parallelized real-time time-dependent density functional theory (RT-TDDFT) calculations of periodic systems in the open-source Qbox software package. Our approach utilizes a custom implementation of a fast Fourier transformation scheme that significantly reduces inter-node message passing interface (MPI) communication of the major computational kernel and shows impressive scaling up to 16,344 CPU cores. In addition to improving computational performance, QRCODE contains a suite of various time propagators for accurate RT-TDDFT calculations. As benchmark applications of QRCODE, we calculate the current density and optical absorption spectra of hexagonal boron nitride (h-BN) and photo-driven reaction dynamics of the ozone-oxygen reaction. We also calculate the second and higher harmonic generation of monolayer and multi-layer boron nitride structures as examples of large material systems. Our optimized implementation of RT-TDDFT in QRCODE enables large-scale calculations of real-time electron dynamics of chemical and material systems with enhanced computational performance and impressive scaling across several thousand CPU cores.</p></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"305 ","pages":"Article 109349"},"PeriodicalIF":7.2,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0010465524002728/pdfft?md5=a5f7076a1b55e6e220016f622443470d&pid=1-s2.0-S0010465524002728-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142098480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-12DOI: 10.1016/j.cpc.2024.109345
A. Fierro , A. Alibalazadeh , J. Stephens , C. Moore
A highly parallelizable fluid plasma simulation tool based upon the first-order drift-diffusion equations is discussed. Atmospheric pressure plasmas have densities and gradients that require small element sizes in order to accurately simulate the plasm resulting in computational meshes on the order of millions to tens of millions of elements for realistic size plasma reactors. To enable simulations of this nature, parallel computing is required and must be optimized for the particular problem. Here, a finite-volume, electrostatic drift-diffusion implementation for low-temperature plasma is discussed. The implementation is built upon the Message Passing Interface (MPI) library in C++ using Object Oriented Programming. The underlying numerical method is outlined in detail and benchmarked against simple streamer formation from other streamer codes. Electron densities, electric field, and propagation speeds are compared with the reference case and show good agreement. Convergence studies are also performed showing a minimal space step of approximately 4 μm required to reduce relative error to below 1% during early streamer simulation times and even finer space steps are required for longer times. Additionally, strong and weak scaling of the implementation are studied and demonstrate the excellent performance behavior of the implementation up to 100 million elements on 1024 processors. Finally, different advection schemes are compared for the simple streamer problem to analyze the influence of numerical diffusion on the resulting quantities of interest.
{"title":"Massively parallel axisymmetric fluid model for streamer discharges","authors":"A. Fierro , A. Alibalazadeh , J. Stephens , C. Moore","doi":"10.1016/j.cpc.2024.109345","DOIUrl":"10.1016/j.cpc.2024.109345","url":null,"abstract":"<div><p>A highly parallelizable fluid plasma simulation tool based upon the first-order drift-diffusion equations is discussed. Atmospheric pressure plasmas have densities and gradients that require small element sizes in order to accurately simulate the plasm resulting in computational meshes on the order of millions to tens of millions of elements for realistic size plasma reactors. To enable simulations of this nature, parallel computing is required and must be optimized for the particular problem. Here, a finite-volume, electrostatic drift-diffusion implementation for low-temperature plasma is discussed. The implementation is built upon the Message Passing Interface (MPI) library in C++ using Object Oriented Programming. The underlying numerical method is outlined in detail and benchmarked against simple streamer formation from other streamer codes. Electron densities, electric field, and propagation speeds are compared with the reference case and show good agreement. Convergence studies are also performed showing a minimal space step of approximately 4 μm required to reduce relative error to below 1% during early streamer simulation times and even finer space steps are required for longer times. Additionally, strong and weak scaling of the implementation are studied and demonstrate the excellent performance behavior of the implementation up to 100 million elements on 1024 processors. Finally, different advection schemes are compared for the simple streamer problem to analyze the influence of numerical diffusion on the resulting quantities of interest.</p></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"305 ","pages":"Article 109345"},"PeriodicalIF":7.2,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142006816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-10DOI: 10.1016/j.cpc.2024.109341
Niko Oinonen , Aliaksandr V. Yakutovich , Aurelio Gallardo , Martin Ondráček , Prokop Hapala , Ondřej Krejčí
The Probe-Particle Model combine theories designed for the simulation of scanning probe microscopy experiments, employing non-reactive, flexible tip apices to achieve sub-molecular resolution. In the article we present the latest version of the Probe-Particle Model implemented in the open-source ppafm package, highlighting substantial advancements in accuracy, computational performance, and user-friendliness. To demonstrate this we provide a comprehensive review of approaches for simulating non-contact Atomic Force Microscopy. They vary in complexity from simple Lennard-Jones potential to the latest full density-based model. We compared those approaches with ab initio calculated references, showcasing their respective merits. All parts of the ppafm package have undergone acceleration by 1-2 orders of magnitude using OpenMP and OpenCL technologies. The updated package includes an interactive graphical user interface and seamless integration into the Python ecosystem via pip, facilitating advanced scripting and interoperability with other software. This adaptability positions ppafm as an ideal tool for high-throughput applications, including the training of machine learning models for the automatic recovery of atomic structures from nc-AFM measurements. We envision significant potential for this application in future single-molecule analysis, synthesis, and advancements in surface science in general. Additionally, we discuss simulations of other sub-molecular scanning-probe imaging techniques, such as bond-resolved scanning tunneling microscopy and kelvin probe force microscopy, all built on the robust foundation of the Probe-Particle Model. Altogether this demonstrates the broad impact of the model across diverse domains of on-surface science and molecular chemistry.
{"title":"Advancing scanning probe microscopy simulations: A decade of development in probe-particle models","authors":"Niko Oinonen , Aliaksandr V. Yakutovich , Aurelio Gallardo , Martin Ondráček , Prokop Hapala , Ondřej Krejčí","doi":"10.1016/j.cpc.2024.109341","DOIUrl":"10.1016/j.cpc.2024.109341","url":null,"abstract":"<div><p>The Probe-Particle Model combine theories designed for the simulation of scanning probe microscopy experiments, employing non-reactive, flexible tip apices to achieve <em>sub-molecular resolution</em>. In the article we present the latest version of the Probe-Particle Model implemented in the open-source <span>ppafm</span> package, highlighting substantial advancements in accuracy, computational performance, and user-friendliness. To demonstrate this we provide a comprehensive review of approaches for simulating non-contact Atomic Force Microscopy. They vary in complexity from simple Lennard-Jones potential to the latest <em>full density-based model</em>. We compared those approaches with <em>ab initio</em> calculated references, showcasing their respective merits. All parts of the <span>ppafm</span> package have undergone acceleration by 1-2 orders of magnitude using OpenMP and OpenCL technologies. The updated package includes an interactive graphical user interface and seamless integration into the Python ecosystem via <span>pip</span>, facilitating advanced scripting and interoperability with other software. This adaptability positions <span>ppafm</span> as an ideal tool for high-throughput applications, including the training of machine learning models for the automatic recovery of atomic structures from nc-AFM measurements. We envision significant potential for this application in future single-molecule analysis, synthesis, and advancements in surface science in general. Additionally, we discuss simulations of other <em>sub-molecular</em> scanning-probe imaging techniques, such as bond-resolved scanning tunneling microscopy and kelvin probe force microscopy, all built on the robust foundation of the Probe-Particle Model. Altogether this demonstrates the broad impact of the model across diverse domains of on-surface science and molecular chemistry.</p></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"305 ","pages":"Article 109341"},"PeriodicalIF":7.2,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0010465524002649/pdfft?md5=3c013c21ac97945bc843aef1b4dd416b&pid=1-s2.0-S0010465524002649-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141979431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-08DOI: 10.1016/j.cpc.2024.109343
Yan Wang , Xufeng Xiao , Hong Zhang , Xu Qian , Songhe Song
The numerical simulation of diblock copolymers under hydrodynamic action in complex domains is of great significance in academic research and industrial applications. The purpose of this study is to establish a fast, stable, and easily implementable numerical simulation framework for them. A hydrodynamically coupled diblock copolymer phase field model is considered, which includes a conserved Allen-Cahn-Ohta-Kawasaki type equation and an incompressible Navier-Stokes equation. However, rapid numerical simulation of the model in complex domains faces significant challenges, including discretization of complex boundaries, huge computational costs of three-dimensional (3D) problems, strong nonlinear coupling between multiple equations, and preserving the volume conservation properties. To overcome the above difficulties, a new modified model that can be computed in the regular domain is established by diffusion domain (DD) method, avoiding numerical discretization of complex boundaries. Then, we develop a stabilized second-order dimension splitting (DS) technique for the modified model. This approach effectively decomposes 2D or 3D problems into 1D sub-problems in different directions, significantly improving the computation efficiency. For spatial discretization, the central difference scheme is applied on mark and cell (MAC) grid, and the discrete volume conservation is ensured by proper processing. Finally, the efficacy of the modified model and numerical scheme is verified through numerical experiments. A series of numerical simulations are performed to investigate the effects of complex domains and fluid dynamics on the evolution of diblock copolymers.
{"title":"Efficient diffusion domain modeling and fast numerical methods for diblock copolymer melt in complex domains","authors":"Yan Wang , Xufeng Xiao , Hong Zhang , Xu Qian , Songhe Song","doi":"10.1016/j.cpc.2024.109343","DOIUrl":"10.1016/j.cpc.2024.109343","url":null,"abstract":"<div><p>The numerical simulation of diblock copolymers under hydrodynamic action in complex domains is of great significance in academic research and industrial applications. The purpose of this study is to establish a fast, stable, and easily implementable numerical simulation framework for them. A hydrodynamically coupled diblock copolymer phase field model is considered, which includes a conserved Allen-Cahn-Ohta-Kawasaki type equation and an incompressible Navier-Stokes equation. However, rapid numerical simulation of the model in complex domains faces significant challenges, including discretization of complex boundaries, huge computational costs of three-dimensional (3D) problems, strong nonlinear coupling between multiple equations, and preserving the volume conservation properties. To overcome the above difficulties, a new modified model that can be computed in the regular domain is established by diffusion domain (DD) method, avoiding numerical discretization of complex boundaries. Then, we develop a stabilized second-order dimension splitting (DS) technique for the modified model. This approach effectively decomposes 2D or 3D problems into 1D sub-problems in different directions, significantly improving the computation efficiency. For spatial discretization, the central difference scheme is applied on mark and cell (MAC) grid, and the discrete volume conservation is ensured by proper processing. Finally, the efficacy of the modified model and numerical scheme is verified through numerical experiments. A series of numerical simulations are performed to investigate the effects of complex domains and fluid dynamics on the evolution of diblock copolymers.</p></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"305 ","pages":"Article 109343"},"PeriodicalIF":7.2,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141984589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-08DOI: 10.1016/j.cpc.2024.109342
Sheng-Jer Chen, Hsiu-Yu Yu
The phase-field model is a prominent mesoscopic computational framework for predicting diverse phase change processes. Recent advancements in machine learning algorithms offer the potential to accelerate simulations by data-driven dimensionality reduction techniques. Here, we detail our development of a multivariate spatiotemporal predicting network, termed the linearized Motion-Aware Unit (L-MAU), to predict phase-field microstructures at reduced dimensions precisely. We employ the numerical Cahn-Hilliard equation incorporating the Flory-Huggins free energy function and concentration-dependent mobility to generate training and validation data. This comprehensive dataset encompasses slow- and fast-coarsening systems exhibiting droplet-like and bicontinuous patterns. To address computational complexity, we propose three dimensionality reduction pipelines: (I) two-point correlation function (TPCF) with principal component analysis (PCA), (II) low-compression autoencoder (LCA) with PCA, and (III) high-compression autoencoder (HCA). Following the steps of transformation, prediction, and reconstruction, we rigorously evaluate the results using statistical descriptors, including the average TPCF, structure factor, domain growth, and the structural similarity index measure (SSIM), to ensure the fidelity of machine predictions. A comparative analysis reveals that the dual-stage LCA approach with 300 principal components delivers optimal outcomes with accurate evolution dynamics and reconstructed morphologies. Moreover, incorporating the physical mass-conservation constraint into this dual-stage configuration (designated as C-LCA) produces more coherent and compact low-dimensional representations, further enhancing spatiotemporal feature predictions. This novel dimensionality reduction approach enables high-fidelity predictions of phase-field evolutions with controllable errors, and the final recovered microstructures may improve numerical integration robustly to achieve desired later-stage phase separation morphologies.
{"title":"L-MAU: A multivariate time-series network for predicting the Cahn-Hilliard microstructure evolutions via low-dimensional approaches","authors":"Sheng-Jer Chen, Hsiu-Yu Yu","doi":"10.1016/j.cpc.2024.109342","DOIUrl":"10.1016/j.cpc.2024.109342","url":null,"abstract":"<div><p>The phase-field model is a prominent mesoscopic computational framework for predicting diverse phase change processes. Recent advancements in machine learning algorithms offer the potential to accelerate simulations by data-driven dimensionality reduction techniques. Here, we detail our development of a multivariate spatiotemporal predicting network, termed the linearized Motion-Aware Unit (L-MAU), to predict phase-field microstructures at reduced dimensions precisely. We employ the numerical Cahn-Hilliard equation incorporating the Flory-Huggins free energy function and concentration-dependent mobility to generate training and validation data. This comprehensive dataset encompasses slow- and fast-coarsening systems exhibiting droplet-like and bicontinuous patterns. To address computational complexity, we propose three dimensionality reduction pipelines: (I) two-point correlation function (TPCF) with principal component analysis (PCA), (II) low-compression autoencoder (LCA) with PCA, and (III) high-compression autoencoder (HCA). Following the steps of transformation, prediction, and reconstruction, we rigorously evaluate the results using statistical descriptors, including the average TPCF, structure factor, domain growth, and the structural similarity index measure (SSIM), to ensure the fidelity of machine predictions. A comparative analysis reveals that the dual-stage LCA approach with 300 principal components delivers optimal outcomes with accurate evolution dynamics and reconstructed morphologies. Moreover, incorporating the physical mass-conservation constraint into this dual-stage configuration (designated as C-LCA) produces more coherent and compact low-dimensional representations, further enhancing spatiotemporal feature predictions. This novel dimensionality reduction approach enables high-fidelity predictions of phase-field evolutions with controllable errors, and the final recovered microstructures may improve numerical integration robustly to achieve desired later-stage phase separation morphologies.</p></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"305 ","pages":"Article 109342"},"PeriodicalIF":7.2,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142083135","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-08DOI: 10.1016/j.cpc.2024.109344
Aditya Vasudevan , Jorge Zorrilla Prieto , Sergei Zorkaltsev , Maciej Haranczyk
Local geometrical features of a porous material such as the shape and size of a pore or the curvature of a solid ligament do often affect the macroscopic properties of the material, and their characterization is necessary to fully understand the structure-property relationships. In this contribution, we present an approach to automatically segment large porous structures into such local features. Our work takes inspiration from techniques available in Topological Data Analysis (TDA). In particular, using Morse theory, we generate Morse-Smale Complexes of our structures that segment the structure, and/or its porosity into individual features that can then be compared. We develop a tool written in C++ that is built on the topology toolkit (TTK) library, an open source platform for the topological analysis of scalar data, with which we can perform segmentation of these structures. Our tool takes a volumetric grid representation as an input, which can be generated from atomistic or mesh structure models and any function defined on such grid, e.g. the distance to the surface or the interaction energy with a probe. We demonstrate the applicability of the tool by two examples related with analysis of porosity in zeolite materials as well as analysis of ligaments in a porous metal structure. Specifically, by segmenting the pores in the structure we demonstrate some applications to zeolites such as assessing pore-similarity between structures or evaluating the accessible volume to a target molecule such as methane that can be adsorbed to its surface. Moreover, once the Morse-Smale complexes are generated, we can construct graph representations of the void space, replacing the entire pore structure by a simply connected graph. Similarly, the same tool is used to segment and generates graphs representing the solid structure and we show how they can be used to correlate structure and mechanical properties of the material. The code is published as open-source and can be accessed here: https://github.com/AMDatIMDEA/tda-segmentor
多孔材料的局部几何特征(如孔隙的形状和大小或固体韧带的曲率)通常会影响材料的宏观特性,因此,要充分了解结构与特性之间的关系,就必须对这些特征进行表征。在本文中,我们提出了一种将大型多孔结构自动分割为此类局部特征的方法。我们的工作从拓扑数据分析(TDA)技术中获得灵感。特别是,利用莫尔斯理论,我们生成了结构的莫尔斯-尺度复合物,将结构和/或其孔隙率分割成可以比较的单个特征。我们开发了一种用 C++ 编写的工具,该工具基于拓扑工具包(TTK)库,这是一个用于标量数据拓扑分析的开源平台,我们可以利用它对这些结构进行分割。我们的工具将体积网格表示法作为输入,该表示法可由原子或网格结构模型以及定义在此类网格上的任何函数(如到表面的距离或与探针的相互作用能量)生成。我们通过分析沸石材料中的孔隙率以及分析多孔金属结构中的韧带这两个实例来展示该工具的适用性。具体来说,通过分割结构中的孔隙,我们展示了沸石的一些应用,如评估结构之间的孔隙相似性或评估目标分子(如可吸附在其表面的甲烷)的可及体积。此外,一旦生成莫尔斯-斯马尔复合体,我们就可以构建空隙空间的图示,用简单连接的图取代整个孔隙结构。同样,同样的工具也可用于分割和生成表示固体结构的图形,我们将展示如何利用它们来关联材料的结构和机械特性。代码以开源形式发布,可在此处访问: https://github.com/AMDatIMDEA/tda-segmentor
{"title":"tda-segmentor: A tool to extract and analyze local structure and porosity features in porous materials","authors":"Aditya Vasudevan , Jorge Zorrilla Prieto , Sergei Zorkaltsev , Maciej Haranczyk","doi":"10.1016/j.cpc.2024.109344","DOIUrl":"10.1016/j.cpc.2024.109344","url":null,"abstract":"<div><p>Local geometrical features of a porous material such as the shape and size of a pore or the curvature of a solid ligament do often affect the macroscopic properties of the material, and their characterization is necessary to fully understand the structure-property relationships. In this contribution, we present an approach to automatically segment large porous structures into such local features. Our work takes inspiration from techniques available in Topological Data Analysis (TDA). In particular, using Morse theory, we generate Morse-Smale Complexes of our structures that segment the structure, and/or its porosity into individual features that can then be compared. We develop a tool written in C<span>++</span> that is built on the topology toolkit (TTK) library, an open source platform for the topological analysis of scalar data, with which we can perform segmentation of these structures. Our tool takes a volumetric grid representation as an input, which can be generated from atomistic or mesh structure models and any function defined on such grid, e.g. the distance to the surface or the interaction energy with a probe. We demonstrate the applicability of the tool by two examples related with analysis of porosity in zeolite materials as well as analysis of ligaments in a porous metal structure. Specifically, by segmenting the pores in the structure we demonstrate some applications to zeolites such as assessing pore-similarity between structures or evaluating the accessible volume to a target molecule such as methane that can be adsorbed to its surface. Moreover, once the Morse-Smale complexes are generated, we can construct graph representations of the void space, replacing the entire pore structure by a simply connected graph. Similarly, the same tool is used to segment and generates graphs representing the solid structure and we show how they can be used to correlate structure and mechanical properties of the material. The code is published as open-source and can be accessed here: <span><span>https://github.com/AMDatIMDEA/tda-segmentor</span><svg><path></path></svg></span></p></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"305 ","pages":"Article 109344"},"PeriodicalIF":7.2,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142077043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-08DOI: 10.1016/j.cpc.2024.109333
Jiatao Zhang , Xiaohu Guo , Xiufang Feng , Li Zhu , Xiaolu Su
This paper presents a new preprocessing algorithm to generate accurate initial conditions for particle-method-based CFD simulations with complex geometries. The algorithm is based on the improved Marching Cubes method (MC) with the newly proposed isosurface particle redistribution optimisation. It can not only produce topologically accurate isosurfaces and boundary particles that encompass the entire boundary surface but also offers a seamless method for evenly distributing internal fluid particles, eliminating the necessity for additional fluid field reconstruction algorithms. To address the issue of particle clustering on the surface boundary caused by MC intersection with sharp corners in complex geometries, we have introduced an iterative particle-moving algorithm. This algorithm aims to both achieve a uniform distribution of boundary particles across the surface and to recompute their normal vectors due to particles movement. In introducing our newly developed preprocessing algorithm, we have taken the initiative to systematically elucidate the entire process of generating boundary particles on complex surfaces using optimization theory, marking a pioneering effort in this regard. The developed particle preprocessing optimization techniques can use inputs from both the volume image data format from MRI/CT and standard CAD files, such as STL models. We have used various test cases with standard CAD geometries and complex real-world application geometries to validate and test the algorithms. The results demonstrate the impressive ability of our preprocessing toolkit1 to handle real complex geometries, along with the robustness and efficiency of the newly developed algorithms.
{"title":"A complex geometry isosurface reconstruction algorithm for particle based CFD simulations","authors":"Jiatao Zhang , Xiaohu Guo , Xiufang Feng , Li Zhu , Xiaolu Su","doi":"10.1016/j.cpc.2024.109333","DOIUrl":"10.1016/j.cpc.2024.109333","url":null,"abstract":"<div><p>This paper presents a new preprocessing algorithm to generate accurate initial conditions for particle-method-based CFD simulations with complex geometries. The algorithm is based on the improved Marching Cubes method (MC) with the newly proposed isosurface particle redistribution optimisation. It can not only produce topologically accurate isosurfaces and boundary particles that encompass the entire boundary surface but also offers a seamless method for evenly distributing internal fluid particles, eliminating the necessity for additional fluid field reconstruction algorithms. To address the issue of particle clustering on the surface boundary caused by MC intersection with sharp corners in complex geometries, we have introduced an iterative particle-moving algorithm. This algorithm aims to both achieve a uniform distribution of boundary particles across the surface and to recompute their normal vectors due to particles movement. In introducing our newly developed preprocessing algorithm, we have taken the initiative to systematically elucidate the entire process of generating boundary particles on complex surfaces using optimization theory, marking a pioneering effort in this regard. The developed particle preprocessing optimization techniques can use inputs from both the volume image data format from MRI/CT and standard CAD files, such as STL models. We have used various test cases with standard CAD geometries and complex real-world application geometries to validate and test the algorithms. The results demonstrate the impressive ability of our preprocessing toolkit<span><span><sup>1</sup></span></span> to handle real complex geometries, along with the robustness and efficiency of the newly developed algorithms.</p></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"305 ","pages":"Article 109333"},"PeriodicalIF":7.2,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142013032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-05DOI: 10.1016/j.cpc.2024.109332
Semih Akkurt , Freddie Witherden , Peter Vincent
In this article, cache blocking is implemented for the Navier Stokes equations with anti-aliasing support on mixed grids in PyFR for CPUs. In particular, cache blocking is used as an alternative to kernel fusion to eliminate unnecessary data movements between kernels at the main memory level. Specifically, kernels that exchange data are grouped together, and these groups are then executed on small sub-regions of the domain that fit in per-core private data cache. Additionally, cache blocking is also used to efficiently implement a tensor product factorisation of the interpolation operators associated with anti-aliasing. By using cache blocking, the intermediate results between application of the sparse factors are stored in per-core private data cache, and a significant amount of data movement from main memory is avoided. In order to assess the performance gains a theoretical model is developed, and the implementation is benchmarked using a compressible 3D Taylor-Green vortex test case on both hexahedral and prismatic grids, with third-, fourth-, and fifth-order solution polynomials. The expected performance gains based on the theoretical model range from 1.99 to 2.83, and the speedups obtained in practice range from 1.51 to 3.91 compared to PyFR v1.11.0.
在本文中,PyFR for CPU 实现了在混合网格上支持抗锯齿的纳维-斯托克斯方程缓存阻塞。尤其是,缓存阻塞被用作内核融合的替代方法,以消除主内存级内核间不必要的数据移动。具体来说,将交换数据的内核分组,然后在适合每个内核专用数据缓存的小域子区域上执行这些分组。此外,缓存阻塞还用于高效地实现与抗锯齿相关的插值运算符的张量乘积因式分解。通过使用缓存阻塞,稀疏因子应用之间的中间结果被存储在每核专用数据缓存中,从而避免了从主存储器移动大量数据。为了评估性能增益,我们开发了一个理论模型,并使用六面体和棱柱网格上的可压缩三维泰勒-格林涡旋测试案例,以及三阶、四阶和五阶求解多项式,对实施情况进行了基准测试。与 PyFR v1.11.0 相比,基于理论模型的预期性能增益从 1.99 到 2.83 不等,而实际获得的速度提升从 1.51 到 3.91 不等。
{"title":"Cache blocking for flux reconstruction: Extension to Navier-Stokes equations and anti-aliasing","authors":"Semih Akkurt , Freddie Witherden , Peter Vincent","doi":"10.1016/j.cpc.2024.109332","DOIUrl":"10.1016/j.cpc.2024.109332","url":null,"abstract":"<div><p>In this article, cache blocking is implemented for the Navier Stokes equations with anti-aliasing support on mixed grids in PyFR for CPUs. In particular, cache blocking is used as an alternative to kernel fusion to eliminate unnecessary data movements between kernels at the main memory level. Specifically, kernels that exchange data are grouped together, and these groups are then executed on small sub-regions of the domain that fit in per-core private data cache. Additionally, cache blocking is also used to efficiently implement a tensor product factorisation of the interpolation operators associated with anti-aliasing. By using cache blocking, the intermediate results between application of the sparse factors are stored in per-core private data cache, and a significant amount of data movement from main memory is avoided. In order to assess the performance gains a theoretical model is developed, and the implementation is benchmarked using a compressible 3D Taylor-Green vortex test case on both hexahedral and prismatic grids, with third-, fourth-, and fifth-order solution polynomials. The expected performance gains based on the theoretical model range from 1.99 to 2.83, and the speedups obtained in practice range from 1.51 to 3.91 compared to PyFR v1.11.0.</p></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"305 ","pages":"Article 109332"},"PeriodicalIF":7.2,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0010465524002558/pdfft?md5=f253f41651251a63812f0a8e5e79c01d&pid=1-s2.0-S0010465524002558-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141941229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-02DOI: 10.1016/j.cpc.2024.109331
Joao Cunha , José Queiroz , Carlos Silva , Fabio Gentile , Diogo E. Aguiam
We introduce a new open-source software package written in Python to design and model micro optical elements, such as diffractive lenses, holograms, as well as other components within the broad area of flat optics, and generate their corresponding (production-ready) lithography mask files. To this aim, the package provides functions to design a multitude of kinoform lenses, phase masks and holograms, but is versatile and the user can implement any arbitrary numerical or analytical optical component designs. For validating the designs, this package provides scalar diffraction propagation to simulate optical field propagation in different regimes covering near- and far-field regions (Fresnel, Fraunhofer and Rayleigh-Sommerfeld). Particularly, by implementing Rayleigh-Sommerfeld propagation, we demonstrate accurate field propagation within near- and far-field ranges, providing versatility and accuracy. Importantly, the package allows to directly export production-ready multilevel/binary lithography mask files of the designed optical components. Additionally, metasurface masks can conveniently be generated for any user-defined meta-element library given as input. Finally, the software package capabilities are illustrated with examples of mask design and modeling of diffractive lenses, holograms, and metasurfaces susceptible of being fabricated via lithography techniques. Beyond lithography, the package can also straightforwardly be used in other applications requiring mask generation, such as beam shaping, optical trapping and digital holography.
{"title":"pyMOE: Mask design and modeling for micro optical elements and flat optics","authors":"Joao Cunha , José Queiroz , Carlos Silva , Fabio Gentile , Diogo E. Aguiam","doi":"10.1016/j.cpc.2024.109331","DOIUrl":"10.1016/j.cpc.2024.109331","url":null,"abstract":"<div><p>We introduce a new open-source software package written in Python to design and model micro optical elements, such as diffractive lenses, holograms, as well as other components within the broad area of flat optics, and generate their corresponding (production-ready) lithography mask files. To this aim, the package provides functions to design a multitude of kinoform lenses, phase masks and holograms, but is versatile and the user can implement any arbitrary numerical or analytical optical component designs. For validating the designs, this package provides scalar diffraction propagation to simulate optical field propagation in different regimes covering near- and far-field regions (Fresnel, Fraunhofer and Rayleigh-Sommerfeld). Particularly, by implementing Rayleigh-Sommerfeld propagation, we demonstrate accurate field propagation within near- and far-field ranges, providing versatility and accuracy. Importantly, the package allows to directly export production-ready multilevel/binary lithography mask files of the designed optical components. Additionally, metasurface masks can conveniently be generated for any user-defined meta-element library given as input. Finally, the software package capabilities are illustrated with examples of mask design and modeling of diffractive lenses, holograms, and metasurfaces susceptible of being fabricated via lithography techniques. Beyond lithography, the package can also straightforwardly be used in other applications requiring mask generation, such as beam shaping, optical trapping and digital holography.</p></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"305 ","pages":"Article 109331"},"PeriodicalIF":7.2,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0010465524002546/pdfft?md5=72d18388a6734669c1b1b763c0bc9586&pid=1-s2.0-S0010465524002546-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142006818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}