DataCentric Engineering最新文献_第7页

Data-driven sparse reconstruction of flow over a stalled aerofoil using experimental data 基于实验数据的失速翼型上流动的数据驱动稀疏重建

Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

DataCentric Engineering

Pub Date : 2021-05-31 DOI: 10.1017/dce.2021.5

D. Carter, Francis De Voogt, R. Soares, B. Ganapathisubramani

Abstract Recent work has demonstrated the use of sparse sensors in combination with the proper orthogonal decomposition (POD) to produce data-driven reconstructions of the full velocity fields in a variety of flows. The present work investigates the fidelity of such techniques applied to a stalled NACA 0012 aerofoil at $ {Re}_c=75,000 $ at an angle of attack $ alpha ={12}^{circ } $ as measured experimentally using planar time-resolved particle image velocimetry. In contrast to many previous studies, the flow is absent of any dominant shedding frequency and exhibits a broad range of singular values due to the turbulence in the separated region. Several reconstruction methodologies for linear state estimation based on classical compressed sensing and extended POD methodologies are presented as well as nonlinear refinement through the use of a shallow neural network (SNN). It is found that the linear reconstructions inspired by the extended POD are inferior to the compressed sensing approach provided that the sparse sensors avoid regions of the flow with small variance across the global POD basis. Regardless of the linear method used, the nonlinear SNN gives strikingly similar performance in its refinement of the reconstructions. The capability of sparse sensors to reconstruct separated turbulent flow measurements is further discussed and directions for future work suggested. Impact Statement Sparse reconstruction of full-field information using a limited subset of data is widely relevant to data-centric engineering applications; from reconstructing human faces with limited pixels to predicting laminar and turbulent flow fields from limited sensors. The focus of the present study is of the latter example with high relevance to active flow control in aerospace and related industry. There are multiple data-driven methodologies for obtaining flow field reconstructions from sparse measurements ranging from the linear unsupervised proper orthogonal decomposition to the use of nonlinear supervised NNs. The feasibility of such methods to flow fields that are highly turbulent as well as obtained via experiment remains an open area of research. The present study reveals the capability of these techniques to create a time-invariant library that can predict instantaneous states of the flow from sparse measurements alone (provided that these states are within the bounds of the applied training data). The proposed linear methods, as well as the NN architecture, provide well-characterized frameworks for future efforts in sparse sensing and state estimation applications: particularly for highly nonlinear underlying systems such as turbulent flow.

最近的研究表明，利用稀疏传感器与适当的正交分解(POD)相结合，可以产生各种流动的全速度场的数据驱动重建。本工作研究了这些技术应用于停滞的NACA 0012机翼在$ {Re}_c= 75000 $，攻角$ alpha ={12}^{circ} $上的保真度，这是用平面时间分辨粒子图像测速仪实验测量的。与以往的许多研究相反，由于分离区域的湍流，流动没有任何主导的脱落频率，并且表现出广泛的奇异值。提出了几种基于经典压缩感知和扩展POD方法的线性状态估计重建方法，以及通过使用浅神经网络(SNN)进行非线性细化。研究发现，如果稀疏传感器避开了全局POD基上方差较小的区域，则扩展POD激发的线性重建效果不如压缩感知方法。无论使用哪种线性方法，非线性SNN在改进重建方面都具有惊人的相似性能。进一步讨论了稀疏传感器重建分离湍流测量的能力，并提出了未来工作的方向。使用有限的数据子集进行全域信息的稀疏重建与以数据为中心的工程应用广泛相关;从用有限的像素重建人脸到用有限的传感器预测层流和湍流流场。本文研究的重点是后一种与航空航天及相关工业中主动流量控制高度相关的例子。从稀疏测量中获得流场重构有多种数据驱动的方法，从线性无监督适当正交分解到使用非线性监督神经网络。这些方法在高湍流流场以及实验得到的流场中的可行性仍然是一个开放的研究领域。本研究揭示了这些技术创建时不变库的能力，该库可以仅从稀疏测量中预测流的瞬时状态(前提是这些状态在应用训练数据的范围内)。所提出的线性方法，以及神经网络架构，为稀疏感知和状态估计应用的未来努力提供了良好的特征框架:特别是对于高度非线性的底层系统，如湍流。

{"title":"Data-driven sparse reconstruction of flow over a stalled aerofoil using experimental data","authors":"D. Carter, Francis De Voogt, R. Soares, B. Ganapathisubramani","doi":"10.1017/dce.2021.5","DOIUrl":"https://doi.org/10.1017/dce.2021.5","url":null,"abstract":"Abstract Recent work has demonstrated the use of sparse sensors in combination with the proper orthogonal decomposition (POD) to produce data-driven reconstructions of the full velocity fields in a variety of flows. The present work investigates the fidelity of such techniques applied to a stalled NACA 0012 aerofoil at $ {Re}_c=75,000 $ at an angle of attack $ alpha ={12}^{circ } $ as measured experimentally using planar time-resolved particle image velocimetry. In contrast to many previous studies, the flow is absent of any dominant shedding frequency and exhibits a broad range of singular values due to the turbulence in the separated region. Several reconstruction methodologies for linear state estimation based on classical compressed sensing and extended POD methodologies are presented as well as nonlinear refinement through the use of a shallow neural network (SNN). It is found that the linear reconstructions inspired by the extended POD are inferior to the compressed sensing approach provided that the sparse sensors avoid regions of the flow with small variance across the global POD basis. Regardless of the linear method used, the nonlinear SNN gives strikingly similar performance in its refinement of the reconstructions. The capability of sparse sensors to reconstruct separated turbulent flow measurements is further discussed and directions for future work suggested. Impact Statement Sparse reconstruction of full-field information using a limited subset of data is widely relevant to data-centric engineering applications; from reconstructing human faces with limited pixels to predicting laminar and turbulent flow fields from limited sensors. The focus of the present study is of the latter example with high relevance to active flow control in aerospace and related industry. There are multiple data-driven methodologies for obtaining flow field reconstructions from sparse measurements ranging from the linear unsupervised proper orthogonal decomposition to the use of nonlinear supervised NNs. The feasibility of such methods to flow fields that are highly turbulent as well as obtained via experiment remains an open area of research. The present study reveals the capability of these techniques to create a time-invariant library that can predict instantaneous states of the flow from sparse measurements alone (provided that these states are within the bounds of the applied training data). The proposed linear methods, as well as the NN architecture, provide well-characterized frameworks for future efforts in sparse sensing and state estimation applications: particularly for highly nonlinear underlying systems such as turbulent flow.","PeriodicalId":34169,"journal":{"name":"DataCentric Engineering","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1017/dce.2021.5","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43448134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 18

Feature extraction and artificial neural networks for the on-the-fly classification of high-dimensional thermochemical spaces in adaptive-chemistry simulations – ERRATUM 自适应化学模拟中高维热化学空间动态分类的特征提取和人工神经网络-ERATUM

Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

DataCentric Engineering

Pub Date : 2021-05-21 DOI: 10.1017/dce.2021.4

G. D’Alessio, A. Cuoci, A. Parente

引用次数: 0

On the reproducibility of fully convolutional neural networks for modeling time–space-evolving physical systems 关于全卷积神经网络建模时空演化物理系统的可再现性

Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

DataCentric Engineering

Pub Date : 2021-05-12 DOI: 10.1017/dce.2022.18

Wagner G. Pinto, Antonio Alguacil, M. Bauerheim

Abstract Reproducibility of a deep-learning fully convolutional neural network is evaluated by training several times the same network on identical conditions (database, hyperparameters, and hardware) with nondeterministic graphics processing unit operations. The network is trained to model three typical time–space-evolving physical systems in two dimensions: heat, Burgers’, and wave equations. The behavior of the networks is evaluated on both recursive and nonrecursive tasks. Significant changes in models’ properties (weights and feature fields) are observed. When tested on various benchmarks, these models systematically return estimations with a high level of deviation, especially for the recurrent analysis which strongly amplifies variability due to the nondeterminism. Trainings performed with double floating-point precision provide slightly better estimations and a significant reduction of the variability of both the network parameters and its testing error range.

通过在相同条件下(数据库、超参数和硬件)使用不确定的图形处理单元操作对相同网络进行多次训练，评估了深度学习全卷积神经网络的可重复性。该网络被训练成在两个维度上模拟三种典型的时空演化物理系统:热方程、伯格方程和波动方程。在递归和非递归任务上评估网络的行为。可以观察到模型属性(权重和特征字段)的显著变化。当在各种基准上进行测试时，这些模型系统地返回具有高水平偏差的估计，特别是对于由于不确定性而强烈放大可变性的循环分析。用双浮点精度执行的训练提供了稍微更好的估计，并显著减少了网络参数及其测试误差范围的可变性。

引用次数: 2

Principal component density estimation for scenario generation using normalizing flows 使用归一化流的场景生成的主成分密度估计

Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

DataCentric Engineering

Pub Date : 2021-04-21 DOI: 10.1017/dce.2022.7

Eike Cramer, A. Mitsos, R. Tempone, M. Dahmen

Abstract Neural networks-based learning of the distribution of non-dispatchable renewable electricity generation from sources, such as photovoltaics (PV) and wind as well as load demands, has recently gained attention. Normalizing flow density models are particularly well suited for this task due to the training through direct log-likelihood maximization. However, research from the field of image generation has shown that standard normalizing flows can only learn smeared-out versions of manifold distributions. Previous works on normalizing flow-based scenario generation do not address this issue, and the smeared-out distributions result in the sampling of noisy time series. In this paper, we exploit the isometry of the principal component analysis (PCA), which sets up the normalizing flow in a lower-dimensional space while maintaining the direct and computationally efficient likelihood maximization. We train the resulting principal component flow (PCF) on data of PV and wind power generation as well as load demand in Germany in the years 2013–2015. The results of this investigation show that the PCF preserves critical features of the original distributions, such as the probability density and frequency behavior of the time series. The application of the PCF is, however, not limited to renewable power generation but rather extends to any dataset, time series, or otherwise, which can be efficiently reduced using PCA.

摘要基于神经网络的学习最近引起了人们的关注，学习来自光伏（PV）和风能等来源的不可调度可再生发电的分布以及负载需求。由于通过直接对数似然最大化进行训练，归一化流密度模型特别适合此任务。然而，来自图像生成领域的研究表明，标准归一化流只能学习流形分布的涂抹版本。先前关于规范化基于流的场景生成的工作没有解决这个问题，并且模糊分布导致了噪声时间序列的采样。在本文中，我们利用了主成分分析（PCA）的等距性，它在低维空间中建立了归一化流，同时保持了直接和计算有效的似然最大化。我们根据德国2013-2015年的光伏和风力发电数据以及负荷需求对由此产生的主成分流（PCF）进行了训练。研究结果表明，PCF保留了原始分布的关键特征，如时间序列的概率密度和频率行为。然而，PCF的应用并不局限于可再生能源发电，而是扩展到任何数据集、时间序列或其他方面，这些数据集、时序或其他方面可以使用PCA有效地减少。

{"title":"Principal component density estimation for scenario generation using normalizing flows","authors":"Eike Cramer, A. Mitsos, R. Tempone, M. Dahmen","doi":"10.1017/dce.2022.7","DOIUrl":"https://doi.org/10.1017/dce.2022.7","url":null,"abstract":"Abstract Neural networks-based learning of the distribution of non-dispatchable renewable electricity generation from sources, such as photovoltaics (PV) and wind as well as load demands, has recently gained attention. Normalizing flow density models are particularly well suited for this task due to the training through direct log-likelihood maximization. However, research from the field of image generation has shown that standard normalizing flows can only learn smeared-out versions of manifold distributions. Previous works on normalizing flow-based scenario generation do not address this issue, and the smeared-out distributions result in the sampling of noisy time series. In this paper, we exploit the isometry of the principal component analysis (PCA), which sets up the normalizing flow in a lower-dimensional space while maintaining the direct and computationally efficient likelihood maximization. We train the resulting principal component flow (PCF) on data of PV and wind power generation as well as load demand in Germany in the years 2013–2015. The results of this investigation show that the PCF preserves critical features of the original distributions, such as the probability density and frequency behavior of the time series. The application of the PCF is, however, not limited to renewable power generation but rather extends to any dataset, time series, or otherwise, which can be efficiently reduced using PCA.","PeriodicalId":34169,"journal":{"name":"DataCentric Engineering","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46938143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Feature extraction and artificial neural networks for the on-the-fly classification of high-dimensional thermochemical spaces in adaptive-chemistry simulations 自适应化学模拟中高维热化学空间动态分类的特征提取和人工神经网络

Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

DataCentric Engineering

Pub Date : 2021-04-12 DOI: 10.1017/dce.2021.2

G. D’Alessio, A. Cuoci, A. Parente

Abstract The integration of Artificial Neural Networks (ANNs) and Feature Extraction (FE) in the context of the Sample- Partitioning Adaptive Reduced Chemistry approach was investigated in this work, to increase the on-the-fly classification accuracy for very large thermochemical states. The proposed methodology was firstly compared with an on-the-fly classifier based on the Principal Component Analysis reconstruction error, as well as with a standard ANN (s-ANN) classifier, operating on the full thermochemical space, for the adaptive simulation of a steady laminar flame fed with a nitrogen-diluted stream of n-heptane in air. The numerical simulations were carried out with a kinetic mechanism accounting for 172 species and 6,067 reactions, which includes the chemistry of Polycyclic Aromatic Hydrocarbons (PAHs) up to C$ {}_{20} $. Among all the aforementioned classifiers, the one exploiting the combination of an FE step with ANN proved to be more efficient for the classification of high-dimensional spaces, leading to a higher speed-up factor and a higher accuracy of the adaptive simulation in the description of the PAH and soot-precursor chemistry. Finally, the investigation of the classifier’s performances was also extended to flames with different boundary conditions with respect to the training one, obtained imposing a higher Reynolds number or time-dependent sinusoidal perturbations. Satisfying results were observed on all the test flames. Impact Statement The existing methodologies for the simulation of multidimensional flames with detailed kinetic mechanisms are time-consuming because of the large number of involved chemical species and reactions. This aspect has prompted the development of approaches to reduce the computational requirements of computational fluid dynamics simulations of reacting flows. Among them, adaptive chemistry is worth mentioning, as it allows to use complex kinetic mechanisms only where needed. In this work, an artificial neural network architecture with a prior encoding step via Principal Component Analysis was integrated in the Sample-Partitioning Adaptive Reduced Chemistry approach, to increase the on-the-fly classification accuracy when high-dimensional spaces are considered. Its performances were compared with others supervised classifiers, operating on the full thermochemical space, in terms of speed-up with respect to the detailed simulation and accuracy in the description of Polycyclic Aromatic Hydrocarbon species.

摘要:本文研究了在样本划分自适应还原化学方法背景下，人工神经网络(ann)和特征提取(FE)的集成，以提高对非常大的热化学状态的实时分类精度。首先，将所提出的方法与基于主成分分析重构误差的实时分类器以及在全热化学空间运行的标准ANN (s-ANN)分类器进行了比较，用于自适应模拟空气中氮稀释的正庚烷流的稳定层流火焰。数值模拟了172种6067种反应的动力学机制，其中包括C${}_{20} $的多环芳烃(PAHs)的化学反应。在上述分类器中，利用有限元步骤与人工神经网络相结合的分类器对高维空间的分类效率更高，在描述多环芳烃和煤烟前体化学时具有更高的加速因子和更高的自适应模拟精度。最后，对分类器性能的研究还扩展到具有不同边界条件的火焰，这些边界条件相对于训练火焰施加了更高的雷诺数或时变正弦扰动。所有试验火焰均取得了满意的结果。由于涉及的化学物质和反应数量众多，现有的模拟多维火焰的详细动力学机制的方法非常耗时。这方面促使了各种方法的发展，以减少反应流动的计算流体动力学模拟的计算需求。其中，适应性化学值得一提，因为它只允许在需要时使用复杂的动力学机制。在这项工作中，通过主成分分析将具有先验编码步骤的人工神经网络架构集成到样本划分自适应还原化学方法中，以提高在考虑高维空间时的实时分类精度。在多环芳烃物种描述的详细模拟和准确性方面，将其性能与其他在全热化学空间运行的监督分类器进行了比较。

{"title":"Feature extraction and artificial neural networks for the on-the-fly classification of high-dimensional thermochemical spaces in adaptive-chemistry simulations","authors":"G. D’Alessio, A. Cuoci, A. Parente","doi":"10.1017/dce.2021.2","DOIUrl":"https://doi.org/10.1017/dce.2021.2","url":null,"abstract":"Abstract The integration of Artificial Neural Networks (ANNs) and Feature Extraction (FE) in the context of the Sample- Partitioning Adaptive Reduced Chemistry approach was investigated in this work, to increase the on-the-fly classification accuracy for very large thermochemical states. The proposed methodology was firstly compared with an on-the-fly classifier based on the Principal Component Analysis reconstruction error, as well as with a standard ANN (s-ANN) classifier, operating on the full thermochemical space, for the adaptive simulation of a steady laminar flame fed with a nitrogen-diluted stream of n-heptane in air. The numerical simulations were carried out with a kinetic mechanism accounting for 172 species and 6,067 reactions, which includes the chemistry of Polycyclic Aromatic Hydrocarbons (PAHs) up to C$ {}_{20} $. Among all the aforementioned classifiers, the one exploiting the combination of an FE step with ANN proved to be more efficient for the classification of high-dimensional spaces, leading to a higher speed-up factor and a higher accuracy of the adaptive simulation in the description of the PAH and soot-precursor chemistry. Finally, the investigation of the classifier’s performances was also extended to flames with different boundary conditions with respect to the training one, obtained imposing a higher Reynolds number or time-dependent sinusoidal perturbations. Satisfying results were observed on all the test flames. Impact Statement The existing methodologies for the simulation of multidimensional flames with detailed kinetic mechanisms are time-consuming because of the large number of involved chemical species and reactions. This aspect has prompted the development of approaches to reduce the computational requirements of computational fluid dynamics simulations of reacting flows. Among them, adaptive chemistry is worth mentioning, as it allows to use complex kinetic mechanisms only where needed. In this work, an artificial neural network architecture with a prior encoding step via Principal Component Analysis was integrated in the Sample-Partitioning Adaptive Reduced Chemistry approach, to increase the on-the-fly classification accuracy when high-dimensional spaces are considered. Its performances were compared with others supervised classifiers, operating on the full thermochemical space, in terms of speed-up with respect to the detailed simulation and accuracy in the description of Polycyclic Aromatic Hydrocarbon species.","PeriodicalId":34169,"journal":{"name":"DataCentric Engineering","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1017/dce.2021.2","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41836917","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

The study of variability in engineering design—An appreciation and a retrospective 工程设计中可变性的研究——综述与回顾

Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

DataCentric Engineering

Pub Date : 2021-03-29 DOI: 10.1017/dce.2021.3

T. Davis

Abstract We explore the concept of parameter design applied to the production of glass beads in the manufacture of metal-encapsulated transistors. The main motivation is to complete the analysis hinted at in the original publication by Jim Morrison in 1957, which was an early example of discussing the idea of transmitted variation in engineering design, and an influential paper in the development of analytic parameter design as a data-centric engineering activity. Parameter design is a secondary design activity focused on selecting the nominals of the design variables to achieve the required target performance and to simultaneously reduce the variance around the target. Although the 1957 paper is not recent, its approach to engineering design is modern. Impact Statement This paper draws attention to a 1957 publication by Jim Morrison and illustrates the concept of parameter design (a secondary design activity between concept design and tolerance design). The 1957 paper was the first in the English language to discuss parameter design and is an early example of data-centric engineering. This paper illustrates that the obvious or intuitive solutions to design optimization can be wrong, even in the simplest of cases as illustrated here, motivating the need for careful data-centered analysis, when solving engineering problems.

摘要：我们探讨了参数设计的概念，该概念适用于金属封装晶体管生产中的玻璃珠生产。主要动机是完成Jim Morrison在1957年的原始出版物中暗示的分析，这是讨论工程设计中传递变异思想的早期例子，也是分析参数设计作为一项以数据为中心的工程活动发展过程中的一篇有影响力的论文。参数设计是一项次要的设计活动，重点是选择设计变量的主格，以实现所需的目标性能，同时减少目标周围的方差。尽管1957年的论文不是最新的，但它的工程设计方法是现代的。影响声明本文提请注意Jim Morrison 1957年的一份出版物，并阐述了参数设计的概念（概念设计和公差设计之间的二次设计活动）。1957年的论文是第一篇用英语讨论参数设计的论文，也是以数据为中心的工程的早期例子。本文表明，设计优化的明显或直观的解决方案可能是错误的，即使在这里所示的最简单的情况下也是如此，这促使在解决工程问题时需要仔细的以数据为中心的分析。

{"title":"The study of variability in engineering design—An appreciation and a retrospective","authors":"T. Davis","doi":"10.1017/dce.2021.3","DOIUrl":"https://doi.org/10.1017/dce.2021.3","url":null,"abstract":"Abstract We explore the concept of parameter design applied to the production of glass beads in the manufacture of metal-encapsulated transistors. The main motivation is to complete the analysis hinted at in the original publication by Jim Morrison in 1957, which was an early example of discussing the idea of transmitted variation in engineering design, and an influential paper in the development of analytic parameter design as a data-centric engineering activity. Parameter design is a secondary design activity focused on selecting the nominals of the design variables to achieve the required target performance and to simultaneously reduce the variance around the target. Although the 1957 paper is not recent, its approach to engineering design is modern. Impact Statement This paper draws attention to a 1957 publication by Jim Morrison and illustrates the concept of parameter design (a secondary design activity between concept design and tolerance design). The 1957 paper was the first in the English language to discuss parameter design and is an early example of data-centric engineering. This paper illustrates that the obvious or intuitive solutions to design optimization can be wrong, even in the simplest of cases as illustrated here, motivating the need for careful data-centered analysis, when solving engineering problems.","PeriodicalId":34169,"journal":{"name":"DataCentric Engineering","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1017/dce.2021.3","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47483669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Digital twinning of self-sensing structures using the statistical finite element method 基于统计有限元方法的自感结构数字孪晶

Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

DataCentric Engineering

Pub Date : 2021-03-25 DOI: 10.1017/dce.2022.28

Eky Febrianto, Liam J. Butler, M. Girolami, F. Cirak

Abstract The monitoring of infrastructure assets using sensor networks is becoming increasingly prevalent. A digital twin in the form of a finite element (FE) model, as commonly used in design and construction, can help make sense of the copious amount of collected sensor data. This paper demonstrates the application of the statistical finite element method (statFEM), which provides a principled means of synthesizing data and physics-based models, in developing a digital twin of a self-sensing structure. As a case study, an instrumented steel railway bridge of $ 27.34hskip1.5pt mathrm{m} $ length located along the West Coast Mainline near Staffordshire in the UK is considered. Using strain data captured from fiber Bragg grating sensors at 108 locations along the bridge superstructure, statFEM can predict the “true” system response while taking into account the uncertainties in sensor readings, applied loading, and FE model misspecification errors. Longitudinal strain distributions along the two main I-beams are both measured and modeled during the passage of a passenger train. The statFEM digital twin is able to generate reasonable strain distribution predictions at locations where no measurement data are available, including at several points along the main I-beams and on structural elements on which sensors are not even installed. The implications for long-term structural health monitoring and assessment include optimization of sensor placement and performing more reliable what-if analyses at locations and under loading scenarios for which no measurement data are available.

摘要使用传感器网络监控基础设施资产正变得越来越普遍。设计和施工中常用的有限元（FE）模型形式的数字孪生可以帮助理解收集的大量传感器数据。本文演示了统计有限元方法（statFEM）在开发自感结构的数字孪生中的应用，该方法提供了一种综合数据和基于物理的模型的原理性方法。作为一个案例研究，考虑了位于英国斯塔福德郡附近西海岸干线沿线的一座长度为27.34hskip1.5ptmathrm{m}美元的仪表化铁路铁桥。利用从桥梁上部结构108个位置的光纤布拉格光栅传感器捕获的应变数据，statFEM可以预测“真实”的系统响应，同时考虑传感器读数、施加载荷和有限元模型指定错误的不确定性。在客运列车通过过程中，测量并模拟了沿两个主工字梁的纵向应变分布。statFEM数字孪生能够在没有测量数据的位置生成合理的应变分布预测，包括在沿着主工字梁的几个点和甚至没有安装传感器的结构元件上。对长期结构健康监测和评估的影响包括优化传感器布局，并在没有测量数据的位置和低负荷情况下进行更可靠的假设分析。

{"title":"Digital twinning of self-sensing structures using the statistical finite element method","authors":"Eky Febrianto, Liam J. Butler, M. Girolami, F. Cirak","doi":"10.1017/dce.2022.28","DOIUrl":"https://doi.org/10.1017/dce.2022.28","url":null,"abstract":"Abstract The monitoring of infrastructure assets using sensor networks is becoming increasingly prevalent. A digital twin in the form of a finite element (FE) model, as commonly used in design and construction, can help make sense of the copious amount of collected sensor data. This paper demonstrates the application of the statistical finite element method (statFEM), which provides a principled means of synthesizing data and physics-based models, in developing a digital twin of a self-sensing structure. As a case study, an instrumented steel railway bridge of $ 27.34hskip1.5pt mathrm{m} $ length located along the West Coast Mainline near Staffordshire in the UK is considered. Using strain data captured from fiber Bragg grating sensors at 108 locations along the bridge superstructure, statFEM can predict the “true” system response while taking into account the uncertainties in sensor readings, applied loading, and FE model misspecification errors. Longitudinal strain distributions along the two main I-beams are both measured and modeled during the passage of a passenger train. The statFEM digital twin is able to generate reasonable strain distribution predictions at locations where no measurement data are available, including at several points along the main I-beams and on structural elements on which sensors are not even installed. The implications for long-term structural health monitoring and assessment include optimization of sensor placement and performing more reliable what-if analyses at locations and under loading scenarios for which no measurement data are available.","PeriodicalId":34169,"journal":{"name":"DataCentric Engineering","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48740222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15

Recurrent neural network for end-to-end modeling of laminar-turbulent transition 层流-湍流过渡端到端模型的递归神经网络

Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

DataCentric Engineering

Pub Date : 2021-03-25 DOI: 10.1017/dce.2021.11

M. Zafar, Meelan Choudhari, P. Paredes, Heng Xiao

Abstract Accurate prediction of laminar-turbulent transition is a critical element of computational fluid dynamics simulations for aerodynamic design across multiple flow regimes. Traditional methods of transition prediction cannot be easily extended to flow configurations where the transition process depends on a large set of parameters. In comparison, neural network methods allow higher dimensional input features to be considered without compromising the efficiency and accuracy of the traditional data-driven models. Neural network methods proposed earlier follow a cumbersome methodology of predicting instability growth rates over a broad range of frequencies, which are then processed to obtain the N-factor envelope, and then, the transition location based on the correlating N-factor. This paper presents an end-to-end transition model based on a recurrent neural network, which sequentially processes the mean boundary-layer profiles along the surface of the aerodynamic body to directly predict the N-factor envelope and the transition locations over a two-dimensional airfoil. The proposed transition model has been developed and assessed using a large database of 53 airfoils over a wide range of chord Reynolds numbers and angles of attack. The large universe of airfoils encountered in various applications causes additional difficulties. As such, we provide further insights on selecting training datasets from large amounts of available data. Although the proposed model has been analyzed for two-dimensional boundary layers in this paper, it can be easily generalized to other flows due to embedded feature extraction capability of convolutional neural network in the model.

摘要层流湍流过渡的精确预测是跨多个流态的气动设计计算流体动力学模拟的关键因素。传统的过渡预测方法不能很容易地扩展到其中过渡过程依赖于一组大参数的流动配置。相比之下，神经网络方法允许在不影响传统数据驱动模型的效率和准确性的情况下考虑更高维度的输入特征。早期提出的神经网络方法遵循了一种繁琐的方法，即预测宽频率范围内的不稳定性增长率，然后对其进行处理以获得N因子包络，然后基于相关的N因子来获得过渡位置。本文提出了一种基于递归神经网络的端到端过渡模型，该模型顺序处理沿气动体表面的平均边界层轮廓，以直接预测二维翼型上的N因子包络和过渡位置。所提出的过渡模型是使用53个翼型的大型数据库在广泛的弦雷诺数和攻角范围内开发和评估的。在各种应用中遇到的翼型的大范围导致了额外的困难。因此，我们提供了从大量可用数据中选择训练数据集的进一步见解。尽管本文已经对所提出的模型进行了二维边界层的分析，但由于模型中嵌入了卷积神经网络的特征提取能力，它可以很容易地推广到其他流。

{"title":"Recurrent neural network for end-to-end modeling of laminar-turbulent transition","authors":"M. Zafar, Meelan Choudhari, P. Paredes, Heng Xiao","doi":"10.1017/dce.2021.11","DOIUrl":"https://doi.org/10.1017/dce.2021.11","url":null,"abstract":"Abstract Accurate prediction of laminar-turbulent transition is a critical element of computational fluid dynamics simulations for aerodynamic design across multiple flow regimes. Traditional methods of transition prediction cannot be easily extended to flow configurations where the transition process depends on a large set of parameters. In comparison, neural network methods allow higher dimensional input features to be considered without compromising the efficiency and accuracy of the traditional data-driven models. Neural network methods proposed earlier follow a cumbersome methodology of predicting instability growth rates over a broad range of frequencies, which are then processed to obtain the N-factor envelope, and then, the transition location based on the correlating N-factor. This paper presents an end-to-end transition model based on a recurrent neural network, which sequentially processes the mean boundary-layer profiles along the surface of the aerodynamic body to directly predict the N-factor envelope and the transition locations over a two-dimensional airfoil. The proposed transition model has been developed and assessed using a large database of 53 airfoils over a wide range of chord Reynolds numbers and angles of attack. The large universe of airfoils encountered in various applications causes additional difficulties. As such, we provide further insights on selecting training datasets from large amounts of available data. Although the proposed model has been analyzed for two-dimensional boundary layers in this paper, it can be easily generalized to other flows due to embedded feature extraction capability of convolutional neural network in the model.","PeriodicalId":34169,"journal":{"name":"DataCentric Engineering","volume":"438 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41315390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

Model order reduction based on Runge–Kutta neural networks 基于Runge-Kutta神经网络的模型降阶

Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

DataCentric Engineering

Pub Date : 2021-03-25 DOI: 10.1017/dce.2021.15

Qinyu Zhuang, Juan M Lorenzi, H. Bungartz, D. Hartmann

Abstract Model order reduction (MOR) methods enable the generation of real-time-capable digital twins, with the potential to unlock various novel value streams in industry. While traditional projection-based methods are robust and accurate for linear problems, incorporating machine learning to deal with nonlinearity becomes a new choice for reducing complex problems. These kinds of methods are independent to the numerical solver for the full order model and keep the nonintrusiveness of the whole workflow. Such methods usually consist of two steps. The first step is the dimension reduction by a projection-based method, and the second is the model reconstruction by a neural network (NN). In this work, we apply some modifications for both steps respectively and investigate how they are impacted by testing with three different simulation models. In all cases Proper orthogonal decomposition is used for dimension reduction. For this step, the effects of generating the snapshot database with constant input parameters is compared with time-dependent input parameters. For the model reconstruction step, three types of NN architectures are compared: multilayer perceptron (MLP), explicit Euler NN (EENN), and Runge–Kutta NN (RKNN). The MLPs learn the system state directly, whereas EENNs and RKNNs learn the derivative of system state and predict the new state as a numerical integrator. In the tests, RKNNs show their advantage as the network architecture informed by higher-order numerical strategy.

模型降阶(MOR)方法能够生成具有实时能力的数字孪生，具有解锁工业中各种新颖价值流的潜力。传统的基于投影的方法对于线性问题具有鲁棒性和准确性，而结合机器学习来处理非线性问题成为简化复杂问题的新选择。这些方法独立于全阶模型的数值求解，保持了整个工作流的非侵入性。这种方法通常包括两个步骤。第一步是基于投影的降维方法，第二步是基于神经网络的模型重建。在这项工作中，我们分别对这两个步骤进行了一些修改，并研究了用三种不同的仿真模型进行测试对它们的影响。在所有情况下，适当的正交分解用于降维。对于这一步，将使用恒定输入参数生成快照数据库的效果与依赖于时间的输入参数进行比较。对于模型重建步骤，比较了三种类型的神经网络架构:多层感知器(MLP)，显式欧拉神经网络(EENN)和龙格-库塔神经网络(RKNN)。mlp直接学习系统状态，而eenn和rknn学习系统状态的导数并作为数值积分器预测新状态。在测试中，rknn显示了其作为由高阶数值策略通知的网络架构的优势。

{"title":"Model order reduction based on Runge–Kutta neural networks","authors":"Qinyu Zhuang, Juan M Lorenzi, H. Bungartz, D. Hartmann","doi":"10.1017/dce.2021.15","DOIUrl":"https://doi.org/10.1017/dce.2021.15","url":null,"abstract":"Abstract Model order reduction (MOR) methods enable the generation of real-time-capable digital twins, with the potential to unlock various novel value streams in industry. While traditional projection-based methods are robust and accurate for linear problems, incorporating machine learning to deal with nonlinearity becomes a new choice for reducing complex problems. These kinds of methods are independent to the numerical solver for the full order model and keep the nonintrusiveness of the whole workflow. Such methods usually consist of two steps. The first step is the dimension reduction by a projection-based method, and the second is the model reconstruction by a neural network (NN). In this work, we apply some modifications for both steps respectively and investigate how they are impacted by testing with three different simulation models. In all cases Proper orthogonal decomposition is used for dimension reduction. For this step, the effects of generating the snapshot database with constant input parameters is compared with time-dependent input parameters. For the model reconstruction step, three types of NN architectures are compared: multilayer perceptron (MLP), explicit Euler NN (EENN), and Runge–Kutta NN (RKNN). The MLPs learn the system state directly, whereas EENNs and RKNNs learn the derivative of system state and predict the new state as a numerical integrator. In the tests, RKNNs show their advantage as the network architecture informed by higher-order numerical strategy.","PeriodicalId":34169,"journal":{"name":"DataCentric Engineering","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42433722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Microseismic event detection in large heterogeneous velocity models using Bayesian multimodal nested sampling 基于贝叶斯多模态嵌套采样的大非均匀速度模型微震事件检测

Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

DataCentric Engineering

Pub Date : 2021-02-26 DOI: 10.1017/dce.2021.1

Saptarshi Das, M. Hobson, F. Feroz, Xi Chen, S. Phadke, J. Goudswaard, D. Hohl

Abstract In passive seismic and microseismic monitoring, identifying and characterizing events in a strong noisy background is a challenging task. Most of the established methods for geophysical inversion are likely to yield many false event detections. The most advanced of these schemes require thousands of computationally demanding forward elastic-wave propagation simulations. Here we train and use an ensemble of Gaussian process surrogate meta-models, or proxy emulators, to accelerate the generation of accurate template seismograms from random microseismic event locations. In the presence of multiple microseismic events occurring at different spatial locations with arbitrary amplitude and origin time, and in the presence of noise, an inference algorithm needs to navigate an objective function or likelihood landscape of highly complex shape, perhaps with multiple modes and narrow curving degeneracies. This is a challenging computational task even for state-of-the-art Bayesian sampling algorithms. In this paper, we propose a novel method for detecting multiple microseismic events in a strong noise background using Bayesian inference, in particular, the Multimodal Nested Sampling (MultiNest) algorithm. The method not only provides the posterior samples for the 5D spatio-temporal-amplitude inference for the real microseismic events, by inverting the seismic traces in multiple surface receivers, but also computes the Bayesian evidence or the marginal likelihood that permits hypothesis testing for discriminating true vs. false event detection. Impact Statement Bayesian evidence-based reasoning is helpful in identifying real microseismic events as opposed to the environmental noise. The geophysical challenge here is the high-computational burden for simulating noiseless template seismic responses for explosive type events and combining them together having different amplitudes and origin times. We use Gaussian process based surrogate models as proxy for multi-receiver seismic responses to be used for the Bayesian detection of microseismic events in a heterogeneous marine velocity model. We used the MultiNest sampler for Bayesian inference since in the presence of multiple events, the likelihood surface becomes multimodal. From the sampled points, a density-based clustering algorithm is employed to filter out each microseismic event for improved mode separation and obtain the posterior distribution of each event in a joint 5D space of amplitude, origin time, and three spatial co-ordinates. Choice of the resolution parameter in MultiNest sampler (Nlive) is also crucial to obtain accurate inference within reasonable computational time and resources and have been compared for two different scenarios (Nlive = 300, 500). A data analytics pipeline is proposed in this paper, starting from GPU based simulation of microseismic events to training a surrogate model for cheaper likelihood calculation, followed by 5D posterior inference for simultaneous detection

在被动地震和微地震监测中，在强噪声背景下识别和表征地震事件是一项具有挑战性的任务。大多数已建立的地球物理反演方法都可能产生许多假事件检测。这些方案中最先进的需要成千上万的计算要求高的前向弹性波传播模拟。在这里，我们训练并使用高斯过程代理元模型或代理模拟器的集合，以加速从随机微地震事件位置生成准确的模板地震图。当多个微地震事件发生在不同的空间位置，具有任意振幅和起始时间，并且存在噪声时，推理算法需要导航高度复杂形状的目标函数或似然景观，可能具有多模态和窄曲线简并。即使对于最先进的贝叶斯抽样算法来说，这也是一项具有挑战性的计算任务。在本文中，我们提出了一种利用贝叶斯推理在强噪声背景下检测多个微地震事件的新方法，特别是多模态嵌套采样(MultiNest)算法。该方法不仅为真实微地震事件的5D时空振幅推断提供后验样本，通过反演多个地面接收器中的地震轨迹，而且还计算贝叶斯证据或边际似然，允许假设检验来区分真假事件检测。贝叶斯循证推理有助于识别真实的微地震事件，而不是环境噪声。这里的地球物理挑战是模拟爆炸型事件的无噪声模板地震反应并将它们组合在一起具有不同的振幅和起源时间，计算量很大。我们使用基于高斯过程的代理模型作为多接收器地震响应的代理，用于非均匀海洋速度模型中微地震事件的贝叶斯检测。我们使用multitest采样器进行贝叶斯推理，因为在存在多个事件的情况下，似然面变得多模态。从采样点中，采用基于密度的聚类算法对各微震事件进行过滤，提高模态分离，得到各微震事件在振幅、起始时间和三个空间坐标联合5D空间中的后验分布。在MultiNest sampler (Nlive)中，分辨率参数的选择对于在合理的计算时间和资源内获得准确的推断也是至关重要的，并且已经对两种不同的场景(Nlive = 300,500)进行了比较。本文提出了一种数据分析管道，从基于GPU的微地震事件模拟开始，到训练代理模型以进行更便宜的似然计算，然后进行5D后验推理以同时检测多个事件。

{"title":"Microseismic event detection in large heterogeneous velocity models using Bayesian multimodal nested sampling","authors":"Saptarshi Das, M. Hobson, F. Feroz, Xi Chen, S. Phadke, J. Goudswaard, D. Hohl","doi":"10.1017/dce.2021.1","DOIUrl":"https://doi.org/10.1017/dce.2021.1","url":null,"abstract":"Abstract In passive seismic and microseismic monitoring, identifying and characterizing events in a strong noisy background is a challenging task. Most of the established methods for geophysical inversion are likely to yield many false event detections. The most advanced of these schemes require thousands of computationally demanding forward elastic-wave propagation simulations. Here we train and use an ensemble of Gaussian process surrogate meta-models, or proxy emulators, to accelerate the generation of accurate template seismograms from random microseismic event locations. In the presence of multiple microseismic events occurring at different spatial locations with arbitrary amplitude and origin time, and in the presence of noise, an inference algorithm needs to navigate an objective function or likelihood landscape of highly complex shape, perhaps with multiple modes and narrow curving degeneracies. This is a challenging computational task even for state-of-the-art Bayesian sampling algorithms. In this paper, we propose a novel method for detecting multiple microseismic events in a strong noise background using Bayesian inference, in particular, the Multimodal Nested Sampling (MultiNest) algorithm. The method not only provides the posterior samples for the 5D spatio-temporal-amplitude inference for the real microseismic events, by inverting the seismic traces in multiple surface receivers, but also computes the Bayesian evidence or the marginal likelihood that permits hypothesis testing for discriminating true vs. false event detection. Impact Statement Bayesian evidence-based reasoning is helpful in identifying real microseismic events as opposed to the environmental noise. The geophysical challenge here is the high-computational burden for simulating noiseless template seismic responses for explosive type events and combining them together having different amplitudes and origin times. We use Gaussian process based surrogate models as proxy for multi-receiver seismic responses to be used for the Bayesian detection of microseismic events in a heterogeneous marine velocity model. We used the MultiNest sampler for Bayesian inference since in the presence of multiple events, the likelihood surface becomes multimodal. From the sampled points, a density-based clustering algorithm is employed to filter out each microseismic event for improved mode separation and obtain the posterior distribution of each event in a joint 5D space of amplitude, origin time, and three spatial co-ordinates. Choice of the resolution parameter in MultiNest sampler (Nlive) is also crucial to obtain accurate inference within reasonable computational time and resources and have been compared for two different scenarios (Nlive = 300, 500). A data analytics pipeline is proposed in this paper, starting from GPU based simulation of microseismic events to training a surrogate model for cheaper likelihood calculation, followed by 5D posterior inference for simultaneous detection ","PeriodicalId":34169,"journal":{"name":"DataCentric Engineering","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1017/dce.2021.1","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41969051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3