Hugo Valayer, Nathalie Bartoli, Mauricio Castaño-Aguirre, R. Lafage, Thierry Lefebvre, A. F. López-Lopera, Sylvain Mouton
{"title":"使用稀疏高斯过程进行数据驱动空气动力学建模的 Python 工具箱","authors":"Hugo Valayer, Nathalie Bartoli, Mauricio Castaño-Aguirre, R. Lafage, Thierry Lefebvre, A. F. López-Lopera, Sylvain Mouton","doi":"10.3390/aerospace11040260","DOIUrl":null,"url":null,"abstract":"In aerodynamics, characterizing the aerodynamic behavior of aircraft typically requires a large number of observation data points. Real experiments can generate thousands of data points with suitable accuracy, but they are time-consuming and resource-intensive. Consequently, conducting real experiments at new input configurations might be impractical. To address this challenge, data-driven surrogate models have emerged as a cost-effective and time-efficient alternative. They provide simplified mathematical representations that approximate the output of interest. Models based on Gaussian Processes (GPs) have gained popularity in aerodynamics due to their ability to provide accurate predictions and quantify uncertainty while maintaining tractable execution times. To handle large datasets, sparse approximations of GPs have been further investigated to reduce the computational complexity of exact inference. In this paper, we revisit and adapt two classic sparse methods for GPs to address the specific requirements frequently encountered in aerodynamic applications. We compare different strategies for choosing the inducing inputs, which significantly impact the complexity reduction. We formally integrate our implementations into the open-source Python toolbox SMT, enabling the use of sparse methods across the GP regression pipeline. We demonstrate the performance of our Sparse GP (SGP) developments in a comprehensive 1D analytic example as well as in a real wind tunnel application with thousands of training data points.","PeriodicalId":505273,"journal":{"name":"Aerospace","volume":"34 12","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Python Toolbox for Data-Driven Aerodynamic Modeling Using Sparse Gaussian Processes\",\"authors\":\"Hugo Valayer, Nathalie Bartoli, Mauricio Castaño-Aguirre, R. Lafage, Thierry Lefebvre, A. F. López-Lopera, Sylvain Mouton\",\"doi\":\"10.3390/aerospace11040260\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In aerodynamics, characterizing the aerodynamic behavior of aircraft typically requires a large number of observation data points. Real experiments can generate thousands of data points with suitable accuracy, but they are time-consuming and resource-intensive. Consequently, conducting real experiments at new input configurations might be impractical. To address this challenge, data-driven surrogate models have emerged as a cost-effective and time-efficient alternative. They provide simplified mathematical representations that approximate the output of interest. Models based on Gaussian Processes (GPs) have gained popularity in aerodynamics due to their ability to provide accurate predictions and quantify uncertainty while maintaining tractable execution times. To handle large datasets, sparse approximations of GPs have been further investigated to reduce the computational complexity of exact inference. In this paper, we revisit and adapt two classic sparse methods for GPs to address the specific requirements frequently encountered in aerodynamic applications. We compare different strategies for choosing the inducing inputs, which significantly impact the complexity reduction. We formally integrate our implementations into the open-source Python toolbox SMT, enabling the use of sparse methods across the GP regression pipeline. We demonstrate the performance of our Sparse GP (SGP) developments in a comprehensive 1D analytic example as well as in a real wind tunnel application with thousands of training data points.\",\"PeriodicalId\":505273,\"journal\":{\"name\":\"Aerospace\",\"volume\":\"34 12\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Aerospace\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3390/aerospace11040260\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aerospace","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/aerospace11040260","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
在空气动力学中,描述飞机的空气动力学行为通常需要大量的观测数据点。实际实验可以生成数千个具有适当精度的数据点,但这些实验既耗时又耗费资源。因此,在新的输入配置下进行真实实验可能不切实际。为了应对这一挑战,数据驱动的代用模型应运而生,成为一种具有成本效益和时间效率的替代方法。这些模型提供了简化的数学表示方法,可近似于相关输出。基于高斯过程(GPs)的模型在空气动力学领域很受欢迎,因为它们能够提供准确的预测并量化不确定性,同时保持可控的执行时间。为了处理大型数据集,人们进一步研究了 GPs 的稀疏近似,以降低精确推理的计算复杂性。在本文中,我们重新审视并调整了 GPs 的两种经典稀疏方法,以满足空气动力学应用中经常遇到的特定要求。我们比较了选择诱导输入的不同策略,这些策略对降低复杂性有显著影响。我们将实现方法正式集成到开源 Python 工具箱 SMT 中,使稀疏方法的使用贯穿 GP 回归管道。我们在一个全面的一维分析示例中,以及在一个具有数千个训练数据点的真实风洞应用中,展示了我们的稀疏 GP (SGP) 开发成果的性能。
A Python Toolbox for Data-Driven Aerodynamic Modeling Using Sparse Gaussian Processes
In aerodynamics, characterizing the aerodynamic behavior of aircraft typically requires a large number of observation data points. Real experiments can generate thousands of data points with suitable accuracy, but they are time-consuming and resource-intensive. Consequently, conducting real experiments at new input configurations might be impractical. To address this challenge, data-driven surrogate models have emerged as a cost-effective and time-efficient alternative. They provide simplified mathematical representations that approximate the output of interest. Models based on Gaussian Processes (GPs) have gained popularity in aerodynamics due to their ability to provide accurate predictions and quantify uncertainty while maintaining tractable execution times. To handle large datasets, sparse approximations of GPs have been further investigated to reduce the computational complexity of exact inference. In this paper, we revisit and adapt two classic sparse methods for GPs to address the specific requirements frequently encountered in aerodynamic applications. We compare different strategies for choosing the inducing inputs, which significantly impact the complexity reduction. We formally integrate our implementations into the open-source Python toolbox SMT, enabling the use of sparse methods across the GP regression pipeline. We demonstrate the performance of our Sparse GP (SGP) developments in a comprehensive 1D analytic example as well as in a real wind tunnel application with thousands of training data points.