Xueren Li , Weijie Sun , Chao Qin , Yihuan Yan , Liwei Zhang , Jiyuan Tu
{"title":"Evaluation of supervised machine learning regression models for CFD-based surrogate modelling in indoor airflow field reconstruction","authors":"Xueren Li , Weijie Sun , Chao Qin , Yihuan Yan , Liwei Zhang , Jiyuan Tu","doi":"10.1016/j.buildenv.2024.112173","DOIUrl":null,"url":null,"abstract":"<div><div>Fast and reliable prediction of indoor airflow distribution is critical for indoor environment control. While neural networks (NN), often interchangeably referred to as Back Propagation Neural Networks (BPNNs), are popular for airflow predictions, optimising these models is challenging due to their ”black box” nature and complex network structures. This study explores alternative robust regression models, including decision-tree-based models (e.g., XGBoost, LightGBM, Random Forest) and Support Vector Regression (SVR), for predicting indoor airflow. Two BPNN structures were initially developed to evaluate feasibility of NN models. BPNN A was trained using airflow velocities from two inlets as input neurons to directly predict the airflow velocity distribution within the domain. BPNN B was trained additionally with spatial information, including space samples and boundary wall data. Higher-dimensional training structures of BPNN B were applied to decision tree-based models and SVR to assess their capability in predicting non-linear airflow patterns. Results indicated that BPNN A achieved the highest accuracy, while the inclusion of higher-dimensional data in BPNN B led to decreased accuracy. Among all decision-tree-based models, XGBoost demonstrated the greatest potential, achieving an R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> above 99.5% and predictive errors below 10%. XGBoost also outperformed both BPNN models in speed, being 15.78 times faster than BPNN A and 252 times faster than BPNN B. The interpretability of XGBoost was further explored by analysing feature importance, which helps identify the most influential input variables while predicting the airflow velocity. This analysis is expected to offer an enhanced understanding of boundary conditions leading to optimised indoor environment strategy.</div></div>","PeriodicalId":9273,"journal":{"name":"Building and Environment","volume":null,"pages":null},"PeriodicalIF":7.1000,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Building and Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360132324010151","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Fast and reliable prediction of indoor airflow distribution is critical for indoor environment control. While neural networks (NN), often interchangeably referred to as Back Propagation Neural Networks (BPNNs), are popular for airflow predictions, optimising these models is challenging due to their ”black box” nature and complex network structures. This study explores alternative robust regression models, including decision-tree-based models (e.g., XGBoost, LightGBM, Random Forest) and Support Vector Regression (SVR), for predicting indoor airflow. Two BPNN structures were initially developed to evaluate feasibility of NN models. BPNN A was trained using airflow velocities from two inlets as input neurons to directly predict the airflow velocity distribution within the domain. BPNN B was trained additionally with spatial information, including space samples and boundary wall data. Higher-dimensional training structures of BPNN B were applied to decision tree-based models and SVR to assess their capability in predicting non-linear airflow patterns. Results indicated that BPNN A achieved the highest accuracy, while the inclusion of higher-dimensional data in BPNN B led to decreased accuracy. Among all decision-tree-based models, XGBoost demonstrated the greatest potential, achieving an R above 99.5% and predictive errors below 10%. XGBoost also outperformed both BPNN models in speed, being 15.78 times faster than BPNN A and 252 times faster than BPNN B. The interpretability of XGBoost was further explored by analysing feature importance, which helps identify the most influential input variables while predicting the airflow velocity. This analysis is expected to offer an enhanced understanding of boundary conditions leading to optimised indoor environment strategy.
期刊介绍:
Building and Environment, an international journal, is dedicated to publishing original research papers, comprehensive review articles, editorials, and short communications in the fields of building science, urban physics, and human interaction with the indoor and outdoor built environment. The journal emphasizes innovative technologies and knowledge verified through measurement and analysis. It covers environmental performance across various spatial scales, from cities and communities to buildings and systems, fostering collaborative, multi-disciplinary research with broader significance.