基于叠加的集合模型预测PM2.5浓度

IF 4.6 2区数学 Q1 MATHEMATICS, APPLIED Applied and Computational Mathematics Pub Date : 2021-01-01 DOI:10.11648/j.acm.20211006.14

Haoyuan Zhang, Yilun Jin, Jiaxuan Shi, Shuai Zhang

{"title":"基于叠加的集合模型预测PM2.5浓度","authors":"Haoyuan Zhang, Yilun Jin, Jiaxuan Shi, Shuai Zhang","doi":"10.11648/j.acm.20211006.14","DOIUrl":null,"url":null,"abstract":": With the increasingly serious air pollution problem, PM2.5 concentration, as an effective indicator to evaluate air quality, has attracted extensive attention from all sectors of society. Accurate prediction of PM2.5 concentrations is of great significance in providing the public with early air pollution warning information to protect public health. With a decade of development, artificial intelligence technology has given birth to various prediction models with high-performance, in particular, brought new impetus to the prediction of PM2.5 concentrations. In this study, a stacking-based ensemble model with self-adaptive hyper-parameter optimization is proposed to solve the PM2.5 concentrations prediction problem. First, the raw data are preprocessed with the normalization method to reduce the influence of the different orders of magnitude of input variables on model performance. Second, the Bayesian optimization method is used to optimize the hyper-parameters of the base predictors to improve their performance. Finally, a stacking ensemble method is applied to integrate the optimized base predictors into an ensemble model for final prediction. In the experiments, two datasets from the air quality stations in different areas are tested with four metrics to evaluate the performance of the proposed model in PM2.5 concentration prediction. The experimental results show that the proposed model outperforms other baseline models in solving the PM2.5 concentrations prediction problem.","PeriodicalId":55503,"journal":{"name":"Applied and Computational Mathematics","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting PM2.5 Concentrations Using Stacking-based Ensemble Model\",\"authors\":\"Haoyuan Zhang, Yilun Jin, Jiaxuan Shi, Shuai Zhang\",\"doi\":\"10.11648/j.acm.20211006.14\",\"DOIUrl\":null,\"url\":null,\"abstract\":\": With the increasingly serious air pollution problem, PM2.5 concentration, as an effective indicator to evaluate air quality, has attracted extensive attention from all sectors of society. Accurate prediction of PM2.5 concentrations is of great significance in providing the public with early air pollution warning information to protect public health. With a decade of development, artificial intelligence technology has given birth to various prediction models with high-performance, in particular, brought new impetus to the prediction of PM2.5 concentrations. In this study, a stacking-based ensemble model with self-adaptive hyper-parameter optimization is proposed to solve the PM2.5 concentrations prediction problem. First, the raw data are preprocessed with the normalization method to reduce the influence of the different orders of magnitude of input variables on model performance. Second, the Bayesian optimization method is used to optimize the hyper-parameters of the base predictors to improve their performance. Finally, a stacking ensemble method is applied to integrate the optimized base predictors into an ensemble model for final prediction. In the experiments, two datasets from the air quality stations in different areas are tested with four metrics to evaluate the performance of the proposed model in PM2.5 concentration prediction. The experimental results show that the proposed model outperforms other baseline models in solving the PM2.5 concentrations prediction problem.\",\"PeriodicalId\":55503,\"journal\":{\"name\":\"Applied and Computational Mathematics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2021-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied and Computational Mathematics\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.11648/j.acm.20211006.14\",\"RegionNum\":2,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MATHEMATICS, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied and Computational Mathematics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.11648/j.acm.20211006.14","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}

引用次数: 0

摘要

随着大气污染问题的日益严重，PM2.5浓度作为评价空气质量的有效指标，受到了社会各界的广泛关注。准确预测PM2.5浓度对于为公众提供早期空气污染预警信息，保护公众健康具有重要意义。经过十年的发展，人工智能技术催生了各种高性能的预测模型，特别是为PM2.5浓度的预测带来了新的动力。本文提出了一种基于自适应超参数优化的叠加集成模型来解决PM2.5浓度预测问题。首先，对原始数据进行归一化预处理，降低输入变量不同数量级对模型性能的影响。其次，采用贝叶斯优化方法对基预测器的超参数进行优化，提高基预测器的性能;最后，采用叠加集成方法将优化后的基本预测量集成到集成模型中进行最终预测。在实验中，采用来自不同地区空气质量监测站的两个数据集，用四个指标来评估所提出的模型在PM2.5浓度预测中的性能。实验结果表明，该模型在解决PM2.5浓度预测问题上优于其他基线模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Predicting PM2.5 Concentrations Using Stacking-based Ensemble Model

: With the increasingly serious air pollution problem, PM2.5 concentration, as an effective indicator to evaluate air quality, has attracted extensive attention from all sectors of society. Accurate prediction of PM2.5 concentrations is of great significance in providing the public with early air pollution warning information to protect public health. With a decade of development, artificial intelligence technology has given birth to various prediction models with high-performance, in particular, brought new impetus to the prediction of PM2.5 concentrations. In this study, a stacking-based ensemble model with self-adaptive hyper-parameter optimization is proposed to solve the PM2.5 concentrations prediction problem. First, the raw data are preprocessed with the normalization method to reduce the influence of the different orders of magnitude of input variables on model performance. Second, the Bayesian optimization method is used to optimize the hyper-parameters of the base predictors to improve their performance. Finally, a stacking ensemble method is applied to integrate the optimized base predictors into an ensemble model for final prediction. In the experiments, two datasets from the air quality stations in different areas are tested with four metrics to evaluate the performance of the proposed model in PM2.5 concentration prediction. The experimental results show that the proposed model outperforms other baseline models in solving the PM2.5 concentrations prediction problem.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Applied and Computational Mathematics 数学-应用数学

CiteScore

8.80

自引率

5.00%

发文量

审稿时长

6 months

期刊介绍： Applied and Computational Mathematics (ISSN Online: 2328-5613, ISSN Print: 2328-5605) is a prestigious journal that focuses on the field of applied and computational mathematics. It is driven by the computational revolution and places a strong emphasis on innovative applied mathematics with potential for real-world applicability and practicality. The journal caters to a broad audience of applied mathematicians and scientists who are interested in the advancement of mathematical principles and practical aspects of computational mathematics. Researchers from various disciplines can benefit from the diverse range of topics covered in ACM. To ensure the publication of high-quality content, all research articles undergo a rigorous peer review process. This process includes an initial screening by the editors and anonymous evaluation by expert reviewers. This guarantees that only the most valuable and accurate research is published in ACM.