{"title":"确定人工智能模型的输入变量,以预测韩国首尔的高PM2.5浓度事件","authors":"Sang-Heon Kim, Moon-Soo Park","doi":"10.1007/s13143-023-00333-5","DOIUrl":null,"url":null,"abstract":"<div><p>The concentration of particulate matter (PMs) is governed by complex processes such as long-range transport, vertical diffusion, and local emissions. Therefore, thus it is relatively difficult to accurately forecast high PM concentration events. As the application of artificial intelligence (AI) techniques to air-quality prediction has increased, optimal input variables for AI models have become critical. The purpose of this study was to suggest combined and synoptic variables, in addition to conventional surface meteorological and air quality variables, for AI-based high PM event prediction models. In Seoul and four cities in China, the observed surface meteorological and air quality data, upper air meteorological data, planetary boundary layer height, and temperature gradients between the surface and 850 hPa were tested. The east–west geopotential index (EWGI) and Korean Region Blocking Index (KRBI) have been suggested as regional-scale blocking indices. A concentration-wind (CW) variable was introduced to represent the effects of long-range transport from China. The usefulness of the suggested variables was tested using random forest (RF) and support vector machine (SVM) for 2017–2020. As the forecasting days progressed, the importance of surface variables decreased, whereas those of the EWGI, KRBI, CW, and stability variables increased. The stability variables increased the accuracy, probability of detection, and F1 scores, while decreasing the false alarm rate on the 3‒5 forecasting days. EWGI and KRBI improved the prediction performance after the third forecast day, and CW was important for predicting the 3‒4 forecast days. Newly introduced variables, such as EWGI, KRBI, CW, and stability tended to increase the 1‒4 day forecast hit rate for high PM<sub>2.5</sub> events and were found to be useful input data for machine learning or artificial intelligence-based air quality prediction models.</p></div>","PeriodicalId":8556,"journal":{"name":"Asia-Pacific Journal of Atmospheric Sciences","volume":"59 5","pages":"607 - 623"},"PeriodicalIF":2.2000,"publicationDate":"2023-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Determination of Input variables for Artificial Intelligence Models to predict the High PM2.5 concentration events in Seoul, Korea\",\"authors\":\"Sang-Heon Kim, Moon-Soo Park\",\"doi\":\"10.1007/s13143-023-00333-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The concentration of particulate matter (PMs) is governed by complex processes such as long-range transport, vertical diffusion, and local emissions. Therefore, thus it is relatively difficult to accurately forecast high PM concentration events. As the application of artificial intelligence (AI) techniques to air-quality prediction has increased, optimal input variables for AI models have become critical. The purpose of this study was to suggest combined and synoptic variables, in addition to conventional surface meteorological and air quality variables, for AI-based high PM event prediction models. In Seoul and four cities in China, the observed surface meteorological and air quality data, upper air meteorological data, planetary boundary layer height, and temperature gradients between the surface and 850 hPa were tested. The east–west geopotential index (EWGI) and Korean Region Blocking Index (KRBI) have been suggested as regional-scale blocking indices. A concentration-wind (CW) variable was introduced to represent the effects of long-range transport from China. The usefulness of the suggested variables was tested using random forest (RF) and support vector machine (SVM) for 2017–2020. As the forecasting days progressed, the importance of surface variables decreased, whereas those of the EWGI, KRBI, CW, and stability variables increased. The stability variables increased the accuracy, probability of detection, and F1 scores, while decreasing the false alarm rate on the 3‒5 forecasting days. EWGI and KRBI improved the prediction performance after the third forecast day, and CW was important for predicting the 3‒4 forecast days. Newly introduced variables, such as EWGI, KRBI, CW, and stability tended to increase the 1‒4 day forecast hit rate for high PM<sub>2.5</sub> events and were found to be useful input data for machine learning or artificial intelligence-based air quality prediction models.</p></div>\",\"PeriodicalId\":8556,\"journal\":{\"name\":\"Asia-Pacific Journal of Atmospheric Sciences\",\"volume\":\"59 5\",\"pages\":\"607 - 623\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2023-08-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Asia-Pacific Journal of Atmospheric Sciences\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s13143-023-00333-5\",\"RegionNum\":4,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"METEOROLOGY & ATMOSPHERIC SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asia-Pacific Journal of Atmospheric Sciences","FirstCategoryId":"89","ListUrlMain":"https://link.springer.com/article/10.1007/s13143-023-00333-5","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
Determination of Input variables for Artificial Intelligence Models to predict the High PM2.5 concentration events in Seoul, Korea
The concentration of particulate matter (PMs) is governed by complex processes such as long-range transport, vertical diffusion, and local emissions. Therefore, thus it is relatively difficult to accurately forecast high PM concentration events. As the application of artificial intelligence (AI) techniques to air-quality prediction has increased, optimal input variables for AI models have become critical. The purpose of this study was to suggest combined and synoptic variables, in addition to conventional surface meteorological and air quality variables, for AI-based high PM event prediction models. In Seoul and four cities in China, the observed surface meteorological and air quality data, upper air meteorological data, planetary boundary layer height, and temperature gradients between the surface and 850 hPa were tested. The east–west geopotential index (EWGI) and Korean Region Blocking Index (KRBI) have been suggested as regional-scale blocking indices. A concentration-wind (CW) variable was introduced to represent the effects of long-range transport from China. The usefulness of the suggested variables was tested using random forest (RF) and support vector machine (SVM) for 2017–2020. As the forecasting days progressed, the importance of surface variables decreased, whereas those of the EWGI, KRBI, CW, and stability variables increased. The stability variables increased the accuracy, probability of detection, and F1 scores, while decreasing the false alarm rate on the 3‒5 forecasting days. EWGI and KRBI improved the prediction performance after the third forecast day, and CW was important for predicting the 3‒4 forecast days. Newly introduced variables, such as EWGI, KRBI, CW, and stability tended to increase the 1‒4 day forecast hit rate for high PM2.5 events and were found to be useful input data for machine learning or artificial intelligence-based air quality prediction models.
期刊介绍:
The Asia-Pacific Journal of Atmospheric Sciences (APJAS) is an international journal of the Korean Meteorological Society (KMS), published fully in English. It has started from 2008 by succeeding the KMS'' former journal, the Journal of the Korean Meteorological Society (JKMS), which published a total of 47 volumes as of 2011, in its time-honored tradition since 1965. Since 2008, the APJAS is included in the journal list of Thomson Reuters’ SCIE (Science Citation Index Expanded) and also in SCOPUS, the Elsevier Bibliographic Database, indicating the increased awareness and quality of the journal.