Xu Wang, Kai Zhang, Peishan Han, Meijia Wang, Xianjun Li, Yaqiong Zhang, Qiong Pan
{"title":"Application of gene expression programing in predicting the concentration of PM2.5 and PM10 in Xi’an, China: a preliminary study","authors":"Xu Wang, Kai Zhang, Peishan Han, Meijia Wang, Xianjun Li, Yaqiong Zhang, Qiong Pan","doi":"10.3389/fenvs.2024.1416765","DOIUrl":null,"url":null,"abstract":"Introduction: Traditional statistical methods cannot find quantitative relationship from environmental data.Methods: We selected gene expression programming (GEP) to study the relationship between pollutant gas and PM<jats:sub>2.5</jats:sub> (PM<jats:sub>10</jats:sub>). They were used to construct the relationship between pollutant gas and PM<jats:sub>2.5</jats:sub> (PM<jats:sub>10</jats:sub>) with environmental monitoring data of Xi’an, China. GEP could construct a formula to express the relationship between pollutant gas and PM<jats:sub>2.5</jats:sub> (PM<jats:sub>10</jats:sub>), which is more explainable. Back Propagation neural networks (BPNN) was used as the baseline method. Relevant data from January 1st 2021 to April 26th 2021 were used to train and validate the performance of the models from GEP and BPNN.Results: After the models of GEP and BPNN constructed, coefficient of determination and RMSE (Root Mean Squared Error) are used to evaluate the fitting degree and measure the effect power of pollutant gas on PM<jats:sub>2.5</jats:sub> (PM<jats:sub>10</jats:sub>). GEP achieved RMSE of [8.7365–14.6438] for PM<jats:sub>2.5</jats:sub>; RMSE of [13.2739–45.8769] for PM<jats:sub>10</jats:sub>, and BP neural networks achieved average RMSE of [13.8741–34.7682] for PM<jats:sub>2.5</jats:sub>; RMSE of [29.7327–52.8653] for PM<jats:sub>10</jats:sub>. Additionally, experimental results show that the influence power of pollutant gas on PM<jats:sub>2.5</jats:sub> (PM<jats:sub>10</jats:sub>) situates between −0.0704 and 0.6359 (between −0.3231 and 0.2242), and the formulas are obtained with GEP so that further analysis become possible. Then linear regression was employed to study which pollutant gas is more relevant to PM<jats:sub>2.5</jats:sub> (PM<jats:sub>10</jats:sub>), the result demonstrates CO (SO<jats:sub>2</jats:sub>, NO<jats:sub>2</jats:sub>) are more related to PM<jats:sub>2.5</jats:sub> (PM<jats:sub>10</jats:sub>).Discussion: The formulas produced by GEP can also provide a direct relationship between pollutant gas and PM<jats:sub>2.5</jats:sub> (PM<jats:sub>10</jats:sub>). Besides, GEP could model the trend of PM<jats:sub>2.5</jats:sub> and PM<jats:sub>10</jats:sub> (increase and decrease). All results show that GEP can be applied smoothly in environmental modelling.","PeriodicalId":12460,"journal":{"name":"Frontiers in Environmental Science","volume":"76 1","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Environmental Science","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.3389/fenvs.2024.1416765","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Traditional statistical methods cannot find quantitative relationship from environmental data.Methods: We selected gene expression programming (GEP) to study the relationship between pollutant gas and PM2.5 (PM10). They were used to construct the relationship between pollutant gas and PM2.5 (PM10) with environmental monitoring data of Xi’an, China. GEP could construct a formula to express the relationship between pollutant gas and PM2.5 (PM10), which is more explainable. Back Propagation neural networks (BPNN) was used as the baseline method. Relevant data from January 1st 2021 to April 26th 2021 were used to train and validate the performance of the models from GEP and BPNN.Results: After the models of GEP and BPNN constructed, coefficient of determination and RMSE (Root Mean Squared Error) are used to evaluate the fitting degree and measure the effect power of pollutant gas on PM2.5 (PM10). GEP achieved RMSE of [8.7365–14.6438] for PM2.5; RMSE of [13.2739–45.8769] for PM10, and BP neural networks achieved average RMSE of [13.8741–34.7682] for PM2.5; RMSE of [29.7327–52.8653] for PM10. Additionally, experimental results show that the influence power of pollutant gas on PM2.5 (PM10) situates between −0.0704 and 0.6359 (between −0.3231 and 0.2242), and the formulas are obtained with GEP so that further analysis become possible. Then linear regression was employed to study which pollutant gas is more relevant to PM2.5 (PM10), the result demonstrates CO (SO2, NO2) are more related to PM2.5 (PM10).Discussion: The formulas produced by GEP can also provide a direct relationship between pollutant gas and PM2.5 (PM10). Besides, GEP could model the trend of PM2.5 and PM10 (increase and decrease). All results show that GEP can be applied smoothly in environmental modelling.
期刊介绍:
Our natural world is experiencing a state of rapid change unprecedented in the presence of humans. The changes affect virtually all physical, chemical and biological systems on Earth. The interaction of these systems leads to tipping points, feedbacks and amplification of effects. In virtually all cases, the causes of environmental change can be traced to human activity through either direct interventions as a consequence of pollution, or through global warming from greenhouse case emissions. Well-formulated and internationally-relevant policies to mitigate the change, or adapt to the consequences, that will ensure our ability to thrive in the coming decades are badly needed. Without proper understanding of the processes involved, and deep understanding of the likely impacts of bad decisions or inaction, the security of food, water and energy is a risk. Left unchecked shortages of these basic commodities will lead to migration, global geopolitical tension and conflict. This represents the major challenge of our time. We are the first generation to appreciate the problem and we will be judged in future by our ability to determine and take the action necessary. Appropriate knowledge of the condition of our natural world, appreciation of the changes occurring, and predictions of how the future will develop are requisite to the definition and implementation of solutions.
Frontiers in Environmental Science publishes research at the cutting edge of knowledge of our natural world and its various intersections with society. It bridges between the identification and measurement of change, comprehension of the processes responsible, and the measures needed to reduce their impact. Its aim is to assist the formulation of policies, by offering sound scientific evidence on environmental science, that will lead to a more inhabitable and sustainable world for the generations to come.