A Comparison of Spatial and Nonspatial Methods in Statistical Modeling of
NO
2
: Prediction Accuracy, Uncertainty Quantification, and Model Interpretation
{"title":"A Comparison of Spatial and Nonspatial Methods in Statistical Modeling of \n \n \n \n \n NO\n \n \n 2\n \n \n \n : Prediction Accuracy, Uncertainty Quantification, and Model Interpretation","authors":"Meng Lu, Joaquin Cavieres, Paula Moraga","doi":"10.1111/gean.12356","DOIUrl":null,"url":null,"abstract":"<p><math>\n <semantics>\n <mrow>\n <msub>\n <mrow>\n <mtext>NO</mtext>\n </mrow>\n <mrow>\n <mn>2</mn>\n </mrow>\n </msub>\n </mrow>\n <annotation>$$ {\\mathrm{NO}}_2 $$</annotation>\n </semantics></math> is a traffic-related air pollutant. Ground <math>\n <semantics>\n <mrow>\n <msub>\n <mrow>\n <mtext>NO</mtext>\n </mrow>\n <mrow>\n <mn>2</mn>\n </mrow>\n </msub>\n </mrow>\n <annotation>$$ {\\mathrm{NO}}_2 $$</annotation>\n </semantics></math> monitoring stations measure <math>\n <semantics>\n <mrow>\n <msub>\n <mrow>\n <mtext>NO</mtext>\n </mrow>\n <mrow>\n <mn>2</mn>\n </mrow>\n </msub>\n </mrow>\n <annotation>$$ {\\mathrm{NO}}_2 $$</annotation>\n </semantics></math> concentrations at certain locations and statistical predictive methods have been developed to predict <math>\n <semantics>\n <mrow>\n <msub>\n <mrow>\n <mtext>NO</mtext>\n </mrow>\n <mrow>\n <mn>2</mn>\n </mrow>\n </msub>\n </mrow>\n <annotation>$$ {\\mathrm{NO}}_2 $$</annotation>\n </semantics></math> as a continuous surface. Among them, ensemble tree-based methods have shown to be powerful in capturing nonlinear relationships between <math>\n <semantics>\n <mrow>\n <msub>\n <mrow>\n <mtext>NO</mtext>\n </mrow>\n <mrow>\n <mn>2</mn>\n </mrow>\n </msub>\n </mrow>\n <annotation>$$ {\\mathrm{NO}}_2 $$</annotation>\n </semantics></math> measurements and geospatial predictors but it is unclear if the spatial structure of <math>\n <semantics>\n <mrow>\n <msub>\n <mrow>\n <mtext>NO</mtext>\n </mrow>\n <mrow>\n <mn>2</mn>\n </mrow>\n </msub>\n </mrow>\n <annotation>$$ {\\mathrm{NO}}_2 $$</annotation>\n </semantics></math> is also captured in the response-covariates relationships. We dive into the comparison between spatial and nonspatial data models accounting for prediction accuracy, model interpretation and uncertainty quantification. Moreover, we implement two new spatial and a nonspatial methods that have not been applied to air pollution mapping. We implemented our study using national ground station measurements of <math>\n <semantics>\n <mrow>\n <msub>\n <mrow>\n <mtext>NO</mtext>\n </mrow>\n <mrow>\n <mn>2</mn>\n </mrow>\n </msub>\n </mrow>\n <annotation>$$ {\\mathrm{NO}}_2 $$</annotation>\n </semantics></math> in Germany and the Netherlands of 2017. Our results indicate heterogeneous levels of importance of modeling the spatial process in different areas. The prediction intervals predicted with ensemble tree-based methods are more satisfactory than the geostatistical methods. The two new methods implemented each obtained better prediction accuracy compared to the original ensemble tree-based and stacking methods. The probabilistic distribution of the spatial random field estimated by the geostatistical methods could provide useful information for analyzing emission sources and the spatial process of observations.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 4","pages":"703-727"},"PeriodicalIF":3.3000,"publicationDate":"2023-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/gean.12356","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geographical Analysis","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/gean.12356","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY","Score":null,"Total":0}
引用次数: 2
Abstract
is a traffic-related air pollutant. Ground monitoring stations measure concentrations at certain locations and statistical predictive methods have been developed to predict as a continuous surface. Among them, ensemble tree-based methods have shown to be powerful in capturing nonlinear relationships between measurements and geospatial predictors but it is unclear if the spatial structure of is also captured in the response-covariates relationships. We dive into the comparison between spatial and nonspatial data models accounting for prediction accuracy, model interpretation and uncertainty quantification. Moreover, we implement two new spatial and a nonspatial methods that have not been applied to air pollution mapping. We implemented our study using national ground station measurements of in Germany and the Netherlands of 2017. Our results indicate heterogeneous levels of importance of modeling the spatial process in different areas. The prediction intervals predicted with ensemble tree-based methods are more satisfactory than the geostatistical methods. The two new methods implemented each obtained better prediction accuracy compared to the original ensemble tree-based and stacking methods. The probabilistic distribution of the spatial random field estimated by the geostatistical methods could provide useful information for analyzing emission sources and the spatial process of observations.
期刊介绍:
First in its specialty area and one of the most frequently cited publications in geography, Geographical Analysis has, since 1969, presented significant advances in geographical theory, model building, and quantitative methods to geographers and scholars in a wide spectrum of related fields. Traditionally, mathematical and nonmathematical articulations of geographical theory, and statements and discussions of the analytic paradigm are published in the journal. Spatial data analyses and spatial econometrics and statistics are strongly represented.