首页 > 最新文献

Geographical Analysis最新文献

英文 中文
Comparison of Moran's I and Geary's c in Multivariate Spatial Pattern Analysis 多元空间格局分析中Moran’s I与Geary’s c的比较
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-11-25 DOI: 10.1111/gean.12355
Jie Lin

This article compares multivariate spatial analysis methods that include not only multivariate covariance, but also spatial dependence of the data explicitly and simultaneously in model design by extending two univariate autocorrelation measures, namely Moran's I and Geary's c. The results derived from the simulation datasets indicate that the standard Moran component analysis is preferable to Geary component analysis as a tool for summarizing multivariate spatial structures. However, the generalized Geary principal component analysis developed in this study by adding variance into the optimization criterion and solved as a trace ratio optimization problem performs as well as, if not better than its counterpart the Moran principal component analysis does. With respect to the sensitivity in detecting subtle spatial structures, the choice of the appropriate tool is dependent on the correlation and variance of the spatial multivariate data. Finally, the four techniques are applied to the Social Determinants of Health dataset to analyze its multivariate spatial pattern. The two generalized methods detect more urban areas and higher autocorrelation structures than the other two standard methods, and provide more obvious contrast between urban and rural areas due to the large variance of the spatial component.

本文通过扩展Moran’s I和Geary’s c这两个单变量自相关测度,比较了多元空间分析方法在模型设计中不仅包含多元协方差,而且明确地同时包含数据的空间依赖性。模拟数据集的结果表明,标准Moran分量分析比Geary分量分析更适合作为总结多元空间结构的工具。然而,本研究提出的广义Geary主成分分析将方差加入到优化准则中,并将其作为一个迹比优化问题来解决,即使不优于其对应的Moran主成分分析,也表现得很好。在检测细微空间结构的灵敏度方面,适当工具的选择取决于空间多变量数据的相关性和方差。最后,将这四种技术应用于健康的社会决定因素数据集,分析其多元空间格局。与其他两种标准方法相比,两种广义方法能够检测到更多的城市区域和更高的自相关结构,并且由于空间分量的差异较大,提供了更明显的城乡对比。
{"title":"Comparison of Moran's I and Geary's c in Multivariate Spatial Pattern Analysis","authors":"Jie Lin","doi":"10.1111/gean.12355","DOIUrl":"10.1111/gean.12355","url":null,"abstract":"<p>This article compares multivariate spatial analysis methods that include not only multivariate covariance, but also spatial dependence of the data explicitly and simultaneously in model design by extending two univariate autocorrelation measures, namely Moran's <i>I</i> and Geary's <i>c</i>. The results derived from the simulation datasets indicate that the standard Moran component analysis is preferable to Geary component analysis as a tool for summarizing multivariate spatial structures. However, the generalized Geary principal component analysis developed in this study by adding variance into the optimization criterion and solved as a trace ratio optimization problem performs as well as, if not better than its counterpart the Moran principal component analysis does. With respect to the sensitivity in detecting subtle spatial structures, the choice of the appropriate tool is dependent on the correlation and variance of the spatial multivariate data. Finally, the four techniques are applied to the Social Determinants of Health dataset to analyze its multivariate spatial pattern. The two generalized methods detect more urban areas and higher autocorrelation structures than the other two standard methods, and provide more obvious contrast between urban and rural areas due to the large variance of the spatial component.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 4","pages":"685-702"},"PeriodicalIF":3.6,"publicationDate":"2022-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46219353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Testing Transferability: Quantitative Evaluation of Labor Market Area Definition Methods in Three Contrasting Countries 可转移性检验:三个对比国家劳动力市场区域界定方法的定量评价
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-11-10 DOI: 10.1111/gean.12353
José Manuel Casado-Díaz, Mike Coombes, Lucas Martínez-Bernabéu

Sub-national economic policies increasingly use labor market areas (LMAs) rather than administrative areas for analysis and implementation. How a set of LMAs was defined influences the results of such analyses, and so accurate policy delivery needs appropriately defined LMAs. Multinational bodies need comparable LMA definitions in many countries, calling for a definition method that is transferable across national boundaries. This article applies quantitative metrics to evaluate LMAs defined in three contrasting countries by three methods that represent the main methodological approaches. The deductive approach—based on a center and hinterland—is too inflexible to deal with differing geographical circumstances and cannot cope with statistical zones that are very small, or do not respect settlement structure. The alternative inductive methods tested define appropriate LMAs in each country, with the newer method performing slightly better in statistical terms. The article also exemplifies the usefulness of the metrics for comparisons of alternative regionalizations.

次国家经济政策越来越多地使用劳动力市场区域而不是行政区域进行分析和实施。如何定义一组LMA会影响此类分析的结果,因此准确的政策交付需要适当定义LMA。跨国机构需要在许多国家进行类似的LMA定义,要求采用可跨国界转移的定义方法。本文应用定量指标,通过代表主要方法论方法的三种方法,对三个对比国家的LMA进行评估。基于中心和腹地的演绎方法过于灵活,无法处理不同的地理环境,也无法处理非常小或不尊重定居点结构的统计区域。替代归纳法在每个国家测试确定了合适的LMA,新方法在统计方面表现稍好。这篇文章还举例说明了这些指标对替代区域化比较的有用性。
{"title":"Testing Transferability: Quantitative Evaluation of Labor Market Area Definition Methods in Three Contrasting Countries","authors":"José Manuel Casado-Díaz,&nbsp;Mike Coombes,&nbsp;Lucas Martínez-Bernabéu","doi":"10.1111/gean.12353","DOIUrl":"10.1111/gean.12353","url":null,"abstract":"<p>Sub-national economic policies increasingly use labor market areas (LMAs) rather than administrative areas for analysis and implementation. How a set of LMAs was defined influences the results of such analyses, and so accurate policy delivery needs appropriately defined LMAs. Multinational bodies need comparable LMA definitions in many countries, calling for a definition method that is transferable across national boundaries. This article applies quantitative metrics to evaluate LMAs defined in three contrasting countries by three methods that represent the main methodological approaches. The deductive approach—based on a center and hinterland—is too inflexible to deal with differing geographical circumstances and cannot cope with statistical zones that are very small, or do not respect settlement structure. The alternative inductive methods tested define appropriate LMAs in each country, with the newer method performing slightly better in statistical terms. The article also exemplifies the usefulness of the metrics for comparisons of alternative regionalizations.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 4","pages":"517-534"},"PeriodicalIF":3.6,"publicationDate":"2022-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/gean.12353","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49391392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Rejoinder to the Commentaries on “A Route Map for Successful Applications of Geographically Weighted Regression” by Comber et al. (2022) 对Comber等人(2022)关于“地理加权回归成功应用路线图”评论的回复
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-11-05 DOI: 10.1111/gean.12352
Alexis Comber, Paul Harris, Chris Brunsdon

We are delighted that the RM paper has stimulated three coherent but diverse Commentaries from leading thinkers in this field (Fotheringham, 2022; Oshan, 2022; Wolf, 2022). Each of these contains robust critiques of the proposed RM and suggest alternative but diverse sets of considerations. We consider each of these in turn and provide a rejoinder by way of response.

We thank the authors of these commentaries for their efforts, and for taking the time to consider our article in detail. In general, we are pleased to see these—part of our motivation here was to initiate discussion on approaches to modeling spatial non-stationarity in regression models. By setting out one way to proceed through our RM, we intended to make an opening move. One thing we observe from these responses is that there is perhaps a spectrum for motivation for using these kind of models—at one end, an approach that is strongly motivated by underlying theories, and at the other, a more exploratory approach. One also has to consider the idea of data analysis as compromise—the reality of modern data collection is frequently that of “big data” where datasets are large, but quality and suitability assurance are not to the standards achieved by carefully designed surveys or experiments. In many cases, geographical fluctuations in models may be a consequence of this, and spatially varying coefficient methods may act as “spatial detectives” by shedding light on spatial inconsistencies and biases in the data collection, rather than direct measurements of a true underlying process. This suggests the need for a kind of “deep inference” where processes under investigation and the process of data collection are considered in equal measure, requiring consideration of underlying process theories, in addition to issues relating to the act of data exploration—perhaps suggesting that the spectrum referred to earlier is something to be scanned, rather than choosing a specific viewpoint from which to carry out analysis.

As we stated earlier, the approach outlined in the GWR RM by Comber et al. (2022a) is not intended to be a strict set of immutable rules, but more of an exemplar of what could be done to respond to a specific research context, and acknowledging that a degree of ‘fuzziness’ in modeling strategies is inevitable. The replies to our article have been useful in considering potential alternative research contexts, and how they may interact with this kind of fuzziness. We look forward to the debate advancing.

我们很高兴RM论文激发了该领域领先思想家的三个连贯但不同的评论(Fotheringham, 2022;Oshan, 2022;狼,2022)。其中每一个都包含了对提议的RM的强有力的批评,并提出了不同的考虑因素。我们依次考虑这些问题,并以回应的方式提出反驳。我们感谢这些评论的作者的努力,并感谢他们花时间详细考虑我们的文章。总的来说,我们很高兴看到这些——我们在这里的部分动机是开始讨论回归模型中空间非平稳性建模的方法。通过设定一种方式来通过我们的RM,我们打算做一个开局。我们从这些回应中观察到的一件事是,使用这些模型的动机可能是有一个范围的——一端是一种受到潜在理论强烈推动的方法,另一端是一种更具探索性的方法。人们还必须考虑到数据分析的想法是一种妥协——现代数据收集的现实往往是“大数据”,其中数据集很大,但质量和适用性保证不能达到精心设计的调查或实验所达到的标准。在许多情况下,模型的地理波动可能是这种情况的结果,空间变化系数方法可以作为"空间侦探",揭示数据收集中的空间不一致和偏差,而不是直接测量真正的基本过程。这表明需要一种“深度推理”,即在调查过程和数据收集过程中同等考虑,除了与数据探索行为相关的问题外,还需要考虑潜在的过程理论——也许这表明前面提到的频谱是需要扫描的东西,而不是选择一个特定的观点来进行分析。正如我们之前所述,Comber等人(2022a)在GWR RM中概述的方法并不是一套严格的不可变规则,而是更多的是针对特定研究背景可以采取的措施的范例,并承认建模策略中的一定程度的“模糊性”是不可避免的。对我们文章的回复在考虑潜在的替代研究背景以及它们如何与这种模糊性相互作用方面非常有用。我们期待着辩论继续进行。
{"title":"A Rejoinder to the Commentaries on “A Route Map for Successful Applications of Geographically Weighted Regression” by Comber et al. (2022)","authors":"Alexis\u0000 Comber,&nbsp;Paul Harris,&nbsp;Chris Brunsdon","doi":"10.1111/gean.12352","DOIUrl":"10.1111/gean.12352","url":null,"abstract":"<p>We are delighted that the RM paper has stimulated three coherent but diverse Commentaries from leading thinkers in this field (Fotheringham, <span>2022</span>; Oshan, <span>2022</span>; Wolf, <span>2022</span>). Each of these contains robust critiques of the proposed RM and suggest alternative but diverse sets of considerations. We consider each of these in turn and provide a rejoinder by way of response.</p><p>We thank the authors of these commentaries for their efforts, and for taking the time to consider our article in detail. In general, we are pleased to see these—part of our motivation here was to initiate discussion on approaches to modeling spatial non-stationarity in regression models. By setting out one way to proceed through our RM, we intended to make an opening move. One thing we observe from these responses is that there is perhaps a spectrum for motivation for using these kind of models—at one end, an approach that is strongly motivated by underlying theories, and at the other, a more exploratory approach. One also has to consider the idea of data analysis as compromise—the reality of modern data collection is frequently that of “big data” where datasets are large, but quality and suitability assurance are not to the standards achieved by carefully designed surveys or experiments. In many cases, geographical fluctuations in models may be a consequence of this, and spatially varying coefficient methods may act as “spatial detectives” by shedding light on spatial inconsistencies and biases in the data collection, rather than direct measurements of a true underlying process. This suggests the need for a kind of “deep inference” where processes under investigation <i>and</i> the process of data collection are considered in equal measure, requiring consideration of underlying process theories, in addition to issues relating to the act of data exploration—perhaps suggesting that the spectrum referred to earlier is something to be scanned, rather than choosing a specific viewpoint from which to carry out analysis.</p><p>As we stated earlier, the approach outlined in the GWR RM by Comber et al. (<span>2022a</span>) is not intended to be a strict set of immutable rules, but more of an exemplar of what could be done to respond to a specific research context, and acknowledging that a degree of ‘fuzziness’ in modeling strategies is inevitable. The replies to our article have been useful in considering potential alternative research contexts, and how they may interact with this kind of fuzziness. We look forward to the debate advancing.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 1","pages":"198-202"},"PeriodicalIF":3.6,"publicationDate":"2022-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/gean.12352","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41833176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Simulation Study to Explore Inference about Global Moran's I with Random Spatial Indexes 基于随机空间指数的全球Moran’s I推理模拟研究
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-10-17 DOI: 10.1111/gean.12349
René Westerholt

Inference procedures for spatial autocorrelation statistics assume that the underlying configurations of spatial units are fixed. However, sometimes this assumption can be disadvantageous, for example, when analyzing social media posts or moving objects. This article examines for the case of point geometries how a change from fixed to random spatial indexes affects inferences about global Moran's I, a popular spatial autocorrelation measure. Homogeneous and inhomogeneous Matérn and Thomas cluster processes are studied and for each of these processes, 10,000 random point patterns are simulated for investigating three aspects that are key in an inferential context: the null distributions of I when the underlying geometries are varied; the effect of the latter on critical values used to reject null hypotheses; and how the presence of point processes affects the statistical power of Moran's I. The results show that point processes affect all three characteristics. Inferences about spatial structure in relevant application contexts may therefore be different from conventional inferences when this additional source of randomness is taken into account.

空间自相关统计的推理程序假设空间单元的底层配置是固定的。然而,有时这种假设可能是不利的,例如,在分析社交媒体帖子或移动物体时。本文研究了点几何的情况下,从固定到随机的空间索引的变化如何影响关于全局Moran's I的推断,这是一种流行的空间自相关度量。研究了齐次和非齐次mat和托马斯聚类过程,并对这些过程中的每一个进行了模拟,模拟了10,000个随机点模式,以调查在推理环境中关键的三个方面:当底层几何形状变化时I的零分布;后者对用于拒绝零假设的临界值的影响;以及点过程的存在如何影响莫兰i的统计能力。结果表明,点过程影响所有三个特征。因此,当考虑到这种额外的随机性来源时,有关相关应用环境中空间结构的推断可能与常规推断不同。
{"title":"A Simulation Study to Explore Inference about Global Moran's I with Random Spatial Indexes","authors":"René Westerholt","doi":"10.1111/gean.12349","DOIUrl":"10.1111/gean.12349","url":null,"abstract":"<p>Inference procedures for spatial autocorrelation statistics assume that the underlying configurations of spatial units are fixed. However, sometimes this assumption can be disadvantageous, for example, when analyzing social media posts or moving objects. This article examines for the case of point geometries how a change from fixed to random spatial indexes affects inferences about global Moran's I, a popular spatial autocorrelation measure. Homogeneous and inhomogeneous Matérn and Thomas cluster processes are studied and for each of these processes, 10,000 random point patterns are simulated for investigating three aspects that are key in an inferential context: the null distributions of I when the underlying geometries are varied; the effect of the latter on critical values used to reject null hypotheses; and how the presence of point processes affects the statistical power of Moran's I. The results show that point processes affect all three characteristics. Inferences about spatial structure in relevant application contexts may therefore be different from conventional inferences when this additional source of randomness is taken into account.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 4","pages":"621-650"},"PeriodicalIF":3.6,"publicationDate":"2022-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/gean.12349","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45247627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A Hybrid Approach for Mass Valuation of Residential Properties through Geographic Information Systems and Machine Learning Integration 基于地理信息系统和机器学习集成的住宅物业大规模估值混合方法
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-10-14 DOI: 10.1111/gean.12350
Muhammed Oguzhan Mete, Tahsin Yomralioglu

Geographic Information Systems (GIS) and Machine Learning methods are now widely used in mass property valuation using the physical attributes of properties. However, locational criteria, such as as proximity to important places, sea or forest views, flat topography are just some of the spatial factors that affect property values and, to date, these have been insufficiently used as part of the valuation process. In this study, a hybrid approach is developed by integrating GIS and Machine Learning for mass valuation of residential properties. GIS-based Nominal Valuation Method was applied to carry out proximity, terrain, and visibility analyses using Ordnance Survey and OpenStreetMap data, than land value map of Great Britain was produced. Spatial criteria scores obtained from the GIS analyses were included in the price prediction process in which global and spatially clustered local regression models are built for England and Wales using Price Paid Data and Energy Performance Certificates data. Results showed that adding locational factors to the property price data and applying a novel nominally weighted spatial clustering algorithm for creating a local regression increased the prediction accuracy by about 45%. It also demonstrated that Random Forest was the most accurate ensemble model.

地理信息系统(GIS)和机器学习方法现在广泛用于利用财产的物理属性进行大规模财产评估。然而,位置标准,如靠近重要地点、海景或森林景观、平坦地形等只是影响财产价值的一些空间因素,迄今为止,这些因素在估价过程中没有得到充分利用。在本研究中,通过集成GIS和机器学习开发了一种混合方法,用于住宅物业的大规模估值。利用地形测量和OpenStreetMap数据,采用基于gis的标称估价方法对英国土地进行接近性、地形和可见度分析,绘制了英国土地价值图。从GIS分析中获得的空间标准分数被包括在价格预测过程中,其中使用价格支付数据和能源绩效证书数据为英格兰和威尔士建立了全球和空间聚类的局部回归模型。结果表明,在房价数据中加入位置因素,并应用一种新的名义加权空间聚类算法创建局部回归,预测精度提高了约45%。它还证明了随机森林是最准确的集合模型。
{"title":"A Hybrid Approach for Mass Valuation of Residential Properties through Geographic Information Systems and Machine Learning Integration","authors":"Muhammed Oguzhan Mete,&nbsp;Tahsin Yomralioglu","doi":"10.1111/gean.12350","DOIUrl":"10.1111/gean.12350","url":null,"abstract":"<p>Geographic Information Systems (GIS) and Machine Learning methods are now widely used in mass property valuation using the physical attributes of properties. However, locational criteria, such as as proximity to important places, sea or forest views, flat topography are just some of the spatial factors that affect property values and, to date, these have been insufficiently used as part of the valuation process. In this study, a hybrid approach is developed by integrating GIS and Machine Learning for mass valuation of residential properties. GIS-based Nominal Valuation Method was applied to carry out proximity, terrain, and visibility analyses using Ordnance Survey and OpenStreetMap data, than land value map of Great Britain was produced. Spatial criteria scores obtained from the GIS analyses were included in the price prediction process in which global and spatially clustered local regression models are built for England and Wales using Price Paid Data and Energy Performance Certificates data. Results showed that adding locational factors to the property price data and applying a novel nominally weighted spatial clustering algorithm for creating a local regression increased the prediction accuracy by about 45%. It also demonstrated that Random Forest was the most accurate ensemble model.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 4","pages":"535-559"},"PeriodicalIF":3.6,"publicationDate":"2022-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43311551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Three Common Machine Learning Algorithms Neither Enhance Prediction Accuracy Nor Reduce Spatial Autocorrelation in Residuals: An Analysis of Twenty-five Socioeconomic Data Sets 三种常见的机器学习算法既不能提高预测精度,也不能降低残差的空间自相关:对25个社会经济数据集的分析
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-10-13 DOI: 10.1111/gean.12351
Insang Song, Daehyun Kim

Machine learning (ML) is being applied in an increasing volume of geographical research. However, the aspects of spatial autocorrelation (SAC) in the residuals produced by ML models have been understudied compared to the benefit of ML, namely, reduction of prediction errors. In this study, we examined the relationship between predictive accuracy and the reduction in the residual SAC for 597 variables from 25 geographical socio-economic data sets using spatial and nonspatial cross-validation of three ML algorithms such as random forests, support vector machine, and artificial neural network (ANN) to provide an extensive empirical diagnosis—but not a definitive theory—of the relationship between SAC and ML. Our results highlighted that the ML algorithms with tuned hyperparameters yielded marginal predictive accuracy gains and the minimal decreases in residual SAC. ANN revealed lower accuracy and higher reduction in the residual SAC than others. This implies ML algorithms in geographical research in socio-economic domains would not always result in higher prediction accuracy. We suggest that ML in geographical research should be cautiously employed when the main objective is related to the residual SAC. We also showed that spatial cross-validation neither improves predictive accuracy substantially nor reduce the residual SAC effectively.

机器学习(ML)正在越来越多的地理研究中得到应用。然而,与机器学习的优势(即减少预测误差)相比,机器学习模型产生的残差中的空间自相关(SAC)方面的研究还不够充分。在这项研究中,我们使用三种机器学习算法(随机森林,支持向量机,和人工神经网络(ANN)提供了广泛的经验诊断-但不是确定的理论- SAC和ML之间的关系。我们的结果强调,具有调谐超参数的ML算法产生了边际预测精度增益和残余SAC的最小减少。与其他方法相比,人工神经网络显示出较低的准确率和较高的残余SAC减少率。这意味着在社会经济领域的地理研究中的ML算法并不总是导致更高的预测精度。我们建议在地理研究中,当主要目标与剩余SAC相关时,应谨慎使用ML。我们还发现,空间交叉验证既不能显著提高预测精度,也不能有效降低剩余SAC。
{"title":"Three Common Machine Learning Algorithms Neither Enhance Prediction Accuracy Nor Reduce Spatial Autocorrelation in Residuals: An Analysis of Twenty-five Socioeconomic Data Sets","authors":"Insang Song,&nbsp;Daehyun Kim","doi":"10.1111/gean.12351","DOIUrl":"10.1111/gean.12351","url":null,"abstract":"<p>Machine learning (ML) is being applied in an increasing volume of geographical research. However, the aspects of spatial autocorrelation (SAC) in the residuals produced by ML models have been understudied compared to the benefit of ML, namely, reduction of prediction errors. In this study, we examined the relationship between predictive accuracy and the reduction in the residual SAC for 597 variables from 25 geographical socio-economic data sets using spatial and nonspatial cross-validation of three ML algorithms such as random forests, support vector machine, and artificial neural network (ANN) to provide an extensive empirical diagnosis—but not a definitive theory—of the relationship between SAC and ML. Our results highlighted that the ML algorithms with tuned hyperparameters yielded marginal predictive accuracy gains and the minimal decreases in residual SAC. ANN revealed lower accuracy and higher reduction in the residual SAC than others. This implies ML algorithms in geographical research in socio-economic domains would not always result in higher prediction accuracy. We suggest that ML in geographical research should be cautiously employed when the main objective is related to the residual SAC. We also showed that spatial cross-validation neither improves predictive accuracy substantially nor reduce the residual SAC effectively.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 4","pages":"585-620"},"PeriodicalIF":3.6,"publicationDate":"2022-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48895872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
City Size Distribution Analyses Based on the Concept of Entropy Competition 基于熵竞争概念的城市规模分布分析
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-10-02 DOI: 10.1111/gean.12348
Antonio Sanchirico, Giovanna Andrulli, Mauro Fiorentino

The present work pursues theoretical and empirical objectives. With regards to the former, it is demonstrated that the natural tendency to uniformity of both the probability distribution of a city to have a certain number of inhabitants and that of a person to reside in a town of a given number of citizens leads to a competition between their information entropies, which provides the power law distribution as the most probable one for city size. It is also shown that Zipf's law reflects the significant control of the existence of interconnections between cities on the self-organization of their size. With regards to the empirical objectives, based on population data of European countries and Italian municipalities, the theoretical approach proposed is validated. At the Italian scale, city distribution is shown to be a power law for cities above 10,000 inhabitants. In the 20 Italian regions, the breakpoint in the distribution is generally lower. Finally, the geographical control on city distribution is discussed based on the results achieved in some regions.

目前的工作追求理论和实证目标。对于前者,证明了拥有一定数量居民的城市的概率分布和一个人居住在拥有一定数量公民的城镇的概率分布的自然一致性倾向导致它们的信息熵之间的竞争,这提供了幂律分布作为城市规模最可能的分布。Zipf定律反映了城市间相互联系的存在对城市规模自组织的显著控制。在实证目标方面,基于欧洲各国和意大利各市的人口数据,对提出的理论方法进行了验证。在意大利的规模上,对于人口超过1万的城市,城市分布呈现幂次规律。在意大利的20个地区中,分布的断点普遍较低。最后,结合部分地区的研究成果,对城市布局的地理调控进行了探讨。
{"title":"City Size Distribution Analyses Based on the Concept of Entropy Competition","authors":"Antonio Sanchirico,&nbsp;Giovanna Andrulli,&nbsp;Mauro Fiorentino","doi":"10.1111/gean.12348","DOIUrl":"10.1111/gean.12348","url":null,"abstract":"<p>The present work pursues theoretical and empirical objectives. With regards to the former, it is demonstrated that the natural tendency to uniformity of both the probability distribution of a city to have a certain number of inhabitants and that of a person to reside in a town of a given number of citizens leads to a competition between their information entropies, which provides the power law distribution as the most probable one for city size. It is also shown that Zipf's law reflects the significant control of the existence of interconnections between cities on the self-organization of their size. With regards to the empirical objectives, based on population data of European countries and Italian municipalities, the theoretical approach proposed is validated. At the Italian scale, city distribution is shown to be a power law for cities above 10,000 inhabitants. In the 20 Italian regions, the breakpoint in the distribution is generally lower. Finally, the geographical control on city distribution is discussed based on the results achieved in some regions.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 4","pages":"560-584"},"PeriodicalIF":3.6,"publicationDate":"2022-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/gean.12348","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41884316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comment on “A Route Map for Successful Applications of Geographically-Weighted Regression”: The Alternative Expressway to Defensible Regression-Based Local Modeling 对“地理加权回归成功应用路线图”的评析:基于可防御回归的局部建模的替代高速公路
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-09-27 DOI: 10.1111/gean.12347
A. Stewart Fotheringham
{"title":"A Comment on “A Route Map for Successful Applications of Geographically-Weighted Regression”: The Alternative Expressway to Defensible Regression-Based Local Modeling","authors":"A. Stewart Fotheringham","doi":"10.1111/gean.12347","DOIUrl":"10.1111/gean.12347","url":null,"abstract":"","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 1","pages":"191-197"},"PeriodicalIF":3.6,"publicationDate":"2022-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47604932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Navigating the Methodological Landscape in Spatial Analysis: A Comment on “A Route Map for Successful Applications of Geographically-Weighted Regression” 导航空间分析的方法论景观:对“地理加权回归成功应用的路线图”的评论
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-09-17 DOI: 10.1111/gean.12345
Taylor M. Oshan

The development of “route maps” for spatial analytical methods is a pursuit with important ramifications. Comber et al. propose a route map to guide applications of geographically weighted regression consisting of a three-step primary pathway and a series of secondary arterials. This comment first highlights some concerns about the underlying “map” (i.e., experimental setup and assumptions) and then with the proposed “route” (i.e., core decisions and evaluation criteria). It closes by suggesting a more general focus on identifying modeling issues with the highest impact and facilitating consensus-building, which could improve the future production of route maps for navigating the methodological landscape in spatial analysis.

空间分析方法的“路线图”的发展是一项具有重要分支的追求。Comber等人提出了一个路线图,以指导地理加权回归的应用,该路线图由三步主要途径和一系列次要动脉组成。这个评论首先强调了对潜在“地图”(即实验设置和假设)的一些关注,然后是对提议的“路线”(即核心决策和评估标准)的关注。最后,报告建议将更广泛的重点放在确定影响最大的建模问题和促进建立共识上,这可以改善未来绘制路线图的工作,以便在空间分析的方法学景观中导航。
{"title":"Navigating the Methodological Landscape in Spatial Analysis: A Comment on “A Route Map for Successful Applications of Geographically-Weighted Regression”","authors":"Taylor M. Oshan","doi":"10.1111/gean.12345","DOIUrl":"10.1111/gean.12345","url":null,"abstract":"<p>The development of “route maps” for spatial analytical methods is a pursuit with important ramifications. Comber et al. propose a route map to guide applications of geographically weighted regression consisting of a three-step primary pathway and a series of secondary arterials. This comment first highlights some concerns about the underlying “map” (i.e., experimental setup and assumptions) and then with the proposed “route” (i.e., core decisions and evaluation criteria). It closes by suggesting a more general focus on identifying modeling issues with the highest impact and facilitating consensus-building, which could improve the future production of route maps for navigating the methodological landscape in spatial analysis.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 1","pages":"179-183"},"PeriodicalIF":3.6,"publicationDate":"2022-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/gean.12345","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42662805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The Right to Rule by Thumb: A Comment on Epistemology in “A Route Map for Successful Applications of Geographically-Weighted Regression” 拇指统治权——评《地理加权回归成功应用路线图》中的认识论
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-09-15 DOI: 10.1111/gean.12346
Levi John Wolf

Comber et al. provide an important contribution to the future of quantitative geography and Geographical Analysis. The contribution is chiefly in their development of a “GWR Route Map,” a diagram showing the sequence of analytical steps that “successful” specification searches in local modeling tend to follow. Geographically weighted techniques have been rapidly expanding, both in terms of complexity, users, and disciplinary reach. With geographically weighted methods now in so many more analysts' hands, any new rule of thumb will have a major imprint. But, by what right does the thumb rule the analysts? That is, what “counts” as valid knowledge about local models in general? In the following comment, I argue that we probably should use theory, not route maps to decide specifications. But, if we are pressed to build route maps, we sorely need better epistemological foundations for them. I discuss a few previous examples of strongly grounded route maps and offer a few paths to these better grounds as well as two ways to the exit.

Comber等人对定量地理学和地理分析的未来做出了重要贡献。贡献主要在于他们开发了“GWR路线图”,这是一个图表,显示了局部建模中“成功的”规范搜索倾向于遵循的分析步骤的顺序。地理加权技术在复杂性、用户和学科范围方面都在迅速扩展。随着越来越多的分析师掌握了地理加权方法,任何新的经验法则都将产生重大影响。但是,凭什么经验可以统治分析师呢?也就是说,一般来说,什么“算作”关于局部模型的有效知识?在下面的评论中,我认为我们可能应该使用理论,而不是路线图来决定规格。但是,如果我们被迫构建路线图,我们迫切需要更好的认识论基础。我讨论了之前的一些强接地路线图的例子,并提供了一些通往这些更好的地面的路径以及两种出口方式。
{"title":"The Right to Rule by Thumb: A Comment on Epistemology in “A Route Map for Successful Applications of Geographically-Weighted Regression”","authors":"Levi John Wolf","doi":"10.1111/gean.12346","DOIUrl":"10.1111/gean.12346","url":null,"abstract":"<p>Comber et al. provide an important contribution to the future of quantitative geography and <i>Geographical Analysis</i>. The contribution is chiefly in their development of a “GWR Route Map,” a diagram showing the sequence of analytical steps that “successful” specification searches in local modeling tend to follow. Geographically weighted techniques have been rapidly expanding, both in terms of complexity, users, and disciplinary reach. With geographically weighted methods now in so many more analysts' hands, any new rule of thumb will have a major imprint. But, by what right does the thumb rule the analysts? That is, what “counts” as valid knowledge about local models in general? In the following comment, I argue that we probably should use theory, not route maps to decide specifications. But, if we are pressed to build route maps, we sorely need better epistemological foundations for them. I discuss a few previous examples of strongly grounded route maps and offer a few paths to these better grounds as well as two ways to the exit.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 1","pages":"184-190"},"PeriodicalIF":3.6,"publicationDate":"2022-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41583846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Geographical Analysis
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1