首页 > 最新文献

Geographical Analysis最新文献

英文 中文
Understanding Population Decline Trajectories in Spain using Sequence Analysis 利用序列分析了解西班牙人口下降轨迹
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2023-01-16 DOI: 10.1111/gean.12357
Miguel González-Leonardo, Niall Newsham, Francisco Rowe

Population decline is a key contemporary demographic challenge. Previous work has measured the national extent of population decline, and we know that it is more acute in Japan and Eastern Europe and is set to accelerate across many industrialized countries. Yet, little is known about the population trajectories leading to current trends of depopulation and their underpinning demographic and contextual factors. To address this gap, we aim to identify and characterize the different trajectories of depopulation in Spain from 2000 to 2020 at the small area level using sequence analysis, spatial autocorrelation analysis, decomposition techniques, and multinomial logistic modeling. We show that while Spain recorded an overall 17.2% national population growth between 2000 and 2020, 63% of municipalities experienced depopulation. We identify six trajectories of population decline, with a well-defined northwest-south divide. These trajectories include mostly rural municipalities, but also certain small- and medium-sized cities. Natural decline comprises the main demographic component underpinning differences in the extent of depopulation across trajectories, and international migration plays an important role in explaining transitions to decline since the financial crisis of 2008. Small and old populations, and, to a lesser extent, remoteness from cities are key features characterizing areas of high decline.

人口下降是当代人口学面临的一个关键挑战。先前的工作已经衡量了全国人口下降的程度,我们知道,日本和东欧的人口下降更为严重,许多工业化国家的人口下降速度也将加快。然而,人们对导致当前人口减少趋势的人口轨迹及其背后的人口和环境因素知之甚少。为了解决这一差距,我们的目标是使用序列分析、空间自相关分析、分解技术和多项逻辑建模,在小面积水平上识别和表征2000年至2020年西班牙人口减少的不同轨迹。我们发现,尽管西班牙在2000年至2020年间的全国人口总体增长率为17.2%,但63%的城市人口减少。我们确定了人口下降的六个轨迹,其中有一个明确的西北-南分界线。这些轨迹主要包括农村城市,但也包括某些中小城市。自然衰退是支撑各轨迹人口减少程度差异的主要人口组成部分,国际移民在解释自2008年金融危机以来向衰退过渡方面发挥着重要作用。人口稀少和老龄化,以及在较小程度上远离城市,是人口高度下降地区的主要特征。
{"title":"Understanding Population Decline Trajectories in Spain using Sequence Analysis","authors":"Miguel González-Leonardo,&nbsp;Niall Newsham,&nbsp;Francisco Rowe","doi":"10.1111/gean.12357","DOIUrl":"10.1111/gean.12357","url":null,"abstract":"<p>Population decline is a key contemporary demographic challenge. Previous work has measured the national extent of population decline, and we know that it is more acute in Japan and Eastern Europe and is set to accelerate across many industrialized countries. Yet, little is known about the population trajectories leading to current trends of depopulation and their underpinning demographic and contextual factors. To address this gap, we aim to identify and characterize the different trajectories of depopulation in Spain from 2000 to 2020 at the small area level using sequence analysis, spatial autocorrelation analysis, decomposition techniques, and multinomial logistic modeling. We show that while Spain recorded an overall 17.2% national population growth between 2000 and 2020, 63% of municipalities experienced depopulation. We identify six trajectories of population decline, with a well-defined northwest-south divide. These trajectories include mostly rural municipalities, but also certain small- and medium-sized cities. Natural decline comprises the main demographic component underpinning differences in the extent of depopulation across trajectories, and international migration plays an important role in explaining transitions to decline since the financial crisis of 2008. Small and old populations, and, to a lesser extent, remoteness from cities are key features characterizing areas of high decline.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 4","pages":"495-516"},"PeriodicalIF":3.6,"publicationDate":"2023-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/gean.12357","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47656312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Prediction of Bike-sharing Trip Counts: Comparing Parametric Spatial Regression Models to a Geographically Weighted XGBoost Algorithm 共享单车出行次数的预测:参数空间回归模型与地理加权XGBoost算法的比较
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-11-29 DOI: 10.1111/gean.12354
Katja Schimohr, Philipp Doebler, Joachim Scheiner

Regression models are commonly applied in the analysis of transportation data. This research aims at broadening the range of methods used for this task by modeling the spatial distribution of bike-sharing trips in Cologne, Germany, applying both parametric regression models and a modified machine learning approach while incorporating measures to account for spatial autocorrelation. Independent variables included in the models consist of land use types, elements of the transport system and sociodemographic characteristics. Out of several regression models with different underlying distributions, a Tweedie generalized additive model is chosen by its values for AIC, RMSE, and sMAPE to be compared to an XGBoost model. To consider spatial relationships, spatial splines are included in the Tweedie model, while the estimations of the XGBoost model are modified using a geographically weighted regression. Both methods entail certain advantages: while XGBoost leads to far better values regarding RMSE and sMAPE and therefore to a better model fit, the Tweedie model allows an easier interpretation of the influence of the independent variables including spatial effects.

回归模型是交通数据分析中常用的一种方法。本研究旨在通过对德国科隆共享单车出行的空间分布进行建模,扩大用于该任务的方法范围,同时应用参数回归模型和改进的机器学习方法,同时纳入考虑空间自相关的措施。模型中包括的独立变量包括土地利用类型、运输系统要素和社会人口特征。在几种具有不同底层分布的回归模型中,根据AIC、RMSE和sMAPE的值选择Tweedie广义加性模型与XGBoost模型进行比较。为了考虑空间关系,Tweedie模型中包含了空间样条,而XGBoost模型的估计使用地理加权回归进行了修改。这两种方法都有一定的优势:虽然XGBoost在RMSE和sMAPE方面可以得到更好的值,因此可以得到更好的模型拟合,但Tweedie模型可以更容易地解释包括空间效应在内的自变量的影响。
{"title":"Prediction of Bike-sharing Trip Counts: Comparing Parametric Spatial Regression Models to a Geographically Weighted XGBoost Algorithm","authors":"Katja Schimohr,&nbsp;Philipp Doebler,&nbsp;Joachim Scheiner","doi":"10.1111/gean.12354","DOIUrl":"10.1111/gean.12354","url":null,"abstract":"<p>Regression models are commonly applied in the analysis of transportation data. This research aims at broadening the range of methods used for this task by modeling the spatial distribution of bike-sharing trips in Cologne, Germany, applying both parametric regression models and a modified machine learning approach while incorporating measures to account for spatial autocorrelation. Independent variables included in the models consist of land use types, elements of the transport system and sociodemographic characteristics. Out of several regression models with different underlying distributions, a Tweedie generalized additive model is chosen by its values for AIC, RMSE, and sMAPE to be compared to an XGBoost model. To consider spatial relationships, spatial splines are included in the Tweedie model, while the estimations of the XGBoost model are modified using a geographically weighted regression. Both methods entail certain advantages: while XGBoost leads to far better values regarding RMSE and sMAPE and therefore to a better model fit, the Tweedie model allows an easier interpretation of the influence of the independent variables including spatial effects.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 4","pages":"651-684"},"PeriodicalIF":3.6,"publicationDate":"2022-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/gean.12354","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43120370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of Moran's I and Geary's c in Multivariate Spatial Pattern Analysis 多元空间格局分析中Moran’s I与Geary’s c的比较
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-11-25 DOI: 10.1111/gean.12355
Jie Lin

This article compares multivariate spatial analysis methods that include not only multivariate covariance, but also spatial dependence of the data explicitly and simultaneously in model design by extending two univariate autocorrelation measures, namely Moran's I and Geary's c. The results derived from the simulation datasets indicate that the standard Moran component analysis is preferable to Geary component analysis as a tool for summarizing multivariate spatial structures. However, the generalized Geary principal component analysis developed in this study by adding variance into the optimization criterion and solved as a trace ratio optimization problem performs as well as, if not better than its counterpart the Moran principal component analysis does. With respect to the sensitivity in detecting subtle spatial structures, the choice of the appropriate tool is dependent on the correlation and variance of the spatial multivariate data. Finally, the four techniques are applied to the Social Determinants of Health dataset to analyze its multivariate spatial pattern. The two generalized methods detect more urban areas and higher autocorrelation structures than the other two standard methods, and provide more obvious contrast between urban and rural areas due to the large variance of the spatial component.

本文通过扩展Moran’s I和Geary’s c这两个单变量自相关测度,比较了多元空间分析方法在模型设计中不仅包含多元协方差,而且明确地同时包含数据的空间依赖性。模拟数据集的结果表明,标准Moran分量分析比Geary分量分析更适合作为总结多元空间结构的工具。然而,本研究提出的广义Geary主成分分析将方差加入到优化准则中,并将其作为一个迹比优化问题来解决,即使不优于其对应的Moran主成分分析,也表现得很好。在检测细微空间结构的灵敏度方面,适当工具的选择取决于空间多变量数据的相关性和方差。最后,将这四种技术应用于健康的社会决定因素数据集,分析其多元空间格局。与其他两种标准方法相比,两种广义方法能够检测到更多的城市区域和更高的自相关结构,并且由于空间分量的差异较大,提供了更明显的城乡对比。
{"title":"Comparison of Moran's I and Geary's c in Multivariate Spatial Pattern Analysis","authors":"Jie Lin","doi":"10.1111/gean.12355","DOIUrl":"10.1111/gean.12355","url":null,"abstract":"<p>This article compares multivariate spatial analysis methods that include not only multivariate covariance, but also spatial dependence of the data explicitly and simultaneously in model design by extending two univariate autocorrelation measures, namely Moran's <i>I</i> and Geary's <i>c</i>. The results derived from the simulation datasets indicate that the standard Moran component analysis is preferable to Geary component analysis as a tool for summarizing multivariate spatial structures. However, the generalized Geary principal component analysis developed in this study by adding variance into the optimization criterion and solved as a trace ratio optimization problem performs as well as, if not better than its counterpart the Moran principal component analysis does. With respect to the sensitivity in detecting subtle spatial structures, the choice of the appropriate tool is dependent on the correlation and variance of the spatial multivariate data. Finally, the four techniques are applied to the Social Determinants of Health dataset to analyze its multivariate spatial pattern. The two generalized methods detect more urban areas and higher autocorrelation structures than the other two standard methods, and provide more obvious contrast between urban and rural areas due to the large variance of the spatial component.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 4","pages":"685-702"},"PeriodicalIF":3.6,"publicationDate":"2022-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46219353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Testing Transferability: Quantitative Evaluation of Labor Market Area Definition Methods in Three Contrasting Countries 可转移性检验:三个对比国家劳动力市场区域界定方法的定量评价
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-11-10 DOI: 10.1111/gean.12353
José Manuel Casado-Díaz, Mike Coombes, Lucas Martínez-Bernabéu

Sub-national economic policies increasingly use labor market areas (LMAs) rather than administrative areas for analysis and implementation. How a set of LMAs was defined influences the results of such analyses, and so accurate policy delivery needs appropriately defined LMAs. Multinational bodies need comparable LMA definitions in many countries, calling for a definition method that is transferable across national boundaries. This article applies quantitative metrics to evaluate LMAs defined in three contrasting countries by three methods that represent the main methodological approaches. The deductive approach—based on a center and hinterland—is too inflexible to deal with differing geographical circumstances and cannot cope with statistical zones that are very small, or do not respect settlement structure. The alternative inductive methods tested define appropriate LMAs in each country, with the newer method performing slightly better in statistical terms. The article also exemplifies the usefulness of the metrics for comparisons of alternative regionalizations.

次国家经济政策越来越多地使用劳动力市场区域而不是行政区域进行分析和实施。如何定义一组LMA会影响此类分析的结果,因此准确的政策交付需要适当定义LMA。跨国机构需要在许多国家进行类似的LMA定义,要求采用可跨国界转移的定义方法。本文应用定量指标,通过代表主要方法论方法的三种方法,对三个对比国家的LMA进行评估。基于中心和腹地的演绎方法过于灵活,无法处理不同的地理环境,也无法处理非常小或不尊重定居点结构的统计区域。替代归纳法在每个国家测试确定了合适的LMA,新方法在统计方面表现稍好。这篇文章还举例说明了这些指标对替代区域化比较的有用性。
{"title":"Testing Transferability: Quantitative Evaluation of Labor Market Area Definition Methods in Three Contrasting Countries","authors":"José Manuel Casado-Díaz,&nbsp;Mike Coombes,&nbsp;Lucas Martínez-Bernabéu","doi":"10.1111/gean.12353","DOIUrl":"10.1111/gean.12353","url":null,"abstract":"<p>Sub-national economic policies increasingly use labor market areas (LMAs) rather than administrative areas for analysis and implementation. How a set of LMAs was defined influences the results of such analyses, and so accurate policy delivery needs appropriately defined LMAs. Multinational bodies need comparable LMA definitions in many countries, calling for a definition method that is transferable across national boundaries. This article applies quantitative metrics to evaluate LMAs defined in three contrasting countries by three methods that represent the main methodological approaches. The deductive approach—based on a center and hinterland—is too inflexible to deal with differing geographical circumstances and cannot cope with statistical zones that are very small, or do not respect settlement structure. The alternative inductive methods tested define appropriate LMAs in each country, with the newer method performing slightly better in statistical terms. The article also exemplifies the usefulness of the metrics for comparisons of alternative regionalizations.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 4","pages":"517-534"},"PeriodicalIF":3.6,"publicationDate":"2022-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/gean.12353","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49391392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Rejoinder to the Commentaries on “A Route Map for Successful Applications of Geographically Weighted Regression” by Comber et al. (2022) 对Comber等人(2022)关于“地理加权回归成功应用路线图”评论的回复
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-11-05 DOI: 10.1111/gean.12352
Alexis Comber, Paul Harris, Chris Brunsdon

We are delighted that the RM paper has stimulated three coherent but diverse Commentaries from leading thinkers in this field (Fotheringham, 2022; Oshan, 2022; Wolf, 2022). Each of these contains robust critiques of the proposed RM and suggest alternative but diverse sets of considerations. We consider each of these in turn and provide a rejoinder by way of response.

We thank the authors of these commentaries for their efforts, and for taking the time to consider our article in detail. In general, we are pleased to see these—part of our motivation here was to initiate discussion on approaches to modeling spatial non-stationarity in regression models. By setting out one way to proceed through our RM, we intended to make an opening move. One thing we observe from these responses is that there is perhaps a spectrum for motivation for using these kind of models—at one end, an approach that is strongly motivated by underlying theories, and at the other, a more exploratory approach. One also has to consider the idea of data analysis as compromise—the reality of modern data collection is frequently that of “big data” where datasets are large, but quality and suitability assurance are not to the standards achieved by carefully designed surveys or experiments. In many cases, geographical fluctuations in models may be a consequence of this, and spatially varying coefficient methods may act as “spatial detectives” by shedding light on spatial inconsistencies and biases in the data collection, rather than direct measurements of a true underlying process. This suggests the need for a kind of “deep inference” where processes under investigation and the process of data collection are considered in equal measure, requiring consideration of underlying process theories, in addition to issues relating to the act of data exploration—perhaps suggesting that the spectrum referred to earlier is something to be scanned, rather than choosing a specific viewpoint from which to carry out analysis.

As we stated earlier, the approach outlined in the GWR RM by Comber et al. (2022a) is not intended to be a strict set of immutable rules, but more of an exemplar of what could be done to respond to a specific research context, and acknowledging that a degree of ‘fuzziness’ in modeling strategies is inevitable. The replies to our article have been useful in considering potential alternative research contexts, and how they may interact with this kind of fuzziness. We look forward to the debate advancing.

我们很高兴RM论文激发了该领域领先思想家的三个连贯但不同的评论(Fotheringham, 2022;Oshan, 2022;狼,2022)。其中每一个都包含了对提议的RM的强有力的批评,并提出了不同的考虑因素。我们依次考虑这些问题,并以回应的方式提出反驳。我们感谢这些评论的作者的努力,并感谢他们花时间详细考虑我们的文章。总的来说,我们很高兴看到这些——我们在这里的部分动机是开始讨论回归模型中空间非平稳性建模的方法。通过设定一种方式来通过我们的RM,我们打算做一个开局。我们从这些回应中观察到的一件事是,使用这些模型的动机可能是有一个范围的——一端是一种受到潜在理论强烈推动的方法,另一端是一种更具探索性的方法。人们还必须考虑到数据分析的想法是一种妥协——现代数据收集的现实往往是“大数据”,其中数据集很大,但质量和适用性保证不能达到精心设计的调查或实验所达到的标准。在许多情况下,模型的地理波动可能是这种情况的结果,空间变化系数方法可以作为"空间侦探",揭示数据收集中的空间不一致和偏差,而不是直接测量真正的基本过程。这表明需要一种“深度推理”,即在调查过程和数据收集过程中同等考虑,除了与数据探索行为相关的问题外,还需要考虑潜在的过程理论——也许这表明前面提到的频谱是需要扫描的东西,而不是选择一个特定的观点来进行分析。正如我们之前所述,Comber等人(2022a)在GWR RM中概述的方法并不是一套严格的不可变规则,而是更多的是针对特定研究背景可以采取的措施的范例,并承认建模策略中的一定程度的“模糊性”是不可避免的。对我们文章的回复在考虑潜在的替代研究背景以及它们如何与这种模糊性相互作用方面非常有用。我们期待着辩论继续进行。
{"title":"A Rejoinder to the Commentaries on “A Route Map for Successful Applications of Geographically Weighted Regression” by Comber et al. (2022)","authors":"Alexis\u0000 Comber,&nbsp;Paul Harris,&nbsp;Chris Brunsdon","doi":"10.1111/gean.12352","DOIUrl":"10.1111/gean.12352","url":null,"abstract":"<p>We are delighted that the RM paper has stimulated three coherent but diverse Commentaries from leading thinkers in this field (Fotheringham, <span>2022</span>; Oshan, <span>2022</span>; Wolf, <span>2022</span>). Each of these contains robust critiques of the proposed RM and suggest alternative but diverse sets of considerations. We consider each of these in turn and provide a rejoinder by way of response.</p><p>We thank the authors of these commentaries for their efforts, and for taking the time to consider our article in detail. In general, we are pleased to see these—part of our motivation here was to initiate discussion on approaches to modeling spatial non-stationarity in regression models. By setting out one way to proceed through our RM, we intended to make an opening move. One thing we observe from these responses is that there is perhaps a spectrum for motivation for using these kind of models—at one end, an approach that is strongly motivated by underlying theories, and at the other, a more exploratory approach. One also has to consider the idea of data analysis as compromise—the reality of modern data collection is frequently that of “big data” where datasets are large, but quality and suitability assurance are not to the standards achieved by carefully designed surveys or experiments. In many cases, geographical fluctuations in models may be a consequence of this, and spatially varying coefficient methods may act as “spatial detectives” by shedding light on spatial inconsistencies and biases in the data collection, rather than direct measurements of a true underlying process. This suggests the need for a kind of “deep inference” where processes under investigation <i>and</i> the process of data collection are considered in equal measure, requiring consideration of underlying process theories, in addition to issues relating to the act of data exploration—perhaps suggesting that the spectrum referred to earlier is something to be scanned, rather than choosing a specific viewpoint from which to carry out analysis.</p><p>As we stated earlier, the approach outlined in the GWR RM by Comber et al. (<span>2022a</span>) is not intended to be a strict set of immutable rules, but more of an exemplar of what could be done to respond to a specific research context, and acknowledging that a degree of ‘fuzziness’ in modeling strategies is inevitable. The replies to our article have been useful in considering potential alternative research contexts, and how they may interact with this kind of fuzziness. We look forward to the debate advancing.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 1","pages":"198-202"},"PeriodicalIF":3.6,"publicationDate":"2022-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/gean.12352","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41833176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Simulation Study to Explore Inference about Global Moran's I with Random Spatial Indexes 基于随机空间指数的全球Moran’s I推理模拟研究
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-10-17 DOI: 10.1111/gean.12349
René Westerholt

Inference procedures for spatial autocorrelation statistics assume that the underlying configurations of spatial units are fixed. However, sometimes this assumption can be disadvantageous, for example, when analyzing social media posts or moving objects. This article examines for the case of point geometries how a change from fixed to random spatial indexes affects inferences about global Moran's I, a popular spatial autocorrelation measure. Homogeneous and inhomogeneous Matérn and Thomas cluster processes are studied and for each of these processes, 10,000 random point patterns are simulated for investigating three aspects that are key in an inferential context: the null distributions of I when the underlying geometries are varied; the effect of the latter on critical values used to reject null hypotheses; and how the presence of point processes affects the statistical power of Moran's I. The results show that point processes affect all three characteristics. Inferences about spatial structure in relevant application contexts may therefore be different from conventional inferences when this additional source of randomness is taken into account.

空间自相关统计的推理程序假设空间单元的底层配置是固定的。然而,有时这种假设可能是不利的,例如,在分析社交媒体帖子或移动物体时。本文研究了点几何的情况下,从固定到随机的空间索引的变化如何影响关于全局Moran's I的推断,这是一种流行的空间自相关度量。研究了齐次和非齐次mat和托马斯聚类过程,并对这些过程中的每一个进行了模拟,模拟了10,000个随机点模式,以调查在推理环境中关键的三个方面:当底层几何形状变化时I的零分布;后者对用于拒绝零假设的临界值的影响;以及点过程的存在如何影响莫兰i的统计能力。结果表明,点过程影响所有三个特征。因此,当考虑到这种额外的随机性来源时,有关相关应用环境中空间结构的推断可能与常规推断不同。
{"title":"A Simulation Study to Explore Inference about Global Moran's I with Random Spatial Indexes","authors":"René Westerholt","doi":"10.1111/gean.12349","DOIUrl":"10.1111/gean.12349","url":null,"abstract":"<p>Inference procedures for spatial autocorrelation statistics assume that the underlying configurations of spatial units are fixed. However, sometimes this assumption can be disadvantageous, for example, when analyzing social media posts or moving objects. This article examines for the case of point geometries how a change from fixed to random spatial indexes affects inferences about global Moran's I, a popular spatial autocorrelation measure. Homogeneous and inhomogeneous Matérn and Thomas cluster processes are studied and for each of these processes, 10,000 random point patterns are simulated for investigating three aspects that are key in an inferential context: the null distributions of I when the underlying geometries are varied; the effect of the latter on critical values used to reject null hypotheses; and how the presence of point processes affects the statistical power of Moran's I. The results show that point processes affect all three characteristics. Inferences about spatial structure in relevant application contexts may therefore be different from conventional inferences when this additional source of randomness is taken into account.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 4","pages":"621-650"},"PeriodicalIF":3.6,"publicationDate":"2022-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/gean.12349","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45247627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A Hybrid Approach for Mass Valuation of Residential Properties through Geographic Information Systems and Machine Learning Integration 基于地理信息系统和机器学习集成的住宅物业大规模估值混合方法
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-10-14 DOI: 10.1111/gean.12350
Muhammed Oguzhan Mete, Tahsin Yomralioglu

Geographic Information Systems (GIS) and Machine Learning methods are now widely used in mass property valuation using the physical attributes of properties. However, locational criteria, such as as proximity to important places, sea or forest views, flat topography are just some of the spatial factors that affect property values and, to date, these have been insufficiently used as part of the valuation process. In this study, a hybrid approach is developed by integrating GIS and Machine Learning for mass valuation of residential properties. GIS-based Nominal Valuation Method was applied to carry out proximity, terrain, and visibility analyses using Ordnance Survey and OpenStreetMap data, than land value map of Great Britain was produced. Spatial criteria scores obtained from the GIS analyses were included in the price prediction process in which global and spatially clustered local regression models are built for England and Wales using Price Paid Data and Energy Performance Certificates data. Results showed that adding locational factors to the property price data and applying a novel nominally weighted spatial clustering algorithm for creating a local regression increased the prediction accuracy by about 45%. It also demonstrated that Random Forest was the most accurate ensemble model.

地理信息系统(GIS)和机器学习方法现在广泛用于利用财产的物理属性进行大规模财产评估。然而,位置标准,如靠近重要地点、海景或森林景观、平坦地形等只是影响财产价值的一些空间因素,迄今为止,这些因素在估价过程中没有得到充分利用。在本研究中,通过集成GIS和机器学习开发了一种混合方法,用于住宅物业的大规模估值。利用地形测量和OpenStreetMap数据,采用基于gis的标称估价方法对英国土地进行接近性、地形和可见度分析,绘制了英国土地价值图。从GIS分析中获得的空间标准分数被包括在价格预测过程中,其中使用价格支付数据和能源绩效证书数据为英格兰和威尔士建立了全球和空间聚类的局部回归模型。结果表明,在房价数据中加入位置因素,并应用一种新的名义加权空间聚类算法创建局部回归,预测精度提高了约45%。它还证明了随机森林是最准确的集合模型。
{"title":"A Hybrid Approach for Mass Valuation of Residential Properties through Geographic Information Systems and Machine Learning Integration","authors":"Muhammed Oguzhan Mete,&nbsp;Tahsin Yomralioglu","doi":"10.1111/gean.12350","DOIUrl":"10.1111/gean.12350","url":null,"abstract":"<p>Geographic Information Systems (GIS) and Machine Learning methods are now widely used in mass property valuation using the physical attributes of properties. However, locational criteria, such as as proximity to important places, sea or forest views, flat topography are just some of the spatial factors that affect property values and, to date, these have been insufficiently used as part of the valuation process. In this study, a hybrid approach is developed by integrating GIS and Machine Learning for mass valuation of residential properties. GIS-based Nominal Valuation Method was applied to carry out proximity, terrain, and visibility analyses using Ordnance Survey and OpenStreetMap data, than land value map of Great Britain was produced. Spatial criteria scores obtained from the GIS analyses were included in the price prediction process in which global and spatially clustered local regression models are built for England and Wales using Price Paid Data and Energy Performance Certificates data. Results showed that adding locational factors to the property price data and applying a novel nominally weighted spatial clustering algorithm for creating a local regression increased the prediction accuracy by about 45%. It also demonstrated that Random Forest was the most accurate ensemble model.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 4","pages":"535-559"},"PeriodicalIF":3.6,"publicationDate":"2022-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43311551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Three Common Machine Learning Algorithms Neither Enhance Prediction Accuracy Nor Reduce Spatial Autocorrelation in Residuals: An Analysis of Twenty-five Socioeconomic Data Sets 三种常见的机器学习算法既不能提高预测精度,也不能降低残差的空间自相关:对25个社会经济数据集的分析
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-10-13 DOI: 10.1111/gean.12351
Insang Song, Daehyun Kim

Machine learning (ML) is being applied in an increasing volume of geographical research. However, the aspects of spatial autocorrelation (SAC) in the residuals produced by ML models have been understudied compared to the benefit of ML, namely, reduction of prediction errors. In this study, we examined the relationship between predictive accuracy and the reduction in the residual SAC for 597 variables from 25 geographical socio-economic data sets using spatial and nonspatial cross-validation of three ML algorithms such as random forests, support vector machine, and artificial neural network (ANN) to provide an extensive empirical diagnosis—but not a definitive theory—of the relationship between SAC and ML. Our results highlighted that the ML algorithms with tuned hyperparameters yielded marginal predictive accuracy gains and the minimal decreases in residual SAC. ANN revealed lower accuracy and higher reduction in the residual SAC than others. This implies ML algorithms in geographical research in socio-economic domains would not always result in higher prediction accuracy. We suggest that ML in geographical research should be cautiously employed when the main objective is related to the residual SAC. We also showed that spatial cross-validation neither improves predictive accuracy substantially nor reduce the residual SAC effectively.

机器学习(ML)正在越来越多的地理研究中得到应用。然而,与机器学习的优势(即减少预测误差)相比,机器学习模型产生的残差中的空间自相关(SAC)方面的研究还不够充分。在这项研究中,我们使用三种机器学习算法(随机森林,支持向量机,和人工神经网络(ANN)提供了广泛的经验诊断-但不是确定的理论- SAC和ML之间的关系。我们的结果强调,具有调谐超参数的ML算法产生了边际预测精度增益和残余SAC的最小减少。与其他方法相比,人工神经网络显示出较低的准确率和较高的残余SAC减少率。这意味着在社会经济领域的地理研究中的ML算法并不总是导致更高的预测精度。我们建议在地理研究中,当主要目标与剩余SAC相关时,应谨慎使用ML。我们还发现,空间交叉验证既不能显著提高预测精度,也不能有效降低剩余SAC。
{"title":"Three Common Machine Learning Algorithms Neither Enhance Prediction Accuracy Nor Reduce Spatial Autocorrelation in Residuals: An Analysis of Twenty-five Socioeconomic Data Sets","authors":"Insang Song,&nbsp;Daehyun Kim","doi":"10.1111/gean.12351","DOIUrl":"10.1111/gean.12351","url":null,"abstract":"<p>Machine learning (ML) is being applied in an increasing volume of geographical research. However, the aspects of spatial autocorrelation (SAC) in the residuals produced by ML models have been understudied compared to the benefit of ML, namely, reduction of prediction errors. In this study, we examined the relationship between predictive accuracy and the reduction in the residual SAC for 597 variables from 25 geographical socio-economic data sets using spatial and nonspatial cross-validation of three ML algorithms such as random forests, support vector machine, and artificial neural network (ANN) to provide an extensive empirical diagnosis—but not a definitive theory—of the relationship between SAC and ML. Our results highlighted that the ML algorithms with tuned hyperparameters yielded marginal predictive accuracy gains and the minimal decreases in residual SAC. ANN revealed lower accuracy and higher reduction in the residual SAC than others. This implies ML algorithms in geographical research in socio-economic domains would not always result in higher prediction accuracy. We suggest that ML in geographical research should be cautiously employed when the main objective is related to the residual SAC. We also showed that spatial cross-validation neither improves predictive accuracy substantially nor reduce the residual SAC effectively.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 4","pages":"585-620"},"PeriodicalIF":3.6,"publicationDate":"2022-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48895872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
City Size Distribution Analyses Based on the Concept of Entropy Competition 基于熵竞争概念的城市规模分布分析
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-10-02 DOI: 10.1111/gean.12348
Antonio Sanchirico, Giovanna Andrulli, Mauro Fiorentino

The present work pursues theoretical and empirical objectives. With regards to the former, it is demonstrated that the natural tendency to uniformity of both the probability distribution of a city to have a certain number of inhabitants and that of a person to reside in a town of a given number of citizens leads to a competition between their information entropies, which provides the power law distribution as the most probable one for city size. It is also shown that Zipf's law reflects the significant control of the existence of interconnections between cities on the self-organization of their size. With regards to the empirical objectives, based on population data of European countries and Italian municipalities, the theoretical approach proposed is validated. At the Italian scale, city distribution is shown to be a power law for cities above 10,000 inhabitants. In the 20 Italian regions, the breakpoint in the distribution is generally lower. Finally, the geographical control on city distribution is discussed based on the results achieved in some regions.

目前的工作追求理论和实证目标。对于前者,证明了拥有一定数量居民的城市的概率分布和一个人居住在拥有一定数量公民的城镇的概率分布的自然一致性倾向导致它们的信息熵之间的竞争,这提供了幂律分布作为城市规模最可能的分布。Zipf定律反映了城市间相互联系的存在对城市规模自组织的显著控制。在实证目标方面,基于欧洲各国和意大利各市的人口数据,对提出的理论方法进行了验证。在意大利的规模上,对于人口超过1万的城市,城市分布呈现幂次规律。在意大利的20个地区中,分布的断点普遍较低。最后,结合部分地区的研究成果,对城市布局的地理调控进行了探讨。
{"title":"City Size Distribution Analyses Based on the Concept of Entropy Competition","authors":"Antonio Sanchirico,&nbsp;Giovanna Andrulli,&nbsp;Mauro Fiorentino","doi":"10.1111/gean.12348","DOIUrl":"10.1111/gean.12348","url":null,"abstract":"<p>The present work pursues theoretical and empirical objectives. With regards to the former, it is demonstrated that the natural tendency to uniformity of both the probability distribution of a city to have a certain number of inhabitants and that of a person to reside in a town of a given number of citizens leads to a competition between their information entropies, which provides the power law distribution as the most probable one for city size. It is also shown that Zipf's law reflects the significant control of the existence of interconnections between cities on the self-organization of their size. With regards to the empirical objectives, based on population data of European countries and Italian municipalities, the theoretical approach proposed is validated. At the Italian scale, city distribution is shown to be a power law for cities above 10,000 inhabitants. In the 20 Italian regions, the breakpoint in the distribution is generally lower. Finally, the geographical control on city distribution is discussed based on the results achieved in some regions.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 4","pages":"560-584"},"PeriodicalIF":3.6,"publicationDate":"2022-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/gean.12348","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41884316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comment on “A Route Map for Successful Applications of Geographically-Weighted Regression”: The Alternative Expressway to Defensible Regression-Based Local Modeling 对“地理加权回归成功应用路线图”的评析:基于可防御回归的局部建模的替代高速公路
IF 3.6 3区 地球科学 Q1 GEOGRAPHY Pub Date : 2022-09-27 DOI: 10.1111/gean.12347
A. Stewart Fotheringham
{"title":"A Comment on “A Route Map for Successful Applications of Geographically-Weighted Regression”: The Alternative Expressway to Defensible Regression-Based Local Modeling","authors":"A. Stewart Fotheringham","doi":"10.1111/gean.12347","DOIUrl":"10.1111/gean.12347","url":null,"abstract":"","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"55 1","pages":"191-197"},"PeriodicalIF":3.6,"publicationDate":"2022-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47604932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Geographical Analysis
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1