Lucas Spierenburg, Sander van Cranenburgh, O. Cats
{"title":"一种过滤小尺度空间波动的区域化方法","authors":"Lucas Spierenburg, Sander van Cranenburgh, O. Cats","doi":"10.5194/agile-giss-3-61-2022","DOIUrl":null,"url":null,"abstract":"Abstract. Regionalization is the process of aggregating contiguous spatial units to form areas that are homogeneous with respect to one or a set of variables. It is useful when studying spatial phenomena or when designing region-based policies, as it allows to unravel the latent spatial structure of a dataset. However, this task is challenging when small-scale fluctuations in the data interfere with the phenomenon of interest. In such circumstances, regionalization techniques are prone to overfitting small-scale fluctuations, and producing erratic regions. This paper presents a regionalization method robust to small-scale variations that is particularly relevant when handling demographic data. Fluctuations are filtered out using a weighted spatial average before applying agglomerative clustering. The method is tested against a conventional agglomerative clustering approach on a fine-resolution demographic dataset, for a set of indicators quantifying: the ability to identify large-scale spatial patterns, the homogeneity of the regions produced, and the spatial regularity of these regions. These indicators have been computed for the two methods for a number of clusters ranging from 2 to 101, and results show that the proposed approach performs better than conventional agglomerative clustering more than 90% of the time at identifying large-scale patterns, and produces more regular regions 96% of the time.\n","PeriodicalId":116168,"journal":{"name":"AGILE: GIScience Series","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A regionalization method filtering out small-scale spatial fluctuations\",\"authors\":\"Lucas Spierenburg, Sander van Cranenburgh, O. Cats\",\"doi\":\"10.5194/agile-giss-3-61-2022\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract. Regionalization is the process of aggregating contiguous spatial units to form areas that are homogeneous with respect to one or a set of variables. It is useful when studying spatial phenomena or when designing region-based policies, as it allows to unravel the latent spatial structure of a dataset. However, this task is challenging when small-scale fluctuations in the data interfere with the phenomenon of interest. In such circumstances, regionalization techniques are prone to overfitting small-scale fluctuations, and producing erratic regions. This paper presents a regionalization method robust to small-scale variations that is particularly relevant when handling demographic data. Fluctuations are filtered out using a weighted spatial average before applying agglomerative clustering. The method is tested against a conventional agglomerative clustering approach on a fine-resolution demographic dataset, for a set of indicators quantifying: the ability to identify large-scale spatial patterns, the homogeneity of the regions produced, and the spatial regularity of these regions. These indicators have been computed for the two methods for a number of clusters ranging from 2 to 101, and results show that the proposed approach performs better than conventional agglomerative clustering more than 90% of the time at identifying large-scale patterns, and produces more regular regions 96% of the time.\\n\",\"PeriodicalId\":116168,\"journal\":{\"name\":\"AGILE: GIScience Series\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"AGILE: GIScience Series\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5194/agile-giss-3-61-2022\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"AGILE: GIScience Series","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5194/agile-giss-3-61-2022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A regionalization method filtering out small-scale spatial fluctuations
Abstract. Regionalization is the process of aggregating contiguous spatial units to form areas that are homogeneous with respect to one or a set of variables. It is useful when studying spatial phenomena or when designing region-based policies, as it allows to unravel the latent spatial structure of a dataset. However, this task is challenging when small-scale fluctuations in the data interfere with the phenomenon of interest. In such circumstances, regionalization techniques are prone to overfitting small-scale fluctuations, and producing erratic regions. This paper presents a regionalization method robust to small-scale variations that is particularly relevant when handling demographic data. Fluctuations are filtered out using a weighted spatial average before applying agglomerative clustering. The method is tested against a conventional agglomerative clustering approach on a fine-resolution demographic dataset, for a set of indicators quantifying: the ability to identify large-scale spatial patterns, the homogeneity of the regions produced, and the spatial regularity of these regions. These indicators have been computed for the two methods for a number of clusters ranging from 2 to 101, and results show that the proposed approach performs better than conventional agglomerative clustering more than 90% of the time at identifying large-scale patterns, and produces more regular regions 96% of the time.