Margaret A. Lawrimore , Georgina M. Sanchez , Cayla Cothron , Mirela G. Tulbure , Todd K. BenDor , Ross K. Meentemeyer
{"title":"利用机器学习创建空间上完整的分区地图","authors":"Margaret A. Lawrimore , Georgina M. Sanchez , Cayla Cothron , Mirela G. Tulbure , Todd K. BenDor , Ross K. Meentemeyer","doi":"10.1016/j.compenvurbsys.2024.102157","DOIUrl":null,"url":null,"abstract":"<div><p>Zoning regulates land use and intensity of urban development at the county and municipal level in the United States, promoting economic growth, community health, and environmental preservation. However, limited availability of zoning data at scale hinders regional assessments of regulations and coordinated resilience planning efforts. In this study, we developed an open-source, replicable, and transferable framework to predict spatially complete zoning in areas where zoning information is publicly unavailable. We applied a Hierarchical Random Forest algorithm to predict multilevel zoning districts, including three core districts (<em>residential</em>, <em>non-residential</em>, <em>mixed use</em>) and 13 sub-districts. To mimic real-world data accessibility challenges, we evaluated two models: one filling gaps within a county (within-county) and the other extrapolating for counties with no available data (between-county). We tested our models statewide in North Carolina (NC), USA, and developed the State's first comprehensive zoning map. We found strong predictive performance for our within-county model (∼99% accuracy; macro averaged F1 score of ∼0.97) irrespective of district breakdown (i.e., core and sub). However, our between-county model performance was lower and varied depending on the training counties sampled and the district breakdown considered (19–90% accuracy; macro averaged F1 score of 0.105–0.451). Our framework provides spatially complete zoning maps for previously inaccessible locations, enabling researchers and planners to conduct large-scale comprehensive zoning assessments.</p></div>","PeriodicalId":48241,"journal":{"name":"Computers Environment and Urban Systems","volume":"112 ","pages":"Article 102157"},"PeriodicalIF":7.1000,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Creating spatially complete zoning maps using machine learning\",\"authors\":\"Margaret A. Lawrimore , Georgina M. Sanchez , Cayla Cothron , Mirela G. Tulbure , Todd K. BenDor , Ross K. Meentemeyer\",\"doi\":\"10.1016/j.compenvurbsys.2024.102157\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Zoning regulates land use and intensity of urban development at the county and municipal level in the United States, promoting economic growth, community health, and environmental preservation. However, limited availability of zoning data at scale hinders regional assessments of regulations and coordinated resilience planning efforts. In this study, we developed an open-source, replicable, and transferable framework to predict spatially complete zoning in areas where zoning information is publicly unavailable. We applied a Hierarchical Random Forest algorithm to predict multilevel zoning districts, including three core districts (<em>residential</em>, <em>non-residential</em>, <em>mixed use</em>) and 13 sub-districts. To mimic real-world data accessibility challenges, we evaluated two models: one filling gaps within a county (within-county) and the other extrapolating for counties with no available data (between-county). We tested our models statewide in North Carolina (NC), USA, and developed the State's first comprehensive zoning map. We found strong predictive performance for our within-county model (∼99% accuracy; macro averaged F1 score of ∼0.97) irrespective of district breakdown (i.e., core and sub). However, our between-county model performance was lower and varied depending on the training counties sampled and the district breakdown considered (19–90% accuracy; macro averaged F1 score of 0.105–0.451). Our framework provides spatially complete zoning maps for previously inaccessible locations, enabling researchers and planners to conduct large-scale comprehensive zoning assessments.</p></div>\",\"PeriodicalId\":48241,\"journal\":{\"name\":\"Computers Environment and Urban Systems\",\"volume\":\"112 \",\"pages\":\"Article 102157\"},\"PeriodicalIF\":7.1000,\"publicationDate\":\"2024-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers Environment and Urban Systems\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0198971524000863\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL STUDIES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers Environment and Urban Systems","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0198971524000863","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL STUDIES","Score":null,"Total":0}
引用次数: 0
摘要
分区对美国县市一级的土地使用和城市发展强度进行管理,促进经济增长、社区健康和环境保护。然而,规模化分区数据的可用性有限,阻碍了区域法规评估和协调复原力规划工作。在本研究中,我们开发了一个开源、可复制、可转移的框架,用于预测未公开分区信息地区的空间完整分区。我们采用层次随机森林算法预测多级分区,包括三个核心区(、、)和 13 个子区。为了模拟现实世界中数据获取方面的挑战,我们评估了两个模型:一个是填补县内空白的模型(县内模型),另一个是推断无可用数据的县(县间模型)。我们在美国北卡罗来纳州(NC)全州范围内测试了我们的模型,并绘制了该州第一张综合分区地图。我们发现,无论地区细分(即核心区和次级区)如何,县内模型都具有很强的预测性能(准确率达 99%;宏观平均 F1 得分达 0.97)。然而,我们的县域间模型性能较低,且因抽样的培训县和考虑的地区细分而异(准确率为 19-90%;宏观平均 F1 得分为 0.105-0.451)。我们的框架为以前无法到达的地点提供了空间上完整的分区地图,使研究人员和规划人员能够进行大规模的综合分区评估。
Creating spatially complete zoning maps using machine learning
Zoning regulates land use and intensity of urban development at the county and municipal level in the United States, promoting economic growth, community health, and environmental preservation. However, limited availability of zoning data at scale hinders regional assessments of regulations and coordinated resilience planning efforts. In this study, we developed an open-source, replicable, and transferable framework to predict spatially complete zoning in areas where zoning information is publicly unavailable. We applied a Hierarchical Random Forest algorithm to predict multilevel zoning districts, including three core districts (residential, non-residential, mixed use) and 13 sub-districts. To mimic real-world data accessibility challenges, we evaluated two models: one filling gaps within a county (within-county) and the other extrapolating for counties with no available data (between-county). We tested our models statewide in North Carolina (NC), USA, and developed the State's first comprehensive zoning map. We found strong predictive performance for our within-county model (∼99% accuracy; macro averaged F1 score of ∼0.97) irrespective of district breakdown (i.e., core and sub). However, our between-county model performance was lower and varied depending on the training counties sampled and the district breakdown considered (19–90% accuracy; macro averaged F1 score of 0.105–0.451). Our framework provides spatially complete zoning maps for previously inaccessible locations, enabling researchers and planners to conduct large-scale comprehensive zoning assessments.
期刊介绍:
Computers, Environment and Urban Systemsis an interdisciplinary journal publishing cutting-edge and innovative computer-based research on environmental and urban systems, that privileges the geospatial perspective. The journal welcomes original high quality scholarship of a theoretical, applied or technological nature, and provides a stimulating presentation of perspectives, research developments, overviews of important new technologies and uses of major computational, information-based, and visualization innovations. Applied and theoretical contributions demonstrate the scope of computer-based analysis fostering a better understanding of environmental and urban systems, their spatial scope and their dynamics.