Peiran Zhang , Haonan Guo , Fabiano L. Ribeiro , Pavel L. Kirillov , Alla G. Makhrova , Ziyou Gao , Liang Gao
{"title":"Quantifying contributions of geographical features to urban GDP outputs via interpretable machine learning","authors":"Peiran Zhang , Haonan Guo , Fabiano L. Ribeiro , Pavel L. Kirillov , Alla G. Makhrova , Ziyou Gao , Liang Gao","doi":"10.1016/j.scs.2025.106185","DOIUrl":null,"url":null,"abstract":"<div><div>Urban scaling laws, which assume homogeneous population interactions, traditionally describe the relationship between urban population and GDP. However, this approach often overlooks the complexity of urban environments, particularly geographical features such as land use, road networks, and points of interest, which significantly shape urban economies. To address this gap, we propose an interpretable machine learning framework that quantifies the impact of urban geographical features (UGFs) on economic outputs (GDP) across five countries: the USA, Brazil, Nigeria, China, and India. Our study can be summarized in three parts: (1) Using the CatBoost algorithm for GDP estimation, which achieves an average <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.96 across countries, we demonstrate the substantial effects of UGFs (2) The Shapley Additive Explanations (SHAP) method is employed to quantify feature contributions on GDP, revealing that UGFs account for 45% to 89% variance, with influences differing across and within countries. (3) By classifying cities based on feature contribution vectors, we show that cities with similar GDP levels often exhibit analogous contributions from both population and UGFs, suggesting that shared strategies could be applied to cities with comparable economic profiles. Our findings provide valuable insights into the role of UGFs in shaping GDP, advancing the understanding of how UGFs influence economic development, and offering policymakers more informed suggestions. Furthermore, this framework opens new opportunities to integrate diverse urban features into urban studies through machine learning, enhancing our understanding of the complexity of urban systems.</div></div>","PeriodicalId":48659,"journal":{"name":"Sustainable Cities and Society","volume":"121 ","pages":"Article 106185"},"PeriodicalIF":10.5000,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sustainable Cities and Society","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210670725000630","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Urban scaling laws, which assume homogeneous population interactions, traditionally describe the relationship between urban population and GDP. However, this approach often overlooks the complexity of urban environments, particularly geographical features such as land use, road networks, and points of interest, which significantly shape urban economies. To address this gap, we propose an interpretable machine learning framework that quantifies the impact of urban geographical features (UGFs) on economic outputs (GDP) across five countries: the USA, Brazil, Nigeria, China, and India. Our study can be summarized in three parts: (1) Using the CatBoost algorithm for GDP estimation, which achieves an average of 0.96 across countries, we demonstrate the substantial effects of UGFs (2) The Shapley Additive Explanations (SHAP) method is employed to quantify feature contributions on GDP, revealing that UGFs account for 45% to 89% variance, with influences differing across and within countries. (3) By classifying cities based on feature contribution vectors, we show that cities with similar GDP levels often exhibit analogous contributions from both population and UGFs, suggesting that shared strategies could be applied to cities with comparable economic profiles. Our findings provide valuable insights into the role of UGFs in shaping GDP, advancing the understanding of how UGFs influence economic development, and offering policymakers more informed suggestions. Furthermore, this framework opens new opportunities to integrate diverse urban features into urban studies through machine learning, enhancing our understanding of the complexity of urban systems.
期刊介绍:
Sustainable Cities and Society (SCS) is an international journal that focuses on fundamental and applied research to promote environmentally sustainable and socially resilient cities. The journal welcomes cross-cutting, multi-disciplinary research in various areas, including:
1. Smart cities and resilient environments;
2. Alternative/clean energy sources, energy distribution, distributed energy generation, and energy demand reduction/management;
3. Monitoring and improving air quality in built environment and cities (e.g., healthy built environment and air quality management);
4. Energy efficient, low/zero carbon, and green buildings/communities;
5. Climate change mitigation and adaptation in urban environments;
6. Green infrastructure and BMPs;
7. Environmental Footprint accounting and management;
8. Urban agriculture and forestry;
9. ICT, smart grid and intelligent infrastructure;
10. Urban design/planning, regulations, legislation, certification, economics, and policy;
11. Social aspects, impacts and resiliency of cities;
12. Behavior monitoring, analysis and change within urban communities;
13. Health monitoring and improvement;
14. Nexus issues related to sustainable cities and societies;
15. Smart city governance;
16. Decision Support Systems for trade-off and uncertainty analysis for improved management of cities and society;
17. Big data, machine learning, and artificial intelligence applications and case studies;
18. Critical infrastructure protection, including security, privacy, forensics, and reliability issues of cyber-physical systems.
19. Water footprint reduction and urban water distribution, harvesting, treatment, reuse and management;
20. Waste reduction and recycling;
21. Wastewater collection, treatment and recycling;
22. Smart, clean and healthy transportation systems and infrastructure;