首页 > 最新文献

International Journal of Geographical Information Science最新文献

英文 中文
Estimating urban noise along road network from street view imagery 根据街景图像估计道路网沿线的城市噪音
1区 地球科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-11-03 DOI: 10.1080/13658816.2023.2274475
Jing Huang, Teng Fei, Yuhao Kang, Jun Li, Ziyu Liu, Guofeng Wu
AbstractEstimating road traffic noise is essential for examining the quality of sounding environment and mitigating such a non-negligible pollutant in urban areas. However, existing estimated models often have limited applicability to specific traffic conditions, while the required parameters may not be readily available for city-wide collection. This paper proposes a data-driven approach for measuring road-level acoustic information of traffic with street view imagery. Specifically, we utilize portable vehicle-equipped hardware for in-situ noise acquisition and employ a deep learning model ResNet to learn high-level visual features from street view images that are closely associated with road traffic noise. The ResNet captures meaningful patterns from the input data, and the output probability vectors are then fed into a Random-Forest regression algorithm to quantitatively estimate the noise in decibels for different road segments. The MAE and RMSE of the DCNN-RF model are 2.01 and 2.71, respectively. Additionally, we employ a gradient-weighted Class Active Mapping approach to visually interpret our deep learning model and explore the significant elements in streetscapes that contribute to the model's estimations. Our proposed framework facilitates low-cost and fine-scale road traffic noise estimations and sheds light on how auditory information could be inferred from street imagery, which may benefit practices in geography and urban planning.Keywords: Road traffic noisestreet view imagerydeep learningbuild environmenturban planning AcknowledgmentThe authors would like to thank Urli for the valuable advice provided during the initial stages of the experiment and Mr. Mengze Gao for designing and 3D-printing the enclosure for the data acquisition device. Thanks to the financial support from the National Natural Science Foundation of China [42271476]; the Wuhan University 351 Talent Program, 2020; the State Key Laboratory of Resources and Environmental Information System [2023OPEN007] and the Guangdong Science and Technology Strategic Innovation Fund [2020B1212030009].Disclosure statementNo potential conflict of interest was reported by the author(s).Data and codes availability statementThe original in-situ road traffic noise data with geographic coordinates collected by the experimental vehicle using our portable device, as well as the road traffic noise estimation model and some sample street view images used for demonstration are available at https://github.com/kellyhuang313/traffic-noise-estimation. Instructions for executing the code are provided in the README.txt.Notes1 https://www.openstreetmap.org/Additional informationFundingThis work was supported by the National Natural Science Foundation of China [42271476]; the Wuhan University 351 Talent Program, 2020; and the Guangdong Science and Technology Strategic Innovation Fund [2020B1212030009]. This work was also supported by State Key Laboratory of Resources and Environmental Information Sys
摘要道路交通噪声是检测城市测深环境质量和减轻城市交通噪声污染的重要手段。然而,现有的估计模型通常对特定交通状况的适用性有限,而所需的参数可能无法在全市范围内收集。本文提出了一种数据驱动的基于街景图像的道路声信息测量方法。具体来说,我们利用配备便携式车辆的硬件进行现场噪声采集,并使用深度学习模型ResNet从街景图像中学习与道路交通噪声密切相关的高级视觉特征。ResNet从输入数据中捕获有意义的模式,然后将输出概率向量输入随机森林回归算法,以定量估计不同路段的分贝噪声。DCNN-RF模型的MAE和RMSE分别为2.01和2.71。此外,我们采用梯度加权类主动映射方法来直观地解释我们的深度学习模型,并探索街景中有助于模型估计的重要元素。我们提出的框架有助于低成本和精细尺度的道路交通噪声估计,并阐明了如何从街道图像中推断听觉信息,这可能有利于地理和城市规划的实践。关键词:道路交通噪声街景图像深度学习建筑环境城市规划致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢致谢国家自然科学基金资助[42271476];武汉大学351人才计划(2020);资源与环境信息系统国家重点实验室[2023OPEN007]和广东省科技战略创新基金[2020B1212030009]。披露声明作者未报告潜在的利益冲突。数据和代码可用性声明实验车辆使用我们的便携式设备收集的原始地理坐标的现场道路交通噪声数据,以及用于演示的道路交通噪声估计模型和一些示例街景图像可在https://github.com/kellyhuang313/traffic-noise-estimation上获得。在README.txt中提供了执行代码的说明。注1 https://www.openstreetmap.org/Additional信息资助国家自然科学基金[42271476];武汉大学351人才计划(2020);广东省科技战略创新基金[2020B1212030009]。资源与环境信息系统国家重点实验室[2023OPEN007]资助。作者简介黄静,武汉大学资源与环境科学学院硕士研究生。主要研究方向为城市地理时空数据分析。她的贡献包括开发交通噪声估计模型,算法实现,进行案例研究,撰写论文稿件。滕飞,武汉大学资源与环境科学学院地图学与地理信息科学副教授,主要从事城市地理大数据与生态遥感研究。他为现场交通噪声数据采集的便携式设备的构思,概念化,设计和手稿修改做出了贡献。康宇豪,南卡罗来纳大学地理系GISense实验室助理教授。主要研究方向为以人为本的地理空间数据科学、地理信息科学、GeoAI、城市视觉智能等。他对方法的发展,以及对手稿的审查和编辑做出了贡献。李俊毕业于中国武汉大学资源与环境科学学院。他的研究方向是地理空间分析。他对街景图像和交通噪声数据的数据处理做出了贡献。刘子玉毕业于武汉大学资源与环境科学学院。她最近的工作重点是利用街景图像准确估计道路PV产量。在这项工作中,她对街景图像的数据收集和整理做出了贡献。吴国峰,中国深圳大学城市信息系教授。主要研究方向为遥感在自然资源与生态环境中的应用。他是这篇论文发表的共同负责人。
{"title":"Estimating urban noise along road network from street view imagery","authors":"Jing Huang, Teng Fei, Yuhao Kang, Jun Li, Ziyu Liu, Guofeng Wu","doi":"10.1080/13658816.2023.2274475","DOIUrl":"https://doi.org/10.1080/13658816.2023.2274475","url":null,"abstract":"AbstractEstimating road traffic noise is essential for examining the quality of sounding environment and mitigating such a non-negligible pollutant in urban areas. However, existing estimated models often have limited applicability to specific traffic conditions, while the required parameters may not be readily available for city-wide collection. This paper proposes a data-driven approach for measuring road-level acoustic information of traffic with street view imagery. Specifically, we utilize portable vehicle-equipped hardware for in-situ noise acquisition and employ a deep learning model ResNet to learn high-level visual features from street view images that are closely associated with road traffic noise. The ResNet captures meaningful patterns from the input data, and the output probability vectors are then fed into a Random-Forest regression algorithm to quantitatively estimate the noise in decibels for different road segments. The MAE and RMSE of the DCNN-RF model are 2.01 and 2.71, respectively. Additionally, we employ a gradient-weighted Class Active Mapping approach to visually interpret our deep learning model and explore the significant elements in streetscapes that contribute to the model's estimations. Our proposed framework facilitates low-cost and fine-scale road traffic noise estimations and sheds light on how auditory information could be inferred from street imagery, which may benefit practices in geography and urban planning.Keywords: Road traffic noisestreet view imagerydeep learningbuild environmenturban planning AcknowledgmentThe authors would like to thank Urli for the valuable advice provided during the initial stages of the experiment and Mr. Mengze Gao for designing and 3D-printing the enclosure for the data acquisition device. Thanks to the financial support from the National Natural Science Foundation of China [42271476]; the Wuhan University 351 Talent Program, 2020; the State Key Laboratory of Resources and Environmental Information System [2023OPEN007] and the Guangdong Science and Technology Strategic Innovation Fund [2020B1212030009].Disclosure statementNo potential conflict of interest was reported by the author(s).Data and codes availability statementThe original in-situ road traffic noise data with geographic coordinates collected by the experimental vehicle using our portable device, as well as the road traffic noise estimation model and some sample street view images used for demonstration are available at https://github.com/kellyhuang313/traffic-noise-estimation. Instructions for executing the code are provided in the README.txt.Notes1 https://www.openstreetmap.org/Additional informationFundingThis work was supported by the National Natural Science Foundation of China [42271476]; the Wuhan University 351 Talent Program, 2020; and the Guangdong Science and Technology Strategic Innovation Fund [2020B1212030009]. This work was also supported by State Key Laboratory of Resources and Environmental Information Sys","PeriodicalId":14162,"journal":{"name":"International Journal of Geographical Information Science","volume":"26 7","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135819359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiscale spatially varying coefficient modelling using a Geographical Gaussian Process GAM 基于地理高斯过程的多尺度空间变系数建模
1区 地球科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-27 DOI: 10.1080/13658816.2023.2270285
Alexis Comber, Paul Harris, Chris Brunsdon
This paper proposes a novel spatially varying coefficient (SVC) regression through a Geographical Gaussian Process GAM (GGP-GAM): a Generalized Additive Model (GAM) with Gaussian Process (GP) splines parameterised at observation locations. A GGP-GAM was applied to multiple simulated coefficient datasets exhibiting varying degrees of spatial heterogeneity and out-performed the SVC brand-leader, Multiscale Geographically Weighted Regression (MGWR), under a range of fit metrics. Both were then applied to a Brexit case study and compared, with MGWR marginally out-performing GGP-GAM. The theoretical frameworks and implementation of both approaches are discussed: GWR models calibrate multiple models whereas GAMs provide a full single model; GAMs can automatically penalise local collinearity; GWR-based approaches are computationally more demanding; MGWR is still only for Gaussian responses; MGWR bandwidths are intuitive indicators of spatial heterogeneity. GGP-GAM calibration and tuning are also discussed and areas of future work are identified, including the creation of a user-friendly package to support model creation and coefficient mapping, and to facilitate ease of comparison with alternate SVC models. A final observation that GGP-GAMs have the potential to overcome some of the long-standing reservations about GWR-based regression methods and to elevate the perception of SVCs amongst the broader community.
本文通过地理高斯过程GAM (GGP-GAM)提出了一种新的空间变系数(SVC)回归方法:高斯过程(GP)样条在观测位置参数化的广义加性模型(GAM)。将GGP-GAM应用于具有不同程度空间异质性的多个模拟系数数据集,并在一系列拟合指标下优于SVC品牌领导者多尺度地理加权回归(MGWR)。然后将两者应用于英国脱欧案例研究并进行比较,MGWR略优于GGP-GAM。讨论了两种方法的理论框架和实现:GWR模型校准多个模型,而GAMs提供完整的单一模型;GAMs可以自动惩罚局部共线性;基于gwr的方法在计算上要求更高;MGWR仍然只适用于高斯响应;MGWR带宽是空间异质性的直观指标。还讨论了GGP-GAM的校准和调整,并确定了未来工作的领域,包括创建一个用户友好的软件包来支持模型创建和系数映射,并促进与备选SVC模型的比较。最后观察到GGP-GAMs有潜力克服一些长期以来对基于gwr的回归方法的保留意见,并在更广泛的社区中提高对SVCs的认识。
{"title":"Multiscale spatially varying coefficient modelling using a Geographical Gaussian Process GAM","authors":"Alexis Comber, Paul Harris, Chris Brunsdon","doi":"10.1080/13658816.2023.2270285","DOIUrl":"https://doi.org/10.1080/13658816.2023.2270285","url":null,"abstract":"This paper proposes a novel spatially varying coefficient (SVC) regression through a Geographical Gaussian Process GAM (GGP-GAM): a Generalized Additive Model (GAM) with Gaussian Process (GP) splines parameterised at observation locations. A GGP-GAM was applied to multiple simulated coefficient datasets exhibiting varying degrees of spatial heterogeneity and out-performed the SVC brand-leader, Multiscale Geographically Weighted Regression (MGWR), under a range of fit metrics. Both were then applied to a Brexit case study and compared, with MGWR marginally out-performing GGP-GAM. The theoretical frameworks and implementation of both approaches are discussed: GWR models calibrate multiple models whereas GAMs provide a full single model; GAMs can automatically penalise local collinearity; GWR-based approaches are computationally more demanding; MGWR is still only for Gaussian responses; MGWR bandwidths are intuitive indicators of spatial heterogeneity. GGP-GAM calibration and tuning are also discussed and areas of future work are identified, including the creation of a user-friendly package to support model creation and coefficient mapping, and to facilitate ease of comparison with alternate SVC models. A final observation that GGP-GAMs have the potential to overcome some of the long-standing reservations about GWR-based regression methods and to elevate the perception of SVCs amongst the broader community.","PeriodicalId":14162,"journal":{"name":"International Journal of Geographical Information Science","volume":"156 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136261882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The impact of spatial scale on layout learning and individual evacuation behavior in indoor fires: single-scale learning perspectives 空间尺度对室内火灾中布局学习和个体疏散行为的影响:单尺度学习视角
1区 地球科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-26 DOI: 10.1080/13658816.2023.2271956
Jun Zhu, Pei Dang, Jinbin Zhang, Yungang Cao, Jianlin Wu, Weilian Li, Ya Hu, Jigang You
{"title":"The impact of spatial scale on layout learning and individual evacuation behavior in indoor fires: single-scale learning perspectives","authors":"Jun Zhu, Pei Dang, Jinbin Zhang, Yungang Cao, Jianlin Wu, Weilian Li, Ya Hu, Jigang You","doi":"10.1080/13658816.2023.2271956","DOIUrl":"https://doi.org/10.1080/13658816.2023.2271956","url":null,"abstract":"","PeriodicalId":14162,"journal":{"name":"International Journal of Geographical Information Science","volume":"31 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134908076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CoYangCZ: a new spatial interpolation method for nonstationary multivariate spatial processes CoYangCZ:一种新的非平稳多元空间过程插值方法
1区 地球科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-13 DOI: 10.1080/13658816.2023.2268665
Qiliang Liu, Yongchuan Zhu, Jie Yang, Xiancheng Mao, Min Deng
AbstractIn multivariate spatial interpolation, the accuracy of a variable of interest can be improved using ancillary variables. Although geostatistical methods are widely used for multivariate spatial interpolation, these methods usually require second-order stationary assumption of spatial processes, which is difficult to satisfy in practice. We developed a new multivariate spatial interpolation method based on Yang-Chizhong filtering (CoYangCZ) to overcome this limitation. CoYangCZ does not solve the multivariate spatial interpolation problem from a purely statistical point of view but integrates geometry and statistics-based strategies. First, we used a weighted moving average method based on binomial coefficients (i.e. Yang-Chizhong filtering) to fit the spatial autocorrelation structure of each spatial variable from a geometric perspective. We then quantified the spatial autocorrelation of each spatial variable and the correlations between different spatial variables by analyzing the variances of different spatial variables. Finally, we obtain the best linear unbiased estimators at the unsampled locations. Experiments on air pollution and meteorological datasets show that CoYangCZ has a higher interpolation accuracy than cokriging, regression kriging, gradient plus-inverse distance squared, sequential Gaussian co-simulation, and the kriging convolutional network. CoYangCZ can adapt to second-order non-stationary spatial processes; therefore, it has a wider scope of application than purely statistical methods.Keywords: Multivariate spatial processesspatial interpolationYang Chizhong filteringgeostatistics AcknowledgementsWe gratefully acknowledge the comments from the editor and the reviewers.Author contributionsQiliang Liu, Yongchuan Zhu, and Jie Yang conceived and designed the presented idea. Yongchuan Zhu and Jie Yang implemented the experiments and analysed the results. Qiliang Liu and Yongchuan Zhu wrote the manuscript. Xiancheng Mao and Min Deng reviewed the manuscript, and provided comments.Disclosure statementNo potential conflict of interest was reported by the author(s).Data and codes availability statementThe findings of this study are backed by data and codes that can be found on ‘figshare.com’, with the identifier at the public link: https://doi.org/10.6084/m9.figshare.24230179.Additional informationFundingThis study was funded through support from National Natural Science Foundation of China (NSFC) [No. 42271484 and 41971353] and Natural Science Foundation of Hunan Province [No. 2021JJ20058].Notes on contributorsQiliang LiuQiliang Liu is currently a professor at Central South University, Hunan, China. His research interests focus on multi-scale spatio-temporal data mining and spatiotemporal statistics. He has published more than 30 peer-reviewed journal articles in these areas.Yongchuan ZhuYongchuan Zhu is currently a postgraduate student at Central South University and his research interests focus on spatial statistics.Jie Yan
摘要在多元空间插值中,利用辅助变量可以提高感兴趣变量的精度。虽然地统计学方法被广泛用于多元空间插值,但这些方法通常需要对空间过程进行二阶平稳假设,这在实践中很难得到满足。为了克服这一局限性,我们提出了一种基于杨-池中滤波(CoYangCZ)的多元空间插值方法。CoYangCZ不是单纯从统计角度解决多元空间插值问题,而是将几何与统计相结合。首先,采用基于二项式系数的加权移动平均方法(即yang - chzhong滤波),从几何角度拟合各空间变量的空间自相关结构。然后,通过分析不同空间变量的方差,量化各空间变量的空间自相关性和不同空间变量之间的相关性。最后,我们得到了在未采样位置的最佳线性无偏估计量。在大气污染和气象数据集上的实验表明,CoYangCZ插值精度高于cokriging、回归kriging、梯度加逆距离平方、顺序高斯联合模拟和kriging卷积网络。CoYangCZ能够适应二阶非平稳空间过程;因此,它比单纯的统计方法具有更广泛的适用范围。关键词:多元空间过程空间插值杨赤忠滤波地质统计学感谢编辑和审稿人的意见。作者:刘四亮、朱永川、杨洁构思并设计了本文的思路。朱永川和杨洁进行了实验并分析了结果。刘启亮和朱永川撰写了手稿。毛宪成、邓敏审稿,并提出意见。披露声明作者未报告潜在的利益冲突。数据和代码可用性声明本研究的数据和代码可以在“figshare.com”上找到,公共链接的标识符为:https://doi.org/10.6084/m9.figshare.24230179.Additional information。湖南省自然科学基金项目[41971484和41971353];2021 jj20058]。作者简介刘其亮,现任中南大学教授。主要研究方向为多尺度时空数据挖掘和时空统计。他在这些领域发表了30多篇同行评议的期刊文章。朱永川,中南大学研究生,主要研究方向为空间统计。杨杰,中南大学博士研究生,主要研究方向为时空统计。毛贤成,现任中南大学教授。主要研究方向为三维地质建模和矿产远景制图。邓敏现任中南大学教授,中南大学地球科学与信息物理学院副院长。主要研究方向为地图综合、时空数据分析与挖掘。
{"title":"CoYangCZ: a new spatial interpolation method for nonstationary multivariate spatial processes","authors":"Qiliang Liu, Yongchuan Zhu, Jie Yang, Xiancheng Mao, Min Deng","doi":"10.1080/13658816.2023.2268665","DOIUrl":"https://doi.org/10.1080/13658816.2023.2268665","url":null,"abstract":"AbstractIn multivariate spatial interpolation, the accuracy of a variable of interest can be improved using ancillary variables. Although geostatistical methods are widely used for multivariate spatial interpolation, these methods usually require second-order stationary assumption of spatial processes, which is difficult to satisfy in practice. We developed a new multivariate spatial interpolation method based on Yang-Chizhong filtering (CoYangCZ) to overcome this limitation. CoYangCZ does not solve the multivariate spatial interpolation problem from a purely statistical point of view but integrates geometry and statistics-based strategies. First, we used a weighted moving average method based on binomial coefficients (i.e. Yang-Chizhong filtering) to fit the spatial autocorrelation structure of each spatial variable from a geometric perspective. We then quantified the spatial autocorrelation of each spatial variable and the correlations between different spatial variables by analyzing the variances of different spatial variables. Finally, we obtain the best linear unbiased estimators at the unsampled locations. Experiments on air pollution and meteorological datasets show that CoYangCZ has a higher interpolation accuracy than cokriging, regression kriging, gradient plus-inverse distance squared, sequential Gaussian co-simulation, and the kriging convolutional network. CoYangCZ can adapt to second-order non-stationary spatial processes; therefore, it has a wider scope of application than purely statistical methods.Keywords: Multivariate spatial processesspatial interpolationYang Chizhong filteringgeostatistics AcknowledgementsWe gratefully acknowledge the comments from the editor and the reviewers.Author contributionsQiliang Liu, Yongchuan Zhu, and Jie Yang conceived and designed the presented idea. Yongchuan Zhu and Jie Yang implemented the experiments and analysed the results. Qiliang Liu and Yongchuan Zhu wrote the manuscript. Xiancheng Mao and Min Deng reviewed the manuscript, and provided comments.Disclosure statementNo potential conflict of interest was reported by the author(s).Data and codes availability statementThe findings of this study are backed by data and codes that can be found on ‘figshare.com’, with the identifier at the public link: https://doi.org/10.6084/m9.figshare.24230179.Additional informationFundingThis study was funded through support from National Natural Science Foundation of China (NSFC) [No. 42271484 and 41971353] and Natural Science Foundation of Hunan Province [No. 2021JJ20058].Notes on contributorsQiliang LiuQiliang Liu is currently a professor at Central South University, Hunan, China. His research interests focus on multi-scale spatio-temporal data mining and spatiotemporal statistics. He has published more than 30 peer-reviewed journal articles in these areas.Yongchuan ZhuYongchuan Zhu is currently a postgraduate student at Central South University and his research interests focus on spatial statistics.Jie Yan","PeriodicalId":14162,"journal":{"name":"International Journal of Geographical Information Science","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135858084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extending regionalization algorithms to explore spatial process heterogeneity 扩展区域化算法探索空间过程异质性
1区 地球科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-13 DOI: 10.1080/13658816.2023.2266493
Hao Guo, Andre Python, Yu Liu
AbstractIn spatial regression models, spatial heterogeneity may be considered with either continuous or discrete specifications. The latter is related to delineation of spatially connected regions with homogeneous relationships between variables (spatial regimes). Although various regionalization algorithms have been proposed and studied in the field of spatial analytics, methods to optimize spatial regimes have been largely unexplored. In this paper, we propose two new algorithms for spatial regime delineation, two-stage K-Models and Regional-K-Models. We also extend the classic Automatic Zoning Procedure to a spatial regression context. The proposed algorithms are applied to a series of synthetic datasets and two real-world datasets. Results indicate that all three algorithms achieve superior or comparable performance to existing approaches, while the two-stage K-Models algorithm largely outperforms existing approaches on model fitting, region reconstruction and coefficient estimation. Our work enriches the spatial analytics toolbox to explore spatial heterogeneous processes.Keywords: Regionalizationspatial heterogeneityspatial regimespatial regression NotesAcknowledgmentsThe authors thank members of Spatial Analysis Group, Spatio-temporal Social Sensing Lab for helpful discussion. The constructive comments from anonymous reviewers are gratefully acknowledged.Data and codes availability statementThe data and codes that support the findings of this study are available at https://github.com/Nithouson/regreg.Disclosure statementThe authors declare that they have no conflict of interest.Notes1 For example, the population in each region is required to be as similar as possible or above a predefined value (see Duque et al. (Citation2012), Folch and Spielman (Citation2014), Wei et al. (Citation2021)).2 Note that the optimization of spatial regimes differs from Openshaw (Citation1978), where spatial units are aggregated into areas, and each area is treated as an observation in a global regression model.3 Note that in EquationEquation 1(1) L(R)=∑j=1p∑1≤i1<i2≤nI[ui1,ui2∈Rj]||xi1−xi2||2,(1) , the number of considered unit pairs in the sum is ∑j=1M(|Rj|2), which is smaller if |Rj|(j=1,…,M) are close to each other. Hence the objective function might favor solutions whose regions have similar numbers of units.4 Throughout the paper, we describe the case of lattice data (spatial data on areal units). Our approach is also applicable to point observation data after building adjacency (with k-nearest neighbors (KNN) or Delaunay triangulation, for example).5 This usually happens when min_obs is close to n/p, where p is the number of regions. Given min_obs≪n/p, this issue does not cause problems, as observed in our experiments.6 If a region with inadequate units has two or more neighboring regions, we select the neighbor which minimizes the total SSR after the merge.7 When min_obs is too large or K is too small (close to p), exceptions may occur that the number of
在空间回归模型中,空间异质性可以考虑连续或离散规格。后者与空间连接区域的描述有关,这些区域具有变量之间的均匀关系(空间制度)。虽然在空间分析领域已经提出和研究了各种区划算法,但优化空间制度的方法在很大程度上尚未得到探索。本文提出了两种新的空间状态描述算法:两阶段k -模型和区域k -模型。我们还将经典的自动分区程序扩展到空间回归环境。提出的算法应用于一系列合成数据集和两个真实数据集。结果表明,这三种算法的性能都优于或与现有方法相当,而两阶段K-Models算法在模型拟合、区域重建和系数估计方面大大优于现有方法。我们的工作丰富了空间分析工具箱,以探索空间异构过程。关键词:区域化;空间异质性;空间制度;感谢匿名审稿人提出的建设性意见。数据和代码可用性声明支持本研究结果的数据和代码可从https://github.com/Nithouson/regreg.Disclosure获取声明作者声明无利益冲突。注1例如,要求每个地区的人口尽可能接近或高于预定义值(参见Duque et al. (Citation2012), Folch and Spielman (Citation2014), Wei et al. (Citation2021))请注意,空间制度的优化不同于Openshaw (Citation1978),在Openshaw中,空间单元被聚合成区域,每个区域被视为全局回归模型中的一个观测值注意,在等式1(1)L(R)=∑j=1p∑1≤i1<i2≤nI[ui1,ui2∈Rj]||xi1−xi2||2,(1)中,求和中考虑的单位对个数为∑j=1M(|Rj|2),当|Rj|(j=1,…,M)彼此接近时,考虑的单位对个数越小。因此,目标函数可能倾向于具有相似单元数的区域的解在整篇论文中,我们描述了点阵数据(面单位上的空间数据)的情况。我们的方法也适用于建立邻接关系后的点观测数据(例如,与k近邻(KNN)或Delaunay三角剖分)这通常发生在min_obs接近n/p时,其中p是区域的数量。鉴于min_obs≪n/p,正如我们在实验中观察到的那样,这个问题不会造成问题如果单元不足的区域有两个或两个以上的相邻区域,我们选择合并后总SSR最小的相邻区域当min_obs太大或K太小(接近p)时,可能会出现区域数量小于p的例外情况,因此算法无法通过合并'微集群'来产生所需的区域数量。这个问题可以通过调整min_obs和K.8来解决。系数向量的OLS估计为β=(XTX) - 1XTy,其中X为自变量的nrx (m+1)矩阵,y为因变量的n维向量。在这里,通过添加一个常数为1的自变量,将截距包含在β中。通过应用Sherman-Morrison公式(Bartlett Citation1951)来更新(XTX)−1项,可以将时间复杂度从O(m2(nr+m))降低到O(m(nr+m))设βi,j表示系数βi在区域Rj中的值。在每次模拟中,列表(−2,−1,0,1,2)被随机洗牌两次,分别用作(β1,1,…,β1,5)和(β2,1,…,β2,5)Helbich等人(Citation2013)也对GWR系数进行了主成分分析。跳过这一步,因为在我们的实验中不需要降维在K-Models算法的两个阶段可以使用不同的min_obs值。这里min_obs=10用于合并阶段,而分区阶段的min_obs是本文中自变量的数量加1即使考虑平均SSR而不是最低SSR,两阶段k模型和AZP的表现也始终优于GWR-Skater和Skater-reg;Regional-K-Models与skater - regg相当,优于gwr - skater在我们的机器上,GWR估计没有在30分钟内完成在一台CPU为Intel酷睿i5-1135G7 (2.40 GHz)、内存为16GB的计算机上对King County房价数据集进行了实验。 基金资助:国家自然科学基金项目(41830645,42271426,41971331,82273731)、智慧广州时空信息云平台建设项目(GZIT2016-A5-147)和国家重点研发计划项目(2021YFC2701905)。郭浩,北京大学遥感与地理信息系统研究所博士研究生。他于2020年获得北京大学地理信息科学学士学位和数学双学士学位。主要研究方向为空间分析、地理空间人工智能和空间优化。Andre Python,浙江大学数据科学中心ZJU100青年统计学教授。他在瑞士弗里堡大学获得学士学位和硕士学位,在英国圣安德鲁斯大学获得博士学位。他开发并应用空间模型和可解释的机器学习算法,以更好地理解观察到的空间现象模式背后的机制。刘宇,现任北京大学遥感与地理信息系统研究所gisscience博雅教授。他分别于1994年、1997年和2003年获得北京大学学士学位、硕士学位和博士学位。主要研究方向为基于大地理数据的人文社会科学。
{"title":"Extending regionalization algorithms to explore spatial process heterogeneity","authors":"Hao Guo, Andre Python, Yu Liu","doi":"10.1080/13658816.2023.2266493","DOIUrl":"https://doi.org/10.1080/13658816.2023.2266493","url":null,"abstract":"AbstractIn spatial regression models, spatial heterogeneity may be considered with either continuous or discrete specifications. The latter is related to delineation of spatially connected regions with homogeneous relationships between variables (spatial regimes). Although various regionalization algorithms have been proposed and studied in the field of spatial analytics, methods to optimize spatial regimes have been largely unexplored. In this paper, we propose two new algorithms for spatial regime delineation, two-stage K-Models and Regional-K-Models. We also extend the classic Automatic Zoning Procedure to a spatial regression context. The proposed algorithms are applied to a series of synthetic datasets and two real-world datasets. Results indicate that all three algorithms achieve superior or comparable performance to existing approaches, while the two-stage K-Models algorithm largely outperforms existing approaches on model fitting, region reconstruction and coefficient estimation. Our work enriches the spatial analytics toolbox to explore spatial heterogeneous processes.Keywords: Regionalizationspatial heterogeneityspatial regimespatial regression NotesAcknowledgmentsThe authors thank members of Spatial Analysis Group, Spatio-temporal Social Sensing Lab for helpful discussion. The constructive comments from anonymous reviewers are gratefully acknowledged.Data and codes availability statementThe data and codes that support the findings of this study are available at https://github.com/Nithouson/regreg.Disclosure statementThe authors declare that they have no conflict of interest.Notes1 For example, the population in each region is required to be as similar as possible or above a predefined value (see Duque et al. (Citation2012), Folch and Spielman (Citation2014), Wei et al. (Citation2021)).2 Note that the optimization of spatial regimes differs from Openshaw (Citation1978), where spatial units are aggregated into areas, and each area is treated as an observation in a global regression model.3 Note that in EquationEquation 1(1) L(R)=∑j=1p∑1≤i1&lt;i2≤nI[ui1,ui2∈Rj]||xi1−xi2||2,(1) , the number of considered unit pairs in the sum is ∑j=1M(|Rj|2), which is smaller if |Rj|(j=1,…,M) are close to each other. Hence the objective function might favor solutions whose regions have similar numbers of units.4 Throughout the paper, we describe the case of lattice data (spatial data on areal units). Our approach is also applicable to point observation data after building adjacency (with k-nearest neighbors (KNN) or Delaunay triangulation, for example).5 This usually happens when min_obs is close to n/p, where p is the number of regions. Given min_obs≪n/p, this issue does not cause problems, as observed in our experiments.6 If a region with inadequate units has two or more neighboring regions, we select the neighbor which minimizes the total SSR after the merge.7 When min_obs is too large or K is too small (close to p), exceptions may occur that the number of","PeriodicalId":14162,"journal":{"name":"International Journal of Geographical Information Science","volume":"130 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135859064","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Graph based embedding learning of trajectory data for transportation mode recognition by fusing sequence and dependency relations 基于图的轨迹数据嵌入学习,融合序列和依赖关系进行交通模式识别
1区 地球科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-11 DOI: 10.1080/13658816.2023.2268668
Wenhao Yu, Guanwen Wang
AbstractAs an important task in spatial data mining, trajectory transportation mode recognition can reflect various individual behaviors and traveling patterns in urban space. As trajectory is essentially a sequence, many scholars use the sequence inference models to mine the information in trajectory data. However, such methods often ignored the spatial correlation between trajectory points and implemented the evaluation based only on representative feature statistics selected in the trajectory data preprocessing stage, thus have difficulties in acquiring high-order traveling pattern features. In this study, we propose a novel ensemble recognition method for representing trajectory data with the graph structure based on sequence and dependency relations. This method integrates the sequence of trajectory points and the correlation between characteristic points of a travel path into a fused graph convolutional network to obtain semantic feature information at multiple levels. We validate our proposed method with experiments on the trajectory benchmark dataset from the Microsoft GeoLife project. The results demonstrated that our proposed graph network outperforms other baseline methods in the transportation mode recognition task of trajectories. This method can help to discover the movement patterns of urban residents, and further provide effective assistance for the management of cities.Keywords: Trajectory datagraph convolution networktransportation mode recognitionfeature extractionfeature fusion AcknowledgmentsThe authors are grateful to the associate editor, Urska Demsar, and the anonymous referees for their valuable comments and suggestions. The project was supported by the National Natural Science Foundation of China (42371446 and 42071442) and by the Fundamental Research Funds for the Central Universities, China University of Geosciences (Wuhan) (No.CUG170640). This research was also supported by Meituan.Author contributionsWenhao Yu: Conceptualization, methodology, formal analysis, validation, writing—original draft preparation, writing—review and editing, supervision, project administration, funding acquisition; Guanwen Wang: Methodology, validation, formal analysis, investigation, writing—original draft preparation, writing—review and editing, visualization. All authors have read and agreed to the published version of the manuscript.Disclosure statementNo potential conflict of interest was reported by the author(s).Data and codes availability statementThe data and codes that support the findings of this study are available with a DOI at (https://doi.org/10.6084/m9.figshare.21608310).Additional informationNotes on contributorsWenhao YuWenhao Yu received the B.S. and Ph.D. degrees in Geoinformatics from the Wuhan University, Wuhan, China, in 2010 and 2015, respectively. He is a professor at China University of Geosciences, Wuhan, China (CUG). His research interests include spatial data mining, map generalization, and deep learning.Guanwen
摘要轨道交通模式识别是空间数据挖掘中的一项重要任务,能够反映城市空间中个体的各种行为和出行模式。由于轨迹本质上是一个序列,许多学者使用序列推理模型来挖掘轨迹数据中的信息。然而,这类方法往往忽略了轨迹点之间的空间相关性,仅根据轨迹数据预处理阶段选择的代表性特征统计量进行评价,难以获得高阶旅行模式特征。在本研究中,我们提出了一种新的基于序列和依赖关系的图结构来表示轨迹数据的集成识别方法。该方法将轨迹点的序列和行走路径特征点之间的相关性整合到融合图卷积网络中,获得多层次的语义特征信息。我们在Microsoft GeoLife项目的轨迹基准数据集上验证了我们提出的方法。结果表明,我们提出的图网络在轨道运输模式识别任务中优于其他基线方法。这种方法有助于发现城市居民的流动模式,进一步为城市的管理提供有效的帮助。关键词:轨迹数据图卷积网络运输模式识别特征提取特征融合致谢感谢副主编Urska Demsar和匿名审稿人提出的宝贵意见和建议。国家自然科学基金(42371446和42071442)和中国地质大学(武汉)中央高校基本科研业务费专项资金(cug170640)资助。这项研究也得到了美团的支持。作者:于文浩:构想、方法论、形式分析、验证、撰写-原稿、撰写-审编、监督、项目管理、资金获取;王冠文:方法论,验证,形式分析,调查,写作-原稿准备,写作-审查和编辑,可视化。所有作者都已阅读并同意稿件的出版版本。披露声明作者未报告潜在的利益冲突。数据和代码可用性声明支持本研究结果的数据和代码可通过DOI (https://doi.org/10.6084/m9.figshare.21608310).Additional information)获取。作者说明:于文豪于2010年和2015年分别获得武汉大学地理信息学学士学位和博士学位。中国地质大学(武汉)教授。主要研究方向为空间数据挖掘、地图泛化和深度学习。王冠文,中国地质大学(武汉)地理与信息工程学院硕士研究生。她的研究兴趣包括深度学习和空间数据挖掘。
{"title":"Graph based embedding learning of trajectory data for transportation mode recognition by fusing sequence and dependency relations","authors":"Wenhao Yu, Guanwen Wang","doi":"10.1080/13658816.2023.2268668","DOIUrl":"https://doi.org/10.1080/13658816.2023.2268668","url":null,"abstract":"AbstractAs an important task in spatial data mining, trajectory transportation mode recognition can reflect various individual behaviors and traveling patterns in urban space. As trajectory is essentially a sequence, many scholars use the sequence inference models to mine the information in trajectory data. However, such methods often ignored the spatial correlation between trajectory points and implemented the evaluation based only on representative feature statistics selected in the trajectory data preprocessing stage, thus have difficulties in acquiring high-order traveling pattern features. In this study, we propose a novel ensemble recognition method for representing trajectory data with the graph structure based on sequence and dependency relations. This method integrates the sequence of trajectory points and the correlation between characteristic points of a travel path into a fused graph convolutional network to obtain semantic feature information at multiple levels. We validate our proposed method with experiments on the trajectory benchmark dataset from the Microsoft GeoLife project. The results demonstrated that our proposed graph network outperforms other baseline methods in the transportation mode recognition task of trajectories. This method can help to discover the movement patterns of urban residents, and further provide effective assistance for the management of cities.Keywords: Trajectory datagraph convolution networktransportation mode recognitionfeature extractionfeature fusion AcknowledgmentsThe authors are grateful to the associate editor, Urska Demsar, and the anonymous referees for their valuable comments and suggestions. The project was supported by the National Natural Science Foundation of China (42371446 and 42071442) and by the Fundamental Research Funds for the Central Universities, China University of Geosciences (Wuhan) (No.CUG170640). This research was also supported by Meituan.Author contributionsWenhao Yu: Conceptualization, methodology, formal analysis, validation, writing—original draft preparation, writing—review and editing, supervision, project administration, funding acquisition; Guanwen Wang: Methodology, validation, formal analysis, investigation, writing—original draft preparation, writing—review and editing, visualization. All authors have read and agreed to the published version of the manuscript.Disclosure statementNo potential conflict of interest was reported by the author(s).Data and codes availability statementThe data and codes that support the findings of this study are available with a DOI at (https://doi.org/10.6084/m9.figshare.21608310).Additional informationNotes on contributorsWenhao YuWenhao Yu received the B.S. and Ph.D. degrees in Geoinformatics from the Wuhan University, Wuhan, China, in 2010 and 2015, respectively. He is a professor at China University of Geosciences, Wuhan, China (CUG). His research interests include spatial data mining, map generalization, and deep learning.Guanwen ","PeriodicalId":14162,"journal":{"name":"International Journal of Geographical Information Science","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136211026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Predicting households’ residential mobility trajectories with geographically localized interpretable model-agnostic explanation (GLIME) 基于地理定位可解释模型不可知解释(GLIME)的家庭居住迁移轨迹预测
1区 地球科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-10 DOI: 10.1080/13658816.2023.2264921
Chanwoo Jin, Sohyun Park, Hui Jeong Ha, Jinhyung Lee, Junghwan Kim, Johan Hutchenreuther, Atsushi Nara
AbstractHuman mobility analytics using artificial intelligence (AI) has gained significant attention with advancements in computational power and the availability of high-resolution spatial data. However, the application of deep learning in social sciences and human geography remains limited, primarily due to concerns with model explainability. In this study, we employ an explainable GeoAI approach called geographically localized interpretable model-agnostic explanation (GLIME) to explore human mobility patterns over large spatial and temporal extents. Specifically, we develop a two-layered long short-term memory (LSTM) model capable of predicting individual-level residential mobility patterns across the United States from 2012 to 2019. We leverage GLIME to provide geographical perspectives and interpret deep neural networks at the state level. The results reveal that GLIME enables spatially explicit interpretations of local impacts attributed to different variables. Our findings underscore the significance of considering path dependency in residential mobility dynamics. While the prediction of complex human spatial decision-making processes still presents challenges, this research demonstrates the utility of deep neural networks and explainable GeoAI to support human dynamics understanding. It sets the stage for further finely tuned investigations in the future, promising deep insights into intricate mobility phenomena.Keywords: Explainable GeoAImodel-agnostic explanationlong short-term memory (LSTM)trajectory predictionresidential mobility AcknowledgementsAny opinions, findings, and conclusions or recommendations expressed in this material are those of the author and do not necessarily reflect the views of the National Science Foundation.Disclosure statementNo potential conflict of interest was reported by the author(s).Data and codes availability statementThe data, codes, and instructions that support the findings of this study are available on figshare at https://doi.org/10.6084/m9.figshare.21543549.v1Notes1 We have 75 variables in total as categorical variables including state and housing type are input as dummy variables into the models.Additional informationFundingThis research was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (Grant No. RS-2022-00165821) and the Faculty of Social Science at Western University. This work was also supported in part by the National Science Foundation under Grant No. 2031407.Notes on contributorsChanwoo JinChanwoo Jin is an assistant professor in the Department of Humanities and Social Sciences at Northwest Missouri State University. He holds a PhD in Geography at the University of California, Santa Barbara/San Diego State University (Joint Doctoral Program). His main research interests include big spatiotemporal data analysis, Geospatial Artificial Intelligence (GeoAI), human mobility and urban dynamics.Sohyun ParkSohyun Park is an assistant professo
随着计算能力的提高和高分辨率空间数据的可用性,使用人工智能(AI)的人类移动性分析得到了极大的关注。然而,深度学习在社会科学和人文地理学中的应用仍然有限,主要是由于对模型可解释性的担忧。在这项研究中,我们采用了一种可解释的GeoAI方法,称为地理本地化可解释模型不可知论解释(GLIME),以探索大空间和时间范围内的人类流动模式。具体而言,我们开发了一个双层长短期记忆(LSTM)模型,该模型能够预测2012年至2019年美国个人层面的住宅流动模式。我们利用GLIME提供地理视角,并在州一级解释深度神经网络。结果表明,GLIME能够对归因于不同变量的局部影响进行空间显式解释。我们的研究结果强调了在住宅流动动力学中考虑路径依赖的重要性。虽然预测复杂的人类空间决策过程仍然存在挑战,但本研究证明了深度神经网络和可解释的GeoAI在支持人类动力学理解方面的效用。它为未来进一步精细的研究奠定了基础,有望深入了解复杂的移动现象。关键词:可解释地理模型不可知解释长短期记忆(LSTM)轨迹预测居住流动性致谢本材料中表达的任何观点、发现、结论或建议均为作者的观点,并不一定反映美国国家科学基金会的观点。披露声明作者未报告潜在的利益冲突。数据和代码可用性声明支持本研究结果的数据、代码和说明可在https://doi.org/10.6084/m9.figshare.21543549.v1Notes1上获得。我们总共有75个变量作为分类变量,包括状态和住房类型,作为虚拟变量输入到模型中。本研究由韩国国家研究基金会(NRF)资助,由韩国政府(MSIT)资助(批准号:RS-2022-00165821)和西部大学社会科学学院。这项工作也得到了国家科学基金的部分支持,资助号为2031407。本文作者Jin chanwoo是西北密苏里州立大学人文与社会科学系的助理教授。他拥有加州大学圣巴巴拉分校/圣地亚哥州立大学(联合博士项目)地理学博士学位。主要研究方向为大时空数据分析、地理空间人工智能(GeoAI)、人类移动与城市动态。Sohyun Park是韩国乔治梅森大学计算与数据科学助理教授。她的研究重点是人员和货物的流动与当地环境的相互作用。她还对通过定量方法探索地理空间数据感兴趣。Hui Jeong Ha是西方大学地理与环境系的博士生。她擅长人类运动的时空分析以及城市和社区变化的研究。她的工作旨在为地理知识发现创造新的方法,并开发支持人类运动研究的开源软件工具。Jinhyung Lee,美国西部大学地理与环境学系助理教授。他的研究兴趣主要集中在利用地理信息科学、空间分析和时间地理学方法研究城市交通。具体来说,他的目标是开发新的分析和建模技术,以支持人类在空间和时间上的流动性和可达性的研究。Junghwan Kim是弗吉尼亚理工大学地理系的助理教授,他的研究兴趣包括人类流动性(例如,旅行行为和可达性),环境健康,地理空间数据科学方法的应用以及地理空间数据隐私/伦理。john Hutchenreuther是西方大学地理与环境系的博士生。他的研究兴趣是利用地理信息科学、空间分析和时间序列数据来探索社区动态和城市形态。目前,他的研究重点是交通对社区演变的影响。Atsushi Nara是圣地亚哥州立大学地理系副教授,也是移动时代人类动力学中心的副主任。他拥有亚利桑那州立大学地理学博士学位。 主要研究方向为地理计算、时空数据分析与建模、人体动力学与运动行为、复杂自适应系统等。
{"title":"Predicting households’ residential mobility trajectories with geographically localized interpretable model-agnostic explanation (GLIME)","authors":"Chanwoo Jin, Sohyun Park, Hui Jeong Ha, Jinhyung Lee, Junghwan Kim, Johan Hutchenreuther, Atsushi Nara","doi":"10.1080/13658816.2023.2264921","DOIUrl":"https://doi.org/10.1080/13658816.2023.2264921","url":null,"abstract":"AbstractHuman mobility analytics using artificial intelligence (AI) has gained significant attention with advancements in computational power and the availability of high-resolution spatial data. However, the application of deep learning in social sciences and human geography remains limited, primarily due to concerns with model explainability. In this study, we employ an explainable GeoAI approach called geographically localized interpretable model-agnostic explanation (GLIME) to explore human mobility patterns over large spatial and temporal extents. Specifically, we develop a two-layered long short-term memory (LSTM) model capable of predicting individual-level residential mobility patterns across the United States from 2012 to 2019. We leverage GLIME to provide geographical perspectives and interpret deep neural networks at the state level. The results reveal that GLIME enables spatially explicit interpretations of local impacts attributed to different variables. Our findings underscore the significance of considering path dependency in residential mobility dynamics. While the prediction of complex human spatial decision-making processes still presents challenges, this research demonstrates the utility of deep neural networks and explainable GeoAI to support human dynamics understanding. It sets the stage for further finely tuned investigations in the future, promising deep insights into intricate mobility phenomena.Keywords: Explainable GeoAImodel-agnostic explanationlong short-term memory (LSTM)trajectory predictionresidential mobility AcknowledgementsAny opinions, findings, and conclusions or recommendations expressed in this material are those of the author and do not necessarily reflect the views of the National Science Foundation.Disclosure statementNo potential conflict of interest was reported by the author(s).Data and codes availability statementThe data, codes, and instructions that support the findings of this study are available on figshare at https://doi.org/10.6084/m9.figshare.21543549.v1Notes1 We have 75 variables in total as categorical variables including state and housing type are input as dummy variables into the models.Additional informationFundingThis research was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (Grant No. RS-2022-00165821) and the Faculty of Social Science at Western University. This work was also supported in part by the National Science Foundation under Grant No. 2031407.Notes on contributorsChanwoo JinChanwoo Jin is an assistant professor in the Department of Humanities and Social Sciences at Northwest Missouri State University. He holds a PhD in Geography at the University of California, Santa Barbara/San Diego State University (Joint Doctoral Program). His main research interests include big spatiotemporal data analysis, Geospatial Artificial Intelligence (GeoAI), human mobility and urban dynamics.Sohyun ParkSohyun Park is an assistant professo","PeriodicalId":14162,"journal":{"name":"International Journal of Geographical Information Science","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136357441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Geo-knowledge-guided GPT models improve the extraction of location descriptions from disaster-related social media messages 地理知识引导的GPT模型改进了从与灾害相关的社交媒体信息中提取位置描述的方法
1区 地球科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-09 DOI: 10.1080/13658816.2023.2266495
Yingjie Hu, Gengchen Mai, Chris Cundy, Kristy Choi, Ni Lao, Wei Liu, Gaurish Lakhanpal, Ryan Zhenqi Zhou, Kenneth Joseph
Social media messages posted by people during natural disasters often contain important location descriptions, such as the locations of victims. Recent research has shown that many of these location descriptions go beyond simple place names, such as city names and street names, and are difficult to extract using typical named entity recognition (NER) tools. While advanced machine learning models could be trained, they require large labeled training datasets that can be time-consuming and labor-intensive to create. In this work, we propose a method that fuses geo-knowledge of location descriptions and a Generative Pre-trained Transformer (GPT) model, such as ChatGPT and GPT-4. The result is a geo-knowledge-guided GPT model that can accurately extract location descriptions from disaster-related social media messages. Also, only 22 training examples encoding geo-knowledge are used in our method. We conduct experiments to compare this method with nine alternative approaches on a dataset of tweets from Hurricane Harvey. Our method demonstrates an over 40% improvement over typically used NER approaches. The experiment results also show that geo-knowledge is indispensable for guiding the behavior of GPT models. The extracted location descriptions can help disaster responders reach victims more quickly and may even save lives.
人们在自然灾害期间发布的社交媒体信息通常包含重要的位置描述,例如受害者的位置。最近的研究表明,许多这些位置描述超出了简单的地名,如城市名称和街道名称,并且难以使用典型的命名实体识别(NER)工具提取。虽然可以训练先进的机器学习模型,但它们需要大型标记训练数据集,创建这些数据集既耗时又费力。在这项工作中,我们提出了一种融合位置描述的地理知识和生成预训练变压器(GPT)模型的方法,如ChatGPT和GPT-4。结果是一个地理知识引导的GPT模型,可以从与灾害相关的社交媒体信息中准确提取位置描述。此外,我们的方法只使用了22个编码地理知识的训练样例。我们在哈维飓风的推特数据集上进行了实验,将这种方法与九种替代方法进行了比较。我们的方法比通常使用的NER方法提高了40%以上。实验结果还表明,地质知识对于指导GPT模型的行为是不可或缺的。提取的位置描述可以帮助救灾人员更快地找到受害者,甚至可能挽救生命。
{"title":"Geo-knowledge-guided GPT models improve the extraction of location descriptions from disaster-related social media messages","authors":"Yingjie Hu, Gengchen Mai, Chris Cundy, Kristy Choi, Ni Lao, Wei Liu, Gaurish Lakhanpal, Ryan Zhenqi Zhou, Kenneth Joseph","doi":"10.1080/13658816.2023.2266495","DOIUrl":"https://doi.org/10.1080/13658816.2023.2266495","url":null,"abstract":"Social media messages posted by people during natural disasters often contain important location descriptions, such as the locations of victims. Recent research has shown that many of these location descriptions go beyond simple place names, such as city names and street names, and are difficult to extract using typical named entity recognition (NER) tools. While advanced machine learning models could be trained, they require large labeled training datasets that can be time-consuming and labor-intensive to create. In this work, we propose a method that fuses geo-knowledge of location descriptions and a Generative Pre-trained Transformer (GPT) model, such as ChatGPT and GPT-4. The result is a geo-knowledge-guided GPT model that can accurately extract location descriptions from disaster-related social media messages. Also, only 22 training examples encoding geo-knowledge are used in our method. We conduct experiments to compare this method with nine alternative approaches on a dataset of tweets from Hurricane Harvey. Our method demonstrates an over 40% improvement over typically used NER approaches. The experiment results also show that geo-knowledge is indispensable for guiding the behavior of GPT models. The extracted location descriptions can help disaster responders reach victims more quickly and may even save lives.","PeriodicalId":14162,"journal":{"name":"International Journal of Geographical Information Science","volume":"266 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135043452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
CATS: Conditional Adversarial Trajectory Synthesis for privacy-preserving trajectory data publication using deep learning approaches CATS:使用深度学习方法进行隐私保护轨迹数据发布的条件对抗轨迹综合
1区 地球科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-09 DOI: 10.1080/13658816.2023.2262550
Jinmeng Rao, Song Gao, Sijia Zhu
AbstractThe prevalence of ubiquitous location-aware devices and mobile Internet enables us to collect massive individual-level trajectory dataset from users. Such trajectory big data bring new opportunities to human mobility research but also raise public concerns with regard to location privacy. In this work, we present the Conditional Adversarial Trajectory Synthesis (CATS), a deep-learning-based GeoAI methodological framework for privacy-preserving trajectory data generation and publication. CATS applies K-anonymity to the underlying spatiotemporal distributions of human movements, which provides a distributional-level strong privacy guarantee. By leveraging conditional adversarial training on K-anonymized human mobility matrices, trajectory global context learning using the attention-based mechanism, and recurrent bipartite graph matching of adjacent trajectory points, CATS is able to reconstruct trajectory topology from conditionally sampled locations and generate high-quality individual-level synthetic trajectory data, which can serve as supplements or alternatives to raw data for privacy-preserving trajectory data publication. The experiment results on over 90k GPS trajectories show that our method has a better performance in privacy preservation, spatiotemporal characteristic preservation, and downstream utility compared with baseline methods, which brings new insights into privacy-preserving human mobility research using generative AI techniques and explores data ethics issues in GIScience.Keywords: Geoprivacygenerative adversarial networkhuman mobilityGeoAIsynthetic data generation AcknowledgmentThe authors acknowledge the funding support provided by the American Family Insurance Data Science Institute Funding Initiative at the University of Wisconsin-Madison. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the funder(s).Disclosure statementNo potential conflict of interest was reported by the author(s).Data and codes availability statementThe data and codes that support the findings of this study are available at the following link on figshare: https://doi.org/10.6084/m9.figshare.20760970. It is worth noting that due to the non-disclosure agreement with the data provider, we are not releasing the original individual-level GPS trajectory data but sharing the k-anonymized aggregated human mobility data used in our experiments.Additional informationNotes on contributorsJinmeng RaoJinmeng Rao is a research scientist at Mineral Earth Sciences. He received his PhD degree from the Department of Geography, University of Wisconsin-Madison. His research interests include GeoAI, Privacy-Preserving AI, and Location Privacy.Song GaoSong Gao is an associate professor in GIScience at the Department of Geography, University of Wisconsin-Madison. He holds a PhD in Geography at the University of California, Santa Barbara. His main research intere
摘要无处不在的位置感知设备和移动互联网的普及,使我们能够从用户那里收集海量的个人层面的轨迹数据。这种轨迹大数据为人类移动研究带来了新的机遇,但也引起了公众对位置隐私的关注。在这项工作中,我们提出了条件对抗轨迹合成(CATS),这是一种基于深度学习的GeoAI方法框架,用于隐私保护轨迹数据的生成和发布。CATS将k -匿名应用于人类运动的底层时空分布,提供了分布级的强隐私保证。通过利用k匿名人类移动矩阵的条件对抗训练、基于注意机制的轨迹全局上下文学习以及相邻轨迹点的循环二部图匹配,CATS能够从有条件采样的位置重建轨迹拓扑,并生成高质量的个人级合成轨迹数据,这些数据可以作为原始数据的补充或替代,用于保护隐私的轨迹数据发布。在超过90k的GPS轨迹上的实验结果表明,与基线方法相比,我们的方法在隐私保护、时空特征保护和下游效用方面具有更好的性能,为利用生成式人工智能技术保护隐私的人类移动研究提供了新的见解,并探讨了GIScience中的数据伦理问题。关键词:地理隐私生成对抗网络人类移动性地理合成数据生成致谢作者感谢威斯康星大学麦迪逊分校美国家庭保险数据科学研究所资助计划提供的资金支持。本材料中表达的任何意见、发现、结论或建议均为作者的意见,并不一定反映资助者的观点。披露声明作者未报告潜在的利益冲突。数据和代码可用性声明支持本研究结果的数据和代码可从figshare的以下链接获得:https://doi.org/10.6084/m9.figshare.20760970。值得注意的是,由于与数据提供商的保密协议,我们不会发布原始的个人层面的GPS轨迹数据,而是共享我们实验中使用的k匿名聚合的人类移动数据。作者简介饶金梦,矿物地球科学研究所研究员。他获得美国威斯康星大学麦迪逊分校地理系博士学位。他的研究兴趣包括GeoAI、隐私保护AI和位置隐私。高松,美国威斯康星大学麦迪逊分校地理系地理科学副教授。他拥有加州大学圣巴巴拉分校地理学博士学位。他的主要研究兴趣包括基于地点的地理信息系统、地理空间数据科学和GeoAI方法在人类移动和社会感知方面的应用。朱思佳,哥伦比亚大学数据科学专业硕士研究生。她在威斯康星大学麦迪逊分校获得统计学和经济学学士学位。
{"title":"CATS: Conditional Adversarial Trajectory Synthesis for privacy-preserving trajectory data publication using deep learning approaches","authors":"Jinmeng Rao, Song Gao, Sijia Zhu","doi":"10.1080/13658816.2023.2262550","DOIUrl":"https://doi.org/10.1080/13658816.2023.2262550","url":null,"abstract":"AbstractThe prevalence of ubiquitous location-aware devices and mobile Internet enables us to collect massive individual-level trajectory dataset from users. Such trajectory big data bring new opportunities to human mobility research but also raise public concerns with regard to location privacy. In this work, we present the Conditional Adversarial Trajectory Synthesis (CATS), a deep-learning-based GeoAI methodological framework for privacy-preserving trajectory data generation and publication. CATS applies K-anonymity to the underlying spatiotemporal distributions of human movements, which provides a distributional-level strong privacy guarantee. By leveraging conditional adversarial training on K-anonymized human mobility matrices, trajectory global context learning using the attention-based mechanism, and recurrent bipartite graph matching of adjacent trajectory points, CATS is able to reconstruct trajectory topology from conditionally sampled locations and generate high-quality individual-level synthetic trajectory data, which can serve as supplements or alternatives to raw data for privacy-preserving trajectory data publication. The experiment results on over 90k GPS trajectories show that our method has a better performance in privacy preservation, spatiotemporal characteristic preservation, and downstream utility compared with baseline methods, which brings new insights into privacy-preserving human mobility research using generative AI techniques and explores data ethics issues in GIScience.Keywords: Geoprivacygenerative adversarial networkhuman mobilityGeoAIsynthetic data generation AcknowledgmentThe authors acknowledge the funding support provided by the American Family Insurance Data Science Institute Funding Initiative at the University of Wisconsin-Madison. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the funder(s).Disclosure statementNo potential conflict of interest was reported by the author(s).Data and codes availability statementThe data and codes that support the findings of this study are available at the following link on figshare: https://doi.org/10.6084/m9.figshare.20760970. It is worth noting that due to the non-disclosure agreement with the data provider, we are not releasing the original individual-level GPS trajectory data but sharing the k-anonymized aggregated human mobility data used in our experiments.Additional informationNotes on contributorsJinmeng RaoJinmeng Rao is a research scientist at Mineral Earth Sciences. He received his PhD degree from the Department of Geography, University of Wisconsin-Madison. His research interests include GeoAI, Privacy-Preserving AI, and Location Privacy.Song GaoSong Gao is an associate professor in GIScience at the Department of Geography, University of Wisconsin-Madison. He holds a PhD in Geography at the University of California, Santa Barbara. His main research intere","PeriodicalId":14162,"journal":{"name":"International Journal of Geographical Information Science","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135142073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Understanding and extending the geographical detector model under a linear regression framework 理解和扩展线性回归框架下的地理探测器模型
1区 地球科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-09 DOI: 10.1080/13658816.2023.2266497
Hang Zhang, Guanpeng Dong, Jinfeng Wang, Tong-Lin Zhang, Xiaoyu Meng, Dongyang Yang, Yong Liu, Binbin Lu
The Geographical Detector Model (GDM) is a popular statistical toolkit for geographical attribution analysis. Despite the striking resemblance of the q-statistic in GDM to the R-squared in linear regression models, their explicit connection has not yet been established. This study proves that the q-statistic reduces into the R-squared under a linear regression framework. Under linear regression and moderate-to-strong spatial autocorrelation, Monte Carlo simulation results show that the GDM tends to underestimate the importance of variables. In addition, an almost perfect power law relationship is present between the percentage bias and the degree of the spatial autocorrelations, indicating the presence of fast uplifting bias in response to increasing levels of spatial autocorrelations. We propose an integrated approach for variable importance quantification by bringing together the spatial econometrics model and the game theory based-Shapley value method. By applying our proposed methodology to a case study of land desertification in African, it is found human activity tends to affect land desertification both directly and indirectly. However, such effects appear to be underestimated or undistinguished in the classic GDM.
地理探测器模型(GDM)是一种流行的用于地理归因分析的统计工具。尽管GDM中的q统计量与线性回归模型中的r平方惊人地相似,但它们之间的明确联系尚未建立。本研究证明了在线性回归框架下,q统计量约化为r平方。在线性回归和中强空间自相关条件下,蒙特卡罗模拟结果表明,GDM倾向于低估变量的重要性。此外,偏差百分比与空间自相关程度之间存在几乎完美的幂律关系,表明随着空间自相关水平的提高,存在快速上升的偏差。本文将空间计量经济学模型与基于博弈论的shapley值方法相结合,提出了一种综合的变量重要性量化方法。通过将本文提出的方法应用于非洲土地沙漠化的案例研究,发现人类活动倾向于直接和间接地影响土地沙漠化。然而,这种影响在经典GDM中似乎被低估或未被区分。
{"title":"Understanding and extending the geographical detector model under a linear regression framework","authors":"Hang Zhang, Guanpeng Dong, Jinfeng Wang, Tong-Lin Zhang, Xiaoyu Meng, Dongyang Yang, Yong Liu, Binbin Lu","doi":"10.1080/13658816.2023.2266497","DOIUrl":"https://doi.org/10.1080/13658816.2023.2266497","url":null,"abstract":"The Geographical Detector Model (GDM) is a popular statistical toolkit for geographical attribution analysis. Despite the striking resemblance of the q-statistic in GDM to the R-squared in linear regression models, their explicit connection has not yet been established. This study proves that the q-statistic reduces into the R-squared under a linear regression framework. Under linear regression and moderate-to-strong spatial autocorrelation, Monte Carlo simulation results show that the GDM tends to underestimate the importance of variables. In addition, an almost perfect power law relationship is present between the percentage bias and the degree of the spatial autocorrelations, indicating the presence of fast uplifting bias in response to increasing levels of spatial autocorrelations. We propose an integrated approach for variable importance quantification by bringing together the spatial econometrics model and the game theory based-Shapley value method. By applying our proposed methodology to a case study of land desertification in African, it is found human activity tends to affect land desertification both directly and indirectly. However, such effects appear to be underestimated or undistinguished in the classic GDM.","PeriodicalId":14162,"journal":{"name":"International Journal of Geographical Information Science","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135141958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Geographical Information Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1