The way residents perceive safety plays an important role in how they use public spaces, and it informs city planning and public policy. Recent studies have combined street view images and advanced computer vision techniques to measure human safety perceptions of urban environments. Despite their success, such studies have often overlooked the specific environmental visual factors that draw human attention and trigger people’s feelings of safety perceptions. In this study, we introduce a computational framework that enriches the existing body of literature on place perception by using eye-tracking systems with street view images and explainable AI approaches. Eye-tracking systems measure what users are looking at and how long they engage with specific environmental elements. This allows us to explore the nuance of which visual environmental factors influence human safety perceptions. We conducted our research and recruited volunteers in Helsingborg, Sweden. By examining participants’ focus on specific features using Mean Object Ratio in Highlighted Regions (MoRH) and Mean Object Hue (MoH), we identified key visual elements that attract human attention when perceiving safe environments. For instance, certain urban infrastructure (e.g., stairways and signboards) and public space (flags and chairs) features draw more human attention while the sky is less relevant in influencing safety perceptions. These insights offer a more human-centered understanding of which urban features influence human safety perceptions. Furthermore, we compared the real human attention from eye-tracking systems with attention maps obtained from eXplainable Artificial Intelligence (XAI) results. Several XAI models were tested, and we observed that XGradCAM and EigenCAM most closely align with human safety perceptual patterns. Our framework provides a valuable approach to enhance the interpretability and trustworthiness of XAI models by comparing them with empirically observed human behavior data. This study demonstrates the limitations of previous place perception studies that solely rely on street view images and computer vision techniques, which may not comprehensively capture the nuanced human experiences and behaviors at place. The inclusion of technologies such as eye-tracking not only deepens our comprehension of human subjective experiences, but also contributes to the development of safer environments and communities.
扫码关注我们
求助内容:
应助结果提醒方式:
