{"title":"利用正交视觉输入对微型邻域进行可扩展的多模态评估","authors":"Miroslav Despotovic, Wolfgang A. Brunauer","doi":"10.1007/s10901-024-10153-2","DOIUrl":null,"url":null,"abstract":"<p>The features of the micro-location and in particular the micro-neighborhood that residents perceive on a daily basis have a considerable influence on the quality of living and also on housing prices. For automated valuation models (AVMs), the use of micro-neighborhood information would be beneficial, as incorporating additional spatial effects into the price estimate could potentially reduce the empirical error. However, measuring related features is difficult, as they must first be defined and then collected, which is extremely challenging at such a small spatial level. In this study, we investigate the extent to which the quality of micro-neighborhoods can be assessed holistically using multiple data modalities. We design a scalable approach using alternative data (images and text), with the potential to expand coverage to other urban regions. To achieve this, we propose a multimodal deep learning architecture that integrates both textual and visual inputs and fuses this information. In addition, we introduce a training strategy that enables a targeted fusion of orthogonal visual representations of the residential area within the model architecture. In our experiments, we test and compare different unimodal models with our multimodal architectures. The results demonstrate that the multimodal model with targeted fusion of the orthogonal visual inputs achieves the best performance and also improves the prediction accuracy for underrepresented location quality classes.</p>","PeriodicalId":47558,"journal":{"name":"Journal of Housing and the Built Environment","volume":"59 1","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Scalable multimodal assessment of the micro-neighborhood using orthogonal visual inputs\",\"authors\":\"Miroslav Despotovic, Wolfgang A. Brunauer\",\"doi\":\"10.1007/s10901-024-10153-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The features of the micro-location and in particular the micro-neighborhood that residents perceive on a daily basis have a considerable influence on the quality of living and also on housing prices. For automated valuation models (AVMs), the use of micro-neighborhood information would be beneficial, as incorporating additional spatial effects into the price estimate could potentially reduce the empirical error. However, measuring related features is difficult, as they must first be defined and then collected, which is extremely challenging at such a small spatial level. In this study, we investigate the extent to which the quality of micro-neighborhoods can be assessed holistically using multiple data modalities. We design a scalable approach using alternative data (images and text), with the potential to expand coverage to other urban regions. To achieve this, we propose a multimodal deep learning architecture that integrates both textual and visual inputs and fuses this information. In addition, we introduce a training strategy that enables a targeted fusion of orthogonal visual representations of the residential area within the model architecture. In our experiments, we test and compare different unimodal models with our multimodal architectures. The results demonstrate that the multimodal model with targeted fusion of the orthogonal visual inputs achieves the best performance and also improves the prediction accuracy for underrepresented location quality classes.</p>\",\"PeriodicalId\":47558,\"journal\":{\"name\":\"Journal of Housing and the Built Environment\",\"volume\":\"59 1\",\"pages\":\"\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2024-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Housing and the Built Environment\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://doi.org/10.1007/s10901-024-10153-2\",\"RegionNum\":3,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENVIRONMENTAL STUDIES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Housing and the Built Environment","FirstCategoryId":"96","ListUrlMain":"https://doi.org/10.1007/s10901-024-10153-2","RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL STUDIES","Score":null,"Total":0}
Scalable multimodal assessment of the micro-neighborhood using orthogonal visual inputs
The features of the micro-location and in particular the micro-neighborhood that residents perceive on a daily basis have a considerable influence on the quality of living and also on housing prices. For automated valuation models (AVMs), the use of micro-neighborhood information would be beneficial, as incorporating additional spatial effects into the price estimate could potentially reduce the empirical error. However, measuring related features is difficult, as they must first be defined and then collected, which is extremely challenging at such a small spatial level. In this study, we investigate the extent to which the quality of micro-neighborhoods can be assessed holistically using multiple data modalities. We design a scalable approach using alternative data (images and text), with the potential to expand coverage to other urban regions. To achieve this, we propose a multimodal deep learning architecture that integrates both textual and visual inputs and fuses this information. In addition, we introduce a training strategy that enables a targeted fusion of orthogonal visual representations of the residential area within the model architecture. In our experiments, we test and compare different unimodal models with our multimodal architectures. The results demonstrate that the multimodal model with targeted fusion of the orthogonal visual inputs achieves the best performance and also improves the prediction accuracy for underrepresented location quality classes.
期刊介绍:
The Journal of Housing and the Built Environment is a scholarly journal presenting the results of scientific research and new developments in policy and practice to a diverse readership of specialists, practitioners and policy-makers. This refereed journal covers the fields of housing, spatial planning, building and urban development. The journal guarantees high scientific quality by a double blind review procedure. Next to that, the editorial board discusses each article as well. Leading scholars in the field of housing, spatial planning and urban development publish regularly in Journal of Housing and the Built Environment. The journal publishes articles from scientists all over the world, both Western and non-Western, providing a truly international platform for developments in both theory and practice in the fields of housing, spatial planning, building and urban development.
Journal of Housing and the Built Environment (HBE) has a wide scope and includes all topics dealing with people-environment relations. Topics concern social relations within the built environment as well as the physicals component of the built environment. As such the journal brings together social science and engineering. HBE is of interest for scientists like housing researchers, social geographers, (urban) planners and architects. Furthermore it presents a forum for practitioners to present their experiences in new developments on policy and practice. Because of its unique structure of research articles and policy and practice contributions, HBE provides a forum where science and practice can be confronted. Finally, each volume of HBE contains one special issue, in which recent developments on one particular topic are discussed in depth.
The aim of Journal of Housing and the Built Environment is to give international exposure to recent research and policy and practice developments on the built environment and thereby open up a forum wherein re searchers can exchange ideas and develop contacts. In this way HBE seeks to enhance the quality of research in the field and disseminate the results to a wider audience. Its scope is intended to interest scientists as well as policy-makers, both in government and in organizations dealing with housing and urban issues.