Perceptions of Edinburgh: Capturing Neighbourhood Characteristics by Clustering Geoparsed Local News

Andreas Grivas, Claire Grover, Richard Tobin, Clare Llewellyn, Eleojo Oluwaseun Abubakar, Chunyu Zheng, Chris Dibben, Alan Marshall, Jamie Pearce, Beatrice Alex
{"title":"Perceptions of Edinburgh: Capturing Neighbourhood Characteristics by Clustering Geoparsed Local News","authors":"Andreas Grivas, Claire Grover, Richard Tobin, Clare Llewellyn, Eleojo Oluwaseun Abubakar, Chunyu Zheng, Chris Dibben, Alan Marshall, Jamie Pearce, Beatrice Alex","doi":"arxiv-2409.11505","DOIUrl":null,"url":null,"abstract":"The communities that we live in affect our health in ways that are complex\nand hard to define. Moreover, our understanding of the place-based processes\naffecting health and inequalities is limited. This undermines the development\nof robust policy interventions to improve local health and well-being. News\nmedia provides social and community information that may be useful in health\nstudies. Here we propose a methodology for characterising neighbourhoods by\nusing local news articles. More specifically, we show how we can use Natural\nLanguage Processing (NLP) to unlock further information about neighbourhoods by\nanalysing, geoparsing and clustering news articles. Our work is novel because\nwe combine street-level geoparsing tailored to the locality with clustering of\nfull news articles, enabling a more detailed examination of neighbourhood\ncharacteristics. We evaluate our outputs and show via a confluence of evidence,\nboth from a qualitative and a quantitative perspective, that the themes we\nextract from news articles are sensible and reflect many characteristics of the\nreal world. This is significant because it allows us to better understand the\neffects of neighbourhoods on health. Our findings on neighbourhood\ncharacterisation using news data will support a new generation of place-based\nresearch which examines a wider set of spatial processes and how they affect\nhealth, enabling new epidemiological research.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"31 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Information Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11505","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The communities that we live in affect our health in ways that are complex and hard to define. Moreover, our understanding of the place-based processes affecting health and inequalities is limited. This undermines the development of robust policy interventions to improve local health and well-being. News media provides social and community information that may be useful in health studies. Here we propose a methodology for characterising neighbourhoods by using local news articles. More specifically, we show how we can use Natural Language Processing (NLP) to unlock further information about neighbourhoods by analysing, geoparsing and clustering news articles. Our work is novel because we combine street-level geoparsing tailored to the locality with clustering of full news articles, enabling a more detailed examination of neighbourhood characteristics. We evaluate our outputs and show via a confluence of evidence, both from a qualitative and a quantitative perspective, that the themes we extract from news articles are sensible and reflect many characteristics of the real world. This is significant because it allows us to better understand the effects of neighbourhoods on health. Our findings on neighbourhood characterisation using news data will support a new generation of place-based research which examines a wider set of spatial processes and how they affect health, enabling new epidemiological research.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
感知爱丁堡:通过聚类地方新闻捕捉邻里特征
我们生活的社区对我们的健康有着复杂而难以界定的影响。此外,我们对基于地方的影响健康和不平等的过程的了解也很有限。这不利于制定强有力的政策干预措施,以改善当地的健康和福祉。新闻媒体提供的社会和社区信息可能对健康研究有用。在此,我们提出了一种利用本地新闻报道来描述社区特征的方法。更具体地说,我们展示了如何利用自然语言处理(NLP)技术,通过对新闻报道进行分析、地理解析和聚类,进一步挖掘邻里信息。我们的工作很新颖,因为我们将根据当地情况定制的街道级地理解析与完整新闻文章的聚类相结合,从而能够更详细地检查街区特征。我们从定性和定量的角度评估了我们的成果,并通过一系列证据表明,我们从新闻文章中提取的主题是合理的,反映了现实世界的许多特征。这一点意义重大,因为它能让我们更好地了解社区对健康的影响。我们利用新闻数据进行邻里特征描述的研究结果将支持新一代基于地点的研究,该研究将探讨更广泛的空间过程及其对健康的影响,从而开展新的流行病学研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Decoding Style: Efficient Fine-Tuning of LLMs for Image-Guided Outfit Recommendation with Preference Retrieve, Annotate, Evaluate, Repeat: Leveraging Multimodal LLMs for Large-Scale Product Retrieval Evaluation Active Reconfigurable Intelligent Surface Empowered Synthetic Aperture Radar Imaging FLARE: Fusing Language Models and Collaborative Architectures for Recommender Enhancement Basket-Enhanced Heterogenous Hypergraph for Price-Sensitive Next Basket Recommendation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1