Andreas Grivas, Claire Grover, Richard Tobin, Clare Llewellyn, Eleojo Oluwaseun Abubakar, Chunyu Zheng, Chris Dibben, Alan Marshall, Jamie Pearce, Beatrice Alex
{"title":"Perceptions of Edinburgh: Capturing Neighbourhood Characteristics by Clustering Geoparsed Local News","authors":"Andreas Grivas, Claire Grover, Richard Tobin, Clare Llewellyn, Eleojo Oluwaseun Abubakar, Chunyu Zheng, Chris Dibben, Alan Marshall, Jamie Pearce, Beatrice Alex","doi":"arxiv-2409.11505","DOIUrl":null,"url":null,"abstract":"The communities that we live in affect our health in ways that are complex\nand hard to define. Moreover, our understanding of the place-based processes\naffecting health and inequalities is limited. This undermines the development\nof robust policy interventions to improve local health and well-being. News\nmedia provides social and community information that may be useful in health\nstudies. Here we propose a methodology for characterising neighbourhoods by\nusing local news articles. More specifically, we show how we can use Natural\nLanguage Processing (NLP) to unlock further information about neighbourhoods by\nanalysing, geoparsing and clustering news articles. Our work is novel because\nwe combine street-level geoparsing tailored to the locality with clustering of\nfull news articles, enabling a more detailed examination of neighbourhood\ncharacteristics. We evaluate our outputs and show via a confluence of evidence,\nboth from a qualitative and a quantitative perspective, that the themes we\nextract from news articles are sensible and reflect many characteristics of the\nreal world. This is significant because it allows us to better understand the\neffects of neighbourhoods on health. Our findings on neighbourhood\ncharacterisation using news data will support a new generation of place-based\nresearch which examines a wider set of spatial processes and how they affect\nhealth, enabling new epidemiological research.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"31 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Information Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11505","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The communities that we live in affect our health in ways that are complex
and hard to define. Moreover, our understanding of the place-based processes
affecting health and inequalities is limited. This undermines the development
of robust policy interventions to improve local health and well-being. News
media provides social and community information that may be useful in health
studies. Here we propose a methodology for characterising neighbourhoods by
using local news articles. More specifically, we show how we can use Natural
Language Processing (NLP) to unlock further information about neighbourhoods by
analysing, geoparsing and clustering news articles. Our work is novel because
we combine street-level geoparsing tailored to the locality with clustering of
full news articles, enabling a more detailed examination of neighbourhood
characteristics. We evaluate our outputs and show via a confluence of evidence,
both from a qualitative and a quantitative perspective, that the themes we
extract from news articles are sensible and reflect many characteristics of the
real world. This is significant because it allows us to better understand the
effects of neighbourhoods on health. Our findings on neighbourhood
characterisation using news data will support a new generation of place-based
research which examines a wider set of spatial processes and how they affect
health, enabling new epidemiological research.