首页 > 最新文献

International Journal for Population Data Science最新文献

英文 中文
Transport and Mobility Segregation in Urban Spaces 城市空间中的交通和流动隔离
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2268
Nandini Iyer, Ronaldo Menezes, Hugo Barbosa
Introduction & BackgroundPublic transportation is one of many factors that influence the level of disadvantage in a city. By facilitating movement within urban areas, transit systems can democratise accessibility to resources, while also fostering social integration among individuals from different areas and sociodemographic backgrounds. Conversely, inequalities in transport services can hinder individuals from fulfilling their travel demands. In this work, we explore socioeconomic segregation in cities from the perspective of their transit systems and how it intersects with segregation levels on a residential and employment level. Objectives & ApproachIn our analyses, we combine socioeconomic data from the 2020 American Community Survey with amenity visitation patterns from anonymised mobile phone traces, provided by SafeGraph, to estimate the mobility flows between areas (i.e., Census Block Groups - CBGs) in a given city. We define a CBG's segregation level using the Index of Concentration at the Extremes, which ranges from -1 to 1, reflecting extreme concentration of individuals from low and high income groups, respectively. Moreover, we retrieve General Transit Feed Specification and OpenStreetMap data to construct transit-pedestrian networks for various US cities. Relevance to Digital FootprintsWe leverage digital footprints, in the form of mobility flows between CBGs, to estimate the socioeconomic composition of different public transport routes within a city. By combining digital footprints with the respective economic breakdowns of trip origins, and transit-pedestrian networks, we can develop a better understanding of how segregated individuals are throughout various contexts of urban life. ResultsWhile segregation still exists in the transport and amenity dimensions, our findings suggest that individuals are exposed to the highest magnitudes of segregation in the residential dimension, with amenity and transit segregation allowing for potential avenues for reducing experiential segregation. However, we observe that the transit service in many cities hinders individuals in low-income neighbourhoods from accessing areas characterised by more affluent socioeconomic backgrounds. Conclusions & ImplicationsThese results underscore research that reveals how mobility patterns in neighbourhoods with a high concentration of underprivileged demographics, be it immigrant or ethnic minorities, tend to have more constrained activity spaces than their privileged counterparts. Although it is unclear whether mobility patterns are influenced by segregation levels of neighbourhoods, it is apparent that by limiting exposure to different types of neighbourhoods, transit systems impose constraints on the activity space and urban experience of individuals, namely those without access to personal vehicles. We highlight the benefit of analysing segregation as a spatio-temporal experience rather than a static variable, showing
介绍,公共交通是影响城市弱势程度的众多因素之一。通过促进城市地区内的流动,交通系统可以使资源的可及性民主化,同时也促进来自不同地区和社会人口背景的个人之间的社会融合。相反,交通服务方面的不平等可能阻碍个人满足其旅行需求。在这项工作中,我们从城市交通系统的角度探讨了城市的社会经济隔离,以及它如何与住宅和就业层面的隔离水平相交。 目标,方法在我们的分析中,我们将2020年美国社区调查的社会经济数据与SafeGraph提供的匿名移动电话痕迹的便利设施访问模式相结合,以估计给定城市中区域(即人口普查街区组- CBGs)之间的流动流量。我们使用极端浓度指数(Index of Concentration at the Extremes)来定义CBG的隔离水平,其范围从-1到1,分别反映了低收入群体和高收入群体个人的极端集中。此外,我们检索了通用交通馈送规范和OpenStreetMap数据,以构建美国各个城市的交通-行人网络。 与数字足迹的相关性我们利用数字足迹,以cbg之间的流动流的形式,来估计城市中不同公共交通路线的社会经济构成。通过将数字足迹与出行来源和公交-行人网络的各自经济细分相结合,我们可以更好地理解在城市生活的各种背景下,个体是如何被隔离的。 结果:虽然交通和舒适维度上的隔离仍然存在,但我们的研究结果表明,居民在居住维度上的隔离程度最高,而舒适和交通隔离为减少体验性隔离提供了潜在的途径。然而,我们观察到,许多城市的公共交通服务阻碍了低收入社区的个人进入社会经济背景更富裕的地区。结论,这些结果强调了一项研究,该研究揭示了贫困人口(无论是移民还是少数民族)高度集中的社区的流动性模式,往往比特权群体的活动空间更受限制。虽然尚不清楚流动模式是否受到社区隔离程度的影响,但很明显,通过限制与不同类型社区的接触,交通系统对个人(即无法获得个人车辆的人)的活动空间和城市体验施加了限制。我们强调了将隔离作为一种时空体验而不是静态变量进行分析的好处,展示了如何将流动性作为一种工具来尝试和克服住宅隔离。此外,确定过境系统内的不平等现象是提供更好的过境服务的第一步,特别是对来自特别脆弱人口群体的个人。最终,通过确定交通基础设施是如何使隔离永久化的,我们将采取许多步骤中的第一步,将交通重新构想为城市领域中的包容点。
{"title":"Transport and Mobility Segregation in Urban Spaces","authors":"Nandini Iyer, Ronaldo Menezes, Hugo Barbosa","doi":"10.23889/ijpds.v8i3.2268","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2268","url":null,"abstract":"Introduction & BackgroundPublic transportation is one of many factors that influence the level of disadvantage in a city. By facilitating movement within urban areas, transit systems can democratise accessibility to resources, while also fostering social integration among individuals from different areas and sociodemographic backgrounds. Conversely, inequalities in transport services can hinder individuals from fulfilling their travel demands. In this work, we explore socioeconomic segregation in cities from the perspective of their transit systems and how it intersects with segregation levels on a residential and employment level.
 Objectives & ApproachIn our analyses, we combine socioeconomic data from the 2020 American Community Survey with amenity visitation patterns from anonymised mobile phone traces, provided by SafeGraph, to estimate the mobility flows between areas (i.e., Census Block Groups - CBGs) in a given city. We define a CBG's segregation level using the Index of Concentration at the Extremes, which ranges from -1 to 1, reflecting extreme concentration of individuals from low and high income groups, respectively. Moreover, we retrieve General Transit Feed Specification and OpenStreetMap data to construct transit-pedestrian networks for various US cities.
 Relevance to Digital FootprintsWe leverage digital footprints, in the form of mobility flows between CBGs, to estimate the socioeconomic composition of different public transport routes within a city. By combining digital footprints with the respective economic breakdowns of trip origins, and transit-pedestrian networks, we can develop a better understanding of how segregated individuals are throughout various contexts of urban life.
 ResultsWhile segregation still exists in the transport and amenity dimensions, our findings suggest that individuals are exposed to the highest magnitudes of segregation in the residential dimension, with amenity and transit segregation allowing for potential avenues for reducing experiential segregation. However, we observe that the transit service in many cities hinders individuals in low-income neighbourhoods from accessing areas characterised by more affluent socioeconomic backgrounds.
 Conclusions & ImplicationsThese results underscore research that reveals how mobility patterns in neighbourhoods with a high concentration of underprivileged demographics, be it immigrant or ethnic minorities, tend to have more constrained activity spaces than their privileged counterparts. Although it is unclear whether mobility patterns are influenced by segregation levels of neighbourhoods, it is apparent that by limiting exposure to different types of neighbourhoods, transit systems impose constraints on the activity space and urban experience of individuals, namely those without access to personal vehicles. We highlight the benefit of analysing segregation as a spatio-temporal experience rather than a static variable, showing ","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135154081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
What can transactional data reveal about the prevalence of menstrual pain in England? 交易数据能揭示英国月经疼痛的普遍性吗?
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2272
Torty Sivill, Vanja Ljevar Ljevar, James Goulding, Anya Skatova
Introduction & BackgroundIt has been reported that up to 91% of those who menstruate experience associated pain. Despite its ubiquity, the prevalence of menstrual pain has been under researched due to stigma, disregard from medical professionals and a lack of data. It has also been reported that different demographics experience menstrual pain differently yet the impact of socio-demographic factors on menstrual pain remains to be explored on a national scale due to data scarcity. Objectives & ApproachIn this study, we propose one way to overcome this data barrier, using a novel measure of menstrual pain extracted from supermarket shopping data. We use these national datasets to identify individual customer behaviour patterns. Specifically, we use transactions involving both a pain and menstrual item as a proxy measure for menstrual pain. We investigate national menstrual pain sales and whether there are significant differences between deprived and less deprived areas of England. Relevance to Digital FootprintsThis paper brings together data from multiple sources, to provide a population level analysis of the prevalence of menstrual pain England. We use transactional data from a pharmaceutical retailer to develop a novel proxy measure for menstrual pain. We use various machine learning algorithms to explore the relationship between transactional data and various data sources pertaining to social deprivation. ResultsOur findings indicate that there is a high prevalence of menstrual pain with at least 26.7% of customers who purchase menstrual items also purchasing pain relief simultaneously. These customers are nearly four times more likely to purchase pain relief with a menstrual item than they are without. In addition, our results indicate a significant geographical disparity between menstrual pain transactions. We examine the relationship between a variety of deprivation factors and regional menstrual pain transactions and find average regional income has the highest predictive impact on menstrual pain sales. Contrary to what would expected from previous research, customers from the region with the lowest regional income were a third less likely (32%) to make a menstrual pain transaction than those from the highest income region. Conclusions & ImplicationsThis work motivates further research into the national prevalence of menstrual pain to understand why this regional disparity exists and whether it is a consequence of "period poverty". A better understanding of the sociodemographic factors associated with menstrual pain will help healthcare professionals stratify patients by risk, and could inform strategies to predict and prevent menstrual pain and its adverse impacts.
介绍,据报道,高达91%的经期女性经历过相关疼痛。尽管月经疼痛无处不在,但由于耻辱感、医疗专业人员的忽视和缺乏数据,人们一直在研究月经疼痛的普遍性。也有报道称,不同的人口对月经疼痛的体验不同,但由于数据缺乏,社会人口因素对月经疼痛的影响仍有待在全国范围内探索。 目标,在这项研究中,我们提出了一种克服这一数据障碍的方法,使用从超市购物数据中提取的一种新的月经疼痛测量方法。我们使用这些国家数据集来识别个人客户的行为模式。具体来说,我们使用涉及疼痛和月经项目的交易作为月经疼痛的代理度量。我们调查了全国经期镇痛药的销售情况,以及英格兰贫困地区和欠贫困地区之间是否存在显著差异。 与数字足迹相关本文汇集了来自多个来源的数据,以提供英国月经疼痛患病率的人口水平分析。我们使用来自医药零售商的交易数据来开发一种新的月经疼痛代理测量方法。我们使用各种机器学习算法来探索事务数据与与社会剥夺相关的各种数据源之间的关系。 结果调查结果显示,中国消费者对经期疼痛的患病率较高,至少26.7%的消费者在购买经期用品的同时也购买了镇痛药。这些顾客购买带有月经用品的止痛药的可能性几乎是不带月经用品的四倍。此外,我们的结果表明月经疼痛交易之间存在显著的地理差异。我们研究了各种剥夺因素与区域经痛交易之间的关系,发现平均区域收入对经痛销售具有最高的预测影响。与之前的研究预期相反,来自地区收入最低地区的客户进行经期疼痛交易的可能性(32%)比来自收入最高地区的客户低三分之一。结论,这项工作激发了对全国经期疼痛患病率的进一步研究,以了解为什么存在这种地区差异,以及这是否是“经期贫困”的结果。更好地了解与经期疼痛相关的社会人口因素将有助于医疗保健专业人员根据风险对患者进行分层,并可以为预测和预防经期疼痛及其不利影响的策略提供信息。
{"title":"What can transactional data reveal about the prevalence of menstrual pain in England?","authors":"Torty Sivill, Vanja Ljevar Ljevar, James Goulding, Anya Skatova","doi":"10.23889/ijpds.v8i3.2272","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2272","url":null,"abstract":"Introduction & BackgroundIt has been reported that up to 91% of those who menstruate experience associated pain. Despite its ubiquity, the prevalence of menstrual pain has been under researched due to stigma, disregard from medical professionals and a lack of data. It has also been reported that different demographics experience menstrual pain differently yet the impact of socio-demographic factors on menstrual pain remains to be explored on a national scale due to data scarcity.
 Objectives & ApproachIn this study, we propose one way to overcome this data barrier, using a novel measure of menstrual pain extracted from supermarket shopping data. We use these national datasets to identify individual customer behaviour patterns. Specifically, we use transactions involving both a pain and menstrual item as a proxy measure for menstrual pain. We investigate national menstrual pain sales and whether there are significant differences between deprived and less deprived areas of England.
 Relevance to Digital FootprintsThis paper brings together data from multiple sources, to provide a population level analysis of the prevalence of menstrual pain England. We use transactional data from a pharmaceutical retailer to develop a novel proxy measure for menstrual pain. We use various machine learning algorithms to explore the relationship between transactional data and various data sources pertaining to social deprivation.
 ResultsOur findings indicate that there is a high prevalence of menstrual pain with at least 26.7% of customers who purchase menstrual items also purchasing pain relief simultaneously. These customers are nearly four times more likely to purchase pain relief with a menstrual item than they are without. In addition, our results indicate a significant geographical disparity between menstrual pain transactions. We examine the relationship between a variety of deprivation factors and regional menstrual pain transactions and find average regional income has the highest predictive impact on menstrual pain sales. Contrary to what would expected from previous research, customers from the region with the lowest regional income were a third less likely (32%) to make a menstrual pain transaction than those from the highest income region.
 Conclusions & ImplicationsThis work motivates further research into the national prevalence of menstrual pain to understand why this regional disparity exists and whether it is a consequence of \"period poverty\". A better understanding of the sociodemographic factors associated with menstrual pain will help healthcare professionals stratify patients by risk, and could inform strategies to predict and prevent menstrual pain and its adverse impacts.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"121 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135154234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Seasonal purchase of antihistamines and ovarian cancer risk in the Cancer Loyalty Card Study (CLOCS): results from an observational case-control study 癌症忠诚卡研究(CLOCS)中抗组胺药的季节性购买与卵巢癌风险:一项观察性病例对照研究的结果
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2292
Hannah Brewer, Qianhui Jiang, Sudha Sundar, Yasemin Hirst, James Flanagan
Introduction & BackgroundAntihistamine use has been associated with a reduction in ovarian cancer incidence. Herein, we investigate antihistamine exposure in relation to ovarian cancer risk using a novel data resource by examining purchase histories from retailer loyalty card data. Objectives & ApproachParticipants from the Cancer Loyalty Card Study (CLOCS) included ovarian cancer patients (cases, n=153) and women without a diagnosis of ovarian cancer (controls, n=120). Up to 6 years of purchase history was retrieved from two participating high street retailers from 2014-2022. Logistic regression was used to estimate the odds ratio (OR) and 95% confidence intervals for ovarian cancer associated with antihistamine purchases, adjusting for confounders. The association was stratified by season of purchase, age, histology, and family history. Relevance to Digital FootprintsThis study is one of the first to utilise transaction data from high street retailers to investigate associations with cancer risk, based on what participants are buying. ResultsEver purchasing antihistamines was not significantly associated with ovarian cancer overall in this small study (OR=0.68 (0.39-1.19)). However, antihistamine purchases were significantly associated with reduced ovarian cancer risk when purchased only in spring and/or summer (OR=0.37 (0.17-0.82)) and in non-serous ovarian cancer (OR=0.41 (0.18-0.93)) in stratified analyses. Conclusions & ImplicationsAntihistamine purchase is associated with reduced ovarian cancer risk when purchased seasonally. However, larger studies are required to understand the mechanisms of reduced ovarian cancer risk related to seasonal purchases of antihistamines and allergies.
介绍,背景:抗组胺药的使用与卵巢癌发病率的降低有关。在此,我们研究抗组胺暴露与卵巢癌风险的关系,使用一种新的数据资源,通过检查零售商会员卡数据中的购买历史。 目标,癌症忠诚卡研究(CLOCS)的参与者包括卵巢癌患者(病例,n=153)和未诊断为卵巢癌的女性(对照组,n=120)。从两家参与的高街零售商那里检索了2014-2022年长达6年的购买历史。使用逻辑回归估计与抗组胺购买相关的卵巢癌的比值比(OR)和95%置信区间,调整混杂因素。根据购买季节、年龄、组织学和家族史对相关性进行分层。 与数字足迹的相关性这项研究是第一个利用高街零售商的交易数据来调查与癌症风险的关系的研究之一,基于参与者正在购买的东西。结果在这项小型研究中,sever购买抗组胺药与卵巢癌总体上没有显著相关性(OR=0.68(0.39-1.19))。然而,在分层分析中,仅在春季和/或夏季购买抗组胺药(or =0.37(0.17-0.82))和在非浆液性卵巢癌中购买抗组胺药(or =0.41(0.18-0.93))与降低卵巢癌风险显著相关。结论,季节性购买组胺与降低卵巢癌风险有关。然而,需要更大规模的研究来了解与季节性购买抗组胺药和过敏有关的降低卵巢癌风险的机制。
{"title":"Seasonal purchase of antihistamines and ovarian cancer risk in the Cancer Loyalty Card Study (CLOCS): results from an observational case-control study","authors":"Hannah Brewer, Qianhui Jiang, Sudha Sundar, Yasemin Hirst, James Flanagan","doi":"10.23889/ijpds.v8i3.2292","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2292","url":null,"abstract":"Introduction & BackgroundAntihistamine use has been associated with a reduction in ovarian cancer incidence. Herein, we investigate antihistamine exposure in relation to ovarian cancer risk using a novel data resource by examining purchase histories from retailer loyalty card data.
 Objectives & ApproachParticipants from the Cancer Loyalty Card Study (CLOCS) included ovarian cancer patients (cases, n=153) and women without a diagnosis of ovarian cancer (controls, n=120). Up to 6 years of purchase history was retrieved from two participating high street retailers from 2014-2022. Logistic regression was used to estimate the odds ratio (OR) and 95% confidence intervals for ovarian cancer associated with antihistamine purchases, adjusting for confounders. The association was stratified by season of purchase, age, histology, and family history.
 Relevance to Digital FootprintsThis study is one of the first to utilise transaction data from high street retailers to investigate associations with cancer risk, based on what participants are buying.
 ResultsEver purchasing antihistamines was not significantly associated with ovarian cancer overall in this small study (OR=0.68 (0.39-1.19)). However, antihistamine purchases were significantly associated with reduced ovarian cancer risk when purchased only in spring and/or summer (OR=0.37 (0.17-0.82)) and in non-serous ovarian cancer (OR=0.41 (0.18-0.93)) in stratified analyses.
 Conclusions & ImplicationsAntihistamine purchase is associated with reduced ovarian cancer risk when purchased seasonally. However, larger studies are required to understand the mechanisms of reduced ovarian cancer risk related to seasonal purchases of antihistamines and allergies.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"2011 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135153277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring spatial patterns of vulnerability using linked health data 利用相关卫生数据探索脆弱性的空间格局
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2280
Abigail Brake, Daniel Birks, Mark Mon-Williams, Sam Relins
Introduction & BackgroundThe types of challenges police and ambulance services deal with often overlap, for instance supporting those who suffer from mental ill-health. Research has shown that emergency service problems often concentrate, but also that some individuals who come to the attention of one service may not be as visible to another despite their overlap in roles. Objectives & ApproachThis study explored how routinely collected 999 data may reveal insights into how these services support potentially vulnerable populations. We argue that better understanding the nature and distribution of vulnerability-related calls may help to inform future preventative or harm reduction-based interventions. We analysed administrative data provided by Yorkshire Ambulance Service for the Bradford region through the Connected Bradford research database, posing the following questions: (1) can 999 call data provide insights into vulnerability-related incidents attended by ambulances?; (2) where and when are these incidents most prevalent?; and (3) what are the spatial patterns of calls and patient home locations associated with them? Relevance to Digital FootprintsWe first select calls associated with nine callout reasons indicative of vulnerability. Patients can choose to share their data with each healthcare service they use, so we harnessed this digital footprint to analyse the spatial distribution of call locations (at postcode sector level) and patient home location (at MSOA level). ResultsResults indicate substantial concentrations of vulnerability-related calls in multiple postcode sectors including the City Centre (where we estimate 18% of calls may be vulnerability-related) and several other areas which are associated with deprivation. Exploring flows of people from their home location to incident location we also see substantial spatial variation in the locations in which patients involved in these types of incidents reside. Conclusions & ImplicationsThese analyses represent initial efforts to better understand how vulnerable groups are supported by public services, and have the potential to inform future resource allocation and targeting of upstream interventions.
介绍,背景警察和救护车服务处理的挑战类型往往重叠,例如支持患有精神疾病的人。研究表明,紧急服务问题往往是集中的,但也表明,一些引起一个服务部门注意的个人,尽管他们的角色重叠,但对另一个服务部门来说,可能并不那么明显。目标,本研究探讨了常规收集的999数据如何揭示这些服务如何支持潜在弱势群体的见解。我们认为,更好地了解脆弱性相关呼叫的性质和分布可能有助于为未来的预防或减少伤害的干预提供信息。我们通过连接布拉德福德研究数据库分析了约克郡救护车服务为布拉德福德地区提供的行政数据,提出了以下问题:(1)999呼叫数据是否可以为救护车参与的漏洞相关事件提供见解?(2)这些事件在何时何地最普遍?(3)呼叫的空间模式和与之相关的患者家庭位置是什么? 与数字足迹的相关性我们首先选择与表明脆弱性的九个调出原因相关的呼叫。患者可以选择与他们使用的每个医疗保健服务共享他们的数据,因此我们利用这些数字足迹来分析呼叫位置(在邮政编码扇区级别)和患者家庭位置(在MSOA级别)的空间分布。结果表明,与脆弱性相关的电话大量集中在多个邮政编码部门,包括市中心(我们估计18%的电话可能与脆弱性有关)和其他几个与剥夺相关的地区。通过研究人们从他们的家到事件发生地的流动情况,我们还发现,涉及这类事件的患者所居住的地点存在很大的空间差异。结论,这些分析代表了更好地了解弱势群体如何得到公共服务支持的初步努力,并有可能为未来的资源分配和上游干预措施的目标提供信息。
{"title":"Exploring spatial patterns of vulnerability using linked health data","authors":"Abigail Brake, Daniel Birks, Mark Mon-Williams, Sam Relins","doi":"10.23889/ijpds.v8i3.2280","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2280","url":null,"abstract":"Introduction & BackgroundThe types of challenges police and ambulance services deal with often overlap, for instance supporting those who suffer from mental ill-health. Research has shown that emergency service problems often concentrate, but also that some individuals who come to the attention of one service may not be as visible to another despite their overlap in roles.
 Objectives & ApproachThis study explored how routinely collected 999 data may reveal insights into how these services support potentially vulnerable populations. We argue that better understanding the nature and distribution of vulnerability-related calls may help to inform future preventative or harm reduction-based interventions. We analysed administrative data provided by Yorkshire Ambulance Service for the Bradford region through the Connected Bradford research database, posing the following questions: (1) can 999 call data provide insights into vulnerability-related incidents attended by ambulances?; (2) where and when are these incidents most prevalent?; and (3) what are the spatial patterns of calls and patient home locations associated with them?
 Relevance to Digital FootprintsWe first select calls associated with nine callout reasons indicative of vulnerability. Patients can choose to share their data with each healthcare service they use, so we harnessed this digital footprint to analyse the spatial distribution of call locations (at postcode sector level) and patient home location (at MSOA level).
 ResultsResults indicate substantial concentrations of vulnerability-related calls in multiple postcode sectors including the City Centre (where we estimate 18% of calls may be vulnerability-related) and several other areas which are associated with deprivation. Exploring flows of people from their home location to incident location we also see substantial spatial variation in the locations in which patients involved in these types of incidents reside.
 Conclusions & ImplicationsThese analyses represent initial efforts to better understand how vulnerable groups are supported by public services, and have the potential to inform future resource allocation and targeting of upstream interventions.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"218 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135153434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Attitudes towards Sharing Digital Footprint Data: a Discrete Choice Experiment 对共享数字足迹数据的态度:一个离散选择实验
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2287
Rebecca McDonald, Anya Skatova, Carsten Maple
Introduction & BackgroundDigital footprints data are key for the economy, underpinning business models and service provision. This information can also bring benefit to public good, yet sharing of digital footprints data are predicated on individual attitudes which in term depend on the value these data have to consumers. In this study, we investigated how individuals make decisions about sharing their digital footprints data, as well as which features of the data sharing scenario affect their decision to share the data. Objectives & ApproachWe used responses from a nationally representative sample of 2,087 UK residents to estimate public preferences towards sharing different types of digital footprint data in scenarios with different features. The main part of our experiment consisted of a Discrete Choice Experiment which allows the relative importance of the different features of data sharing scenarios to be established, revealing the tradeoffs participants make between them. Participants made a series of choices between two hypothetical data sharing scenario options or could “opt out” by choosing neither specified option. For example, we examined the differences in responses when data are shared for different purposes (e.g., for research vs private benefit), as well as when data are shared with more or less granular details about identity or location. The data were analysed using a logistic regression with an alternative-specific constant. Relevance to Digital FootprintsWe focused on understanding whether varied features of six different types of digital footprints data - namely banking transactions, electricity use at home, retail loyalty cards use, browsing history, social media, and physical activity data - affect people’s decision whether to share these data. ResultsParticipants were more likely to share their data with a university for academic research than with a private company or government. Participants were also most reluctant to share data alongside their personal identity. Participants were concerned with the recipient of the data and their purpose in requesting it; whether the data would be shared along with their location and if so, to what specificity; and with the level of aggregation of the data (i.e. whether it would be shared in fine detail or as a monthly summary). In addition, we demonstrated the importance of the type of data to be shared, with people most reluctant to share bank transactions data, but relatively unconcerned about sharing their physical activity, electricity use and loyalty cards data. Conclusions & ImplicationsWe contribute by highlighting the trade-offs individuals are willing to make between different elements of a data sharing situation, and the relative importance of these different aspects. We also demonstrate that individuals’ have positive attitudes to share digital footprints data for research benefiting public good. By integrating these preferences into ethic
介绍,数字足迹数据是经济的关键,支撑着商业模式和服务提供。这些信息也可以为公共利益带来好处,然而数字足迹数据的共享是基于个人态度的,这取决于这些数据对消费者的价值。在这项研究中,我们调查了个人如何做出共享其数字足迹数据的决定,以及数据共享场景的哪些特征影响了他们共享数据的决定。 目标,我们使用了来自2087名英国居民的全国代表性样本的回复,以估计公众在不同特征的场景中对共享不同类型的数字足迹数据的偏好。我们实验的主要部分包括一个离散选择实验,该实验允许建立数据共享场景的不同特征的相对重要性,揭示参与者在它们之间做出的权衡。参与者在两个假设的数据共享场景选项之间做出一系列选择,或者可以不选择任何指定选项而“选择退出”。例如,我们检查了出于不同目的共享数据时(例如,用于研究与私人利益)的反应差异,以及当数据与更多或更少的关于身份或位置的粒度细节共享时。使用具有替代特定常数的逻辑回归对数据进行分析。 与数字足迹的相关性我们专注于了解六种不同类型的数字足迹数据(即银行交易、家庭用电、零售会员卡使用、浏览历史、社交媒体和体育活动数据)的不同特征是否会影响人们是否分享这些数据的决定。 结果:与私人公司或政府相比,参与者更有可能与大学分享他们的数据进行学术研究。参与者也最不愿意在分享个人身份的同时分享数据。参加者关心资料的接收人及其索取资料的目的;这些数据是否会与他们的位置一起共享,如果是,具体到什么程度;以及数据的聚合程度(即,是详细共享还是按月汇总)。此外,我们证明了共享数据类型的重要性,人们最不愿意共享银行交易数据,但相对不关心共享他们的身体活动,用电量和会员卡数据。 结论,我们通过强调个人愿意在数据共享情况的不同元素之间做出的权衡,以及这些不同方面的相对重要性来贡献。我们还表明,个人对分享数字足迹数据以进行有利于公共利益的研究持积极态度。通过将这些偏好整合到道德和负责任的研究模型中,我们可以创建更公平、更平衡的数据共享框架,最终帮助人们对个人数字足迹数据做出更好的选择。
{"title":"Attitudes towards Sharing Digital Footprint Data: a Discrete Choice Experiment","authors":"Rebecca McDonald, Anya Skatova, Carsten Maple","doi":"10.23889/ijpds.v8i3.2287","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2287","url":null,"abstract":"Introduction & BackgroundDigital footprints data are key for the economy, underpinning business models and service provision. This information can also bring benefit to public good, yet sharing of digital footprints data are predicated on individual attitudes which in term depend on the value these data have to consumers. In this study, we investigated how individuals make decisions about sharing their digital footprints data, as well as which features of the data sharing scenario affect their decision to share the data.
 Objectives & ApproachWe used responses from a nationally representative sample of 2,087 UK residents to estimate public preferences towards sharing different types of digital footprint data in scenarios with different features. The main part of our experiment consisted of a Discrete Choice Experiment which allows the relative importance of the different features of data sharing scenarios to be established, revealing the tradeoffs participants make between them. Participants made a series of choices between two hypothetical data sharing scenario options or could “opt out” by choosing neither specified option. For example, we examined the differences in responses when data are shared for different purposes (e.g., for research vs private benefit), as well as when data are shared with more or less granular details about identity or location. The data were analysed using a logistic regression with an alternative-specific constant.
 Relevance to Digital FootprintsWe focused on understanding whether varied features of six different types of digital footprints data - namely banking transactions, electricity use at home, retail loyalty cards use, browsing history, social media, and physical activity data - affect people’s decision whether to share these data.
 ResultsParticipants were more likely to share their data with a university for academic research than with a private company or government. Participants were also most reluctant to share data alongside their personal identity. Participants were concerned with the recipient of the data and their purpose in requesting it; whether the data would be shared along with their location and if so, to what specificity; and with the level of aggregation of the data (i.e. whether it would be shared in fine detail or as a monthly summary). In addition, we demonstrated the importance of the type of data to be shared, with people most reluctant to share bank transactions data, but relatively unconcerned about sharing their physical activity, electricity use and loyalty cards data.
 Conclusions & ImplicationsWe contribute by highlighting the trade-offs individuals are willing to make between different elements of a data sharing situation, and the relative importance of these different aspects. We also demonstrate that individuals’ have positive attitudes to share digital footprints data for research benefiting public good. By integrating these preferences into ethic","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135153272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Carbon foot printing school meals: data linkage and engagement activity 碳足迹学校餐:数据链接和参与活动
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2294
Alexandra Dalton, Emily Ennis, Melinda Green, Michelle A Morris
Introduction & BackgroundFood production is a substantial contributor to greenhouse gas emissions and climate change. A more sustainable diet is often a healthier one, so making lower carbon food choices serves to benefit the planet and person. In order to understand the carbon footprint of food choices, linkage of recipe information to carbon footprint data and transaction records is required. To inform positive change insights from data linkage must be communicated to the target audience, in this case schoolchildren. Objectives & ApproachSchool dinner recipe information and transaction records for school meals at five schools for a six week period were acquired. Carbon footprint estimates were calculated for each recipe, using published data. An automated dashboard was created in order that these calculations could be replicated by catering teams. Carbon footprints were appended to the school transaction records for meal choices. An interactive web game was created in ‘top trump’s' style using a selection of the recipes, with carbon footprint and popularity ranking, generated from the transaction records. Relevance to Digital FootprintsTransactional meal sales data from schools are digital footprint data. In this work we link these digital footprint data to detailed recipe information with estimated carbon footprints from an open data source. ResultsThe Consumer Data Research Centre Carbon Calculator and The Planet Plates game were created. The Carbon Calculator is being used in a number of settings to support food procurement and recipe development. The Planet Plates game has been used in Leeds Schools to empower schoolchildren to make positive changes to lower the carbon footprint of their meal choices. The children were engaged with all the activities and not only learned about sustainability of their food choices, but about how data they generate can be used anonymously for public good. Conclusions & ImplicationsData linkage of digital footprint data is a powerful tool for behaviour change to tackle some of the world’s most pressing challenges. Methods and insights should be shared widely and made accessible to a range of stakeholders wherever possible.
介绍,粮食生产是温室气体排放和气候变化的重要因素。更可持续的饮食往往更健康,所以选择低碳食物有利于地球和人类。为了了解食物选择的碳足迹,需要将食谱信息与碳足迹数据和交易记录联系起来。为了告知积极的变化,必须将数据链接的见解传达给目标受众,在这种情况下是学童。 目标,获取了五所学校六周内的学校晚餐食谱信息和交易记录。使用已公布的数据,计算出每种配方的碳足迹估计值。为了让餐饮团队可以复制这些计算,我们创建了一个自动仪表板。碳足迹被附加到学校的膳食选择交易记录中。我们以“top trump’s”风格制作了一款互动网页游戏,使用了一系列食谱,并根据交易记录生成了碳足迹和受欢迎程度排名。与数字足迹相关学校的交易膳食销售数据是数字足迹数据。在这项工作中,我们将这些数字足迹数据与来自开放数据源的估计碳足迹的详细配方信息联系起来。结果:消费者数据研究中心碳计算器和行星盘游戏被创建。碳计算器在许多情况下被用于支持食品采购和配方开发。“星球餐盘”游戏已经在利兹的学校里使用,让学生们做出积极的改变,以降低他们选择膳食的碳足迹。孩子们参与了所有的活动,不仅了解了他们选择的食物的可持续性,还了解了他们产生的数据如何匿名用于公益事业。结论,数字足迹数据的数据链接是改变行为以应对世界上一些最紧迫挑战的有力工具。方法和见解应广泛分享,并尽可能使一系列利益攸关方能够获得。
{"title":"Carbon foot printing school meals: data linkage and engagement activity","authors":"Alexandra Dalton, Emily Ennis, Melinda Green, Michelle A Morris","doi":"10.23889/ijpds.v8i3.2294","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2294","url":null,"abstract":"Introduction & BackgroundFood production is a substantial contributor to greenhouse gas emissions and climate change. A more sustainable diet is often a healthier one, so making lower carbon food choices serves to benefit the planet and person. In order to understand the carbon footprint of food choices, linkage of recipe information to carbon footprint data and transaction records is required. To inform positive change insights from data linkage must be communicated to the target audience, in this case schoolchildren.
 Objectives & ApproachSchool dinner recipe information and transaction records for school meals at five schools for a six week period were acquired. Carbon footprint estimates were calculated for each recipe, using published data. An automated dashboard was created in order that these calculations could be replicated by catering teams. Carbon footprints were appended to the school transaction records for meal choices. An interactive web game was created in ‘top trump’s' style using a selection of the recipes, with carbon footprint and popularity ranking, generated from the transaction records.
 Relevance to Digital FootprintsTransactional meal sales data from schools are digital footprint data. In this work we link these digital footprint data to detailed recipe information with estimated carbon footprints from an open data source.
 ResultsThe Consumer Data Research Centre Carbon Calculator and The Planet Plates game were created. The Carbon Calculator is being used in a number of settings to support food procurement and recipe development. The Planet Plates game has been used in Leeds Schools to empower schoolchildren to make positive changes to lower the carbon footprint of their meal choices. The children were engaged with all the activities and not only learned about sustainability of their food choices, but about how data they generate can be used anonymously for public good.
 Conclusions & ImplicationsData linkage of digital footprint data is a powerful tool for behaviour change to tackle some of the world’s most pressing challenges. Methods and insights should be shared widely and made accessible to a range of stakeholders wherever possible.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"170 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135154383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SynthEco - A multi-layered digital ecosystem for analysing complex human behaviour in context SynthEco -一个多层次的数字生态系统,用于分析环境中复杂的人类行为
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2285
Antonia Gieschen, Catherine Paquet, Raja Sengupta, Anna-Liisa Aunio, Fares Belkhiria, Shawn Brown, Laurette Dube
Introduction & BackgroundHuman behaviour is multi-faceted and complex, with different dimensions interacting and impacting each other and individuals operating in an environmental context. In order to understand this behaviour better, the combination of data from different sources is useful to uncover some of those interactions and complexities. We present a multi-layered digital ecosystem based on a data platform providing statistically representative synthetic population derived from census data at different geo-spatial granularity, which we call SynthEco. This platform is enriched with individual data stemming from cohorts and cross-sectional surveys and geo-scanning of different layers of socio-environmental actors and conditions to create a complex digital ecosystem. Objectives & ApproachThe objective of SynthEco is to allow for the analysis of behaviour, as well as health and wellbeing outcomes, through the integration of cohort and cross-sectional data into a geospatially anchored synthetic population embedded into environmental data which is forming the backdrop. We demonstrate the use of this platform on the example of Montreal, Canada. The synthetic population is first generated from census data using iterative proportional fitting, which allows for the creation of a population data set that is artificial yet statistically representative for a given geospatial granularity, such as a city. Each individual household is assigned a geospatial location, which allows for the consideration of their surrounding environment including enterprises or institutions such as schools, hospitals and the local food environment. Through fuzzy matching and statistical extrapolation, different cohort and cross-sectional survey data are then merged to individual records, in order to describe them in more detail. This includes health, as well as financial wellbeing or social environment descriptors. Relevance to Digital FootprintsThere are two important points made through the presented work in relation to Digital Footprints data: the first is the technical approach to merging multiple datasets describing different dimensions of interacting human characteristics and behaviour by anchoring them into a synthetic population through fuzzy record matching. The second is the consideration of a spatial dimension when describing human behaviour. This is especially important when describing behaviour within local environments, such as the interaction with local food outlets. ResultsRecent work in this context includes an analysis of the food environment in Montreal, Canada. It introduces a way of utilising the synthetic population to predict the healthfulness of their local environment in terms of healthy food outlets, as well as providing a platform for the analysis of food environment surveillance and intervention simulations. For this purpose, the healthfulness of different census tract regions in Montreal is calculated to identify food de
介绍,人类行为是多方面和复杂的,不同的维度相互作用和影响,个人在环境背景下运作。为了更好地理解这种行为,来自不同来源的数据组合有助于揭示其中的一些交互和复杂性。我们提出了一个基于数据平台的多层数字生态系统,该平台提供从不同地理空间粒度的人口普查数据中提取的具有统计代表性的合成人口,我们称之为SynthEco。该平台丰富了来自队列和横断面调查的个人数据,以及对不同层次的社会环境行为者和条件的地理扫描,以创建一个复杂的数字生态系统。 目标,SynthEco的目标是通过将队列和横断面数据整合到嵌入环境数据的地理空间锚定的合成人口中,从而分析行为以及健康和福祉结果。环境数据正在形成背景。我们以加拿大蒙特利尔为例说明了该平台的使用。首先使用迭代比例拟合从人口普查数据生成合成人口,这允许创建人口数据集,该数据集是人工的,但在统计上代表给定的地理空间粒度,例如城市。每个家庭都被分配了一个地理空间位置,以便考虑其周围环境,包括企业或机构,如学校、医院和当地食品环境。然后,通过模糊匹配和统计外推,将不同队列和横断面调查数据合并到个人记录中,以便更详细地描述它们。这包括健康,以及财务状况或社会环境描述符。与数字足迹相关通过所介绍的与数字足迹数据相关的工作,有两个要点:首先是通过模糊记录匹配将描述交互人类特征和行为的不同维度的多个数据集锚定到合成人口中,从而合并多个数据集的技术方法。第二个是在描述人类行为时对空间维度的考虑。这在描述当地环境中的行为时尤其重要,例如与当地食品商店的互动。 最近在这方面的工作包括对加拿大蒙特利尔食品环境的分析。它引入了一种利用合成人口来预测其当地环境在健康食品销售点方面的健康状况的方法,并为食品环境监测和干预模拟分析提供了一个平台。为此,对蒙特利尔不同人口普查区的健康状况进行了计算,以确定通过修订零售食品环境指数定义的食物沙漠、食物沼泽和健康区域。我们测试了不同的机器学习方法,然后使用来自各自人口普查区的合成人口的人口普查变量来预测这些健康分数,获得了大约0.53到0.60的准确性分数。这表明,人口普查数据在解释当地食品环境的健康状况方面具有有限的预测能力,这对于当地决策者无法获得零售商信息的情况尤其重要。未来的工作可以扩展这种方法,包括进一步描述人口的数据,这些数据来自综合队列和调查数据,这可以提高预测的准确性或帮助确定关注的领域。结论,所提出的SynthEco平台将个体视为嵌套在系统的模块化系统中的代理,试图捕获内部系统和过程以及个体在其中操作的环境。因此,该平台能够应用计算系统建模来分析环境中的个体人类行为。正如在更健康的食品环境背景下使用SynthEco的例子所表明的那样,该方法特别适用于对地方干预战略感兴趣的从业人员和政策制定者,并确定与健康和福祉的不同层面相关的目标政策领域。
{"title":"SynthEco - A multi-layered digital ecosystem for analysing complex human behaviour in context","authors":"Antonia Gieschen, Catherine Paquet, Raja Sengupta, Anna-Liisa Aunio, Fares Belkhiria, Shawn Brown, Laurette Dube","doi":"10.23889/ijpds.v8i3.2285","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2285","url":null,"abstract":"Introduction & BackgroundHuman behaviour is multi-faceted and complex, with different dimensions interacting and impacting each other and individuals operating in an environmental context. In order to understand this behaviour better, the combination of data from different sources is useful to uncover some of those interactions and complexities. We present a multi-layered digital ecosystem based on a data platform providing statistically representative synthetic population derived from census data at different geo-spatial granularity, which we call SynthEco. This platform is enriched with individual data stemming from cohorts and cross-sectional surveys and geo-scanning of different layers of socio-environmental actors and conditions to create a complex digital ecosystem.
 Objectives & ApproachThe objective of SynthEco is to allow for the analysis of behaviour, as well as health and wellbeing outcomes, through the integration of cohort and cross-sectional data into a geospatially anchored synthetic population embedded into environmental data which is forming the backdrop. We demonstrate the use of this platform on the example of Montreal, Canada. The synthetic population is first generated from census data using iterative proportional fitting, which allows for the creation of a population data set that is artificial yet statistically representative for a given geospatial granularity, such as a city. Each individual household is assigned a geospatial location, which allows for the consideration of their surrounding environment including enterprises or institutions such as schools, hospitals and the local food environment. Through fuzzy matching and statistical extrapolation, different cohort and cross-sectional survey data are then merged to individual records, in order to describe them in more detail. This includes health, as well as financial wellbeing or social environment descriptors.
 Relevance to Digital FootprintsThere are two important points made through the presented work in relation to Digital Footprints data: the first is the technical approach to merging multiple datasets describing different dimensions of interacting human characteristics and behaviour by anchoring them into a synthetic population through fuzzy record matching. The second is the consideration of a spatial dimension when describing human behaviour. This is especially important when describing behaviour within local environments, such as the interaction with local food outlets.
 ResultsRecent work in this context includes an analysis of the food environment in Montreal, Canada. It introduces a way of utilising the synthetic population to predict the healthfulness of their local environment in terms of healthy food outlets, as well as providing a platform for the analysis of food environment surveillance and intervention simulations. For this purpose, the healthfulness of different census tract regions in Montreal is calculated to identify food de","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135154389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Longitudinal reliability of Twitter sentiment for measuring mental health and well-being in a UK birth cohort 推特情绪对衡量英国出生队列心理健康和幸福的纵向可靠性
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2278
Nina Di Cara, Oliver Di Davis, Claire Haworth
Introduction & BackgroundSocial media data is increasingly recognised as an important source of behavioural data. It can provide insights into patterns of life and how individuals and groups are feeling. However, many studies into social media’s relationship to mental health and well-being have suffered from poorly developed ground-truth data, which relies on assumed ground-truth labels and data from single timepoints. This means that the accuracy of models at future timepoints cannot be assessed. Collecting Twitter data from cohorts provides a solution to this issue, given the many years of high quality data that can be used as ground truth. Cohorts can also benefit from the higher-resolution data provided by social media that can supplement their traditional data collection methods. Objectives & ApproachWe used Twitter data that has been collected with consent from two generations of the Avon Longitudinal Study of Parents and Children (ALSPAC) (N=656). The data is linked to two surveys completed in April-May 2020 and May-July 2020 for validated outcome measures of anxiety, depression, and general well-being. Using the LIWC and VADER sentiment algorithms, the sentiment categories most highly associated with each outcome were used to develop a multiple regression model for each of anxiety, depression and general well-being using the first survey timepoint. Error from these models in predicting the second timepoint allowed us to assess how well different outcomes are predicted by demographic group. Relevance to Digital FootprintsDigital footprint data can complement traditional data sources to provide a more nuanced view of health inequalities. These data are typically less timely to collect than traditional data collection methods (census, survey) allowing a more reactive response to emergent issues such as the cost-of-living crisis. ResultsThis study illustrates how the collection of digital footprint data can be integrated into existing long-term studies which can be used to provide multiple points of ground-truth data. Conclusions & ImplicationsThis study has shown that the collection and integration of Twitter data into cohort studies is feasible, and that cohort data provides multiple ground-truth options. This time series data is important for assessing the potential feasibility of mental health inference from online behavioural data, which this study shows may vary across personal characteristics. In future research we plan to link subsequent surveys from ALSPAC to provide more ground truth time points and explore the temporal stability of predictions, and impacts of model drift on performance.
介绍,社交媒体数据越来越被认为是行为数据的重要来源。它可以提供对生活模式的洞察,以及个人和群体的感受。然而,许多关于社交媒体与心理健康和幸福关系的研究都受到了基础事实数据不完善的影响,这些数据依赖于假设的基础事实标签和单一时间点的数据。这意味着模型在未来时间点的准确性无法评估。从群组中收集Twitter数据为这个问题提供了一个解决方案,因为多年的高质量数据可以作为基础事实。群体还可以从社交媒体提供的高分辨率数据中受益,这些数据可以补充他们传统的数据收集方法。 目标,我们使用了雅芳父母与儿童纵向研究(ALSPAC)中两代人(N=656)在征得同意的情况下收集的Twitter数据。这些数据与2020年4月至5月和2020年5月至7月完成的两项调查有关,这些调查旨在验证焦虑、抑郁和总体幸福感的结果测量。使用LIWC和VADER情绪算法,使用与每个结果相关度最高的情绪类别,在第一个调查时间点为焦虑、抑郁和一般幸福感建立多元回归模型。这些模型在预测第二个时间点时的误差使我们能够评估不同人口群体对不同结果的预测程度。与数字足迹的相关性数字足迹数据可以补充传统的数据来源,对卫生不平等现象提供更细致入微的看法。与传统的数据收集方法(人口普查、调查)相比,这些数据的收集通常不及时,因此可以对诸如生活成本危机等紧急问题做出更被动的反应。结果:本研究说明了如何将数字足迹数据的收集整合到现有的长期研究中,从而提供多点地面真值数据。 结论,本研究表明,将Twitter数据收集和整合到队列研究中是可行的,并且队列数据提供了多个基本事实选项。该时间序列数据对于评估从在线行为数据推断心理健康的潜在可行性非常重要,该研究表明,在线行为数据可能因个人特征而异。在未来的研究中,我们计划将ALSPAC的后续调查联系起来,以提供更多的地面真实时间点,并探索预测的时间稳定性,以及模型漂移对性能的影响。
{"title":"Longitudinal reliability of Twitter sentiment for measuring mental health and well-being in a UK birth cohort","authors":"Nina Di Cara, Oliver Di Davis, Claire Haworth","doi":"10.23889/ijpds.v8i3.2278","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2278","url":null,"abstract":"Introduction & BackgroundSocial media data is increasingly recognised as an important source of behavioural data. It can provide insights into patterns of life and how individuals and groups are feeling. However, many studies into social media’s relationship to mental health and well-being have suffered from poorly developed ground-truth data, which relies on assumed ground-truth labels and data from single timepoints. This means that the accuracy of models at future timepoints cannot be assessed.
 Collecting Twitter data from cohorts provides a solution to this issue, given the many years of high quality data that can be used as ground truth. Cohorts can also benefit from the higher-resolution data provided by social media that can supplement their traditional data collection methods.
 Objectives & ApproachWe used Twitter data that has been collected with consent from two generations of the Avon Longitudinal Study of Parents and Children (ALSPAC) (N=656). The data is linked to two surveys completed in April-May 2020 and May-July 2020 for validated outcome measures of anxiety, depression, and general well-being.
 Using the LIWC and VADER sentiment algorithms, the sentiment categories most highly associated with each outcome were used to develop a multiple regression model for each of anxiety, depression and general well-being using the first survey timepoint. Error from these models in predicting the second timepoint allowed us to assess how well different outcomes are predicted by demographic group.
 Relevance to Digital FootprintsDigital footprint data can complement traditional data sources to provide a more nuanced view of health inequalities. These data are typically less timely to collect than traditional data collection methods (census, survey) allowing a more reactive response to emergent issues such as the cost-of-living crisis.
 ResultsThis study illustrates how the collection of digital footprint data can be integrated into existing long-term studies which can be used to provide multiple points of ground-truth data.
 Conclusions & ImplicationsThis study has shown that the collection and integration of Twitter data into cohort studies is feasible, and that cohort data provides multiple ground-truth options. This time series data is important for assessing the potential feasibility of mental health inference from online behavioural data, which this study shows may vary across personal characteristics.
 In future research we plan to link subsequent surveys from ALSPAC to provide more ground truth time points and explore the temporal stability of predictions, and impacts of model drift on performance.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135154390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Behavioural entropy as an individual difference 作为个体差异的行为熵
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2289
Neo Poon, James Goulding, Anya Skatova
Introduction & BackgroundThe availability of digital footprints data have provided new and invaluable opportunities for personality psychologists. One way to study individual differences with digital footprints data is through the lens of entropy, which is a measure of the degree of randomness of a probabilistic system. When applied to individual behaviour, entropy captures how predictable an individual’s (e.g., shopping) pattern of behaviour is over time. In this study, we proposed that entropy can be conceptualised as a proxy measure of Openness, a Big Five personality trait. We further studied entropy’s associations with external behavioural outcome, namely the voting outcomes of the 2016 EU ‘Brexit’ referendum in the UK. This referendum asked UK citizens whether the UK should stay in the EU (vote Remain) or leave the EU (vote Leave). It has been demonstrated that Leave (or ‘Brexit’) vote was heavily influenced by attitudes towards immigration which is associated with values of being less ‘open’ to other cultures, and therefore we expected that entropy – or tendency to try new things – would be associated positively with voting Remain. Objectives & ApproachWith a massive data set (20,550,952 customers) provided by a large UK retail chain over a period of 2 years, we computed aggregated entropy for the Local Authority Districts (LADs). Further we investigated the relationships between entropy and personality traits, as well as between entropy and the referendum outcomes, at geographically aggregated levels. Relevance to Digital FootprintsThis study brought together digital footprints data with external sources. This study also identified population level insights by examining personality traits and their utility in predicting sociopolitical outcomes. ResultsResults of a linear regression model showed strong evidence supporting a positive relationship between entropy and Openness (b = 0.30, t = 3.30, p = .001), and a negative relationship between entropy and Neuroticism (b = -0.48, t = -3.53, p < .001). Further, entropy was associated with outcomes of the EU referendum in each LAD. Results of another linear regression model showed strong evidence supporting a positive relationship between the percentage of Remain votes and entropy (b = 0.28, t = 4.80, p < .001). Conclusions & ImplicationsThe relationship between Big Five trait Openness and entropy provided support that personality can be inferred from digital footprints data such as shopping history records. The positive relationship between entropy and the proportion of Remain vote demonstrated that people who are more open to new experiences voted Remain. Our findings have broader implications showing that it is possible to find associations between personality traits extrapolated from shopping data and real-world choices.
介绍,数字足迹数据的可用性为人格心理学家提供了新的和宝贵的机会。用数字足迹数据研究个体差异的一种方法是通过熵的视角,熵是对概率系统随机程度的衡量。当应用于个体行为时,熵捕获了个体(例如,购物)行为模式随时间的可预测性。在这项研究中,我们提出熵可以被概念化为开放性的代理测量,开放性是五大人格特质之一。我们进一步研究了熵与外部行为结果的关联,即2016年英国欧盟“脱欧”公投的投票结果。这次公投询问英国公民,英国应该留在欧盟(投票留欧)还是离开欧盟(投票脱欧)。已经证明,脱欧(或“脱欧”)投票严重受到对移民的态度的影响,这与对其他文化不那么“开放”的价值观有关,因此我们预计熵-或尝试新事物的倾向-将与投票留下积极相关。目标,方法:利用英国一家大型零售连锁店在2年期间提供的大量数据集(20,550,952名客户),我们计算了地方当局地区(LADs)的汇总熵。我们进一步调查了熵和人格特质之间的关系,以及熵和公投结果之间的关系,在地理上聚集的水平。与数字足迹相关这项研究将数字足迹数据与外部来源结合在一起。这项研究还通过考察人格特征及其在预测社会政治结果方面的效用,确定了人口水平的见解。结果线性回归模型结果显示,熵与开放性呈正相关(b = 0.30, t = 3.30, p = 0.001),熵与神经质呈负相关(b = -0.48, t = -3.53, p <措施)。此外,熵与每个LAD的欧盟公投结果相关。另一个线性回归模型的结果显示,有强有力的证据支持留欧票百分比与熵之间的正相关关系(b = 0.28, t = 4.80, p <措施)。结论,启示五大特征开放性和熵的关系为人格可以从购物历史记录等数字足迹数据中推断出来提供了支持。熵与留欧比例之间的正相关关系表明,对新体验更开放的人投了留欧票。我们的研究结果具有更广泛的意义,表明有可能从购物数据和现实世界的选择中推断出性格特征之间的联系。
{"title":"Behavioural entropy as an individual difference","authors":"Neo Poon, James Goulding, Anya Skatova","doi":"10.23889/ijpds.v8i3.2289","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2289","url":null,"abstract":"Introduction &amp; BackgroundThe availability of digital footprints data have provided new and invaluable opportunities for personality psychologists. One way to study individual differences with digital footprints data is through the lens of entropy, which is a measure of the degree of randomness of a probabilistic system. When applied to individual behaviour, entropy captures how predictable an individual’s (e.g., shopping) pattern of behaviour is over time. In this study, we proposed that entropy can be conceptualised as a proxy measure of Openness, a Big Five personality trait. We further studied entropy’s associations with external behavioural outcome, namely the voting outcomes of the 2016 EU ‘Brexit’ referendum in the UK. This referendum asked UK citizens whether the UK should stay in the EU (vote Remain) or leave the EU (vote Leave). It has been demonstrated that Leave (or ‘Brexit’) vote was heavily influenced by attitudes towards immigration which is associated with values of being less ‘open’ to other cultures, and therefore we expected that entropy – or tendency to try new things – would be associated positively with voting Remain. Objectives &amp; ApproachWith a massive data set (20,550,952 customers) provided by a large UK retail chain over a period of 2 years, we computed aggregated entropy for the Local Authority Districts (LADs). Further we investigated the relationships between entropy and personality traits, as well as between entropy and the referendum outcomes, at geographically aggregated levels. Relevance to Digital FootprintsThis study brought together digital footprints data with external sources. This study also identified population level insights by examining personality traits and their utility in predicting sociopolitical outcomes. ResultsResults of a linear regression model showed strong evidence supporting a positive relationship between entropy and Openness (b = 0.30, t = 3.30, p = .001), and a negative relationship between entropy and Neuroticism (b = -0.48, t = -3.53, p < .001). Further, entropy was associated with outcomes of the EU referendum in each LAD. Results of another linear regression model showed strong evidence supporting a positive relationship between the percentage of Remain votes and entropy (b = 0.28, t = 4.80, p < .001). Conclusions &amp; ImplicationsThe relationship between Big Five trait Openness and entropy provided support that personality can be inferred from digital footprints data such as shopping history records. The positive relationship between entropy and the proportion of Remain vote demonstrated that people who are more open to new experiences voted Remain. Our findings have broader implications showing that it is possible to find associations between personality traits extrapolated from shopping data and real-world choices.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135153433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Supermarket Transaction Records In Dietary Evaluation – the STRIDE validation study. 膳食评价中的超市交易记录- STRIDE验证研究。
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2267
Victoria Jenneson, Darren Greenwood, Graham Clarke, Timothy Rains, Becky Shute, Michelle Morris
Introduction & BackgroundSupermarket transactions leave a digital footprint which offers insight into dietary habits. Use of transactions in nutrition research has increased, but these data are rarely validated. The STRIDE (Supermarket Transaction Records In Dietary Evaluation) study compares dietary estimates from supermarket transactions with self-reported intake from an online Food Frequency Questionnaire (FFQ). Objectives & ApproachWorking with a large UK supermarket, loyalty card customers were recruited to one of four waves (accounting for seasonal dietary variation). Participants completed an online FFQ and consented to sharing their transaction records for one year during the study, and one year prior. The Bland-Altman method was used to calculate agreement and limits of agreement between transactions and intake for daily energy, sugar, total fat, saturated fat, protein and sodium (absolute and energy-adjusted). Relevance to Digital FootprintsSupermarket transactions are a form of digital footprints data with advantages over survey methods, with regards scalability and objectivity, for monitoring population-level diets. Results1,788 participants from four UK regions gave consent. 686 participants who completed the FFQ and made purchases during the same period, were included for analysis. Participants were mostly female (72%), with a mean age of 56 years (SD 13). A regression equation for agreement is presented for estimating intake from purchases. Agreement for absolute measures was poor overall, but higher for single-person households and households reporting a higher proportion of total food purchases from the study retailer. Agreement was stronger for energy-adjusted nutrient estimates, particularly fat, with purchase records under-estimating the proportion of total energy intake from fat by just 2%. Conclusions & ImplicationsThe STRIDE study found that household purchases from a single retailer were a poor proxy for individual-level nutrient intakes. However, close agreement on average energy-adjusted estimates suggests purchases are a good indicator of dietary composition. Supermarket transactions have utility for population dietary assessment, ecological studies, and identifying intervention targets based on dietary patterns. Digital footprint data from transactions can contribute to the design and monitoring of national and local-level interventions.
介绍,超市交易留下的数字足迹可以让人们了解饮食习惯。在营养研究中对交易的使用有所增加,但这些数据很少得到验证。STRIDE(饮食评估中的超市交易记录)研究比较了超市交易的饮食估计与在线食物频率问卷(FFQ)中自我报告的摄入量。目标,方法与英国一家大型超市合作,将会员卡顾客招募到四组(考虑到季节性饮食变化)中的一组。参与者完成了一份在线FFQ,并同意在研究期间和研究前一年分享他们的交易记录。使用Bland-Altman方法计算每日能量、糖、总脂肪、饱和脂肪、蛋白质和钠(绝对和能量调整)的交易与摄入量之间的一致性和一致性限制。与数字足迹相关超市交易是数字足迹数据的一种形式,在可扩展性和客观性方面优于调查方法,用于监测人口水平的饮食。 结果来自英国四个地区的1788名参与者表示同意。686名完成FFQ并在同一时期购物的参与者被纳入分析。参与者大多为女性(72%),平均年龄为56岁(SD 13)。提出了一个协议回归方程,用于估算购买所得。总体而言,绝对衡量标准的一致性很差,但对于单身家庭和从研究零售商处购买食品的比例较高的家庭来说,一致性更高。对于能量调整后的营养估计,尤其是脂肪,一致性更强,购买记录低估了脂肪总能量摄入的比例,仅为2%。结论,STRIDE的研究发现,家庭从单一零售商处购买的食物不能很好地代表个人水平的营养摄入量。然而,在平均能量调整估计上的接近一致表明,购买是饮食构成的良好指标。超市交易对人口饮食评估、生态研究和确定基于饮食模式的干预目标具有实用价值。来自交易的数字足迹数据有助于设计和监测国家和地方一级的干预措施。
{"title":"Supermarket Transaction Records In Dietary Evaluation – the STRIDE validation study.","authors":"Victoria Jenneson, Darren Greenwood, Graham Clarke, Timothy Rains, Becky Shute, Michelle Morris","doi":"10.23889/ijpds.v8i3.2267","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2267","url":null,"abstract":"Introduction &amp; BackgroundSupermarket transactions leave a digital footprint which offers insight into dietary habits. Use of transactions in nutrition research has increased, but these data are rarely validated. The STRIDE (Supermarket Transaction Records In Dietary Evaluation) study compares dietary estimates from supermarket transactions with self-reported intake from an online Food Frequency Questionnaire (FFQ).&#x0D; Objectives &amp; ApproachWorking with a large UK supermarket, loyalty card customers were recruited to one of four waves (accounting for seasonal dietary variation). Participants completed an online FFQ and consented to sharing their transaction records for one year during the study, and one year prior. The Bland-Altman method was used to calculate agreement and limits of agreement between transactions and intake for daily energy, sugar, total fat, saturated fat, protein and sodium (absolute and energy-adjusted).&#x0D; Relevance to Digital FootprintsSupermarket transactions are a form of digital footprints data with advantages over survey methods, with regards scalability and objectivity, for monitoring population-level diets.&#x0D; Results1,788 participants from four UK regions gave consent. 686 participants who completed the FFQ and made purchases during the same period, were included for analysis. Participants were mostly female (72%), with a mean age of 56 years (SD 13). A regression equation for agreement is presented for estimating intake from purchases. Agreement for absolute measures was poor overall, but higher for single-person households and households reporting a higher proportion of total food purchases from the study retailer. Agreement was stronger for energy-adjusted nutrient estimates, particularly fat, with purchase records under-estimating the proportion of total energy intake from fat by just 2%.&#x0D; Conclusions &amp; ImplicationsThe STRIDE study found that household purchases from a single retailer were a poor proxy for individual-level nutrient intakes. However, close agreement on average energy-adjusted estimates suggests purchases are a good indicator of dietary composition. Supermarket transactions have utility for population dietary assessment, ecological studies, and identifying intervention targets based on dietary patterns. Digital footprint data from transactions can contribute to the design and monitoring of national and local-level interventions.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135154082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal for Population Data Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1