首页 > 最新文献

International Journal for Population Data Science最新文献

英文 中文
Identifying drivers of food insecurity through linked data- the Priority Places for Food Index 通过关联数据确定粮食不安全的驱动因素-粮食优先地点指数
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2277
Francesca Pontin, Peter Baudains, Emily Ennis, Michelle Morris
Introduction & Background15.5% of all UK households are food insecure; either unable to afford to eat, skipping meals or reducing meal sizes despite being hungry. Drivers of food insecurity include both access to and affordability of food, with those most in need often left unable to access healthy and affordable food. Taking a place-based approach to understand the drivers of food insecurity allows for targeted support from government, third sector and private organisations to mitigate growing food insecurity in the UK. Objectives & ApproachThis research presents the methodology for the co-production and construction of the Priority Places for Food Index (PPFI) and supporting dashboard, co-designed by the Consumer Data Research Centre (CDRC) and consumer champions Which? in response to the 'cost-of-living crisis'. The PPFI equally weights measures of access to affordable food and indicators of barriers to affording food across seven domains; Proximity to supermarket retail facilities, Accessibility of supermarket retail facilities, Access to online deliveries, Proximity to non-supermarket food provision, Socio-economic barriers, Fuel Poverty and Family Food for support. The PPFI uses open data combing traditional census data metrics, with government data (e.g., Healthy start voucher and free-school meals uptake), digital footprints data (web-scraped delivery addresses and food bank item request data) and scaled-survey data (fuel poverty, propensity to shop online). Relevance to Digital FootprintsDigital footprint data can complement traditional data sources to provide a more nuanced view of health inequalities. These data are typically less timely to collect than traditional data collection methods (census, survey) allowing a more reactive response to emergent issues such as the cost-of-living crisis. ResultsThe PPFI interactive map and underlying data have been published via the CDRC https://priorityplaces.cdrc.ac.uk/. Conclusions & ImplicationsWe demonstrate the value of data linkage across individual and population level data to provide localised insight into food insecurity and identify where digital footprints data can improve gaps in the current evidence base. We also reflect on the value of co-production and stakeholder engagement in creating a policy ready interactive map which has facilitated the lobbying of targeted practical support and policy change to address food insecurity.
介绍,15.5%的英国家庭食品不安全;要么没钱吃饭,要么不吃饭,要么尽管饿了,但还是减少了饭量。粮食不安全的驱动因素包括获得粮食和负担得起粮食,最需要的人往往无法获得健康和负担得起的粮食。采取基于地点的方法来了解粮食不安全的驱动因素,可以从政府、第三部门和私营组织获得有针对性的支持,以缓解英国日益严重的粮食不安全问题。目标,本研究提出了由消费者数据研究中心(CDRC)和消费者冠军组织Which?以应对“生活成本危机”。PPFI在七个领域对获得负担得起的食物的衡量指标和负担得起的食物的障碍指标给予同等权重;靠近超市零售设施,超市零售设施的可及性,在线配送的可及性,非超市食品供应的可及性,社会经济障碍,燃料贫困和家庭食品支持。PPFI使用开放数据,将传统的人口普查数据指标与政府数据(例如,健康起步券和免费学校膳食的摄取情况)、数字足迹数据(网络抓取的送货地址和食品银行物品请求数据)和规模调查数据(燃料贫困、网上购物倾向)结合起来。与数字足迹的相关性数字足迹数据可以补充传统的数据来源,对卫生不平等现象提供更细致入微的看法。与传统的数据收集方法(人口普查、调查)相比,这些数据的收集通常不及时,因此可以对诸如生活成本危机等紧急问题做出更被动的反应。结果PPFI互动图和基础数据已通过CDRC https://priorityplaces.cdrc.ac.uk/. 发布;结论,我们展示了跨个人和人口层面数据的数据链接的价值,以提供对粮食不安全的本地化洞察,并确定数字足迹数据可以改善当前证据基础中的差距。我们还反思了合作制作和利益攸关方参与创建政策准备互动地图的价值,这有助于游说有针对性的实际支持和政策变革,以解决粮食不安全问题。
{"title":"Identifying drivers of food insecurity through linked data- the Priority Places for Food Index","authors":"Francesca Pontin, Peter Baudains, Emily Ennis, Michelle Morris","doi":"10.23889/ijpds.v8i3.2277","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2277","url":null,"abstract":"Introduction & Background15.5% of all UK households are food insecure; either unable to afford to eat, skipping meals or reducing meal sizes despite being hungry. Drivers of food insecurity include both access to and affordability of food, with those most in need often left unable to access healthy and affordable food. Taking a place-based approach to understand the drivers of food insecurity allows for targeted support from government, third sector and private organisations to mitigate growing food insecurity in the UK.
 Objectives & ApproachThis research presents the methodology for the co-production and construction of the Priority Places for Food Index (PPFI) and supporting dashboard, co-designed by the Consumer Data Research Centre (CDRC) and consumer champions Which? in response to the 'cost-of-living crisis'.
 The PPFI equally weights measures of access to affordable food and indicators of barriers to affording food across seven domains; Proximity to supermarket retail facilities, Accessibility of supermarket retail facilities, Access to online deliveries, Proximity to non-supermarket food provision, Socio-economic barriers, Fuel Poverty and Family Food for support. The PPFI uses open data combing traditional census data metrics, with government data (e.g., Healthy start voucher and free-school meals uptake), digital footprints data (web-scraped delivery addresses and food bank item request data) and scaled-survey data (fuel poverty, propensity to shop online).
 Relevance to Digital FootprintsDigital footprint data can complement traditional data sources to provide a more nuanced view of health inequalities. These data are typically less timely to collect than traditional data collection methods (census, survey) allowing a more reactive response to emergent issues such as the cost-of-living crisis.
 ResultsThe PPFI interactive map and underlying data have been published via the CDRC https://priorityplaces.cdrc.ac.uk/.
 Conclusions & ImplicationsWe demonstrate the value of data linkage across individual and population level data to provide localised insight into food insecurity and identify where digital footprints data can improve gaps in the current evidence base. We also reflect on the value of co-production and stakeholder engagement in creating a policy ready interactive map which has facilitated the lobbying of targeted practical support and policy change to address food insecurity.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135154246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting health related deprivation using loyalty card digital footprints 利用会员卡数字足迹预测与健康相关的剥夺
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2282
Gavin Long, Georgiana Nica-Avram, John Harvey, Roberto Mansilla, Simon Welham, Evgeniya Lukinova, James Goulding
Introduction & BackgroundIn England, The Indices of Deprivation (IoD) are a widely used and referenced measure to assess local levels of deprivation across a range of domains, including health and disability. However, due to their complex nature and the number of inputs required to generate these measures, they are only updated infrequently. Typically every 4-5 years, with the most recent versions released in 2019 and 2015. This study expands on previous research looking at the feasibility of using digital footprint data, in the form of retail loyalty card transactions, to predict local deprivation. This work focuses specifically on the health and disability subdomain of IoD. Our hypothesis is that retail behaviour relating to food purchases and their associated nutritional content, can be used to predict health deprivation. Objectives & ApproachThe work utilises loyalty card data from a large UK grocery retailer. Anonymised geo-location data for loyalty card members was used to assign retail grocery transactions to individual Lower Layer Super Output Areas (LSOAs) for each of the ten quarters in the study period (July 2019 - December 2021). A nutritional lookup was developed to enable the nutritional content of food transactions to be assigned to each LSOA. A number of metrics based on categories of food sold and their nutritional content were developed and used in a Machine Learning model, based on a Random Forest classifier, to predict areas with high levels of health related deprivation. Relevance to Digital FootprintsThis study uses data derived from digital footprint data of grocery transactions. It demonstrates the potential for utilising digital footprint data as a proxy for traditional demographic data without the need for expensive, both in terms of time and cost, surveying to be performed. ResultsThe random forest classifier was able to predict neighbourhoods (at the LSOA level) with the top 20% of health related deprivation. A high level of predictive power was identified (Overall accuracy 80%). SHAP (SHapley Additive exPlanations) and Model Class Reliance (MCR) were used to determine the importance of the input features. Areas with higher proportional spending on cigarettes and soft drinks and lower spending on fish, wine and fruit and vegtables were found to be associated with extreme levels of health deprivation. In terms of nutrition, two derived metrics, calories per pound spend and the obesogenicity of food purchased, were found to be important predictors of health deprivation. Conclusions & ImplicationsDigital footprint data on grocery purchases have been shown to be highly effective at predicting areas of extreme health related deprivation at the LSOA level. Features related to proportional spend on food categories and proportions of nutrients associated with these purchases were identified as optimal for predicting health related deprivation. The number of calories per pou
介绍,在英格兰,剥夺指数(IoD)是一种广泛使用和参考的衡量标准,用于评估包括健康和残疾在内的一系列领域的地方剥夺水平。但是,由于它们的复杂性和产生这些措施所需的投入的数量,它们只是不经常更新。通常每4-5年发布一次,最新版本在2019年和2015年发布。这项研究扩展了先前的研究,着眼于使用零售忠诚卡交易形式的数字足迹数据来预测当地贫困的可行性。这项工作特别侧重于IoD的健康和残疾子领域。我们的假设是,与食品购买相关的零售行为及其相关的营养成分,可以用来预测健康剥夺。目标,这项工作利用了英国一家大型杂货零售商的会员卡数据。在研究期间(2019年7月至2021年12月)的十个季度中,使用会员卡会员的匿名地理位置数据将零售杂货交易分配给单个低层超级输出区域(lsoa)。开发了营养查找功能,以便将食品交易的营养成分分配给每个LSOA。基于销售的食品类别及其营养成分开发了许多指标,并在基于随机森林分类器的机器学习模型中使用,以预测与健康相关的贫困程度较高的地区。与数字足迹相关本研究使用的数据来源于食品杂货交易的数字足迹数据。它显示了利用数字足迹数据作为传统人口数据的代理的潜力,而不需要进行昂贵的时间和成本调查。结果随机森林分类器能够预测健康相关剥夺前20%的社区(在LSOA级别)。确定了高水平的预测能力(总体准确率为80%)。SHAP (SHapley Additive explanatory)和模型类依赖(Model Class Reliance, MCR)被用来确定输入特征的重要性。研究发现,在香烟和软饮料上的支出比例较高,而在鱼、酒、水果和蔬菜上的支出比例较低的地区,健康状况被严重剥夺。在营养方面,两个衍生指标,每磅消耗的卡路里和购买的食物的致肥性,被发现是健康剥夺的重要预测因素。结论,影响食品杂货采购的数字足迹数据已被证明在预测LSOA层面上与健康相关的极端贫困地区方面非常有效。与食物类别的比例支出相关的特征以及与这些购买相关的营养素比例被确定为预测与健康相关的剥夺的最佳特征。 研究发现,在LSOA中,每磅消耗的卡路里数量,以及在较小程度上花费在香烟上的比例,是健康相关剥夺程度较高的最重要预测指标。所获得的高水平预测准确性为使用数字足迹数据作为传统剥夺措施的代理提供了潜力。与传统方法相比,这可以实现对健康状况较差的地区的快速和近乎实时的监测。这可以使早期干预措施得以实施,减轻与健康有关的剥夺的一些负面影响。
{"title":"Predicting health related deprivation using loyalty card digital footprints","authors":"Gavin Long, Georgiana Nica-Avram, John Harvey, Roberto Mansilla, Simon Welham, Evgeniya Lukinova, James Goulding","doi":"10.23889/ijpds.v8i3.2282","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2282","url":null,"abstract":"Introduction & BackgroundIn England, The Indices of Deprivation (IoD) are a widely used and referenced measure to assess local levels of deprivation across a range of domains, including health and disability. However, due to their complex nature and the number of inputs required to generate these measures, they are only updated infrequently. Typically every 4-5 years, with the most recent versions released in 2019 and 2015.
 This study expands on previous research looking at the feasibility of using digital footprint data, in the form of retail loyalty card transactions, to predict local deprivation. This work focuses specifically on the health and disability subdomain of IoD. Our hypothesis is that retail behaviour relating to food purchases and their associated nutritional content, can be used to predict health deprivation.
 Objectives & ApproachThe work utilises loyalty card data from a large UK grocery retailer. Anonymised geo-location data for loyalty card members was used to assign retail grocery transactions to individual Lower Layer Super Output Areas (LSOAs) for each of the ten quarters in the study period (July 2019 - December 2021). A nutritional lookup was developed to enable the nutritional content of food transactions to be assigned to each LSOA.
 A number of metrics based on categories of food sold and their nutritional content were developed and used in a Machine Learning model, based on a Random Forest classifier, to predict areas with high levels of health related deprivation.
 Relevance to Digital FootprintsThis study uses data derived from digital footprint data of grocery transactions. It demonstrates the potential for utilising digital footprint data as a proxy for traditional demographic data without the need for expensive, both in terms of time and cost, surveying to be performed.
 ResultsThe random forest classifier was able to predict neighbourhoods (at the LSOA level) with the top 20% of health related deprivation. A high level of predictive power was identified (Overall accuracy 80%). SHAP (SHapley Additive exPlanations) and Model Class Reliance (MCR) were used to determine the importance of the input features. Areas with higher proportional spending on cigarettes and soft drinks and lower spending on fish, wine and fruit and vegtables were found to be associated with extreme levels of health deprivation. In terms of nutrition, two derived metrics, calories per pound spend and the obesogenicity of food purchased, were found to be important predictors of health deprivation.
 Conclusions & ImplicationsDigital footprint data on grocery purchases have been shown to be highly effective at predicting areas of extreme health related deprivation at the LSOA level. Features related to proportional spend on food categories and proportions of nutrients associated with these purchases were identified as optimal for predicting health related deprivation.
 The number of calories per pou","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"224 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135154384","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Education for out-of-school children: Unpacking driving factors of vulnerability via digital-learning footprint data in East Africa 失学儿童的教育:通过东非的数字学习足迹数据揭示脆弱性的驱动因素
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2293
Bethany Huntington, Nicola Pitchford, James Goulding
Introduction & BackgroundApproximately 617 million children and adolescents worldwide do not possess the foundational skills to live healthy and productive lives. Sub-Saharan Africa is profoundly affected, resulting in social and financial dependency and raising vulnerability to forced marriage, female genital mutilation, and mental health issues. Contextual factors are considered critical in this crisis, yet have received little attention due to a currently insurmountable deficit in traditional census and survey data. Digital footprint data offers a potential route to filling this information gap - particularly in education- and how a learner’s environment can impact digital learning and future well-being. To address the crisis, the ‘Global Learning’ XPRIZE competition challenged teams to develop software empowering marginalised out-of-school children to learn literacy and numeracy skills. Five finalist teams tested their technology with 2041 children using handheld tablets in 172 remote villages in Tanzania. Objectives & ApproachOur study examined factors that can predict improvements in learning outcomes, building on the digital footprint data collected from the children participating in the device intervention in the form of app usage and locational and activity data. Additional geospatial features were engineered based on village coordinates, distance to local amenities, services and transport, variables serving as additional potential indicators of isolation and connectedness. These data were linked with child-level factors, including household composition and literacy levels. After comparative assessment of machine learning regression models, tree-based models (XGB, RF) were used to establish the optimal predictive performance for literacy and numeracy. Variable importance using SHAP was used to determine which specific contextual variables should be considered before deploying digital interventions to support education and well-being. Relevance to Digital FootprintsUtilising digital footprint data to quantify the influence of geospatial and contextual data in digital interventions can offer comprehensive insights to understand and address factors impacting learning outcomes in this context. ResultsPrior school attendance, home reading environments and high familial literacy were found to be predictors of higher learning outcomes after a technology-based learning intervention. Environments featuring an unemployed caregiver and few siblings were surprisingly consistent positive predictors, suggesting accompanying and focussed caregiver support as valuable for effective development via digital interventions. Proximity to police stations and health centres were revealed as key predictors, indicating the importance of social and physical connectedness in positive learning environments. Conclusions & ImplicationsTargeted improvement of EdTech provision with out-of-school learners promises to help
介绍,背景全世界约有6.17亿儿童和青少年不具备过健康和有益生活的基本技能。撒哈拉以南非洲受到深刻影响,造成社会和经济依赖,并增加了对强迫婚姻、切割女性生殖器官和精神健康问题的脆弱性。背景因素在这场危机中被认为是至关重要的,但由于传统人口普查和调查数据目前无法克服的缺陷,这些因素很少受到关注。数字足迹数据为填补这一信息缺口——特别是在教育领域——以及学习者的环境如何影响数字学习和未来福祉提供了一条潜在途径。为了解决这一危机,“全球学习”XPRIZE竞赛要求参赛队伍开发软件,帮助边缘化的失学儿童学习识字和算术技能。五个决赛团队在坦桑尼亚172个偏远村庄用手持平板电脑对2041名儿童进行了技术测试。目标,我们的研究基于从参与设备干预的儿童中收集的数字足迹数据,以应用程序使用和位置和活动数据的形式,研究了可以预测学习成果改善的因素。根据村庄坐标、到当地便利设施、服务和交通的距离以及作为隔离和连通性附加潜在指标的变量,设计了其他地理空间特征。这些数据与儿童层面的因素有关,包括家庭组成和文化水平。在对机器学习回归模型进行比较评估后,使用基于树的模型(XGB, RF)建立识字和计算的最佳预测性能。使用SHAP的变量重要性来确定在部署数字干预措施以支持教育和福祉之前应该考虑哪些特定的上下文变量。 与数字足迹的相关性利用数字足迹数据量化数字干预措施中地理空间和背景数据的影响,可以提供全面的见解,以了解和解决在这种情况下影响学习成果的因素。 结果学前教育出勤率、家庭阅读环境和高家庭文化水平是技术学习干预后较高学习成绩的预测因子。一个失业的照顾者和几个兄弟姐妹的环境是令人惊讶的一致的积极预测因素,这表明陪伴和集中的照顾者支持对于通过数字干预有效发展是有价值的。靠近警察局和保健中心是关键的预测因素,表明社会和身体联系在积极的学习环境中的重要性。结论,影响有针对性地改善为校外学习者提供的教育技术,有望帮助发展偏远村庄的学习成果,并减少对相关社会和健康问题的脆弱性。然而,只有在干预措施周围有适当的支持性环境时,才能取得积极成果。
{"title":"Education for out-of-school children: Unpacking driving factors of vulnerability via digital-learning footprint data in East Africa","authors":"Bethany Huntington, Nicola Pitchford, James Goulding","doi":"10.23889/ijpds.v8i3.2293","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2293","url":null,"abstract":"Introduction & BackgroundApproximately 617 million children and adolescents worldwide do not possess the foundational skills to live healthy and productive lives. Sub-Saharan Africa is profoundly affected, resulting in social and financial dependency and raising vulnerability to forced marriage, female genital mutilation, and mental health issues. Contextual factors are considered critical in this crisis, yet have received little attention due to a currently insurmountable deficit in traditional census and survey data. Digital footprint data offers a potential route to filling this information gap - particularly in education- and how a learner’s environment can impact digital learning and future well-being.
 To address the crisis, the ‘Global Learning’ XPRIZE competition challenged teams to develop software empowering marginalised out-of-school children to learn literacy and numeracy skills. Five finalist teams tested their technology with 2041 children using handheld tablets in 172 remote villages in Tanzania.
 Objectives & ApproachOur study examined factors that can predict improvements in learning outcomes, building on the digital footprint data collected from the children participating in the device intervention in the form of app usage and locational and activity data. Additional geospatial features were engineered based on village coordinates, distance to local amenities, services and transport, variables serving as additional potential indicators of isolation and connectedness. These data were linked with child-level factors, including household composition and literacy levels.
 After comparative assessment of machine learning regression models, tree-based models (XGB, RF) were used to establish the optimal predictive performance for literacy and numeracy. Variable importance using SHAP was used to determine which specific contextual variables should be considered before deploying digital interventions to support education and well-being.
 Relevance to Digital FootprintsUtilising digital footprint data to quantify the influence of geospatial and contextual data in digital interventions can offer comprehensive insights to understand and address factors impacting learning outcomes in this context.
 ResultsPrior school attendance, home reading environments and high familial literacy were found to be predictors of higher learning outcomes after a technology-based learning intervention. Environments featuring an unemployed caregiver and few siblings were surprisingly consistent positive predictors, suggesting accompanying and focussed caregiver support as valuable for effective development via digital interventions. Proximity to police stations and health centres were revealed as key predictors, indicating the importance of social and physical connectedness in positive learning environments.
 Conclusions & ImplicationsTargeted improvement of EdTech provision with out-of-school learners promises to help ","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135153267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Daily, Weekly and Monthly Variation in Lunch Time Calories 每日,每周和每月午餐时间卡路里的变化
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2288
Anya Skatova, Neil Stewart, Edward Flavahan, James Goulding
Introduction & BackgroundDespite the level of attention that healthy and unhealthy eating receive from academic research, policymakers and the wider public, objective data on food consumption is limited. This is because studies of individual eating patterns using food diaries are subject to underreporting, particularly by people who are overweight. For example, the UK population is estimated to consume between 30% to 50% more calories than they report in surveys. New data sources such as office canteen ordering systems and individual records of supermarket transactions recorded through supermarket loyalty or bonus cards offer larger and potentially more robust data on real world individual eating behaviours. Objectives & ApproachWe used 2,831,403 machine-recorded ‘meal deal’ transactions from 205,781 individuals over the course of one year from one of the UK’s largest suppliers of lunch time foods to investigate whether there is a relationship between patterns of choice and higher calorie consumption. A meal deal comprises three items; a main (e.g., a sandwich or a salad), a snack (e.g., crisps, fruit or a chocolate bar) and a drink (e.g., a smoothie or a bottle of water). In contrast to diary studies or aggregate transactional data from supermarkets, our dataset included “meal deal’ purchase which is highly likely to be made by an individual for their own consumption or soon afterwards. Relevance to Digital FootprintsLunch time food consumption can reflect the overall diet the individual is exposed to, helping to understand population level patterns of people’s food choices through a type of digital footprints data - shopping history records. ResultsControlling for gender, general index of variety in the choice of lunch food items, income and education, we found that individuals who vary in their calorie consumption most across the time of day, day of the week, and month of the year are the individuals who consume the greatest number of calories overall. These time sensitivity effects are large, together explaining a substantial amount of variance in calorie consumption. Time sensitivity effects are strongly correlated across all three time scales suggesting they measure a stable underlying trait. Conclusions & ImplicationsIndividuals vary calorific composition of their lunch over time of the day, day of the week and month of the year by 100 calories per meal between highest and lowest in sensitivity which is about 9% of the recommended amount of lunchtime calories. Those whose consumption varies the most with time consume the most calories, independently of income and gender. The variation in calories at all three time scales demonstrates the properties of an individual disposition. These findings can be used to understand why and when people make unhealthy food choices.
介绍,尽管健康和不健康饮食受到学术研究、政策制定者和广大公众的高度关注,但有关食品消费的客观数据有限。这是因为使用食物日记对个人饮食模式进行的研究容易被低估,尤其是对于超重的人。例如,据估计,英国人消耗的卡路里比调查中报告的要多30%到50%。新的数据来源,如办公室食堂订餐系统和通过超市忠诚卡或奖励卡记录的超市交易的个人记录,提供了关于现实世界中个人饮食行为的更大、可能更可靠的数据。目标,研究方法我们使用了机器记录的2831403笔来自英国最大的午餐时间食品供应商之一的205781个人的“用餐交易”记录,在一年的时间里调查了选择模式和高卡路里消耗之间是否存在关系。一顿饭包括三样东西;一份主菜(如三明治或沙拉),一份零食(如薯片、水果或巧克力棒)和一份饮料(如冰沙或一瓶水)。与日记研究或来自超市的汇总交易数据相比,我们的数据集包括“套餐交易”购买,这很可能是个人为自己消费或不久后消费而购买的。与数字足迹的相关性午餐时间的食物消费可以反映个人所接触的整体饮食,通过一种数字足迹数据-购物历史记录,帮助了解人们食物选择的人口水平模式。 结果:在控制了性别、午餐食物选择的总体多样性指数、收入和教育水平后,我们发现,在一天中的不同时间、一周中的某一天、一年中的某一个月里,卡路里消耗变化最大的个体,总体上消耗的卡路里数量最多。这些时间敏感性影响很大,共同解释了卡路里消耗的大量差异。时间敏感性效应在所有三个时间尺度上都有很强的相关性,这表明它们测量了一种稳定的潜在特征。 结论,每个人在一天中的不同时间,一周中的不同日子,一年中的不同月份,每顿饭的热量组成在敏感度最高和最低之间变化100卡路里,这大约是午餐热量推荐量的9%。摄入量随时间变化最大的人摄入的卡路里最多,与收入和性别无关。在所有三个时间尺度上卡路里的变化表明了个体性格的特性。这些发现可以用来理解人们为什么以及何时选择不健康的食物。
{"title":"Daily, Weekly and Monthly Variation in Lunch Time Calories","authors":"Anya Skatova, Neil Stewart, Edward Flavahan, James Goulding","doi":"10.23889/ijpds.v8i3.2288","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2288","url":null,"abstract":"Introduction & BackgroundDespite the level of attention that healthy and unhealthy eating receive from academic research, policymakers and the wider public, objective data on food consumption is limited. This is because studies of individual eating patterns using food diaries are subject to underreporting, particularly by people who are overweight. For example, the UK population is estimated to consume between 30% to 50% more calories than they report in surveys. New data sources such as office canteen ordering systems and individual records of supermarket transactions recorded through supermarket loyalty or bonus cards offer larger and potentially more robust data on real world individual eating behaviours.
 Objectives & ApproachWe used 2,831,403 machine-recorded ‘meal deal’ transactions from 205,781 individuals over the course of one year from one of the UK’s largest suppliers of lunch time foods to investigate whether there is a relationship between patterns of choice and higher calorie consumption. A meal deal comprises three items; a main (e.g., a sandwich or a salad), a snack (e.g., crisps, fruit or a chocolate bar) and a drink (e.g., a smoothie or a bottle of water). In contrast to diary studies or aggregate transactional data from supermarkets, our dataset included “meal deal’ purchase which is highly likely to be made by an individual for their own consumption or soon afterwards.
 Relevance to Digital FootprintsLunch time food consumption can reflect the overall diet the individual is exposed to, helping to understand population level patterns of people’s food choices through a type of digital footprints data - shopping history records.
 ResultsControlling for gender, general index of variety in the choice of lunch food items, income and education, we found that individuals who vary in their calorie consumption most across the time of day, day of the week, and month of the year are the individuals who consume the greatest number of calories overall. These time sensitivity effects are large, together explaining a substantial amount of variance in calorie consumption. Time sensitivity effects are strongly correlated across all three time scales suggesting they measure a stable underlying trait.
 Conclusions & ImplicationsIndividuals vary calorific composition of their lunch over time of the day, day of the week and month of the year by 100 calories per meal between highest and lowest in sensitivity which is about 9% of the recommended amount of lunchtime calories. Those whose consumption varies the most with time consume the most calories, independently of income and gender. The variation in calories at all three time scales demonstrates the properties of an individual disposition. These findings can be used to understand why and when people make unhealthy food choices.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"121 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135153268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Kids’ Environment and Health Cohort: a national, linked data resource for environmental child health research 儿童环境与健康队列:儿童环境健康研究的全国性关联数据资源
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2291
Alister Baird, Emilie Courtin, Steven Cummins, Samantha Hajna, Vahé Nafilyan, Alison Macfarlane, Jess Walkeden, Pia Hardelid
Introduction & BackgroundEvidence is mounting that children’s physical environment (e.g. in and around the home, school, and neighbourhood) is critical for their long-term health and education. Early life exposure to factors such as indoor and outdoor air pollution, or a lack of access to greenspaces are associated with the development of long-term health conditions such as asthma or mental health problems. Local and central government in England are implementing numerous policies to improve air quality and housing, and mitigate climate change. Further, England has seen large scale changes to local service provision (including childcare and libraries) due to austerity policies and the COVID-19 pandemic. Currently, there is no national, linked data resource for England that allows research into how the local environment impacts children’s health and education. Objectives & ApproachThe Kids’ Environment and Health Cohort will be a new, linked national data resource for England currently being developing by researchers from UCL, London School of Hygiene and Tropical Medicine, London School of Economics and Political Science, Brock University, and City, University of London in collaboration with the Office for National Statistics (ONS), and funded by Administrative Data Research-UK (ADR-UK). The Kids’ Environment and Health Cohort will be a de-identified and annually updated national birth cohort of all children born in England from 2006 onwards – around 10.5 million children until 2023. The cohort will be constructed using linked administrative data from vital registration (live and stillbirth, and death registration), Census (housing and socio-economic indicators), health (hospital contacts, mental health referrals, and community dispensing data), and education (key stage results, special educational needs, absenteeism). Environmental exposure data can be securely linked to the Cohort via longitudinal residential unique property reference numbers (UPRNs) and postcodes from the Personal Demographic Service, and school location from education records. Relevance to Digital FootprintsThe Kids’ Environment and Health Cohort will, for the first time, link health, education, Census and environmental data at national level in England. It will allow researchers to integrate data on local environments, including physical characteristics (such as temperature, building energy efficiency, or greenspace access) or the social environment (including proximity to food outlets, or services like libraries) with individual level data on health and education outcomes in children. This will be done using the ONS’s 5 safes framework, ensuring highest standards of data security and confidentiality. ResultsThe Kids’ Environment and Health Cohort will be constructed using administrative datasets, including national linked vital statistics, health, education and Census data from multiple data providers (ONS, NHS England and Department for Education)
介绍,背景越来越多的证据表明,儿童的物理环境(如家庭、学校和社区内部及其周围)对他们的长期健康和教育至关重要。生命早期暴露于室内外空气污染等因素,或缺乏进入绿色空间的机会,与哮喘或精神健康问题等长期健康状况的发展有关。英格兰的地方和中央政府正在实施许多政策,以改善空气质量和住房,并减缓气候变化。此外,由于紧缩政策和COVID-19大流行,英格兰的地方服务提供(包括儿童保育和图书馆)发生了大规模变化。目前,英格兰没有全国性的、相互关联的数据资源来研究当地环境对儿童健康和教育的影响。目标,儿童环境与健康队列将是英国一个新的、相互关联的国家数据资源,目前由伦敦大学学院、伦敦卫生与热带医学学院、伦敦政治经济学院、布洛克大学和伦敦城市大学的研究人员与英国国家统计局(ONS)合作开发,并由英国行政数据研究中心(ADR-UK)资助。“儿童环境和健康队列”将是一个不确定的、每年更新的全国出生队列,涵盖自2006年以来在英格兰出生的所有儿童——到2023年约有1050万儿童。该队列将使用相关的行政数据构建,这些数据来自生命登记(活产和死产以及死亡登记)、人口普查(住房和社会经济指标)、健康(医院联系、心理健康转诊和社区配药数据)和教育(关键阶段结果、特殊教育需求、缺勤)。环境暴露数据可以通过纵向住宅唯一财产参考号码(uprn)和个人人口统计服务的邮政编码,以及教育记录中的学校位置,安全地与队列相关联。与数字足迹相关儿童环境与健康队列将首次在英格兰全国范围内将健康、教育、人口普查和环境数据联系起来。它将允许研究人员将当地环境的数据,包括物理特征(如温度、建筑能效或绿地通道)或社会环境(包括食品销售点或图书馆等服务的邻近程度),与儿童健康和教育成果的个人层面数据整合在一起。这将使用国家统计局的5个安全框架来完成,确保最高标准的数据安全和机密性。 儿童环境与健康队列将使用管理数据集构建,包括来自多个数据提供商(ONS、NHS英格兰和教育部)的国家相关生命统计数据、健康、教育和人口普查数据,以及英格兰小区域环境数据。总之,这些数据集可以详细分析环境暴露对儿童健康和教育结果的影响,并进行稳健的混杂因素调整。儿童环境和健康队列将在国家统计局安全研究服务(SRS)中以不确定的格式提供。结论,儿童环境与健康队列将为研究人员提供安全访问整合环境和行政卫生与教育数据的国家数据资源,用于儿童公共卫生研究。
{"title":"The Kids’ Environment and Health Cohort: a national, linked data resource for environmental child health research","authors":"Alister Baird, Emilie Courtin, Steven Cummins, Samantha Hajna, Vahé Nafilyan, Alison Macfarlane, Jess Walkeden, Pia Hardelid","doi":"10.23889/ijpds.v8i3.2291","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2291","url":null,"abstract":"Introduction & BackgroundEvidence is mounting that children’s physical environment (e.g. in and around the home, school, and neighbourhood) is critical for their long-term health and education. Early life exposure to factors such as indoor and outdoor air pollution, or a lack of access to greenspaces are associated with the development of long-term health conditions such as asthma or mental health problems. Local and central government in England are implementing numerous policies to improve air quality and housing, and mitigate climate change. Further, England has seen large scale changes to local service provision (including childcare and libraries) due to austerity policies and the COVID-19 pandemic. Currently, there is no national, linked data resource for England that allows research into how the local environment impacts children’s health and education.
 Objectives & ApproachThe Kids’ Environment and Health Cohort will be a new, linked national data resource for England currently being developing by researchers from UCL, London School of Hygiene and Tropical Medicine, London School of Economics and Political Science, Brock University, and City, University of London in collaboration with the Office for National Statistics (ONS), and funded by Administrative Data Research-UK (ADR-UK). The Kids’ Environment and Health Cohort will be a de-identified and annually updated national birth cohort of all children born in England from 2006 onwards – around 10.5 million children until 2023. The cohort will be constructed using linked administrative data from vital registration (live and stillbirth, and death registration), Census (housing and socio-economic indicators), health (hospital contacts, mental health referrals, and community dispensing data), and education (key stage results, special educational needs, absenteeism). Environmental exposure data can be securely linked to the Cohort via longitudinal residential unique property reference numbers (UPRNs) and postcodes from the Personal Demographic Service, and school location from education records.
 Relevance to Digital FootprintsThe Kids’ Environment and Health Cohort will, for the first time, link health, education, Census and environmental data at national level in England. It will allow researchers to integrate data on local environments, including physical characteristics (such as temperature, building energy efficiency, or greenspace access) or the social environment (including proximity to food outlets, or services like libraries) with individual level data on health and education outcomes in children. This will be done using the ONS’s 5 safes framework, ensuring highest standards of data security and confidentiality.
 ResultsThe Kids’ Environment and Health Cohort will be constructed using administrative datasets, including national linked vital statistics, health, education and Census data from multiple data providers (ONS, NHS England and Department for Education)","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"458 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135153436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Studying insulin needs in Type 1 Diabetes by analysing the OpenAPS Data Commons 通过分析OpenAPS数据共享研究1型糖尿病的胰岛素需求
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2271
Isabella Degen, Kate Robson Brown, Zahraa S Abdallah
Introduction & BackgroundType 1 Diabetes (T1D) is a chronic condition where the body produces too little insulin, a hormone required to regulate blood glucose (BG). Finding the correct insulin dose and time remains a complex and as yet unsolved control task. Many factors such as food, exercise, stress, menstrual cycle, etc. change how much insulin is required. Most of these factors remain unobserved and/or underexplored. The NHS national diabetes audit shows that in 2020-21 only 9.8% of people with T1D had a NICE recommended glycated haemoglobin (HbA1c) test result of 48 mmol/mol or less to avoid complications due to diabetes. Objectives & ApproachOur research aims to improve the understanding of underexplored factors that drive changes in insulin needs of people with T1D. We use unsupervised time series pattern detection methods such as time series k-means, heatmaps and matrix profile to identify underexplored patterns. These are times when insulin on board (IOB) does not rise with more carbohydrates on board (COB) and times when BG does not fall with more IOB and/or rise with more COB as expected. In the future, we aim to use causal feature selection methods to exclude COB as a causal driver for these patterns and identify other factors as causal drivers. Relevance to Digital FootprintsWe use a digital footprints dataset of n=187 people who use an open-source automated insulin delivery system and have donated their data via the OpenAPS Data Commons and the OpenHumans.org platform. The data has been collected in real-life conditions and contains system logs, BG sensor data and various user-entered annotations. The richness of the data provides a unique opportunity to study what happens in real-life conditions but also poses challenges for many methods. These include inconsistencies between users, irregular sampling between devices and missing data. ResultsOur pattern detection methods can successfully identify underexplored patterns in insulin needs that are not directly driven by COB. These include months that require more/less insulin without eating more/less carbohydrates and times of day when blood glucose does not rise with more COB and/or fall with more IOB. Conclusions & ImplicationsWhile our current methods can identify underexplored patterns in insulin needs, the principal pattern identified remains the well-known pattern of the main meal COB spikes. This is not surprising given the frequency and impact of this pattern. We are working on methods that can identify both frequent and less frequent patterns as well as patterns arising from the irregularity of the sampling. Digital footprints datasets like the OpenAPS Data Commons are promising to help increase understanding of complex conditions such as T1D. More of this type of multi-year data is needed. Especially data that includes multiple different sensor readings to help identify causal drivers behind underexplored patterns of insulin needs. And we
介绍,背景1型糖尿病(T1D)是一种慢性疾病,机体产生的胰岛素太少,而胰岛素是调节血糖(BG)所需的激素。找到正确的胰岛素剂量和时间仍然是一项复杂且尚未解决的控制任务。许多因素,如食物、运动、压力、月经周期等,都会改变胰岛素的需要量。这些因素中的大多数仍未被观察到和/或未被充分探索。NHS全国糖尿病审计显示,在2020- 2021年,只有9.8%的T1D患者的糖化血红蛋白(HbA1c)检测结果为NICE推荐的48 mmol/mol或更低,以避免糖尿病引起的并发症。目标,我们的研究旨在提高对驱动T1D患者胰岛素需求变化的未被探索因素的理解。我们使用无监督的时间序列模式检测方法,如时间序列k-means、热图和矩阵剖面来识别未被探索的模式。这些是胰岛素携带量(IOB)没有随着碳水化合物携带量(COB)的增加而升高的时间,以及BG没有像预期的那样随着碳水化合物携带量(COB)的增加而下降和/或升高的时间。在未来,我们的目标是使用因果特征选择方法来排除COB作为这些模式的因果驱动因素,并确定其他因素作为因果驱动因素。 与数字足迹相关我们使用n=187人的数字足迹数据集,这些人使用开源自动胰岛素输送系统,并通过OpenAPS数据共享和OpenHumans.org平台捐赠了他们的数据。这些数据是在实际情况下收集的,包括系统日志、BG传感器数据和各种用户输入的注释。丰富的数据为研究现实生活中发生的情况提供了独特的机会,但也对许多方法提出了挑战。这些问题包括用户之间的不一致,设备之间的不规则采样和数据丢失。 结果我们的模式检测方法可以成功地识别出胰岛素需求中未被探索的模式,这些模式不是由COB直接驱动的。这些包括需要更多/更少胰岛素而不吃更多/更少碳水化合物的月份,以及一天中血糖不随COB增加而上升和/或随IOB增加而下降的时间。结论,虽然我们目前的方法可以确定胰岛素需求中未被探索的模式,但确定的主要模式仍然是众所周知的主餐COB峰值模式。考虑到这种模式的频率和影响,这并不奇怪。我们正在研究能够识别频繁和不频繁的模式以及因采样不规律而产生的模式的方法。像OpenAPS数据共享这样的数字足迹数据集有望帮助增加对T1D等复杂情况的理解。需要更多这类多年数据。特别是数据包括多个不同的传感器读数,以帮助确定未被探索的胰岛素需求模式背后的因果驱动因素。我们还需要能够很好地处理缺失和不规则采样数据的模式查找方法。
{"title":"Studying insulin needs in Type 1 Diabetes by analysing the OpenAPS Data Commons","authors":"Isabella Degen, Kate Robson Brown, Zahraa S Abdallah","doi":"10.23889/ijpds.v8i3.2271","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2271","url":null,"abstract":"Introduction & BackgroundType 1 Diabetes (T1D) is a chronic condition where the body produces too little insulin, a hormone required to regulate blood glucose (BG). Finding the correct insulin dose and time remains a complex and as yet unsolved control task. Many factors such as food, exercise, stress, menstrual cycle, etc. change how much insulin is required. Most of these factors remain unobserved and/or underexplored. The NHS national diabetes audit shows that in 2020-21 only 9.8% of people with T1D had a NICE recommended glycated haemoglobin (HbA1c) test result of 48 mmol/mol or less to avoid complications due to diabetes.
 Objectives & ApproachOur research aims to improve the understanding of underexplored factors that drive changes in insulin needs of people with T1D. We use unsupervised time series pattern detection methods such as time series k-means, heatmaps and matrix profile to identify underexplored patterns. These are times when insulin on board (IOB) does not rise with more carbohydrates on board (COB) and times when BG does not fall with more IOB and/or rise with more COB as expected. In the future, we aim to use causal feature selection methods to exclude COB as a causal driver for these patterns and identify other factors as causal drivers.
 Relevance to Digital FootprintsWe use a digital footprints dataset of n=187 people who use an open-source automated insulin delivery system and have donated their data via the OpenAPS Data Commons and the OpenHumans.org platform. The data has been collected in real-life conditions and contains system logs, BG sensor data and various user-entered annotations. The richness of the data provides a unique opportunity to study what happens in real-life conditions but also poses challenges for many methods. These include inconsistencies between users, irregular sampling between devices and missing data.
 ResultsOur pattern detection methods can successfully identify underexplored patterns in insulin needs that are not directly driven by COB. These include months that require more/less insulin without eating more/less carbohydrates and times of day when blood glucose does not rise with more COB and/or fall with more IOB.
 Conclusions & ImplicationsWhile our current methods can identify underexplored patterns in insulin needs, the principal pattern identified remains the well-known pattern of the main meal COB spikes. This is not surprising given the frequency and impact of this pattern. We are working on methods that can identify both frequent and less frequent patterns as well as patterns arising from the irregularity of the sampling. Digital footprints datasets like the OpenAPS Data Commons are promising to help increase understanding of complex conditions such as T1D. More of this type of multi-year data is needed. Especially data that includes multiple different sensor readings to help identify causal drivers behind underexplored patterns of insulin needs. And we ","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"170 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135154084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generating a Single Session Outcome Measure from Digital Mental Health Platform Footprints Using Natural Language Processing 使用自然语言处理从数字心理健康平台足迹生成单个会话结果测量
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2269
Gregor Milligan, Aynsley Bernard, Liz Dowthwaite, Elvira Perez Vallejos, James Goulding
Introduction & BackgroundThis work demonstrates the development of an Adult Session Wants and Needs Outcome Measure (Adult SWAN-OM) aimed at supporting service delivery within the digital mental health platform (DMHP), Qwell. Qwell is a DMHP commissioned by the United Kingdom’s National Health Service which provides access to an online community of peers, a team of experienced counsellors, and a cadre of emotional well-being practitioners. The service is anonymous at point of entry and free for users, and provides an extensive, person-centred approach which results in a wide range of user needs. Deriving outcome measures from the platform’s varied counselling sessions, aims to provide both insights into the want and needs of users and underpin improved mental health support. Objectives & ApproachThe objective of this research is to show how contemporary machine learning methods (Transformer Models and Contextualised Topic Modelling) may be combined with digital footprint data (in the form of seldom explored text data generated on DMHPs) to identify service user wants and needs. Specifically, with automated inference of patient outcomes currently scarce, we focus on the development of outcome measures in the context of ‘single sessions’, applying machine learning methods to extract topics related to the wants and needs of service users on DMHPs. Relevance to Digital FootprintsThe data used in this study consisted of transcripts between Qwell practitioners and service users (SU’s) (N=874) at conversation level (N=2323), a filter was applied to the dataset to ensure that focus the SUs elicitation of wants and needs fit into the criteria of a single session. Individuals in the final selected cohort (n=192) are not significantly different from the wider Qwell SU population in the study period in terms of age, gender or ethnicity; suggesting that the cohort is representative of the wider target population. This study shows the potential of mental health digital footprints data when providing insight into the wants and needs of DMHP SUs. Conclusions & ImplicationsVia this analysis of mental health digital footprint data, this work establishes a process for creating a new outcome measure through the computational analysis of transcript data, incorporating insights from clinical experts and individuals with lived experience of engaging with DMHPs with textual data analysis. This methodological approach of Transformer Models and Contextualized Topic Modelling enables the analysis of a considerable volume of data faster than manually reviewing transcripts. We offer suggestions for the refinement of automated methods, in collaboration with direct support and feedback from both clinicians and individuals with lived experience of DMHPs to enable the understanding of wants and needs of service users within DMHPs.
介绍,这项工作展示了成人会议需求结果测量(Adult SWAN-OM)的发展,旨在支持数字心理健康平台(DMHP)内的服务提供。Qwell是由英国国家卫生服务机构委托的DMHP,它提供了一个同龄人的在线社区,一个经验丰富的咨询师团队,以及一个情感健康从业人员的骨干。该服务在进入时是匿名的,对用户免费,并提供广泛的、以人为本的方法,从而满足广泛的用户需求。从该平台的各种咨询会议中得出结果措施,旨在提供对用户的需求和需求的见解,并巩固改进的心理健康支持。目标,本研究的目的是展示当代机器学习方法(变压器模型和情境化主题建模)如何与数字足迹数据(以dmhp上生成的很少探索的文本数据的形式)相结合,以识别服务用户的需求。具体来说,由于目前缺乏对患者结果的自动推断,我们专注于在“单次会议”的背景下开发结果度量,应用机器学习方法提取与DMHPs上服务用户的需求和需求相关的主题。与数字足迹的相关性本研究中使用的数据包括Qwell从业者和服务用户(SU) (N=874)在会话级别(N=2323)之间的转录本,对数据集应用了过滤器,以确保SU对需求和需求的关注符合单个会话的标准。在最终选择的队列(n=192)中,个体在年龄、性别或种族方面与研究期间更广泛的Qwell SU人群没有显著差异;这表明该队列代表了更广泛的目标人群。这项研究显示了心理健康数字足迹数据在洞察DMHP SUs的需求和需求方面的潜力。结论,通过对心理健康数字足迹数据的分析,这项工作建立了一个过程,通过对转录数据的计算分析,结合临床专家和具有与DMHPs进行文本数据分析的生活经验的个人的见解,创建了一个新的结果测量。Transformer Models和上下文化主题建模的这种方法学方法使得对大量数据的分析比手动检查抄本更快。我们为改进自动化方法提供建议,与临床医生和有DMHPs生活经验的个人合作,提供直接支持和反馈,以便了解DMHPs内服务用户的需求和需求。
{"title":"Generating a Single Session Outcome Measure from Digital Mental Health Platform Footprints Using Natural Language Processing","authors":"Gregor Milligan, Aynsley Bernard, Liz Dowthwaite, Elvira Perez Vallejos, James Goulding","doi":"10.23889/ijpds.v8i3.2269","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2269","url":null,"abstract":"Introduction & BackgroundThis work demonstrates the development of an Adult Session Wants and Needs Outcome Measure (Adult SWAN-OM) aimed at supporting service delivery within the digital mental health platform (DMHP), Qwell. Qwell is a DMHP commissioned by the United Kingdom’s National Health Service which provides access to an online community of peers, a team of experienced counsellors, and a cadre of emotional well-being practitioners. The service is anonymous at point of entry and free for users, and provides an extensive, person-centred approach which results in a wide range of user needs. Deriving outcome measures from the platform’s varied counselling sessions, aims to provide both insights into the want and needs of users and underpin improved mental health support. Objectives & ApproachThe objective of this research is to show how contemporary machine learning methods (Transformer Models and Contextualised Topic Modelling) may be combined with digital footprint data (in the form of seldom explored text data generated on DMHPs) to identify service user wants and needs. Specifically, with automated inference of patient outcomes currently scarce, we focus on the development of outcome measures in the context of ‘single sessions’, applying machine learning methods to extract topics related to the wants and needs of service users on DMHPs. Relevance to Digital FootprintsThe data used in this study consisted of transcripts between Qwell practitioners and service users (SU’s) (N=874) at conversation level (N=2323), a filter was applied to the dataset to ensure that focus the SUs elicitation of wants and needs fit into the criteria of a single session. Individuals in the final selected cohort (n=192) are not significantly different from the wider Qwell SU population in the study period in terms of age, gender or ethnicity; suggesting that the cohort is representative of the wider target population. This study shows the potential of mental health digital footprints data when providing insight into the wants and needs of DMHP SUs. Conclusions & ImplicationsVia this analysis of mental health digital footprint data, this work establishes a process for creating a new outcome measure through the computational analysis of transcript data, incorporating insights from clinical experts and individuals with lived experience of engaging with DMHPs with textual data analysis. This methodological approach of Transformer Models and Contextualized Topic Modelling enables the analysis of a considerable volume of data faster than manually reviewing transcripts. We offer suggestions for the refinement of automated methods, in collaboration with direct support and feedback from both clinicians and individuals with lived experience of DMHPs to enable the understanding of wants and needs of service users within DMHPs.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"170 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135154237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decoy Effects in a Massive Real-World Shopping Dataset 大规模真实世界购物数据集中的诱饵效应
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2286
Sean Devine, James Goulding, Anya Skatova, Ross Otto
Introduction & BackgroundA key premise of rational choice prescribes that decision-makers ought to ignore irrelevant, inferior alternative options. Consider for example the choice between two wines, where the value of an option is computed across two dimensions: quality and price. When deliberating about which wine to choose, one’s propensity to choose between two otherwise equally preferred wines should be not influenced by the introduction of a third clearly inferior option (being both of lower quality and more expensive than one of the original alternatives). Yet, a large body of work suggests that both people and animals routinely violate this premise in their decisions—in laboratory experiments, the introduction of irrelevant “decoys” into a choice set biases decision-making. However, these decoy effects are less understood in large-scale contexts of real-world decision-making, where choice sets can be large, and preference informed by consumers’ histories of experience. Objectives & ApproachWe explored whether the presence of irrelevant, “decoy” alternative options influenced wine purchases in a large real-world dataset of UK wine purchases. From shopping transaction records, we extracted all red and white wine purchases over a one month period. Our analyses examined 3.6M wine purchases made by 755,158 unique customers. Relevance to Digital FootprintsWe deployed shopping history data which is a popular example for digital footprints allowing us to track people’s choices and decisions over long periods of time. ResultsWe find that among pairs of wines that appear across many different contexts (i.e., stores with different product assortments) and trade off on quality and price, the presence of decoy options— similar options that were dominated by the focal option—made consumers more likely to purchase the focal option (a hallmark of the “attraction effect”). Furthermore, we find that sensitivity to this effect depended on consumers’ history of experience with the product, such that frequent shoppers were less likely to be sensitive to decoy effects in their purchase behaviour. Conclusions & ImplicationsWe examined whether real-world consumer decisions, evidenced in a large dataset of wine purchases in the United Kingdom, were subject to a canonical bias in multiattribute choice: the attraction effect. We found that wine purchases were systematically biased in favour of wines that dominated choice sets—a bias which was not observed when choice sets were not dominated. Together, these results extend laboratory-based accounts of decoy effects to real-world choices, and demonstrate how digital footprints data analysis can be linked to health, especially in terms of decision making which is associated with negative health outcomes.
介绍,理性选择的一个关键前提是,决策者应该忽略无关的、次等的备选方案。例如,考虑两种葡萄酒之间的选择,其中选项的价值是通过两个维度计算的:质量和价格。在考虑选择哪种葡萄酒时,一个人在两种同样喜欢的葡萄酒中做出选择的倾向不应该受到第三种明显较差的选择的影响(既比原来的选择质量更低,也比原来的选择更贵)。然而,大量的研究表明,人和动物在做决定时都经常违反这个前提——在实验室实验中,在一组选择中引入不相关的“诱饵”会使决策产生偏差。然而,这些诱饵效应在现实世界的大规模决策环境中不太容易理解,在现实世界中,选择集可能很大,而且消费者的偏好是由他们的经验历史决定的。 目标,我们在英国葡萄酒购买的大型真实数据集中探索了不相关的“诱饵”替代选项是否会影响葡萄酒购买。从购物交易记录中,我们提取了一个月内购买的所有红葡萄酒和白葡萄酒。我们的分析调查了755,158位独特客户购买的360万瓶葡萄酒。与数字足迹相关我们使用了购物历史数据,这是数字足迹的一个流行例子,使我们能够在很长一段时间内跟踪人们的选择和决定。结果我们发现,在出现在许多不同背景下(即,拥有不同产品种类的商店)并在质量和价格上进行权衡的成对葡萄酒中,诱饵选项的存在-由焦点选项主导的类似选项-使消费者更有可能购买焦点选项(“吸引力效应”的标志)。此外,我们发现对这种效应的敏感性取决于消费者使用产品的历史经验,因此经常购物者在购买行为中不太可能对诱饵效应敏感。 结论,我们研究了现实世界的消费者决策是否受到多属性选择的典型偏差:吸引力效应的影响,这一点在英国的大型葡萄酒购买数据集中得到了证明。我们发现,购买葡萄酒会系统性地偏向于选择集占主导地位的葡萄酒——当选择集不占主导地位时,这种偏见就不会被观察到。总之,这些结果将基于实验室的诱饵效应解释扩展到现实世界的选择,并展示了数字足迹数据分析如何与健康联系起来,特别是在与负面健康结果相关的决策方面。
{"title":"Decoy Effects in a Massive Real-World Shopping Dataset","authors":"Sean Devine, James Goulding, Anya Skatova, Ross Otto","doi":"10.23889/ijpds.v8i3.2286","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2286","url":null,"abstract":"Introduction & BackgroundA key premise of rational choice prescribes that decision-makers ought to ignore irrelevant, inferior alternative options. Consider for example the choice between two wines, where the value of an option is computed across two dimensions: quality and price. When deliberating about which wine to choose, one’s propensity to choose between two otherwise equally preferred wines should be not influenced by the introduction of a third clearly inferior option (being both of lower quality and more expensive than one of the original alternatives). Yet, a large body of work suggests that both people and animals routinely violate this premise in their decisions—in laboratory experiments, the introduction of irrelevant “decoys” into a choice set biases decision-making. However, these decoy effects are less understood in large-scale contexts of real-world decision-making, where choice sets can be large, and preference informed by consumers’ histories of experience.
 Objectives & ApproachWe explored whether the presence of irrelevant, “decoy” alternative options influenced wine purchases in a large real-world dataset of UK wine purchases. From shopping transaction records, we extracted all red and white wine purchases over a one month period. Our analyses examined 3.6M wine purchases made by 755,158 unique customers.
 Relevance to Digital FootprintsWe deployed shopping history data which is a popular example for digital footprints allowing us to track people’s choices and decisions over long periods of time.
 ResultsWe find that among pairs of wines that appear across many different contexts (i.e., stores with different product assortments) and trade off on quality and price, the presence of decoy options— similar options that were dominated by the focal option—made consumers more likely to purchase the focal option (a hallmark of the “attraction effect”). Furthermore, we find that sensitivity to this effect depended on consumers’ history of experience with the product, such that frequent shoppers were less likely to be sensitive to decoy effects in their purchase behaviour.
 Conclusions & ImplicationsWe examined whether real-world consumer decisions, evidenced in a large dataset of wine purchases in the United Kingdom, were subject to a canonical bias in multiattribute choice: the attraction effect. We found that wine purchases were systematically biased in favour of wines that dominated choice sets—a bias which was not observed when choice sets were not dominated. Together, these results extend laboratory-based accounts of decoy effects to real-world choices, and demonstrate how digital footprints data analysis can be linked to health, especially in terms of decision making which is associated with negative health outcomes.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135154385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigating the Relationship between Timing of Tweets and Mental Health, Well-being and Sleep Quality in a UK birth cohort 在英国出生队列中调查推特时间与心理健康、幸福和睡眠质量之间的关系
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2284
Oliver Davis, Nina Di Cara, Nello Cristianini, Claire Haworth
Introduction & BackgroundSocial media use has been proposed as a cause of worsening mental health and wellbeing over the last decade, but its role in mitigating some of the effects of social distancing during the pandemic showed that it also has the potential to improve these outcomes. Whilst existing research disagrees on the degree to which social media use harms or helps, there is growing consensus around the need to move from global measures of social media use to specific measures of types of social media use. These new measures can enable an exploration of proposed mechanisms and causal pathways linking social media use and mental health and wellbeing. A commonly proposed mechanism is nighttime social media use reducing sleep quality, and consequently harming mental health and wellbeing. Objectives & ApproachWe aimed to investigate the relationships between the time Twitter users post content and their mental health, wellbeing and sleep quality using direct measurements of Twitter use linked to standardised mental health measures in a well-characterized cohort. This study uses approximately 1.5 million Tweets harvested between January 2008 and March 2023 from 622 participants in the Avon Longitudinal Study of Parents and Children (ALSPAC). These Tweets have been linked to questionnaire data collected on six occasions spanning April 2019 to May 2021. These questionnaires included standard measures of depressive symptoms, anxiety symptoms, mental wellbeing and difficulty sleeping.We have taken two approaches to explore these relationships, using circular statistical methods novel to social media data analysis to account for day/night cycles. The first approach used mixed effect models to investigate the association between the time a Tweet was posted and the mental health, mental wellbeing and sleep quality of the poster. The second approach explored the relationships between the mean hour participants post Tweets in a given time period, and their mental health, mental wellbeing and sleep quality. Relevance to Digital FootprintsThis research is highly relevant to Digital Footprints, due to its use of data directly extracted from a social media site. The methodologies employed in analysing this alongside more traditional epidemiological survey data provides an example of how digital footprint data can complemented by high quality ground truths. ResultsThere was evidence that the timing of Twitter activity was predictive of the mental wellbeing and sleep quality of participants, even after adjustment for demographic, educational and socio-economic covariates. However, the hour a Tweet was posted at explained very little of the variation in the mental wellbeing or sleep quality of the participant who posted it (0.1% and less than 0.1% respectively). There was weak to no evidence that the timing of Twitter activity was predictive of the depressive and anxiety symptoms of participants. Conclusions & Implic
介绍,在过去十年中,社交媒体的使用被认为是导致心理健康和福祉恶化的一个原因,但它在缓解大流行期间社交距离的一些影响方面的作用表明,它也有可能改善这些结果。虽然现有的研究在社交媒体使用的危害或帮助程度上存在分歧,但越来越多的人认为,有必要从对社交媒体使用的全球衡量转向对社交媒体使用类型的具体衡量。这些新措施可以探索将社交媒体使用与心理健康和福祉联系起来的拟议机制和因果途径。一种普遍提出的机制是,夜间使用社交媒体会降低睡眠质量,从而损害心理健康和福祉。目标,方法:我们旨在调查Twitter用户发布内容的时间与他们的心理健康、幸福和睡眠质量之间的关系,在一个特征明确的队列中,使用与标准化心理健康措施相关的Twitter使用的直接测量。 这项研究使用了2008年1月至2023年3月期间从雅芳父母与儿童纵向研究(ALSPAC)的622名参与者中收集的大约150万条推文。这些推文与2019年4月至2021年5月期间六次收集的问卷数据有关。这些问卷包括抑郁症状、焦虑症状、心理健康和睡眠困难的标准测量。我们采用了两种方法来探索这些关系,使用循环统计方法来分析社交媒体数据,以解释昼夜周期。第一种方法使用混合效应模型来调查推特发布时间与发布者的心理健康、心理健康和睡眠质量之间的关系。第二种方法探讨了参与者在给定时间段内发布推文的平均小时数与他们的心理健康、心理健康和睡眠质量之间的关系。与数字足迹的相关性这项研究与数字足迹高度相关,因为它使用了直接从社交媒体网站提取的数据。与更传统的流行病学调查数据一起分析这一数据所采用的方法提供了一个例子,说明数字足迹数据如何与高质量的实地事实相辅相成。结果有证据表明,Twitter活动的时间可以预测参与者的心理健康和睡眠质量,即使在调整了人口统计学、教育和社会经济协变量之后也是如此。然而,推特发布的时间几乎不能解释发布推特的参与者的心理健康或睡眠质量的变化(分别为0.1%和不到0.1%)。几乎没有证据表明推特活动的时间可以预测参与者的抑郁和焦虑症状。 结论,虽然这项研究发现有证据表明,参与者在推特上发帖的时间可以预测他们的心理健康和睡眠质量,但这些模型解释的差异数量表明,这不是一个临床相关的风险因素。这项研究支持了文献中的观点,即社交媒体的使用对心理健康、幸福和睡眠质量的影响非常小,而且微不足道。
{"title":"Investigating the Relationship between Timing of Tweets and Mental Health, Well-being and Sleep Quality in a UK birth cohort","authors":"Oliver Davis, Nina Di Cara, Nello Cristianini, Claire Haworth","doi":"10.23889/ijpds.v8i3.2284","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2284","url":null,"abstract":"Introduction & BackgroundSocial media use has been proposed as a cause of worsening mental health and wellbeing over the last decade, but its role in mitigating some of the effects of social distancing during the pandemic showed that it also has the potential to improve these outcomes. Whilst existing research disagrees on the degree to which social media use harms or helps, there is growing consensus around the need to move from global measures of social media use to specific measures of types of social media use. These new measures can enable an exploration of proposed mechanisms and causal pathways linking social media use and mental health and wellbeing. A commonly proposed mechanism is nighttime social media use reducing sleep quality, and consequently harming mental health and wellbeing.
 Objectives & ApproachWe aimed to investigate the relationships between the time Twitter users post content and their mental health, wellbeing and sleep quality using direct measurements of Twitter use linked to standardised mental health measures in a well-characterized cohort.
 This study uses approximately 1.5 million Tweets harvested between January 2008 and March 2023 from 622 participants in the Avon Longitudinal Study of Parents and Children (ALSPAC). These Tweets have been linked to questionnaire data collected on six occasions spanning April 2019 to May 2021. These questionnaires included standard measures of depressive symptoms, anxiety symptoms, mental wellbeing and difficulty sleeping.We have taken two approaches to explore these relationships, using circular statistical methods novel to social media data analysis to account for day/night cycles. The first approach used mixed effect models to investigate the association between the time a Tweet was posted and the mental health, mental wellbeing and sleep quality of the poster. The second approach explored the relationships between the mean hour participants post Tweets in a given time period, and their mental health, mental wellbeing and sleep quality.
 Relevance to Digital FootprintsThis research is highly relevant to Digital Footprints, due to its use of data directly extracted from a social media site. The methodologies employed in analysing this alongside more traditional epidemiological survey data provides an example of how digital footprint data can complemented by high quality ground truths.
 ResultsThere was evidence that the timing of Twitter activity was predictive of the mental wellbeing and sleep quality of participants, even after adjustment for demographic, educational and socio-economic covariates. However, the hour a Tweet was posted at explained very little of the variation in the mental wellbeing or sleep quality of the participant who posted it (0.1% and less than 0.1% respectively). There was weak to no evidence that the timing of Twitter activity was predictive of the depressive and anxiety symptoms of participants.
 Conclusions & Implic","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"188 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135153270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Healthy foods, healthy sales? Cross-category spillover effects of a reward program promoting sales of fruit and vegetables 健康食品,健康销售?促进水果和蔬菜销售奖励计划的跨品类溢出效应
Pub Date : 2023-09-18 DOI: 10.23889/ijpds.v8i3.2270
Luca Panzone, Barbara Tocco, Ruzica Brecic, Matthew Gorton
Introduction & BackgroundGlobally, consumption of Fruit and Vegetables (F&V) remains below nutritional guidelines. Marketing healthy products can be complex for retailers, and a key challenge is the design of strategies that benefit retailers, e.g., through improved loyalty, and deliver progress on societal goals. Objectives & ApproachThis study evaluates a point-plus-cash frequency reward program where participants received points by purchasing selected F&V, redeemable against a reward (plush toys in the shape of F&V). We estimate the impact of the program using a difference-in-difference-in-difference model, which compares expenditures in several categories before, during, and after the promotional period, across two different years, and separately for consumers who redeemed a reward and those who did not. Identification includes weighting for the propensity scores, and using an instrumental variable approach. Relevance to Digital FootprintsThe data refers to grocery expenditure in five categories in the focal retailer for over 268,000 consumers, over 27 weeks for 2 years, as recorded through their loyalty card. ResultsThe reward program significantly increased expenditures in F&V in the focal retailer during the promotional period. However, results differed depending on reward redemption. For reward-redeemers, the program increased expenditures in F&V as well as in other food categories, an effect that persisted – at a declining rate – after the program stopped. The program had a short-lived effect on non-rewards redeemers, who only increased F&V expenditures during the promotional period. Conclusions & ImplicationsResults indicate that a loyalty program promoting sales of F&V can create win-win benefits to both society and the retailer: it increases expenditures on healthy foods (F&V), while improving overall loyalty (i.e., expenditures) to the retailer.
介绍,在全球范围内,水果和蔬菜(F&V)的消费量仍低于营养指南。对于零售商来说,健康产品的营销可能是复杂的,一个关键的挑战是设计有利于零售商的策略,例如,通过提高忠诚度,并在社会目标上取得进展。目标,这项研究评估了一个积分加现金的频率奖励计划,参与者通过购买选定的F&V来获得积分,兑换成奖励(F&V形状的毛绒玩具)。我们使用“差中差中差”模型来估计该计划的影响,该模型比较了促销期间之前,期间和之后的几个类别的支出,跨越两个不同的年份,并分别针对兑换奖励的消费者和没有兑换奖励的消费者。识别包括倾向得分的加权,并使用工具变量方法。 与数字足迹的相关性该数据指的是超过26.8万名消费者在重点零售商的五类杂货支出,为期两年,超过27周,通过他们的会员卡记录。结果奖励计划显著增加了重点零售商在促销期间的餐饮支出。然而,结果因奖励的不同而不同。对于奖励兑换者来说,该计划增加了食品和其他食品类别的支出,这种影响在计划停止后持续存在,但速度越来越慢。该计划对非奖励兑换者产生了短暂的影响,他们在促销期间只增加了餐饮支出。 结论,结果表明,促进食品V销售的忠诚计划可以为社会和零售商创造双赢的利益:它增加了健康食品(食品V)的支出,同时提高了对零售商的整体忠诚度(即支出)。
{"title":"Healthy foods, healthy sales? Cross-category spillover effects of a reward program promoting sales of fruit and vegetables","authors":"Luca Panzone, Barbara Tocco, Ruzica Brecic, Matthew Gorton","doi":"10.23889/ijpds.v8i3.2270","DOIUrl":"https://doi.org/10.23889/ijpds.v8i3.2270","url":null,"abstract":"Introduction & BackgroundGlobally, consumption of Fruit and Vegetables (F&V) remains below nutritional guidelines. Marketing healthy products can be complex for retailers, and a key challenge is the design of strategies that benefit retailers, e.g., through improved loyalty, and deliver progress on societal goals.
 Objectives & ApproachThis study evaluates a point-plus-cash frequency reward program where participants received points by purchasing selected F&V, redeemable against a reward (plush toys in the shape of F&V). We estimate the impact of the program using a difference-in-difference-in-difference model, which compares expenditures in several categories before, during, and after the promotional period, across two different years, and separately for consumers who redeemed a reward and those who did not. Identification includes weighting for the propensity scores, and using an instrumental variable approach.
 Relevance to Digital FootprintsThe data refers to grocery expenditure in five categories in the focal retailer for over 268,000 consumers, over 27 weeks for 2 years, as recorded through their loyalty card.
 ResultsThe reward program significantly increased expenditures in F&V in the focal retailer during the promotional period. However, results differed depending on reward redemption. For reward-redeemers, the program increased expenditures in F&V as well as in other food categories, an effect that persisted – at a declining rate – after the program stopped. The program had a short-lived effect on non-rewards redeemers, who only increased F&V expenditures during the promotional period.
 Conclusions & ImplicationsResults indicate that a loyalty program promoting sales of F&V can create win-win benefits to both society and the retailer: it increases expenditures on healthy foods (F&V), while improving overall loyalty (i.e., expenditures) to the retailer.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135154240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal for Population Data Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1