Machine learning on national shopping data reliably estimates childhood obesity prevalence and socio-economic deprivation

IF 6.8 1区 经济学 Q1 AGRICULTURAL ECONOMICS & POLICY Food Policy Pub Date : 2025-02-01 DOI:10.1016/j.foodpol.2025.102826
Gavin Long , Georgiana Nica-Avram , John Harvey , Evgeniya Lukinova , Roberto Mansilla , Simon Welham , Gregor Engelmann , Elizabeth Dolan , Kuzivakwashe Makokoro , Michelle Thomas , Edward Powell , James Goulding
{"title":"Machine learning on national shopping data reliably estimates childhood obesity prevalence and socio-economic deprivation","authors":"Gavin Long ,&nbsp;Georgiana Nica-Avram ,&nbsp;John Harvey ,&nbsp;Evgeniya Lukinova ,&nbsp;Roberto Mansilla ,&nbsp;Simon Welham ,&nbsp;Gregor Engelmann ,&nbsp;Elizabeth Dolan ,&nbsp;Kuzivakwashe Makokoro ,&nbsp;Michelle Thomas ,&nbsp;Edward Powell ,&nbsp;James Goulding","doi":"10.1016/j.foodpol.2025.102826","DOIUrl":null,"url":null,"abstract":"<div><div>Deprivation pushes people to choose cheap, calorie-dense foods instead of nutritious but expensive alternatives. Diseases, such as obesity, cardiovascular disease, and diabetes, resulting from these poor dietary choices place a significant burden on public health systems. Measuring nutritional insecurity is difficult to achieve at scale and so the ability to study the relationship between nutritional outcomes and deprivation at a national level is very challenging. This makes it difficult to understand the effect of new policies or track changes over time. To address this challenge, we develop a machine learning approach using massive anonymised transactional data (4 million members and 2.5 billion transactions) in partnership with the retailer The Co-operative Group UK. We engineer a series of variables related to obesogenic diets, including a new measure called ‘Calorie-oriented purchasing’. These variables help illustrate how large-scale transactional data can discriminate between neighbourhoods most affected by deprivation and childhood obesity. Through comparative assessment of machine learning approaches, we find better performance from tree-based models (Random Forest, XGBoost) with the best-achieving accuracy of 0.88 for predicting deprivation and an accuracy of 0.79 for childhood obesity. Calorie-oriented purchasing emerges as a robust predictor of deprivation and childhood obesity at the census area level. Results show this approach can help summarise nutritional insecurity, and support its spatio-temporal monitoring. We conclude with policy implications and recommend retailers adopt new measures for measuring national nutrition insecurity.</div></div>","PeriodicalId":321,"journal":{"name":"Food Policy","volume":"131 ","pages":"Article 102826"},"PeriodicalIF":6.8000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Food Policy","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306919225000302","RegionNum":1,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ECONOMICS & POLICY","Score":null,"Total":0}
引用次数: 0

Abstract

Deprivation pushes people to choose cheap, calorie-dense foods instead of nutritious but expensive alternatives. Diseases, such as obesity, cardiovascular disease, and diabetes, resulting from these poor dietary choices place a significant burden on public health systems. Measuring nutritional insecurity is difficult to achieve at scale and so the ability to study the relationship between nutritional outcomes and deprivation at a national level is very challenging. This makes it difficult to understand the effect of new policies or track changes over time. To address this challenge, we develop a machine learning approach using massive anonymised transactional data (4 million members and 2.5 billion transactions) in partnership with the retailer The Co-operative Group UK. We engineer a series of variables related to obesogenic diets, including a new measure called ‘Calorie-oriented purchasing’. These variables help illustrate how large-scale transactional data can discriminate between neighbourhoods most affected by deprivation and childhood obesity. Through comparative assessment of machine learning approaches, we find better performance from tree-based models (Random Forest, XGBoost) with the best-achieving accuracy of 0.88 for predicting deprivation and an accuracy of 0.79 for childhood obesity. Calorie-oriented purchasing emerges as a robust predictor of deprivation and childhood obesity at the census area level. Results show this approach can help summarise nutritional insecurity, and support its spatio-temporal monitoring. We conclude with policy implications and recommend retailers adopt new measures for measuring national nutrition insecurity.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
求助全文
约1分钟内获得全文 去求助
来源期刊
Food Policy
Food Policy 管理科学-农业经济与政策
CiteScore
11.40
自引率
4.60%
发文量
128
审稿时长
62 days
期刊介绍: Food Policy is a multidisciplinary journal publishing original research and novel evidence on issues in the formulation, implementation, and evaluation of policies for the food sector in developing, transition, and advanced economies. Our main focus is on the economic and social aspect of food policy, and we prioritize empirical studies informing international food policy debates. Provided that articles make a clear and explicit contribution to food policy debates of international interest, we consider papers from any of the social sciences. Papers from other disciplines (e.g., law) will be considered only if they provide a key policy contribution, and are written in a style which is accessible to a social science readership.
期刊最新文献
Deconstructing fertilizer price spikes: Evidence from Chinese urea fertilizer market India’s export restrictions and response of African and Asian retail rice prices Fish as food: Prioritizing domestic fish consumption to reduce the health burden Input subsidies, fertilizer intensity and imbalances amidst climate change: Evidence from Bangladesh Fertilizer policy reforms in the midst of crisis: Evidence from Rwanda
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1