SynthEco -一个多层次的数字生态系统,用于分析环境中复杂的人类行为

Antonia Gieschen, Catherine Paquet, Raja Sengupta, Anna-Liisa Aunio, Fares Belkhiria, Shawn Brown, Laurette Dube
{"title":"SynthEco -一个多层次的数字生态系统,用于分析环境中复杂的人类行为","authors":"Antonia Gieschen, Catherine Paquet, Raja Sengupta, Anna-Liisa Aunio, Fares Belkhiria, Shawn Brown, Laurette Dube","doi":"10.23889/ijpds.v8i3.2285","DOIUrl":null,"url":null,"abstract":"Introduction & BackgroundHuman behaviour is multi-faceted and complex, with different dimensions interacting and impacting each other and individuals operating in an environmental context. In order to understand this behaviour better, the combination of data from different sources is useful to uncover some of those interactions and complexities. We present a multi-layered digital ecosystem based on a data platform providing statistically representative synthetic population derived from census data at different geo-spatial granularity, which we call SynthEco. This platform is enriched with individual data stemming from cohorts and cross-sectional surveys and geo-scanning of different layers of socio-environmental actors and conditions to create a complex digital ecosystem.
 Objectives & ApproachThe objective of SynthEco is to allow for the analysis of behaviour, as well as health and wellbeing outcomes, through the integration of cohort and cross-sectional data into a geospatially anchored synthetic population embedded into environmental data which is forming the backdrop. We demonstrate the use of this platform on the example of Montreal, Canada. The synthetic population is first generated from census data using iterative proportional fitting, which allows for the creation of a population data set that is artificial yet statistically representative for a given geospatial granularity, such as a city. Each individual household is assigned a geospatial location, which allows for the consideration of their surrounding environment including enterprises or institutions such as schools, hospitals and the local food environment. Through fuzzy matching and statistical extrapolation, different cohort and cross-sectional survey data are then merged to individual records, in order to describe them in more detail. This includes health, as well as financial wellbeing or social environment descriptors.
 Relevance to Digital FootprintsThere are two important points made through the presented work in relation to Digital Footprints data: the first is the technical approach to merging multiple datasets describing different dimensions of interacting human characteristics and behaviour by anchoring them into a synthetic population through fuzzy record matching. The second is the consideration of a spatial dimension when describing human behaviour. This is especially important when describing behaviour within local environments, such as the interaction with local food outlets.
 ResultsRecent work in this context includes an analysis of the food environment in Montreal, Canada. It introduces a way of utilising the synthetic population to predict the healthfulness of their local environment in terms of healthy food outlets, as well as providing a platform for the analysis of food environment surveillance and intervention simulations. For this purpose, the healthfulness of different census tract regions in Montreal is calculated to identify food deserts, food swamps, as well as healthy areas as defined through the Modified Retail Food Environment Index. We test different machine learning approaches to then predict these healthfulness scores using census variables from the synthetic population in their respective census tract, achieving accuracy scores of around 0.53 to 0.60. This demonstrates that census data has some limited predictive power in explaining the healthiness of the local food environment, which could be especially relevant for situations in which no information on the retailers is available to local policy makers. Future work can extend this approach to also include further data describing the population, stemming from the integrated cohorts and survey data, which could improve the prediction accuracy or help in identifying areas of concern.
 Conclusions & ImplicationsThe presented SynthEco platform views individuals as agents nested within modular systems of systems, trying to capture both internal systems and processes as well as environmental ones within which individuals are operating. The platform thus enables the application of computational systems modelling for the analysis of individual human behaviour in contexts. As demonstrated through the example of using SynthEco in the context of healthier food environments, the approach is especially relevant to practitioners and policy makers interested in local intervention strategies and identifying areas for targeted policy related to different dimensions of health and wellbeing.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SynthEco - A multi-layered digital ecosystem for analysing complex human behaviour in context\",\"authors\":\"Antonia Gieschen, Catherine Paquet, Raja Sengupta, Anna-Liisa Aunio, Fares Belkhiria, Shawn Brown, Laurette Dube\",\"doi\":\"10.23889/ijpds.v8i3.2285\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction & BackgroundHuman behaviour is multi-faceted and complex, with different dimensions interacting and impacting each other and individuals operating in an environmental context. In order to understand this behaviour better, the combination of data from different sources is useful to uncover some of those interactions and complexities. We present a multi-layered digital ecosystem based on a data platform providing statistically representative synthetic population derived from census data at different geo-spatial granularity, which we call SynthEco. This platform is enriched with individual data stemming from cohorts and cross-sectional surveys and geo-scanning of different layers of socio-environmental actors and conditions to create a complex digital ecosystem.
 Objectives & ApproachThe objective of SynthEco is to allow for the analysis of behaviour, as well as health and wellbeing outcomes, through the integration of cohort and cross-sectional data into a geospatially anchored synthetic population embedded into environmental data which is forming the backdrop. We demonstrate the use of this platform on the example of Montreal, Canada. The synthetic population is first generated from census data using iterative proportional fitting, which allows for the creation of a population data set that is artificial yet statistically representative for a given geospatial granularity, such as a city. Each individual household is assigned a geospatial location, which allows for the consideration of their surrounding environment including enterprises or institutions such as schools, hospitals and the local food environment. Through fuzzy matching and statistical extrapolation, different cohort and cross-sectional survey data are then merged to individual records, in order to describe them in more detail. This includes health, as well as financial wellbeing or social environment descriptors.
 Relevance to Digital FootprintsThere are two important points made through the presented work in relation to Digital Footprints data: the first is the technical approach to merging multiple datasets describing different dimensions of interacting human characteristics and behaviour by anchoring them into a synthetic population through fuzzy record matching. The second is the consideration of a spatial dimension when describing human behaviour. This is especially important when describing behaviour within local environments, such as the interaction with local food outlets.
 ResultsRecent work in this context includes an analysis of the food environment in Montreal, Canada. It introduces a way of utilising the synthetic population to predict the healthfulness of their local environment in terms of healthy food outlets, as well as providing a platform for the analysis of food environment surveillance and intervention simulations. For this purpose, the healthfulness of different census tract regions in Montreal is calculated to identify food deserts, food swamps, as well as healthy areas as defined through the Modified Retail Food Environment Index. We test different machine learning approaches to then predict these healthfulness scores using census variables from the synthetic population in their respective census tract, achieving accuracy scores of around 0.53 to 0.60. This demonstrates that census data has some limited predictive power in explaining the healthiness of the local food environment, which could be especially relevant for situations in which no information on the retailers is available to local policy makers. Future work can extend this approach to also include further data describing the population, stemming from the integrated cohorts and survey data, which could improve the prediction accuracy or help in identifying areas of concern.
 Conclusions & ImplicationsThe presented SynthEco platform views individuals as agents nested within modular systems of systems, trying to capture both internal systems and processes as well as environmental ones within which individuals are operating. The platform thus enables the application of computational systems modelling for the analysis of individual human behaviour in contexts. As demonstrated through the example of using SynthEco in the context of healthier food environments, the approach is especially relevant to practitioners and policy makers interested in local intervention strategies and identifying areas for targeted policy related to different dimensions of health and wellbeing.\",\"PeriodicalId\":132937,\"journal\":{\"name\":\"International Journal for Population Data Science\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal for Population Data Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23889/ijpds.v8i3.2285\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal for Population Data Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23889/ijpds.v8i3.2285","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

介绍,人类行为是多方面和复杂的,不同的维度相互作用和影响,个人在环境背景下运作。为了更好地理解这种行为,来自不同来源的数据组合有助于揭示其中的一些交互和复杂性。我们提出了一个基于数据平台的多层数字生态系统,该平台提供从不同地理空间粒度的人口普查数据中提取的具有统计代表性的合成人口,我们称之为SynthEco。该平台丰富了来自队列和横断面调查的个人数据,以及对不同层次的社会环境行为者和条件的地理扫描,以创建一个复杂的数字生态系统。 目标,SynthEco的目标是通过将队列和横断面数据整合到嵌入环境数据的地理空间锚定的合成人口中,从而分析行为以及健康和福祉结果。环境数据正在形成背景。我们以加拿大蒙特利尔为例说明了该平台的使用。首先使用迭代比例拟合从人口普查数据生成合成人口,这允许创建人口数据集,该数据集是人工的,但在统计上代表给定的地理空间粒度,例如城市。每个家庭都被分配了一个地理空间位置,以便考虑其周围环境,包括企业或机构,如学校、医院和当地食品环境。然后,通过模糊匹配和统计外推,将不同队列和横断面调查数据合并到个人记录中,以便更详细地描述它们。这包括健康,以及财务状况或社会环境描述符。与数字足迹相关通过所介绍的与数字足迹数据相关的工作,有两个要点:首先是通过模糊记录匹配将描述交互人类特征和行为的不同维度的多个数据集锚定到合成人口中,从而合并多个数据集的技术方法。第二个是在描述人类行为时对空间维度的考虑。这在描述当地环境中的行为时尤其重要,例如与当地食品商店的互动。 最近在这方面的工作包括对加拿大蒙特利尔食品环境的分析。它引入了一种利用合成人口来预测其当地环境在健康食品销售点方面的健康状况的方法,并为食品环境监测和干预模拟分析提供了一个平台。为此,对蒙特利尔不同人口普查区的健康状况进行了计算,以确定通过修订零售食品环境指数定义的食物沙漠、食物沼泽和健康区域。我们测试了不同的机器学习方法,然后使用来自各自人口普查区的合成人口的人口普查变量来预测这些健康分数,获得了大约0.53到0.60的准确性分数。这表明,人口普查数据在解释当地食品环境的健康状况方面具有有限的预测能力,这对于当地决策者无法获得零售商信息的情况尤其重要。未来的工作可以扩展这种方法,包括进一步描述人口的数据,这些数据来自综合队列和调查数据,这可以提高预测的准确性或帮助确定关注的领域。结论,所提出的SynthEco平台将个体视为嵌套在系统的模块化系统中的代理,试图捕获内部系统和过程以及个体在其中操作的环境。因此,该平台能够应用计算系统建模来分析环境中的个体人类行为。正如在更健康的食品环境背景下使用SynthEco的例子所表明的那样,该方法特别适用于对地方干预战略感兴趣的从业人员和政策制定者,并确定与健康和福祉的不同层面相关的目标政策领域。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
SynthEco - A multi-layered digital ecosystem for analysing complex human behaviour in context
Introduction & BackgroundHuman behaviour is multi-faceted and complex, with different dimensions interacting and impacting each other and individuals operating in an environmental context. In order to understand this behaviour better, the combination of data from different sources is useful to uncover some of those interactions and complexities. We present a multi-layered digital ecosystem based on a data platform providing statistically representative synthetic population derived from census data at different geo-spatial granularity, which we call SynthEco. This platform is enriched with individual data stemming from cohorts and cross-sectional surveys and geo-scanning of different layers of socio-environmental actors and conditions to create a complex digital ecosystem. Objectives & ApproachThe objective of SynthEco is to allow for the analysis of behaviour, as well as health and wellbeing outcomes, through the integration of cohort and cross-sectional data into a geospatially anchored synthetic population embedded into environmental data which is forming the backdrop. We demonstrate the use of this platform on the example of Montreal, Canada. The synthetic population is first generated from census data using iterative proportional fitting, which allows for the creation of a population data set that is artificial yet statistically representative for a given geospatial granularity, such as a city. Each individual household is assigned a geospatial location, which allows for the consideration of their surrounding environment including enterprises or institutions such as schools, hospitals and the local food environment. Through fuzzy matching and statistical extrapolation, different cohort and cross-sectional survey data are then merged to individual records, in order to describe them in more detail. This includes health, as well as financial wellbeing or social environment descriptors. Relevance to Digital FootprintsThere are two important points made through the presented work in relation to Digital Footprints data: the first is the technical approach to merging multiple datasets describing different dimensions of interacting human characteristics and behaviour by anchoring them into a synthetic population through fuzzy record matching. The second is the consideration of a spatial dimension when describing human behaviour. This is especially important when describing behaviour within local environments, such as the interaction with local food outlets. ResultsRecent work in this context includes an analysis of the food environment in Montreal, Canada. It introduces a way of utilising the synthetic population to predict the healthfulness of their local environment in terms of healthy food outlets, as well as providing a platform for the analysis of food environment surveillance and intervention simulations. For this purpose, the healthfulness of different census tract regions in Montreal is calculated to identify food deserts, food swamps, as well as healthy areas as defined through the Modified Retail Food Environment Index. We test different machine learning approaches to then predict these healthfulness scores using census variables from the synthetic population in their respective census tract, achieving accuracy scores of around 0.53 to 0.60. This demonstrates that census data has some limited predictive power in explaining the healthiness of the local food environment, which could be especially relevant for situations in which no information on the retailers is available to local policy makers. Future work can extend this approach to also include further data describing the population, stemming from the integrated cohorts and survey data, which could improve the prediction accuracy or help in identifying areas of concern. Conclusions & ImplicationsThe presented SynthEco platform views individuals as agents nested within modular systems of systems, trying to capture both internal systems and processes as well as environmental ones within which individuals are operating. The platform thus enables the application of computational systems modelling for the analysis of individual human behaviour in contexts. As demonstrated through the example of using SynthEco in the context of healthier food environments, the approach is especially relevant to practitioners and policy makers interested in local intervention strategies and identifying areas for targeted policy related to different dimensions of health and wellbeing.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Using novel data linkage of biobank data with administrative health data to inform genomic analysis for future precision medicine treatment of congenital heart disease Common governance model: a way to avoid data segregation between existing trusted research environment Federated learning for generating synthetic data: a scoping review Health Data Governance for Research Use in Alberta Establishment of a birth-to-education cohort of 1 million Palestinian refugees using electronic medical records and electronic education records
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1