首页 > 最新文献

International Journal of Population Data Science最新文献

英文 中文
Cohort profile: The SAIL long-term conditions e-cohort (SLTC cohort) investigating area-level changes in healthcare resource use in Wales. 队列概况:SAIL长期条件电子队列(SLTC队列)调查威尔士医疗保健资源使用的区域水平变化。
IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-06-17 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i1.2465
Timothy Osborne, Rowena Bailey, Amy Mizen, Richard Fry, Ronan A Lyons

Introduction: The prioritisation of acute cases of coronavirus during the pandemic caused significant disruption to non-urgent healthcare services, creating a backlog of undiagnosed and untreated individuals with long-term conditions. Previous research has explored the impact of the pandemic on long-term conditions in Wales, but not the geographic variation or underlying area-level characteristics associated with these changes.

Objectives: We created the SAIL long-term conditions e-cohort (SLTC cohort) within the Secure Anonymised Information Linkage (SAIL) Databank to describe changes in healthcare service use of individuals living with long-term conditions during the COVID-19 pandemic, and to facilitate future investigations into the underlying reasons for these changes.

Methods: Individuals were included in the cohort if they interacted with health services with a long-term condition between January 2017 and December 2022. Interactions were identified using primary and secondary care datasets within the SAIL Databank. We linked this interaction level data with individual, residence, and area-level demographic data. We calculated area-level age-sex-standardised rates of interactions, based on an individual's address at the time of interaction, for the 3 years pre-COVID-19 (2017-2019) and during-COVID-19 (2020-2022). Percentage changes in rates between these time periods were calculated, and we investigated the underlying area-level characteristics associated with these differences.

Results: The SLTC cohort contains 1,277,532 individuals. Age-sex standardised interaction rates varied by Welsh Index of Multiple Deprivation (WIMD) quintiles and Rural-Urban Classification. Areas in the most deprived WIMD quintile had the greatest median percentage decrease (23.5%) in primary care rates of interactions from pre- to during-COVID-19, and the least deprived overall WIMD quintile had the smallest (16.9%). Areas classified as 'Urban city & town in a sparse setting' had the greatest decrease in primary care interactions (29.7%), and 'Rural village' areas had the smallest decrease (17.1%). Secondary care rates of interactions showed less variation in rates of interactions between the two time periods.

Conclusion: We have created a cohort that links area-level characteristics and measures of healthcare resource use, in a study period that covers pre- and during-COVID-19, which will allow researchers to investigate geographic variation of changes in healthcare resource use over this time period and the underlying influences. This cohort can also be further linked to other area-level characteristics of interest, such as travel times to general practices, or access to green space measures.

导言:大流行期间对冠状病毒急性病例的优先处理对非紧急医疗服务造成了严重干扰,造成了长期疾病患者未确诊和未治疗的积压。以前的研究探讨了大流行对威尔士长期状况的影响,但没有探讨与这些变化相关的地理差异或潜在的区域特征。目的:我们在安全匿名信息链接(SAIL)数据库中创建了SAIL长期病情电子队列(SLTC队列),以描述COVID-19大流行期间患有长期疾病的个人在医疗保健服务使用方面的变化,并促进未来对这些变化的潜在原因的调查。方法:将2017年1月至2022年12月期间与长期患病的卫生服务机构互动的个体纳入队列。使用SAIL数据库中的初级和二级保健数据集确定相互作用。我们将这种互动水平的数据与个人、居住地和地区水平的人口统计数据联系起来。我们根据个人在互动时的地址计算了2019冠状病毒病前(2017-2019)和2019冠状病毒病期间(2020-2022)的区域级年龄-性别标准化互动率。我们计算了这些时间段之间的百分比变化,并调查了与这些差异相关的潜在区域特征。结果:SLTC队列包含1,277,532人。年龄-性别标准化的相互作用率因威尔士多重剥夺指数(WIMD)五分位数和城乡分类而异。在最贫困的WIMD五分位数地区,从covid -19之前到期间,初级保健相互作用率的中位数百分比下降幅度最大(23.5%),而最贫困的WIMD五分位数总体下降幅度最小(16.9%)。被归类为“稀疏环境中的城市和城镇”的地区,初级保健互动减少幅度最大(29.7%),而“农村”地区减少幅度最小(17.1%)。相互作用的二级保健率在两个时间段之间的相互作用率变化较小。结论:我们创建了一个队列,将区域层面的特征和医疗资源使用的措施联系起来,在一个涵盖covid -19之前和期间的研究期间,这将使研究人员能够调查这段时间内医疗资源使用变化的地理差异及其潜在影响。这个队列还可以进一步与其他感兴趣的区域级特征联系起来,例如到一般诊所的旅行时间,或获得绿色空间措施。
{"title":"Cohort profile: The SAIL long-term conditions e-cohort (SLTC cohort) investigating area-level changes in healthcare resource use in Wales.","authors":"Timothy Osborne, Rowena Bailey, Amy Mizen, Richard Fry, Ronan A Lyons","doi":"10.23889/ijpds.v10i1.2465","DOIUrl":"10.23889/ijpds.v10i1.2465","url":null,"abstract":"<p><strong>Introduction: </strong>The prioritisation of acute cases of coronavirus during the pandemic caused significant disruption to non-urgent healthcare services, creating a backlog of undiagnosed and untreated individuals with long-term conditions. Previous research has explored the impact of the pandemic on long-term conditions in Wales, but not the geographic variation or underlying area-level characteristics associated with these changes.</p><p><strong>Objectives: </strong>We created the SAIL long-term conditions e-cohort (SLTC cohort) within the Secure Anonymised Information Linkage (SAIL) Databank to describe changes in healthcare service use of individuals living with long-term conditions during the COVID-19 pandemic, and to facilitate future investigations into the underlying reasons for these changes.</p><p><strong>Methods: </strong>Individuals were included in the cohort if they interacted with health services with a long-term condition between January 2017 and December 2022. Interactions were identified using primary and secondary care datasets within the SAIL Databank. We linked this interaction level data with individual, residence, and area-level demographic data. We calculated area-level age-sex-standardised rates of interactions, based on an individual's address at the time of interaction, for the 3 years pre-COVID-19 (2017-2019) and during-COVID-19 (2020-2022). Percentage changes in rates between these time periods were calculated, and we investigated the underlying area-level characteristics associated with these differences.</p><p><strong>Results: </strong>The SLTC cohort contains 1,277,532 individuals. Age-sex standardised interaction rates varied by Welsh Index of Multiple Deprivation (WIMD) quintiles and Rural-Urban Classification. Areas in the most deprived WIMD quintile had the greatest median percentage decrease (23.5%) in primary care rates of interactions from pre- to during-COVID-19, and the least deprived overall WIMD quintile had the smallest (16.9%). Areas classified as 'Urban city & town in a sparse setting' had the greatest decrease in primary care interactions (29.7%), and 'Rural village' areas had the smallest decrease (17.1%). Secondary care rates of interactions showed less variation in rates of interactions between the two time periods.</p><p><strong>Conclusion: </strong>We have created a cohort that links area-level characteristics and measures of healthcare resource use, in a study period that covers pre- and during-COVID-19, which will allow researchers to investigate geographic variation of changes in healthcare resource use over this time period and the underlying influences. This cohort can also be further linked to other area-level characteristics of interest, such as travel times to general practices, or access to green space measures.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 1","pages":"2465"},"PeriodicalIF":2.2,"publicationDate":"2025-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12581028/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145446138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development and validation of a mortality risk prediction index score for adults living with HIV and multiple chronic comorbidities. 艾滋病毒和多种慢性合并症成人死亡风险预测指数评分的开发和验证
IF 1.6 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-06-10 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i2.2926
Viviane D Lima, Bronhilda T Takeh, Neil Faught, Hasan Nathani, Jielin Zhu, Scott Emerson, Katerina Dolguikh, Jason Trigg, Kate A Salters, Rolando Barrios, Julio S G Montaner

Introduction: Aging while living with HIV poses new challenges in clinical management, mainly due to the onset of multiple chronic comorbidities. Population-specific risk prediction indices considering comorbidities and other risk factors are essential to comprehensively characterise disease burden among PLWH. We developed and validated a mortality risk prediction index (MRPi) to predict the risk of one-year all-cause mortality among people living with HIV (PLWH).

Methods: Participants were ≥18 years and had initiated antiretroviral therapy (ART) between 01/2001 and 12/2018, in British Columbia, Canada. The index date was randomly selected between one-year post-ART initiation and the end of the follow-up. Participants were followed for at least one year from the index date until 12/2019, the last contact date, or the date of death (all-cause), whichever came first. The MRPi included 18 physical/mental comorbidities, demographic and clinical variables, and ranged from 0 (no risk) to 100 (highest risk).

Results: The final model demonstrated the highest discrimination (c-statistic 0.8355, 95% CI: 0.8187-0.8523 in the training dataset and 0.7965, 95% CI: 0.7664-0.8266 in the test dataset). The comorbidities with the highest weights in the MRPi were substance use disorders, metastatic solid tumors and non-AIDs defining cancers. For example, for an MRPi of 30, the predicted one-year all-cause mortality was 0.2%, while an MRPi of 50 had a predicted mortality of 2.3%.

Conclusions: The MRPi provides a promising tool to assess the risk of short-term mortality among PLWH in the modern ART era that can inform clinical practice and health policy decisions.

导言:老年艾滋病病毒感染者出现多种慢性合并症,给临床管理带来了新的挑战。考虑合并症和其他危险因素的人群特异性风险预测指标是全面表征PLWH疾病负担的必要条件。我们开发并验证了一种死亡率风险预测指数(MRPi),用于预测HIV感染者(PLWH)一年的全因死亡率风险。方法:参与者年龄≥18岁,在2001年1月至2018年12月期间在加拿大不列颠哥伦比亚省开始抗逆转录病毒治疗(ART)。指标日期随机选择在art治疗开始后1年至随访结束之间。参与者从索引日期开始至少随访一年,直到2019年12月12日,最后一次接触日期或死亡日期(全因),以先到者为准。MRPi包括18种身体/精神合并症、人口统计学和临床变量,范围从0(无风险)到100(最高风险)。结果:最终模型显示出最高的判别性(c-统计量0.8355,95% CI: 0.8187-0.8523,在训练数据集中为0.7965,95% CI: 0.7664-0.8266)。MRPi中权重最高的合并症是物质使用障碍、转移性实体瘤和非艾滋病定义的癌症。例如,MRPi为30时,预计一年的全因死亡率为0.2%,而MRPi为50时,预计死亡率为2.3%。结论:MRPi提供了一个有希望的工具来评估在现代ART时代PLWH的短期死亡风险,可以为临床实践和卫生政策决策提供信息。
{"title":"Development and validation of a mortality risk prediction index score for adults living with HIV and multiple chronic comorbidities.","authors":"Viviane D Lima, Bronhilda T Takeh, Neil Faught, Hasan Nathani, Jielin Zhu, Scott Emerson, Katerina Dolguikh, Jason Trigg, Kate A Salters, Rolando Barrios, Julio S G Montaner","doi":"10.23889/ijpds.v10i2.2926","DOIUrl":"10.23889/ijpds.v10i2.2926","url":null,"abstract":"<p><strong>Introduction: </strong>Aging while living with HIV poses new challenges in clinical management, mainly due to the onset of multiple chronic comorbidities. Population-specific risk prediction indices considering comorbidities and other risk factors are essential to comprehensively characterise disease burden among PLWH. We developed and validated a mortality risk prediction index (MRP<i>i</i>) to predict the risk of one-year all-cause mortality among people living with HIV (PLWH).</p><p><strong>Methods: </strong>Participants were ≥18 years and had initiated antiretroviral therapy (ART) between 01/2001 and 12/2018, in British Columbia, Canada. The index date was randomly selected between one-year post-ART initiation and the end of the follow-up. Participants were followed for at least one year from the index date until 12/2019, the last contact date, or the date of death (all-cause), whichever came first. The MRP<i>i</i> included 18 physical/mental comorbidities, demographic and clinical variables, and ranged from 0 (no risk) to 100 (highest risk).</p><p><strong>Results: </strong>The final model demonstrated the highest discrimination (c-statistic 0.8355, 95% CI: 0.8187-0.8523 in the training dataset and 0.7965, 95% CI: 0.7664-0.8266 in the test dataset). The comorbidities with the highest weights in the MRP<i>i</i> were substance use disorders, metastatic solid tumors and non-AIDs defining cancers. For example, for an MRP<i>i</i> of 30, the predicted one-year all-cause mortality was 0.2%, while an MRP<i>i</i> of 50 had a predicted mortality of 2.3%.</p><p><strong>Conclusions: </strong>The MRP<i>i</i> provides a promising tool to assess the risk of short-term mortality among PLWH in the modern ART era that can inform clinical practice and health policy decisions.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 2","pages":"2926"},"PeriodicalIF":1.6,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12212411/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144545179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Linking digital footprint data into longitudinal population studies. 将数字足迹数据与纵向人口研究联系起来。
IF 1.6 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-06-03 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i1.2946
Romana Burgess, Andy Boyd, Oliver Sp Davis, Louise Ac Millard, Mark Mumme, Sarah Robertson, Andy Skinner, Zhuoni Xiao, Anya Skatova

Background: Linking digital footprint data into longitudinal population studies (LPS) presents an opportunity to enrich our understanding of how digitally captured behaviours relate to health traits and disease. However, this linkage introduces significant methodological challenges that require systematic exploration.

Objectives: To develop a robust framework for successful digital footprint linkage into LPS, informed by discussions from a workshop from the Digital Footprints Conference 2024.

Methods: We propose a structured, four-stage framework to facilitate successful linkage of digital footprint data into LPS: (1) understand participant expectations and acceptability; (2) collect and link the data; (3) evaluate properties of the data; and (4) ensure secure and ethical access for research. This framework addresses the key methodological challenges identified at each stage, discussed through the lens of two LPS case studies: the Avon Longitudinal Study of Parents and Children and Generation Scotland.

Results: Key methodological challenges identified include privacy and confidentiality concerns, reliance on third-party platforms, data quality issues like missing data and measurement error. We also emphasize the role of trusted research environments and synthetic datasets in enabling secure, privacy-sensitive data sharing for research.

Conclusions: While the linkage digital footprint data to LPS remains in early stages, our framework provides a methodological foundation for overcoming current challenges. Through iterative refinement of these methods there is significant potential to advance population-level insights into health and wellbeing.

背景:将数字足迹数据与纵向人口研究(LPS)联系起来,为丰富我们对数字捕获的行为与健康特征和疾病之间的关系的理解提供了机会。然而,这种联系带来了需要系统探索的重大方法论挑战。目标:通过2024年数字足迹会议研讨会的讨论,为成功地将数字足迹链接到LPS开发一个强大的框架。方法:我们提出了一个结构化的四阶段框架,以促进数字足迹数据与LPS的成功联系:(1)了解参与者的期望和可接受性;(2)收集和链接数据;(3)评估数据的属性;(4)确保安全、符合伦理的研究获取途径。该框架解决了在每个阶段确定的关键方法挑战,并通过两个LPS案例研究进行了讨论:雅芳父母与儿童纵向研究和苏格兰一代。结果:确定的主要方法挑战包括隐私和保密问题,对第三方平台的依赖,数据丢失和测量误差等数据质量问题。我们还强调了可信的研究环境和合成数据集在实现安全、隐私敏感的研究数据共享中的作用。结论:虽然数字足迹数据与LPS的联系仍处于早期阶段,但我们的框架为克服当前的挑战提供了方法论基础。通过对这些方法的不断改进,有很大的潜力推进人口层面对健康和福祉的了解。
{"title":"Linking digital footprint data into longitudinal population studies.","authors":"Romana Burgess, Andy Boyd, Oliver Sp Davis, Louise Ac Millard, Mark Mumme, Sarah Robertson, Andy Skinner, Zhuoni Xiao, Anya Skatova","doi":"10.23889/ijpds.v10i1.2946","DOIUrl":"10.23889/ijpds.v10i1.2946","url":null,"abstract":"<p><strong>Background: </strong>Linking digital footprint data into longitudinal population studies (LPS) presents an opportunity to enrich our understanding of how digitally captured behaviours relate to health traits and disease. However, this linkage introduces significant methodological challenges that require systematic exploration.</p><p><strong>Objectives: </strong>To develop a robust framework for successful digital footprint linkage into LPS, informed by discussions from a workshop from the Digital Footprints Conference 2024.</p><p><strong>Methods: </strong>We propose a structured, four-stage framework to facilitate successful linkage of digital footprint data into LPS: (1) understand participant expectations and acceptability; (2) collect and link the data; (3) evaluate properties of the data; and (4) ensure secure and ethical access for research. This framework addresses the key methodological challenges identified at each stage, discussed through the lens of two LPS case studies: the Avon Longitudinal Study of Parents and Children and Generation Scotland.</p><p><strong>Results: </strong>Key methodological challenges identified include privacy and confidentiality concerns, reliance on third-party platforms, data quality issues like missing data and measurement error. We also emphasize the role of trusted research environments and synthetic datasets in enabling secure, privacy-sensitive data sharing for research.</p><p><strong>Conclusions: </strong>While the linkage digital footprint data to LPS remains in early stages, our framework provides a methodological foundation for overcoming current challenges. Through iterative refinement of these methods there is significant potential to advance population-level insights into health and wellbeing.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 1","pages":"2946"},"PeriodicalIF":1.6,"publicationDate":"2025-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12132027/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144217151","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Individual, household structure, and socioeconomic predictors of COVID-19 testing and vaccination outcomes: a whole population linked data analysis. COVID-19检测和疫苗接种结果的个人、家庭结构和社会经济预测因素:与全人群相关的数据分析
IF 1.6 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-05-27 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i1.2930
Nicole Satherley, Andrew Sporle

Introduction: The COVID-19 pandemic produced social inequities in health outcomes between and within nations. Reported inequitable COVID-19 outcomes for ethnic minorities and indigenous peoples are likely to be associated in part because of poorer socioeconomic circumstances experienced by these populations. Understanding these associations within national populations is vital for future pandemic management.

Objective: This study explores the social inequity of COVID-19 outcomes within New Zealand over the first 3 years of the pandemic. We aimed to identify policy amenable socioeconomic factors associated with COVID-19 outcomes while adjusting for relevant individual factors and household structure. We also aimed to examine whether ethnic group differences are smaller when accounting for these socioeconomic factors and household structure.

Methods: Administrative individual-level data for the New Zealand population was analysed to assess COVID-19 health outcomes during 2020 - 2023. The association between individual (e.g. age, ethnicity, disability status), household structure (e.g. household composition) and socioeconomic (e.g. crowding, housing quality, deprivation) factors and four COVID-19 health outcomes - infection, hospitalisation, mortality, and vaccination status was assessed.

Results: Indigenous peoples and ethnic minorities experienced worse outcomes across most COVID-19 outcomes. Adjusting for household structure and socioeconomic factors reduced but did not eliminate these inequities between ethnic groups. Housing issues including high housing mobility, poor quality housing, and household crowding were associated with worse outcomes, as were disability status, no primary health care enrolment, lower household income and older age. The size of these effects also differed for different health outcomes.

Conclusions: Ethnic inequity was persistent and likely partly explained by policy-modifiable social factors, despite the relatively minor population health impacts of COVID-19 in New Zealand. We also demonstrate how a range of socioeconomic determinants predict COVID-19 outcomes in different ways.

导言:2019冠状病毒病大流行在国家之间和国家内部造成了卫生结果方面的社会不平等。少数民族和土著人民报告的COVID-19不公平结果可能部分与这些人群所经历的较差的社会经济环境有关。了解国家人群中的这些关联对于未来的大流行管理至关重要。目的:本研究探讨了新冠肺炎大流行头三年在新西兰的社会不平等现象。我们的目的是确定与COVID-19结果相关的政策适用的社会经济因素,同时调整相关的个人因素和家庭结构。我们还旨在研究在考虑这些社会经济因素和家庭结构时,族群差异是否较小。方法:分析新西兰人口行政层面的数据,以评估2020 - 2023年期间COVID-19的健康结果。评估了个人(如年龄、种族、残疾状况)、家庭结构(如家庭组成)和社会经济因素(如拥挤、住房质量、贫困)与COVID-19四种健康结果(感染、住院、死亡率和疫苗接种状况)之间的关系。结果:在大多数COVID-19结果中,土著人民和少数民族的结果较差。对家庭结构和社会经济因素的调整减少了但没有消除种族群体之间的不平等。住房问题,包括住房流动性高、住房质量差和家庭拥挤,以及残疾状况、没有初级保健登记、家庭收入较低和年龄较大,都与较差的结果有关。这些影响的大小也因不同的健康结果而不同。结论:尽管COVID-19对新西兰人口健康的影响相对较小,但种族不平等现象持续存在,可能部分归因于政策可改变的社会因素。我们还展示了一系列社会经济决定因素如何以不同方式预测COVID-19的结果。
{"title":"Individual, household structure, and socioeconomic predictors of COVID-19 testing and vaccination outcomes: a whole population linked data analysis.","authors":"Nicole Satherley, Andrew Sporle","doi":"10.23889/ijpds.v10i1.2930","DOIUrl":"10.23889/ijpds.v10i1.2930","url":null,"abstract":"<p><strong>Introduction: </strong>The COVID-19 pandemic produced social inequities in health outcomes between and within nations. Reported inequitable COVID-19 outcomes for ethnic minorities and indigenous peoples are likely to be associated in part because of poorer socioeconomic circumstances experienced by these populations. Understanding these associations within national populations is vital for future pandemic management.</p><p><strong>Objective: </strong>This study explores the social inequity of COVID-19 outcomes within New Zealand over the first 3 years of the pandemic. We aimed to identify policy amenable socioeconomic factors associated with COVID-19 outcomes while adjusting for relevant individual factors and household structure. We also aimed to examine whether ethnic group differences are smaller when accounting for these socioeconomic factors and household structure.</p><p><strong>Methods: </strong>Administrative individual-level data for the New Zealand population was analysed to assess COVID-19 health outcomes during 2020 - 2023. The association between individual (e.g. age, ethnicity, disability status), household structure (e.g. household composition) and socioeconomic (e.g. crowding, housing quality, deprivation) factors and four COVID-19 health outcomes - infection, hospitalisation, mortality, and vaccination status was assessed.</p><p><strong>Results: </strong>Indigenous peoples and ethnic minorities experienced worse outcomes across most COVID-19 outcomes. Adjusting for household structure and socioeconomic factors reduced but did not eliminate these inequities between ethnic groups. Housing issues including high housing mobility, poor quality housing, and household crowding were associated with worse outcomes, as were disability status, no primary health care enrolment, lower household income and older age. The size of these effects also differed for different health outcomes.</p><p><strong>Conclusions: </strong>Ethnic inequity was persistent and likely partly explained by policy-modifiable social factors, despite the relatively minor population health impacts of COVID-19 in New Zealand. We also demonstrate how a range of socioeconomic determinants predict COVID-19 outcomes in different ways.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 1","pages":"2930"},"PeriodicalIF":1.6,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12108692/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144162799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Note: Challenges when combining housing data from multiple sources to identify overcrowded households. 数据说明:在结合多个来源的住房数据以确定过度拥挤的家庭时存在挑战。
IF 1.6 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-05-20 eCollection Date: 2023-01-01 DOI: 10.23889/ijpds.v8i2.2927
Laura Scott, Yan Weigang, Marcella Ucci, Jessica Sheringham

Background: This project in one urban local authority in London (England) sought to assess the feasibility of generating locally-derived indices of overcrowding using data available to local councils on the population and their homes.We merged data at household level using the Unique Property Reference Number from publicly available Energy Performance Certificates and commercial property platforms, with data available to councils on the population and their housing characteristics, drawn from multiple sources including council tax bands and council housing databases. Multiple imputation was used to address missing data. Using the dataset, it was possible to generate two indices of overcrowding for households with dependent children, based on the bedroom standard and the space standard, which could be compared with nationally derived estimates.

Data challenges: We encountered three challenges with data. 1. Individuals in the population were excluded through linkage with household-level data. 2. Definitions of overcrowding are ambiguous and variably applied. 3. Many local areas face high proportions of missing household data, particularly numbers of bedrooms. We discuss how we addressed such problems and illustrate with a local example how they could affect estimates of overcrowding prevalence.

Lessons learned: Further clarity is needed in how bedrooms are defined to compare overcrowding prevalence generated locally and nationally. Access to national records on bedroom numbers would facilitate local areas to identify overcrowding in their own populations. Despite these challenges, we demonstrate it is feasible to generate overcrowding indices that can be useful for researchers and local policy makers seeking to develop or evaluate strategies to address household overcrowding.

背景:这个项目在伦敦(英格兰)的一个城市地方当局进行,目的是利用地方议会可获得的关于人口及其住房的数据,评估产生当地过度拥挤指数的可行性。我们使用来自公开的能源绩效证书和商业物业平台的唯一物业参考号码,将家庭层面的数据与议会可获得的人口及其住房特征数据合并,这些数据来自多个来源,包括议会税收等级和议会住房数据库。采用多重插值解决缺失数据。利用该数据集,可以根据卧室标准和空间标准,为有受抚养子女的家庭生成两个过度拥挤指数,这两个指数可以与国家得出的估计值进行比较。数据挑战:我们在数据方面遇到了三个挑战。1. 通过与家庭数据的联系,排除了人口中的个体。2. 过度拥挤的定义是模棱两可的,适用范围也不尽相同。3. 许多地方都面临着大量家庭数据缺失的问题,尤其是卧室数量。我们将讨论如何解决这些问题,并以一个当地的例子说明它们如何影响对过度拥挤流行程度的估计。经验教训:需要进一步明确如何定义卧室,以比较地方和全国产生的过度拥挤现象。获得关于卧室数量的全国记录将有助于当地地区确定其人口过度拥挤的情况。尽管存在这些挑战,我们证明了生成过度拥挤指数是可行的,这些指数可以为研究人员和当地政策制定者寻求制定或评估解决家庭过度拥挤问题的策略提供帮助。
{"title":"<i>Data Note</i>: Challenges when combining housing data from multiple sources to identify overcrowded households.","authors":"Laura Scott, Yan Weigang, Marcella Ucci, Jessica Sheringham","doi":"10.23889/ijpds.v8i2.2927","DOIUrl":"10.23889/ijpds.v8i2.2927","url":null,"abstract":"<p><strong>Background: </strong>This project in one urban local authority in London (England) sought to assess the feasibility of generating locally-derived indices of overcrowding using data available to local councils on the population and their homes.We merged data at household level using the Unique Property Reference Number from publicly available Energy Performance Certificates and commercial property platforms, with data available to councils on the population and their housing characteristics, drawn from multiple sources including council tax bands and council housing databases. Multiple imputation was used to address missing data. Using the dataset, it was possible to generate two indices of overcrowding for households with dependent children, based on the bedroom standard and the space standard, which could be compared with nationally derived estimates.</p><p><strong>Data challenges: </strong>We encountered three challenges with data. 1. Individuals in the population were excluded through linkage with household-level data. 2. Definitions of overcrowding are ambiguous and variably applied. 3. Many local areas face high proportions of missing household data, particularly numbers of bedrooms. We discuss how we addressed such problems and illustrate with a local example how they could affect estimates of overcrowding prevalence.</p><p><strong>Lessons learned: </strong>Further clarity is needed in how bedrooms are defined to compare overcrowding prevalence generated locally and nationally. Access to national records on bedroom numbers would facilitate local areas to identify overcrowding in their own populations. Despite these challenges, we demonstrate it is feasible to generate overcrowding indices that can be useful for researchers and local policy makers seeking to develop or evaluate strategies to address household overcrowding.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"8 2","pages":"2927"},"PeriodicalIF":1.6,"publicationDate":"2025-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12093136/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144119973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Open science and phenotyping in UK administrative health, education and social care data: the ECHILD phenotype code list repository. 开放科学和表型在英国行政卫生,教育和社会保健数据:ECHILD表型代码列表库。
IF 1.6 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-05-13 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i2.2943
Matthew A Jay, Kate Lewis, Difei Shi, Rebecca Langella, Tony Stone, Sorcha Ní Chobhthaigh, Ania Zylbersztejn, Ruth Blackburn, Katie Harron

Administrative health data, such as the Hospital Episode Statistics (HES), can be used to identify groups of people with a particular target condition, a process known as phenotyping. Clinical phenotypes are useful as exposures, covariates and outcomes in research studies using administrative data, including health data linked to other sources such as the Education and Child Health Insights from Linked Data (ECHILD) project. ECHILD brings together HES and other national health datasets with the National Pupil Database and children's social care data for all of England as a data asset that can be accessed by researchers at UK institutions. Because using linked administrative data is complex, the ECHILD team has created additional resources to improve the accessibility of ECHILD. One such initiative is the ECHILD Phenotype Code List Repository. The Repository is a fully open and searchable website containing phenotype code lists that can be used in ECHILD and beyond. As well as a primer on phenotyping, it includes summaries of each code list and R and Stata implementation scripts. The Repository was designed according to a set of principles to ensure that finding and using code lists is easy and standardised. The ECHILD Phenotype Code List Repository is a step forward in the findability and use of phenotype code lists in ECHILD and its constituent datasets.

行政卫生数据,如医院事件统计(HES),可用于识别具有特定目标条件的人群,这一过程称为表型。临床表型在使用管理数据(包括与其他来源相关的健康数据,如来自关联数据的教育和儿童健康洞察(ECHILD)项目)的研究中作为暴露、协变量和结果是有用的。ECHILD将HES和其他国家健康数据集与国家学生数据库和全英格兰儿童社会护理数据作为数据资产汇集在一起,可供英国机构的研究人员访问。由于使用链接的管理数据很复杂,ECHILD团队创建了额外的资源来改善ECHILD的可访问性。其中一个倡议是ECHILD表型代码列表存储库。该资源库是一个完全开放和可搜索的网站,包含表型代码列表,可用于ECHILD和其他。除了对表型的入门,它还包括每个代码列表以及R和Stata实现脚本的摘要。Repository是根据一组原则设计的,以确保查找和使用代码列表是容易和标准化的。ECHILD表型代码列表存储库是在ECHILD及其组成数据集中表型代码列表的可查找性和使用方面向前迈出的一步。
{"title":"Open science and phenotyping in UK administrative health, education and social care data: the ECHILD phenotype code list repository.","authors":"Matthew A Jay, Kate Lewis, Difei Shi, Rebecca Langella, Tony Stone, Sorcha Ní Chobhthaigh, Ania Zylbersztejn, Ruth Blackburn, Katie Harron","doi":"10.23889/ijpds.v10i2.2943","DOIUrl":"https://doi.org/10.23889/ijpds.v10i2.2943","url":null,"abstract":"<p><p>Administrative health data, such as the Hospital Episode Statistics (HES), can be used to identify groups of people with a particular target condition, a process known as phenotyping. Clinical phenotypes are useful as exposures, covariates and outcomes in research studies using administrative data, including health data linked to other sources such as the Education and Child Health Insights from Linked Data (ECHILD) project. ECHILD brings together HES and other national health datasets with the National Pupil Database and children's social care data for all of England as a data asset that can be accessed by researchers at UK institutions. Because using linked administrative data is complex, the ECHILD team has created additional resources to improve the accessibility of ECHILD. One such initiative is the ECHILD Phenotype Code List Repository. The Repository is a fully open and searchable website containing phenotype code lists that can be used in ECHILD and beyond. As well as a primer on phenotyping, it includes summaries of each code list and R and Stata implementation scripts. The Repository was designed according to a set of principles to ensure that finding and using code lists is easy and standardised. The ECHILD Phenotype Code List Repository is a step forward in the findability and use of phenotype code lists in ECHILD and its constituent datasets.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 2","pages":"2943"},"PeriodicalIF":1.6,"publicationDate":"2025-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12076273/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144079932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatio-temporal forecasting of COVID-19 cases in the Netherlands for source and contact tracing. 荷兰COVID-19病例的时空预测及其来源和接触者追踪
IF 1.6 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-05-07 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i1.2703
Max C Keuken, Jizzo R Bosdriesz, Anders Boyd, Elisabeth M den Boogert, Ivo K Joore, Nicole H T M Dukers-Muijrers, Gini van Rijckevorsel, Hannelore M Götz, Irene E Goverse, Mariska W F Petrignani, Stijn F H Raven, Susan van den Hof, Kirsten V C Wevers-de Boer, Maarten F Schim van der Loeff, Amy Matser

Source and contact tracing (SCT) is a core public health measure that is used to contain the spread of infectious diseases. It aims to identify a source of infection, and to advise those who have been exposed to this source. Due to the rapid increases in incidence of COVID-19 in the Netherlands, the capacity to conduct a full SCT quickly became insufficient. Therefore, the public health services (PHS) might benefit from a restricted strategy targeted to geographical regions where (predicted) case-to-case transmission is high. In this study, we set out to develop a prediction model for the number of COVID-19 cases per postal code within the Netherlands using geographic and demographic features. The study population consists of individuals residing in one of the participating nine Dutch PHS regions who tested positive for SARS-CoV-2 between 1 June 2020 and 27 February 2021. Using a machine learning random forest regression model, we predicted the top 100 postal codes with the highest number of cases with an accuracy of 49% for the current week, 42% for next week, and 44% for two weeks from present. In addition, the age groups of 20-39 and 40-64 years had a higher prediction accuracy than groups outside these age ranges. The developed model provides a starting point for targeted preventive SCT efforts that incorporate geospatial and demographic characteristics of a neighbourhood. It should nonetheless be noted that during the early stages of the outbreak, the number of available datapoints needed to inform such models are likely insufficient. Given the accuracy and data requirements of the developed model, it is unlikely that this class of models can play a pivotal role in informing policy during the early phases of a future epidemic.

传染源和接触者追踪(SCT)是一项用于控制传染病传播的核心公共卫生措施。其目的是确定传染源,并向接触过传染源的人提供建议。由于2019冠状病毒病在荷兰的发病率迅速增加,进行全面SCT的能力很快就变得不足。因此,公共卫生服务可能受益于一项针对(预计)病例间传播率高的地理区域的有限战略。在本研究中,我们利用地理和人口特征开发了一个预测模型,用于预测荷兰境内每个邮政编码的COVID-19病例数。研究人群包括居住在参与的九个荷兰小灵通地区之一的个人,他们在2020年6月1日至2021年2月27日期间对SARS-CoV-2检测呈阳性。使用机器学习随机森林回归模型,我们预测了案例数量最多的前100个邮政编码,本周的准确率为49%,下周的准确率为42%,两周后的准确率为44%。此外,20-39岁和40-64岁年龄组的预测准确率高于其他年龄组。开发的模型为结合社区地理空间和人口特征的有针对性的预防性SCT工作提供了起点。然而,应当指出,在疫情爆发的早期阶段,为这种模型提供信息所需的现有数据点数量可能不足。鉴于已开发模型的准确性和数据要求,这类模型不太可能在未来流行病的早期阶段为政策提供信息方面发挥关键作用。
{"title":"Spatio-temporal forecasting of COVID-19 cases in the Netherlands for source and contact tracing.","authors":"Max C Keuken, Jizzo R Bosdriesz, Anders Boyd, Elisabeth M den Boogert, Ivo K Joore, Nicole H T M Dukers-Muijrers, Gini van Rijckevorsel, Hannelore M Götz, Irene E Goverse, Mariska W F Petrignani, Stijn F H Raven, Susan van den Hof, Kirsten V C Wevers-de Boer, Maarten F Schim van der Loeff, Amy Matser","doi":"10.23889/ijpds.v10i1.2703","DOIUrl":"https://doi.org/10.23889/ijpds.v10i1.2703","url":null,"abstract":"<p><p>Source and contact tracing (SCT) is a core public health measure that is used to contain the spread of infectious diseases. It aims to identify a source of infection, and to advise those who have been exposed to this source. Due to the rapid increases in incidence of COVID-19 in the Netherlands, the capacity to conduct a full SCT quickly became insufficient. Therefore, the public health services (PHS) might benefit from a restricted strategy targeted to geographical regions where (predicted) case-to-case transmission is high. In this study, we set out to develop a prediction model for the number of COVID-19 cases per postal code within the Netherlands using geographic and demographic features. The study population consists of individuals residing in one of the participating nine Dutch PHS regions who tested positive for SARS-CoV-2 between 1 June 2020 and 27 February 2021. Using a machine learning random forest regression model, we predicted the top 100 postal codes with the highest number of cases with an accuracy of 49% for the current week, 42% for next week, and 44% for two weeks from present. In addition, the age groups of 20-39 and 40-64 years had a higher prediction accuracy than groups outside these age ranges. The developed model provides a starting point for targeted preventive SCT efforts that incorporate geospatial and demographic characteristics of a neighbourhood. It should nonetheless be noted that during the early stages of the outbreak, the number of available datapoints needed to inform such models are likely insufficient. Given the accuracy and data requirements of the developed model, it is unlikely that this class of models can play a pivotal role in informing policy during the early phases of a future epidemic.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 1","pages":"2703"},"PeriodicalIF":1.6,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12058245/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144040266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discovering linked data collections through a new national metadata platform. 通过新的国家元数据平台发现关联的数据集合。
IF 1.6 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-04-30 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i1.2461
Kate M Miller, Felicity S Flack, Merran B Smith, Vicki Bennett, Carina Ecremen Marshall

Background: Metadata plays a crucial role in the health research infrastructure ecosystem. Despite the abundance of metadata for data collections in Australia, the vast and diverse data custodian landscape poses challenges for linked data researchers to find relevant information for multiple data collections, often making it an arduous and time-intensive task.

Methods: The project comprised three phases: an initial scoping exercise to understand the current state of metadata and related best practice; a national consultation involving researchers, data linkage staff and data custodians to develop a high-fidelity prototype of a metadata platform; and a final build and implementation phase. The platform underwent several prototyping and testing cycles to refine the digital experience.

Results: Expert interviews confirmed that there is a wealth of metadata available, but it is difficult for researchers to access and evaluate. Consultations with researchers identified opportunities to standardise metadata across collections and provide a centralised platform to enhance the discoverability of data collections for research using linked data. High value platform features included searching, browsing and filtering capabilities, data item list metadata, standardised formats, sample data, and frequently asked questions. The final design and functionality reflected user consultations and data custodian input on feasibility.

Conclusion: The Population Health Research Network developed a metadata platform to enable researchers to evaluate the suitability of Australian data collections for linked data projects more effectively. The platform has standardised the way in which metadata is presented for data collections nationally. Improved metadata quality, readability and accessibility will save time and enhance the quality of applications for linked data.

背景:元数据在卫生研究基础设施生态系统中起着至关重要的作用。尽管澳大利亚的数据收集有丰富的元数据,但庞大而多样的数据托管环境给关联数据研究人员寻找多个数据收集的相关信息带来了挑战,这往往使其成为一项艰巨而耗时的任务。方法:该项目包括三个阶段:最初的范围界定工作,以了解元数据的当前状态和相关的最佳实践;由研究人员、数据联系工作人员和数据保管人参与的全国协商,以开发元数据平台的高保真原型;最后的构建和实现阶段。该平台经历了几个原型和测试周期,以完善数字体验。结果:专家访谈证实,有丰富的元数据可用,但研究人员难以访问和评估。与研究人员的磋商确定了跨集合的元数据标准化的机会,并提供了一个集中的平台,以增强使用关联数据进行研究的数据集合的可发现性。高价值的平台特性包括搜索、浏览和过滤功能、数据项列表元数据、标准化格式、样本数据和常见问题。最终的设计和功能反映了用户咨询和数据管理员对可行性的输入。结论:人口健康研究网络开发了一个元数据平台,使研究人员能够更有效地评估澳大利亚数据收集对关联数据项目的适用性。该平台标准化了为全国数据收集提供元数据的方式。改进的元数据质量、可读性和可访问性将节省时间并提高链接数据应用程序的质量。
{"title":"Discovering linked data collections through a new national metadata platform.","authors":"Kate M Miller, Felicity S Flack, Merran B Smith, Vicki Bennett, Carina Ecremen Marshall","doi":"10.23889/ijpds.v10i1.2461","DOIUrl":"https://doi.org/10.23889/ijpds.v10i1.2461","url":null,"abstract":"<p><strong>Background: </strong>Metadata plays a crucial role in the health research infrastructure ecosystem. Despite the abundance of metadata for data collections in Australia, the vast and diverse data custodian landscape poses challenges for linked data researchers to find relevant information for multiple data collections, often making it an arduous and time-intensive task.</p><p><strong>Methods: </strong>The project comprised three phases: an initial scoping exercise to understand the current state of metadata and related best practice; a national consultation involving researchers, data linkage staff and data custodians to develop a high-fidelity prototype of a metadata platform; and a final build and implementation phase. The platform underwent several prototyping and testing cycles to refine the digital experience.</p><p><strong>Results: </strong>Expert interviews confirmed that there is a wealth of metadata available, but it is difficult for researchers to access and evaluate. Consultations with researchers identified opportunities to standardise metadata across collections and provide a centralised platform to enhance the discoverability of data collections for research using linked data. High value platform features included searching, browsing and filtering capabilities, data item list metadata, standardised formats, sample data, and frequently asked questions. The final design and functionality reflected user consultations and data custodian input on feasibility.</p><p><strong>Conclusion: </strong>The Population Health Research Network developed a metadata platform to enable researchers to evaluate the suitability of Australian data collections for linked data projects more effectively. The platform has standardised the way in which metadata is presented for data collections nationally. Improved metadata quality, readability and accessibility will save time and enhance the quality of applications for linked data.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 1","pages":"2461"},"PeriodicalIF":1.6,"publicationDate":"2025-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12042732/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144020126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transparency in the existence, use, and output of a mental health data resource: a descriptive paper from the National Institute for Health and Care Research (NIHR) Maudsley Biomedical Research Centre (BRC) Clinical Record Interactive Search (CRIS) Platform. 心理健康数据资源存在、使用和输出的透明度:来自国家卫生与保健研究所(NIHR)莫兹利生物医学研究中心(BRC)临床记录互动搜索(CRIS)平台的一篇描述性论文。
IF 1.6 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-04-10 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i2.2945
Amelia Jewell, Matthew Broadbent, Claire Delaney-Pope, Megan Pritchard, Hannah Woods, Robert Stewart

Background: Transparency in the use of routinely collected mental health data for research is essential in maintaining public support and trust, as well as for supporting the sharing of information and data resources amongst the academic community. The National Institute for Health and Care Research (NIHR) Maudsley Biomedical Research Centre (BRC) Clinical Records Interactive Search (CRIS) enables a case register of deidentified mental health records from the South London and Maudsley NHS Foundation Trust (SLaM). CRIS supports mental health research across the lifespan from children and adolescents to older adults.

Aim: This paper aims to describe the activities which contribute to ensuring that transparency is maintained throughout the journey of data in CRIS: from data collection, through application in research, to dissemination of findings.

Approach: A communications plan is in place to support Patient and Public Involvement (PPI) and transparency initiatives for all CRIS stakeholders, including patients and carers, academic users, and the general public. Activities can be divided into three categories of transparency: existence, use, and output.

Discussion: There are challenges to maintaining transparency, including ensuring that activities are varied enough to reach all stakeholders, including harder to reach groups, and presenting information in a way that is appropriate for the relevant audience. However, greater transparency has led to more opportunities for researchers to engage with patients and the CRIS model is widely accepted by patients.

Conclusion: This paper set out to describe CRIS communications and transparency activities. We believe the material covered will be of interest to other providers of routinely collected data for research.

背景:为了维持公众的支持和信任,以及为了支持学术界之间的信息和数据资源共享,在使用常规收集的精神卫生数据进行研究方面保持透明度至关重要。国家健康与护理研究所(NIHR)莫兹利生物医学研究中心(BRC)临床记录互动搜索(CRIS)使来自南伦敦和莫兹利NHS基金会信托基金(SLaM)的未识别精神健康记录的病例登记册成为可能。CRIS支持从儿童、青少年到老年人的整个生命周期的心理健康研究。目的:本文旨在描述有助于确保在CRIS数据的整个过程中保持透明度的活动:从数据收集,通过研究应用,到发现的传播。方法:制定了一项沟通计划,以支持所有CRIS利益相关者(包括患者和护理人员、学术用户和公众)的患者和公众参与(PPI)和透明度倡议。活动的透明度可分为三类:存在、使用和输出。讨论:保持透明度存在挑战,包括确保活动的多样性足以覆盖所有利益相关者,包括更难覆盖的群体,以及以适合相关受众的方式呈现信息。然而,更大的透明度为研究人员提供了更多与患者接触的机会,CRIS模型被患者广泛接受。结论:本文旨在描述CRIS的沟通和透明度活动。我们相信所涵盖的材料将对其他常规收集研究数据的提供者感兴趣。
{"title":"Transparency in the existence, use, and output of a mental health data resource: a descriptive paper from the National Institute for Health and Care Research (NIHR) Maudsley Biomedical Research Centre (BRC) Clinical Record Interactive Search (CRIS) Platform.","authors":"Amelia Jewell, Matthew Broadbent, Claire Delaney-Pope, Megan Pritchard, Hannah Woods, Robert Stewart","doi":"10.23889/ijpds.v10i2.2945","DOIUrl":"https://doi.org/10.23889/ijpds.v10i2.2945","url":null,"abstract":"<p><strong>Background: </strong>Transparency in the use of routinely collected mental health data for research is essential in maintaining public support and trust, as well as for supporting the sharing of information and data resources amongst the academic community. The National Institute for Health and Care Research (NIHR) Maudsley Biomedical Research Centre (BRC) Clinical Records Interactive Search (CRIS) enables a case register of deidentified mental health records from the South London and Maudsley NHS Foundation Trust (SLaM). CRIS supports mental health research across the lifespan from children and adolescents to older adults.</p><p><strong>Aim: </strong>This paper aims to describe the activities which contribute to ensuring that transparency is maintained throughout the journey of data in CRIS: from data collection, through application in research, to dissemination of findings.</p><p><strong>Approach: </strong>A communications plan is in place to support Patient and Public Involvement (PPI) and transparency initiatives for all CRIS stakeholders, including patients and carers, academic users, and the general public. Activities can be divided into three categories of transparency: existence, use, and output.</p><p><strong>Discussion: </strong>There are challenges to maintaining transparency, including ensuring that activities are varied enough to reach all stakeholders, including harder to reach groups, and presenting information in a way that is appropriate for the relevant audience. However, greater transparency has led to more opportunities for researchers to engage with patients and the CRIS model is widely accepted by patients.</p><p><strong>Conclusion: </strong>This paper set out to describe CRIS communications and transparency activities. We believe the material covered will be of interest to other providers of routinely collected data for research.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 1","pages":"2945"},"PeriodicalIF":1.6,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12076277/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144079829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data resource profile: a guide for constructing school-to-work sequence analysis trajectories using the longitudinal education outcomes (LEO) data. 数据资源概况:使用纵向教育成果(LEO)数据构建学校到工作序列分析轨迹的指南。
IF 1.6 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-03-25 eCollection Date: 2023-01-01 DOI: 10.23889/ijpds.v8i6.2953
Shivani Sickotra

Introduction: Sequence analysis is a powerful methodology for examining longitudinal school-to-work trajectories. Despite its growing use, there is limited guidance on preparing suitable datasets. This resource details the creation of a dataset specifically designed for sequence analysis, capturing yearly education and employment activity states for 556,182 individuals from England's 2010/11 school-leaver cohort.

Methods: The dataset was constructed using the Department for Education's Longitudinal Education Outcomes (LEO) data. SQL was used to extract relevant variables, and data linkage and preprocessing was performed using R. Data processing was tailored to sequence analysis, including reducing the number of activity states and applying a hierarchy to integrate education and employment data.

Results: The resulting dataset spans activities from the first non-compulsory state in 2011/12 until 2018/19, tracking trajectories from ages 16/17 to 23/24. The dataset was designed with the ability to subset school-leavers by their initial Combined Authority residence to aid in regional analysis of school-to-work trajectories. Individual-level socio-demographic characteristics that can be linked to the longitudinal activity histories were also built, alongside longitudinal geographic locations and employment earnings data. Additionally, the limitations of the developed data are discussed.

Conclusion: This resource provides crucial guidance for researchers and practitioners who may require experience preparing input datasets for sequence analysis, addressing the current gap in available resources. By offering step-by-step instructions and shared code, it empowers users to recreate or adapt the dataset for their specific research needs. Its ability to subset by region further supports localised and comparative studies of school-to-work trajectories, making it a valuable tool for advancing existing research. The LEO data can be accessed by application through the Office for National Statistics Secure Research Service.

简介序列分析是研究从学校到工作的纵向轨迹的有力方法。尽管其应用日益广泛,但关于如何准备合适数据集的指导却很有限。本资料详细介绍了如何创建一个专门用于序列分析的数据集,该数据集记录了英格兰 2010/11 年离校学生群体中 556,182 人的年度教育和就业活动状态:该数据集是利用教育部的纵向教育成果(LEO)数据构建的。数据处理是为序列分析量身定制的,包括减少活动状态的数量以及应用层次结构整合教育和就业数据:由此产生的数据集跨越了从 2011/12 年第一个非义务教育状态到 2018/19 年的活动,追踪了从 16/17 岁到 23/24 岁的轨迹。数据集的设计能够根据离校者最初的联合行政区居住地对其进行子集,以帮助对从学校到工作的轨迹进行区域分析。除了纵向地理位置和就业收入数据外,还建立了可与纵向活动历史相联系的个人社会人口特征。此外,还讨论了所开发数据的局限性:本资料为需要为序列分析准备输入数据集经验的研究人员和从业人员提供了重要指导,解决了当前可用资源不足的问题。通过提供分步指导和共享代码,它使用户能够重新创建或调整数据集,以满足其特定的研究需求。它还能按地区进行子集分析,进一步支持对从学校到工作的轨迹进行本地化比较研究,使其成为推进现有研究的重要工具。LEO 数据可通过国家统计局安全研究服务处申请获取。
{"title":"Data resource profile: a guide for constructing school-to-work sequence analysis trajectories using the longitudinal education outcomes (LEO) data.","authors":"Shivani Sickotra","doi":"10.23889/ijpds.v8i6.2953","DOIUrl":"10.23889/ijpds.v8i6.2953","url":null,"abstract":"<p><strong>Introduction: </strong>Sequence analysis is a powerful methodology for examining longitudinal school-to-work trajectories. Despite its growing use, there is limited guidance on preparing suitable datasets. This resource details the creation of a dataset specifically designed for sequence analysis, capturing yearly education and employment activity states for 556,182 individuals from England's 2010/11 school-leaver cohort.</p><p><strong>Methods: </strong>The dataset was constructed using the Department for Education's Longitudinal Education Outcomes (LEO) data. SQL was used to extract relevant variables, and data linkage and preprocessing was performed using R. Data processing was tailored to sequence analysis, including reducing the number of activity states and applying a hierarchy to integrate education and employment data.</p><p><strong>Results: </strong>The resulting dataset spans activities from the first non-compulsory state in 2011/12 until 2018/19, tracking trajectories from ages 16/17 to 23/24. The dataset was designed with the ability to subset school-leavers by their initial Combined Authority residence to aid in regional analysis of school-to-work trajectories. Individual-level socio-demographic characteristics that can be linked to the longitudinal activity histories were also built, alongside longitudinal geographic locations and employment earnings data. Additionally, the limitations of the developed data are discussed.</p><p><strong>Conclusion: </strong>This resource provides crucial guidance for researchers and practitioners who may require experience preparing input datasets for sequence analysis, addressing the current gap in available resources. By offering step-by-step instructions and shared code, it empowers users to recreate or adapt the dataset for their specific research needs. Its ability to subset by region further supports localised and comparative studies of school-to-work trajectories, making it a valuable tool for advancing existing research. The LEO data can be accessed by application through the Office for National Statistics Secure Research Service.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"8 6","pages":"2953"},"PeriodicalIF":1.6,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11935648/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143711548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Population Data Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1