首页 > 最新文献

International Journal for Population Data Science最新文献

英文 中文
Using novel data linkage of biobank data with administrative health data to inform genomic analysis for future precision medicine treatment of congenital heart disease 利用生物库数据与行政卫生数据的新型数据链接,为未来先天性心脏病的精准医学治疗提供基因组分析信息
Pub Date : 2023-11-14 DOI: 10.23889/ijpds.v8i1.2150
Samantha J. Lain, Gillian Blue, Bridget O’Malley, David Winlaw, Gary Sholler, Sally Dunwoodie, Natasha Nassar, None The Congenital Heart Disease Synergy Study group
IntroductionContemporary care of congenital heart disease (CHD) is largely standardised, however there is heterogeneity in post-surgical outcomes that may be explained by genetic variation. Data linkage between a CHD biobank and routinely collected administrative datasets is a novel method to identify outcomes to explore the impact of genetic variation. ObjectiveUse data linkage to identify and validate patient outcomes following surgical treatment for CHD. MethodsData linkage between clinical and biobank data of children born from 2001-2014 that had a procedure for CHD in New South Wales, Australia, with hospital discharge data, education and death data. The children were grouped according to CHD lesion type and age at first cardiac surgery. Children in each `lesion/age at surgery group' were classified into 'favourable' and 'unfavourable' cardiovascular outcome groups based on variables identified in linked administrative data including; total time in intensive care, total length of stay in hospital, and mechanical ventilation time up to 5 years following the date of the first cardiac surgery. A blind medical record audit of 200 randomly chosen children from 'favourable' and 'unfavourable' outcome groups was performed to validate the outcome groups. ResultsOf the 1872 children in the dataset that linked to hospital or death data, 483 were identified with a `favourable' cardiovascular outcome and 484 were identified as having a 'unfavourable' cardiovascular outcome. The medical record audit found concordant outcome groups for 182/192 records (95%) compared to the outcome groups categorized using the linked data. ConclusionsThe linkage of a curated biobank dataset with routinely collected administrative data is a reliable method to identify outcomes to facilitate a large-scale study to examine genetic variance. These genetic hallmarks could be used to identify patients who are at risk of unfavourable cardiovascular outcomes, to inform strategies for prevention and changes in clinical care.
当代先天性心脏病(CHD)的治疗在很大程度上是标准化的,但术后结果存在异质性,这可能与遗传变异有关。冠心病生物库与常规收集的管理数据集之间的数据链接是一种识别结果以探索遗传变异影响的新方法。目的利用数据链接来识别和验证冠心病手术治疗后患者的预后。方法将澳大利亚新南威尔士州2001-2014年出生的接受过冠心病手术的儿童的临床数据与生物银行数据与出院数据、教育数据和死亡数据联系起来。根据患儿冠心病病变类型和首次心脏手术年龄进行分组。每个“病变/手术年龄组”的儿童根据相关管理数据中确定的变量分为“有利”和“不利”心血管结果组,包括;重症监护总时间、住院总时间和首次心脏手术后最长5年的机械通气时间。对从“有利”和“不利”结果组中随机选择的200名儿童进行盲检,以验证结果组。结果在与医院或死亡数据相关的数据集中的1872名儿童中,483名被确定为心血管结果“有利”,484名被确定为心血管结果“不利”。与使用关联数据分类的结果组相比,医疗记录审计发现182/192个记录(95%)的结果组一致。将精心整理的生物库数据集与常规收集的管理数据相关联是一种可靠的方法,可以识别结果,从而促进大规模研究以检查遗传变异。这些遗传标记可用于识别有不良心血管结果风险的患者,为预防策略和临床护理改变提供信息。
{"title":"Using novel data linkage of biobank data with administrative health data to inform genomic analysis for future precision medicine treatment of congenital heart disease","authors":"Samantha J. Lain, Gillian Blue, Bridget O’Malley, David Winlaw, Gary Sholler, Sally Dunwoodie, Natasha Nassar, None The Congenital Heart Disease Synergy Study group","doi":"10.23889/ijpds.v8i1.2150","DOIUrl":"https://doi.org/10.23889/ijpds.v8i1.2150","url":null,"abstract":"IntroductionContemporary care of congenital heart disease (CHD) is largely standardised, however there is heterogeneity in post-surgical outcomes that may be explained by genetic variation. Data linkage between a CHD biobank and routinely collected administrative datasets is a novel method to identify outcomes to explore the impact of genetic variation. ObjectiveUse data linkage to identify and validate patient outcomes following surgical treatment for CHD. MethodsData linkage between clinical and biobank data of children born from 2001-2014 that had a procedure for CHD in New South Wales, Australia, with hospital discharge data, education and death data. The children were grouped according to CHD lesion type and age at first cardiac surgery. Children in each `lesion/age at surgery group' were classified into 'favourable' and 'unfavourable' cardiovascular outcome groups based on variables identified in linked administrative data including; total time in intensive care, total length of stay in hospital, and mechanical ventilation time up to 5 years following the date of the first cardiac surgery. A blind medical record audit of 200 randomly chosen children from 'favourable' and 'unfavourable' outcome groups was performed to validate the outcome groups. ResultsOf the 1872 children in the dataset that linked to hospital or death data, 483 were identified with a `favourable' cardiovascular outcome and 484 were identified as having a 'unfavourable' cardiovascular outcome. The medical record audit found concordant outcome groups for 182/192 records (95%) compared to the outcome groups categorized using the linked data. ConclusionsThe linkage of a curated biobank dataset with routinely collected administrative data is a reliable method to identify outcomes to facilitate a large-scale study to examine genetic variance. These genetic hallmarks could be used to identify patients who are at risk of unfavourable cardiovascular outcomes, to inform strategies for prevention and changes in clinical care.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"58 9","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134902619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Common governance model: a way to avoid data segregation between existing trusted research environment 公共治理模型:一种避免现有可信研究环境之间数据隔离的方法
Pub Date : 2023-11-09 DOI: 10.23889/ijpds.v8i4.2164
Fatemeh Torabi, Chris Orton, Emma Squires, Sharon Heys, Richard Hier, Ronan A. Lyons, Simon Thompson
BackgroundTrusted Research Environments provide a legitimate basis for data access along with a set of technologies to support implementation of the "five-safes" framework for privacy protection. Lack of standard approaches in achieving compliance with the "five-safes" framework results in a diversity of approaches across different TREs. Data access and analysis across multiple TREs has a range of benefits including improved precision of analysis due to larger sample sizes and broader availability of out-of-sample records, particularly in the study of rare conditions. Knowledge of governance approaches used across UK-TREs is limited. ObjectiveTo document key governance features in major UK-TRE contributing to UK wide analysis and to identify elements that would directly facilitate multi TRE collaborations and federated analysis in future. MethodWe summarised three main characteristics across 15 major UK-based TREs: 1) data access environment; 2) data access requests and disclosure control procedures; and 3) governance models. We undertook case studies of collaborative analyses conducted in more than one TRE. We identified an array of TREs operating on an equivalent level of governance. We further identify commonly governed TREs with architectural considerations for achieving an equivalent level of information security management system standards to facilitate multi TRE functionality and federated analytics. ResultsAll 15 UK-TREs allow pooling and analysis of aggregated research outputs only when they have passed human-operated disclosure control checks. Data access requests procedures are unique to each TRE. We also observed a variability in disclosure control procedures across various TREs with no or minimal researcher guidance on best practices for file out request procedures. In 2023, six TREs (40.0%) held ISO 20071 accreditation, while 9 TREs (56.2%) participated in four-nation analyses. ConclusionSecure analysis of individual-level data from multiple TREs is possible through existing technical solutions but requires development of a well-established governance framework meeting all stakeholder requirements and addressing public and patient concerns. Formation of a standard model could act as the catalyst for evolution of current TREs governance models to a multi TRE ecosystem within the UK and beyond.
可信的研究环境为数据访问提供了合法的基础,同时还提供了一组技术来支持实施隐私保护的“五个一”框架。在遵守“五个安全”框架方面缺乏标准方法,导致不同的安全区域采用不同的方法。跨多个TREs的数据访问和分析具有一系列好处,包括由于样本量更大而提高分析精度和更广泛的样本外记录可用性,特别是在罕见情况的研究中。英国tres使用的治理方法的知识是有限的。目的记录主要英国数据交换系统的关键治理特征,为英国范围内的分析做出贡献,并确定将直接促进未来多数据交换系统合作和联合分析的要素。方法总结了英国15个主要TREs的三个主要特征:1)数据访问环境;2)数据访问请求和披露控制程序;3)治理模型。我们进行了在多个TRE中进行的协作分析的案例研究。我们确定了一系列在同等治理水平上运行的TREs。我们进一步通过体系结构考虑来确定通常治理的TRE,以实现同等级别的信息安全管理系统标准,从而促进多TRE功能和联合分析。结果所有15个UK-TREs只有在通过人工操作的披露控制检查时才允许汇总和分析汇总的研究成果。查阅资料要求的程序对每个数据交换中心都是独一无二的。我们还观察到,在没有或只有很少的研究人员指导提交请求程序最佳实践的情况下,不同TREs的披露控制程序存在差异。2023年,6家TREs(40.0%)通过了ISO 20071认证,9家TREs(56.2%)参加了四国分析。通过现有的技术解决方案,可以安全地分析来自多个TREs的个人层面数据,但需要开发一个完善的治理框架,满足所有利益相关者的要求,并解决公众和患者的关切。标准模型的形成可以作为催化剂,推动当前TREs治理模式向英国内外的多TREs生态系统发展。
{"title":"Common governance model: a way to avoid data segregation between existing trusted research environment","authors":"Fatemeh Torabi, Chris Orton, Emma Squires, Sharon Heys, Richard Hier, Ronan A. Lyons, Simon Thompson","doi":"10.23889/ijpds.v8i4.2164","DOIUrl":"https://doi.org/10.23889/ijpds.v8i4.2164","url":null,"abstract":"BackgroundTrusted Research Environments provide a legitimate basis for data access along with a set of technologies to support implementation of the \"five-safes\" framework for privacy protection. Lack of standard approaches in achieving compliance with the \"five-safes\" framework results in a diversity of approaches across different TREs. Data access and analysis across multiple TREs has a range of benefits including improved precision of analysis due to larger sample sizes and broader availability of out-of-sample records, particularly in the study of rare conditions. Knowledge of governance approaches used across UK-TREs is limited. ObjectiveTo document key governance features in major UK-TRE contributing to UK wide analysis and to identify elements that would directly facilitate multi TRE collaborations and federated analysis in future. MethodWe summarised three main characteristics across 15 major UK-based TREs: 1) data access environment; 2) data access requests and disclosure control procedures; and 3) governance models. We undertook case studies of collaborative analyses conducted in more than one TRE. We identified an array of TREs operating on an equivalent level of governance. We further identify commonly governed TREs with architectural considerations for achieving an equivalent level of information security management system standards to facilitate multi TRE functionality and federated analytics. ResultsAll 15 UK-TREs allow pooling and analysis of aggregated research outputs only when they have passed human-operated disclosure control checks. Data access requests procedures are unique to each TRE. We also observed a variability in disclosure control procedures across various TREs with no or minimal researcher guidance on best practices for file out request procedures. In 2023, six TREs (40.0%) held ISO 20071 accreditation, while 9 TREs (56.2%) participated in four-nation analyses. ConclusionSecure analysis of individual-level data from multiple TREs is possible through existing technical solutions but requires development of a well-established governance framework meeting all stakeholder requirements and addressing public and patient concerns. Formation of a standard model could act as the catalyst for evolution of current TREs governance models to a multi TRE ecosystem within the UK and beyond.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":" 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135242225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Federated learning for generating synthetic data: a scoping review 用于生成合成数据的联邦学习:范围审查
Pub Date : 2023-10-31 DOI: 10.23889/ijpds.v8i1.2158
Claire Little, Mark Elliot, Richard Allmendinger
IntroductionFederated Learning (FL) is a decentralised approach to training statistical models, where training is performed across multiple clients, producing one global model. Since the training data remains with each local client and is not shared or exchanged with other clients the use of FL may reduce privacy and security risks (compared to methods where multiple data sources are pooled) and can also address data access and heterogeneity problems. Synthetic data is artificially generated data that has the same structure and statistical properties as the original but that does not contain any of the original data records, therefore minimising disclosure risk. Using FL to produce synthetic data (which we refer to as "federated synthesis") has the potential to combine data from multiple clients without compromising privacy, allowing access to data that may otherwise be inaccessible in its raw format. ObjectivesThe objective was to review current research and practices for using FL to generate synthetic data and determine the extent to which research has been undertaken, the methods and evaluation practices used, and any research gaps. MethodsA scoping review was conducted to systematically map and describe the published literature on the use of FL to generate synthetic data. Relevant studies were identified through online databases and the findings are described, grouped, and summarised. Information extracted included article characteristics, documenting the type of data that is synthesised, the model architecture and the methods (if any) used to evaluate utility and privacy risk. ResultsA total of 69 articles were included in the scoping review; all were published between 2018 and 2023 with two thirds (46) in 2022. 30% (21) were focussed on synthetic data generation as the main model output (with 6 of these generating tabular data), whereas 59% (41) focussed on data augmentation. Of the 21 performing federated synthesis, all used deep learning methods (predominantly Generative Adversarial Networks) to generate the synthetic data. ConclusionsFederated synthesis is in its early days but shows promise as a method that can construct a global synthetic dataset without sharing any of the local client data. As a field in its infancy there are areas to explore in terms of the privacy risk associated with the various methods proposed, and more generally in how we measure those risks.
联邦学习(FL)是一种训练统计模型的分散方法,其中跨多个客户端执行训练,生成一个全局模型。由于训练数据保留在每个本地客户端,不与其他客户端共享或交换,因此使用FL可以减少隐私和安全风险(与多个数据源池的方法相比),还可以解决数据访问和异构问题。合成数据是人工生成的数据,具有与原始数据相同的结构和统计属性,但不包含任何原始数据记录,因此将披露风险降至最低。使用FL生成合成数据(我们称之为“联邦合成”)有可能在不损害隐私的情况下组合来自多个客户机的数据,从而允许访问以原始格式无法访问的数据。目的回顾目前使用FL生成合成数据的研究和实践,并确定已开展的研究程度、使用的方法和评估实践以及任何研究空白。方法对已发表的有关FL应用的文献进行系统的检索和描述,生成综合数据。通过在线数据库确定相关研究,并对研究结果进行描述、分组和总结。提取的信息包括文章特征、合成数据类型的文档、模型架构和用于评估效用和隐私风险的方法(如果有的话)。结果共纳入69篇文献;全部出版于2018年至2023年之间,其中三分之二(46本)出版于2022年。30%(21)专注于合成数据生成作为主要模型输出(其中6个生成表格数据),而59%(41)专注于数据增强。在21个执行联邦合成的系统中,所有系统都使用深度学习方法(主要是生成对抗网络)来生成合成数据。联邦合成还处于早期阶段,但作为一种可以构建全局合成数据集而不共享任何本地客户端数据的方法,它显示出了前景。作为一个处于起步阶段的领域,在与所提出的各种方法相关的隐私风险方面,以及我们如何衡量这些风险方面,还有很多领域需要探索。
{"title":"Federated learning for generating synthetic data: a scoping review","authors":"Claire Little, Mark Elliot, Richard Allmendinger","doi":"10.23889/ijpds.v8i1.2158","DOIUrl":"https://doi.org/10.23889/ijpds.v8i1.2158","url":null,"abstract":"IntroductionFederated Learning (FL) is a decentralised approach to training statistical models, where training is performed across multiple clients, producing one global model. Since the training data remains with each local client and is not shared or exchanged with other clients the use of FL may reduce privacy and security risks (compared to methods where multiple data sources are pooled) and can also address data access and heterogeneity problems. Synthetic data is artificially generated data that has the same structure and statistical properties as the original but that does not contain any of the original data records, therefore minimising disclosure risk. Using FL to produce synthetic data (which we refer to as \"federated synthesis\") has the potential to combine data from multiple clients without compromising privacy, allowing access to data that may otherwise be inaccessible in its raw format. ObjectivesThe objective was to review current research and practices for using FL to generate synthetic data and determine the extent to which research has been undertaken, the methods and evaluation practices used, and any research gaps. MethodsA scoping review was conducted to systematically map and describe the published literature on the use of FL to generate synthetic data. Relevant studies were identified through online databases and the findings are described, grouped, and summarised. Information extracted included article characteristics, documenting the type of data that is synthesised, the model architecture and the methods (if any) used to evaluate utility and privacy risk. ResultsA total of 69 articles were included in the scoping review; all were published between 2018 and 2023 with two thirds (46) in 2022. 30% (21) were focussed on synthetic data generation as the main model output (with 6 of these generating tabular data), whereas 59% (41) focussed on data augmentation. Of the 21 performing federated synthesis, all used deep learning methods (predominantly Generative Adversarial Networks) to generate the synthetic data. ConclusionsFederated synthesis is in its early days but shows promise as a method that can construct a global synthetic dataset without sharing any of the local client data. As a field in its infancy there are areas to explore in terms of the privacy risk associated with the various methods proposed, and more generally in how we measure those risks.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"342 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135868555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Health Data Governance for Research Use in Alberta 艾伯塔省用于研究的卫生数据治理
Pub Date : 2023-10-26 DOI: 10.23889/ijpds.v8i4.2160
Namneet Sandhu, Sarah Whittle, Danielle Southern, Bing Li, Erik Youngson, Jeffery Bakal, Christie Mcleod, Lexi Hilderman, Tyler Williamson, Cheligeer Cheligeer, Robin Walker, Padma Kaul, Hude Quan, Catherine Eastwood
Alberta has rich clinical and health services data held under the custodianship of Alberta Health and Alberta Health Services (AHS), which is not only used for clinical and administrative purposes but also disease surveillance and epidemiological research. Alberta is the largest province in Canada with a single payer centralised health system, AHS, and a consolidated data and analytics team supporting researchers across the province. This paper describes Alberta's data custodians, data governance mechanisms, and streamlined processes followed for research data access. AHS has created a centralised data repository from multiple sources, including practitioner claims data, hospital discharge data, and medications dispensed, available for research use through the provincial Data and Research Services (DRS) team. The DRS team is integrated within AHS to support researchers across the province with their data extraction and linkage requests. Furthermore, streamlined processes have been established, including: 1) ethics approval from a research ethics board, 2) any necessary operational approvals from AHS, and 3) a tripartite legal agreement dictating terms and conditions for data use, disclosure, and retention. This allows researchers to gain timely access to data. To meet the evolving and ever-expanding big-data needs, the University of Calgary, in partnership with AHS, has built high-performance computing (HPC) infrastructure to facilitate storage and processing of large datasets. When releasing data to researchers, the analytics team ensures that Alberta's Health Information Act's guiding principles are followed. The principal investigator also ensures data retention and disposition are according to the plan specified in ethics and per the terms set out by funding agencies. Even though there are disparities and variations in the data protection laws across the different provinces in Canada, the streamlined processes for research data access in Alberta are highly efficient.
艾伯塔省有丰富的临床和卫生服务数据,这些数据由艾伯塔省卫生和卫生服务局保管,不仅用于临床和行政目的,还用于疾病监测和流行病学研究。艾伯塔省是加拿大最大的省份,拥有单一付款人集中医疗系统AHS,以及支持全省研究人员的统一数据和分析团队。本文描述了艾伯塔省的数据保管人、数据治理机制以及研究数据访问所遵循的精简流程。AHS从多个来源创建了一个集中的数据存储库,包括医生索赔数据、出院数据和分配的药物,可通过省数据和研究服务(DRS)团队进行研究使用。DRS团队被整合到AHS中,以支持全省的研究人员进行数据提取和链接请求。此外,还建立了简化的流程,包括:1)研究伦理委员会的伦理批准,2)美国AHS的任何必要操作批准,以及3)规定数据使用、披露和保留条款和条件的三方法律协议。这使得研究人员能够及时获取数据。为了满足不断发展和不断扩大的大数据需求,卡尔加里大学与AHS合作,建立了高性能计算(HPC)基础设施,以促进大型数据集的存储和处理。在向研究人员发布数据时,分析团队确保遵守艾伯塔省健康信息法案的指导原则。首席研究员还要确保数据的保留和处理符合伦理规定的计划和资助机构规定的条款。尽管加拿大不同省份的数据保护法存在差异和差异,但阿尔伯塔省研究数据访问的精简流程效率很高。
{"title":"Health Data Governance for Research Use in Alberta","authors":"Namneet Sandhu, Sarah Whittle, Danielle Southern, Bing Li, Erik Youngson, Jeffery Bakal, Christie Mcleod, Lexi Hilderman, Tyler Williamson, Cheligeer Cheligeer, Robin Walker, Padma Kaul, Hude Quan, Catherine Eastwood","doi":"10.23889/ijpds.v8i4.2160","DOIUrl":"https://doi.org/10.23889/ijpds.v8i4.2160","url":null,"abstract":"Alberta has rich clinical and health services data held under the custodianship of Alberta Health and Alberta Health Services (AHS), which is not only used for clinical and administrative purposes but also disease surveillance and epidemiological research. Alberta is the largest province in Canada with a single payer centralised health system, AHS, and a consolidated data and analytics team supporting researchers across the province. This paper describes Alberta's data custodians, data governance mechanisms, and streamlined processes followed for research data access. AHS has created a centralised data repository from multiple sources, including practitioner claims data, hospital discharge data, and medications dispensed, available for research use through the provincial Data and Research Services (DRS) team. The DRS team is integrated within AHS to support researchers across the province with their data extraction and linkage requests. Furthermore, streamlined processes have been established, including: 1) ethics approval from a research ethics board, 2) any necessary operational approvals from AHS, and 3) a tripartite legal agreement dictating terms and conditions for data use, disclosure, and retention. This allows researchers to gain timely access to data. To meet the evolving and ever-expanding big-data needs, the University of Calgary, in partnership with AHS, has built high-performance computing (HPC) infrastructure to facilitate storage and processing of large datasets. When releasing data to researchers, the analytics team ensures that Alberta's Health Information Act's guiding principles are followed. The principal investigator also ensures data retention and disposition are according to the plan specified in ethics and per the terms set out by funding agencies. Even though there are disparities and variations in the data protection laws across the different provinces in Canada, the streamlined processes for research data access in Alberta are highly efficient.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"47 12","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136381580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Establishment of a birth-to-education cohort of 1 million Palestinian refugees using electronic medical records and electronic education records 利用电子医疗记录和电子教育记录建立100万巴勒斯坦难民从出生到受教育的队列
Pub Date : 2023-10-24 DOI: 10.23889/ijpds.v8i1.2156
Zeina Jamaluddine, Akihiro Seita, Ghada Ballout, Husam Al-Fudoli, Gloria Paolucci, Shatha Albaik, Rami Ibrahim, Miho Sato, Hala Ghattas, Oona Campbell
IntroductionBy linking datasets, electronic records can be used to build large birth-cohorts, enabling researchers to cost-effectively answer questions relevant to populations over the life-course. Currently, around 5.8 million Palestinian refugees live in five settings: Jordan, Lebanon, Syria, West Bank, and Gaza Strip. The United Nations Relief and Works Agency for Palestine Refugees in the Near East (UNRWA) provides them with free primary health and elementary-school services. It maintains electronic records to do so. We aimed to establish a birth cohort of Palestinian refugees born between 1st January 2010 and 31st December 2020 living in five settings by linking mother obstetric records with child health and education records and to describe some of the cohort characteristics. In future, we plan to assess effects of size-at-birth on growth, health and educational attainment, among other questions. MethodsWe extracted all available data from 140 health centres and 702 schools across five settings, i.e. all UNRWA service users. Creating the cohort involved examining IDs and other data, preparing data, de-duplicating records, and identifying live-births, linking the mothers' and children's data using different deterministic linking algorithms, and understanding reasons for non-linkage. ResultsWe established a birth cohort of Palestinian refugees using electronic records of 972,743 live births. We found high levels of linkage to health records overall (83%), which improved over time (from 73% to 86%), and variations in linkage rates by setting: these averaged 93% in Gaza, 89% in Lebanon, 75% in Jordan, 73% in West Bank and 68% in Syria. Of the 423,580 children age-eligible to go to school, 47% went to UNRWA schools and comprised of 197,479 children with both health and education records, and 2,447 children with only education records. In addition to year and setting, other factors associated with non-linkage included mortality and having a non-refugee mother. Misclassification errors were minimal. ConclusionThis linked open birth-cohort is unique for refugees and the Arab region and forms the basis for many future studies, including to elucidate pathways for improved health and education in this vulnerable, understudied population. Our characterization of the cohort leads us to recommend using different sub-sets of the cohort depending on the research question and analytic purposes.
通过连接数据集,电子记录可用于建立大型出生队列,使研究人员能够经济有效地回答与整个生命过程中人口相关的问题。目前,约有580万巴勒斯坦难民生活在五个地区:约旦、黎巴嫩、叙利亚、西岸和加沙地带。联合国近东巴勒斯坦难民救济和工程处(近东救济工程处)为他们提供免费的初级保健和小学教育服务。为此,它保留了电子记录。我们的目标是建立2010年1月1日至2020年12月31日期间在五种环境中出生的巴勒斯坦难民的出生队列,将产妇产科记录与儿童健康和教育记录联系起来,并描述队列的一些特征。未来,我们计划评估出生时体型对成长、健康和受教育程度等问题的影响。方法我们从五个环境中的140个保健中心和702所学校提取了所有可用数据,即近东救济工程处的所有服务用户。创建队列涉及检查id和其他数据,准备数据,删除重复记录,识别活产,使用不同的确定性链接算法链接母亲和儿童的数据,以及理解非链接的原因。结果利用92743例活产的电子记录建立了巴勒斯坦难民出生队列。我们发现总体上与健康记录的关联度很高(83%),随着时间的推移而提高(从73%到86%),并且不同地区的关联度存在差异:加沙的平均关联度为93%,黎巴嫩为89%,约旦为75%,西岸为73%,叙利亚为68%。在符合入学年龄条件的423 580名儿童中,47%就读于近东救济工程处的学校,其中既有健康记录又有教育记录的儿童197 479名,只有教育记录的儿童2 447名。除了年份和环境外,与无联系有关的其他因素包括死亡率和母亲不是难民。分类错误最小。这种相互关联的开放式出生队列对难民和阿拉伯地区来说是独特的,并构成了许多未来研究的基础,包括阐明改善这一脆弱、研究不足的人群的健康和教育的途径。我们对队列的描述使我们建议根据研究问题和分析目的使用不同的队列子集。
{"title":"Establishment of a birth-to-education cohort of 1 million Palestinian refugees using electronic medical records and electronic education records","authors":"Zeina Jamaluddine, Akihiro Seita, Ghada Ballout, Husam Al-Fudoli, Gloria Paolucci, Shatha Albaik, Rami Ibrahim, Miho Sato, Hala Ghattas, Oona Campbell","doi":"10.23889/ijpds.v8i1.2156","DOIUrl":"https://doi.org/10.23889/ijpds.v8i1.2156","url":null,"abstract":"IntroductionBy linking datasets, electronic records can be used to build large birth-cohorts, enabling researchers to cost-effectively answer questions relevant to populations over the life-course. Currently, around 5.8 million Palestinian refugees live in five settings: Jordan, Lebanon, Syria, West Bank, and Gaza Strip. The United Nations Relief and Works Agency for Palestine Refugees in the Near East (UNRWA) provides them with free primary health and elementary-school services. It maintains electronic records to do so. We aimed to establish a birth cohort of Palestinian refugees born between 1st January 2010 and 31st December 2020 living in five settings by linking mother obstetric records with child health and education records and to describe some of the cohort characteristics. In future, we plan to assess effects of size-at-birth on growth, health and educational attainment, among other questions. MethodsWe extracted all available data from 140 health centres and 702 schools across five settings, i.e. all UNRWA service users. Creating the cohort involved examining IDs and other data, preparing data, de-duplicating records, and identifying live-births, linking the mothers' and children's data using different deterministic linking algorithms, and understanding reasons for non-linkage. ResultsWe established a birth cohort of Palestinian refugees using electronic records of 972,743 live births. We found high levels of linkage to health records overall (83%), which improved over time (from 73% to 86%), and variations in linkage rates by setting: these averaged 93% in Gaza, 89% in Lebanon, 75% in Jordan, 73% in West Bank and 68% in Syria. Of the 423,580 children age-eligible to go to school, 47% went to UNRWA schools and comprised of 197,479 children with both health and education records, and 2,447 children with only education records. In addition to year and setting, other factors associated with non-linkage included mortality and having a non-refugee mother. Misclassification errors were minimal. ConclusionThis linked open birth-cohort is unique for refugees and the Arab region and forms the basis for many future studies, including to elucidate pathways for improved health and education in this vulnerable, understudied population. Our characterization of the cohort leads us to recommend using different sub-sets of the cohort depending on the research question and analytic purposes.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135274259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Resource Profile: COVid VAXines Effects on the Aged (COVVAXAGE) 数据来源简介:COVid疫苗对老年人的影响(COVVAXAGE)
Pub Date : 2023-10-18 DOI: 10.23889/ijpds.v8i6.2170
Kaleen Hayes, Daniel Harris, Andrew Zullo, Djeneba Audrey Djibo, Renae L. Smith-Ray, Michael S. Taitel, Tanya G. Singh, Cheryl McMahill-Walraven, Preeti Chachlani, Katherine Wen, Ellen P. McCarthy, Stefan Gravenstein, Sean McCurdy, Kristina E. Baird, Daniel Moran, Derek Fenson, Yalin Deng, Vincent Mor
BackgroundTo improve the assessment of COVID-19 vaccine use, safety, and effectiveness in older adults and persons with complex multimorbidity, the COVid VAXines Effects on the Aged (COVVAXAGE) database was established by linking CVS Health and Walgreens pharmacy customers to Medicare claims. MethodsWe deterministically linked CVS Health and Walgreens customers who had a pharmacy dispensation/encounter paid for by Medicare to Medicare enrollment and claims records. Linked data include U.S. Medicare claims, Medicare enrollment files, and community pharmacy records. The data currently span 01/01/2016 to 08/31/2022. "Research-ready" files were created, with weekly indicators for vaccinations, censoring, death, enrollment, demographics, and comorbidities. Data are updated quarterly. ResultsAs of November 2022, records for 27,086,723 CVS Health and 23,510,025 Walgreens unique customer IDs were identified for potential linkage. Approximately 91% of customers were matched to a Medicare beneficiary ID (95% for those aged 65 years or older). In the final linked cohort, there were 38,250,873 unique beneficiaries representing ~60% of the Medicare population. Among those alive and enrolled in Medicare as of January 1, 2020 (n = 33,721,568; average age = 73 years, 74% White, 51% Medicare Fee-for-Service, and 11% dual-eligible for Medicaid), the average follow-up time was 130 weeks. The cohort contains 16,021,055 beneficiaries with evidence a first COVID-19 vaccine dose. Data are stored on the secure Medicare & Medicaid Resource Information Center Health & Aging Data Enclave. Data accessInvestigators with funded or in-progress funding applications to the National Institute on Aging who are interested in learning more about the database should contact Dr Vincent Mor [Vincent_mor@brown.edu] and Dr Kaleen Hayes [kaley_hayes@brown.edu]. A data dictionary can be provided under reasonable request. ConclusionsThe COVVAXAGE cohort is a large and diverse cohort that can be used for the ongoing evaluation of COVID-19 vaccine use and other research questions relevant to the Medicare population.
背景:为了更好地评估COVid -19疫苗在老年人和复杂多重疾病患者中的使用、安全性和有效性,通过将CVS Health和Walgreens药房的客户与医疗保险索赔联系起来,建立了COVid -19疫苗对老年人的影响(COVVAXAGE)数据库。方法我们确定地将CVS Health和Walgreens的客户与医疗保险登记和索赔记录联系起来,这些客户由医疗保险支付药房配药/就诊费用。关联数据包括美国医疗保险索赔、医疗保险登记文件和社区药房记录。目前的数据跨度为2016年1月1日至2022年8月31日。创建了“研究就绪”文件,每周都有疫苗接种、审查、死亡、登记、人口统计和合并症的指标。数据每季度更新一次。结果截至2022年11月,已识别出27,086,723 CVS Health和23,510,025 Walgreens唯一客户id的记录,以进行潜在的链接。大约91%的客户与医疗保险受益人身份证相匹配(95%的客户年龄在65岁或以上)。在最后的相关队列中,有38,250,873名独特的受益人,约占医疗保险人口的60%。在截至2020年1月1日活着并参加医疗保险的人中(n = 33,721,568;平均年龄= 73岁,74%白人,51%医疗保险服务收费,11%双重资格医疗补助),平均随访时间为130周。该队列包括16021055名受益人,有证据表明他们首次接种了COVID-19疫苗。数据存储在安全的医疗保险和;医疗补助资源信息中心老化的数据Enclave。数据访问已向国家老龄研究所申请资助或正在申请资助的研究人员,如有兴趣了解更多有关该数据库的信息,请联系Vincent Mor博士[Vincent_mor@brown.edu]和Kaleen Hayes博士[kaley_hayes@brown.edu]。如有合理要求,可提供数据字典。COVVAXAGE队列是一个庞大且多样化的队列,可用于持续评估COVID-19疫苗使用情况以及与Medicare人群相关的其他研究问题。
{"title":"Data Resource Profile: COVid VAXines Effects on the Aged (COVVAXAGE)","authors":"Kaleen Hayes, Daniel Harris, Andrew Zullo, Djeneba Audrey Djibo, Renae L. Smith-Ray, Michael S. Taitel, Tanya G. Singh, Cheryl McMahill-Walraven, Preeti Chachlani, Katherine Wen, Ellen P. McCarthy, Stefan Gravenstein, Sean McCurdy, Kristina E. Baird, Daniel Moran, Derek Fenson, Yalin Deng, Vincent Mor","doi":"10.23889/ijpds.v8i6.2170","DOIUrl":"https://doi.org/10.23889/ijpds.v8i6.2170","url":null,"abstract":"BackgroundTo improve the assessment of COVID-19 vaccine use, safety, and effectiveness in older adults and persons with complex multimorbidity, the COVid VAXines Effects on the Aged (COVVAXAGE) database was established by linking CVS Health and Walgreens pharmacy customers to Medicare claims. MethodsWe deterministically linked CVS Health and Walgreens customers who had a pharmacy dispensation/encounter paid for by Medicare to Medicare enrollment and claims records. Linked data include U.S. Medicare claims, Medicare enrollment files, and community pharmacy records. The data currently span 01/01/2016 to 08/31/2022. \"Research-ready\" files were created, with weekly indicators for vaccinations, censoring, death, enrollment, demographics, and comorbidities. Data are updated quarterly. ResultsAs of November 2022, records for 27,086,723 CVS Health and 23,510,025 Walgreens unique customer IDs were identified for potential linkage. Approximately 91% of customers were matched to a Medicare beneficiary ID (95% for those aged 65 years or older). In the final linked cohort, there were 38,250,873 unique beneficiaries representing ~60% of the Medicare population. Among those alive and enrolled in Medicare as of January 1, 2020 (n = 33,721,568; average age = 73 years, 74% White, 51% Medicare Fee-for-Service, and 11% dual-eligible for Medicaid), the average follow-up time was 130 weeks. The cohort contains 16,021,055 beneficiaries with evidence a first COVID-19 vaccine dose. Data are stored on the secure Medicare & Medicaid Resource Information Center Health & Aging Data Enclave. Data accessInvestigators with funded or in-progress funding applications to the National Institute on Aging who are interested in learning more about the database should contact Dr Vincent Mor [Vincent_mor@brown.edu] and Dr Kaleen Hayes [kaley_hayes@brown.edu]. A data dictionary can be provided under reasonable request. ConclusionsThe COVVAXAGE cohort is a large and diverse cohort that can be used for the ongoing evaluation of COVID-19 vaccine use and other research questions relevant to the Medicare population.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135884320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Orthopedic and ophthalmology surgical service projection modelling in Manitoba: Research approach for a data linkage study 马尼托巴省整形外科和眼科手术服务投影模型:数据链接研究的研究方法
Pub Date : 2023-10-16 DOI: 10.23889/ijpds.v8i1.2123
Alan Katz, Hannah Owczar, Carole Taylor, John-Micheal Bowes, Ruth-Ann Soodeen
BackgroundThe healthcare system in Manitoba, Canada has faced long wait times for many surgical procedures and investigations, including orthopedic and ophthalmology surgeries. Wait times for surgical procedures is considered a significant barrier to accessing healthcare in Canada and can have negative health outcomes for patients. We developed models to forecast anticipated surgical procedure demands up to 2027. This paper explores the opportunities and challenges of using administrative data to describe forecasts of surgical service delivery. MethodsThis study used whole population linked administrative health data to predict future orthopedic and ophthalmology surgical procedure demands up to 2027. Procedure codes (CCI) from hospital discharge abstracts and medical claims data were used in the modelling. A Seasonal Autoregressive Integrated Moving Average model provided the best fit to the data from April 1, 2004 to March 31, 2020. ResultsInitial analyses of only hospital-based procedures excluded a significant portion of provider workload, namely those services provided in clinics. We identified 500,732 orthopedic procedures completed between April 1, 2004 and March 31, 2020 (349,171 procedures identified from hospital discharge abstracts and 151,561 procedures from medical claims). Procedure volumes for these services are expected to rise 17.7% from 2020 (36,542) to 2027 (43,011), including the forecasted 43.9% increase in clinic-based procedures. Of the 660,127 ophthalmology procedures completed between April 1, 2004 and March 31, 2020, 230,717 procedures were identified from hospital discharge abstracts and 429,410 from medical claims. Models forecasted a 27.7% increase from 2020 (69,598) to 2027 (88,893) with most procedures being performed in clinics. ConclusionResearchers should consider including multiple datasets to add information that may have been missing from the presumed data source in their research approach. Confirming the completeness of the data is critical in modelling accurate predictions. Forecast modelling techniques have evolved but still require validation.
加拿大马尼托巴省的医疗保健系统面临着许多外科手术和调查的长时间等待,包括骨科和眼科手术。在加拿大,等待外科手术的时间被认为是获得医疗保健的一个重大障碍,可能对患者的健康产生负面影响。我们开发了预测到2027年预期外科手术需求的模型。本文探讨了使用管理数据来描述外科服务交付预测的机遇和挑战。方法本研究使用与全人群相关的行政健康数据来预测到2027年的未来骨科和眼科手术需求。在建模中使用了医院出院摘要和医疗索赔数据中的程序代码(CCI)。对2004年4月1日至2020年3月31日的数据,采用季节性自回归综合移动平均模型拟合效果最好。结果:仅以医院为基础的程序的初步分析排除了提供者工作量的很大一部分,即诊所提供的服务。我们确定了2004年4月1日至2020年3月31日期间完成的500,732例骨科手术(从出院摘要中确定了349,171例手术,从医疗索赔中确定了151,561例手术)。从2020年(36,542)到2027年(43,011),这些服务的程序量预计将增长17.7%,其中包括预测的基于临床的程序增长43.9%。在2004年4月1日至2020年3月31日期间完成的660,127例眼科手术中,有230,717例来自出院摘要,429,410例来自医疗索赔。模型预测从2020年(69,598)到2027年(88,893)增加27.7%,大多数手术在诊所进行。研究人员应考虑纳入多个数据集,以在其研究方法中添加可能从假定数据源中缺失的信息。确认数据的完整性对于建立准确的预测模型至关重要。预测建模技术已经发展,但仍需要验证。
{"title":"Orthopedic and ophthalmology surgical service projection modelling in Manitoba: Research approach for a data linkage study","authors":"Alan Katz, Hannah Owczar, Carole Taylor, John-Micheal Bowes, Ruth-Ann Soodeen","doi":"10.23889/ijpds.v8i1.2123","DOIUrl":"https://doi.org/10.23889/ijpds.v8i1.2123","url":null,"abstract":"BackgroundThe healthcare system in Manitoba, Canada has faced long wait times for many surgical procedures and investigations, including orthopedic and ophthalmology surgeries. Wait times for surgical procedures is considered a significant barrier to accessing healthcare in Canada and can have negative health outcomes for patients. We developed models to forecast anticipated surgical procedure demands up to 2027. This paper explores the opportunities and challenges of using administrative data to describe forecasts of surgical service delivery. MethodsThis study used whole population linked administrative health data to predict future orthopedic and ophthalmology surgical procedure demands up to 2027. Procedure codes (CCI) from hospital discharge abstracts and medical claims data were used in the modelling. A Seasonal Autoregressive Integrated Moving Average model provided the best fit to the data from April 1, 2004 to March 31, 2020. ResultsInitial analyses of only hospital-based procedures excluded a significant portion of provider workload, namely those services provided in clinics. We identified 500,732 orthopedic procedures completed between April 1, 2004 and March 31, 2020 (349,171 procedures identified from hospital discharge abstracts and 151,561 procedures from medical claims). Procedure volumes for these services are expected to rise 17.7% from 2020 (36,542) to 2027 (43,011), including the forecasted 43.9% increase in clinic-based procedures. Of the 660,127 ophthalmology procedures completed between April 1, 2004 and March 31, 2020, 230,717 procedures were identified from hospital discharge abstracts and 429,410 from medical claims. Models forecasted a 27.7% increase from 2020 (69,598) to 2027 (88,893) with most procedures being performed in clinics. ConclusionResearchers should consider including multiple datasets to add information that may have been missing from the presumed data source in their research approach. Confirming the completeness of the data is critical in modelling accurate predictions. Forecast modelling techniques have evolved but still require validation.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136113011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Resource Profile: The Hospital Electronic Prescribing and Medicines Administration (HEPMA) National Data Collection in Scotland 数据资源简介:苏格兰医院电子处方和药品管理局(HEPMA)国家数据收集
Pub Date : 2023-10-11 DOI: 10.23889/ijpds.v8i6.2182
Tanja Mueller, Euan Proud, Amanj Kurdi, Lynne Jarvis, Kat Reid, Stuart McTaggart, Marion Bennie
IntroductionTo support both electronic prescribing and documentation of medicines administration in secondary care, hospitals in Scotland are currently implementing the Hospital Electronic Prescribing and Medicines Administration (HEPMA) software. Driven by the COVID-19 pandemic, agreements have been put in place to centrally collate data stemming from the operational HEPMA system. The aim was to develop a national data resource based on records created in secondary care, in line with pre-existing collections of data from primary care. MethodsHEPMA is a live clinical system and updated on a continuous basis. Data is automatically extracted from local systems at least weekly and, in most cases, on a nightly basis, and integrated into the national HEPMA dataset. Subsequently, the data are subject to quality checks including data consistency and completeness. Records contain a unique patient identified (Community Health Index number), enabling linkage to other routinely collected data including primary care prescriptions, hospital admission episodes, and death records. ResultsThe HEPMA data resource captures and compiles information on all medicines prescribed within the ward/hospital covered by the system; this includes medicine name, formulation, strength, dose, route, and frequency of administration, and dates and times of prescribing. In addition, the HEPMA dataset also captures information on medicines administration, including dates and time of administration. Data is available from January 2019 onwards and held by Public Health Scotland. ConclusionThe national HEPMA data resource supports cross-sectional/point-prevalence studies including drug utilisation studies, and also offers scope to conduct longitudinal studies, e.g., cohort and case-control studies. With the possibility to link to other relevant datasets, additional areas of interest may include health policy evaluations and health economics studies. Access to data is subject to approval; researchers need to contact the electronic Data Research and Innovation Service (eDRIS) in the first instance.
为了支持二级医疗机构的电子处方和药品管理文档,苏格兰的医院目前正在实施医院电子处方和药品管理(HEPMA)软件。在2019冠状病毒病大流行的推动下,各方达成协议,集中整理来自正在运行的HEPMA系统的数据。其目的是根据二级保健中创建的记录开发一个国家数据资源,并与初级保健中已有的数据集保持一致。方法shepma是一个实时的临床系统,并持续更新。数据至少每周(大多数情况下是每晚)从本地系统中自动提取,并集成到国家HEPMA数据集中。然后对数据进行质量检查,包括数据的一致性和完整性。记录包含一个唯一的患者标识(社区卫生指数编号),从而能够与其他常规收集的数据(包括初级保健处方、住院事件和死亡记录)相关联。结果:HEPMA数据资源捕获并汇编了系统覆盖的病房/医院内所有处方药物的信息;这包括药物名称,配方,强度,剂量,途径,给药频率,以及开药日期和时间。此外,HEPMA数据集还捕获药物给药信息,包括给药日期和时间。数据从2019年1月起提供,由苏格兰公共卫生部保管。国家HEPMA数据资源支持包括药物利用研究在内的横断面/点流行研究,也为纵向研究提供了空间,例如队列研究和病例对照研究。由于有可能与其他相关数据集联系,其他感兴趣的领域可能包括卫生政策评价和卫生经济学研究。查阅资料须经批准;研究人员需要首先联系电子数据研究与创新服务(eDRIS)。
{"title":"Data Resource Profile: The Hospital Electronic Prescribing and Medicines Administration (HEPMA) National Data Collection in Scotland","authors":"Tanja Mueller, Euan Proud, Amanj Kurdi, Lynne Jarvis, Kat Reid, Stuart McTaggart, Marion Bennie","doi":"10.23889/ijpds.v8i6.2182","DOIUrl":"https://doi.org/10.23889/ijpds.v8i6.2182","url":null,"abstract":"IntroductionTo support both electronic prescribing and documentation of medicines administration in secondary care, hospitals in Scotland are currently implementing the Hospital Electronic Prescribing and Medicines Administration (HEPMA) software. Driven by the COVID-19 pandemic, agreements have been put in place to centrally collate data stemming from the operational HEPMA system. The aim was to develop a national data resource based on records created in secondary care, in line with pre-existing collections of data from primary care. MethodsHEPMA is a live clinical system and updated on a continuous basis. Data is automatically extracted from local systems at least weekly and, in most cases, on a nightly basis, and integrated into the national HEPMA dataset. Subsequently, the data are subject to quality checks including data consistency and completeness. Records contain a unique patient identified (Community Health Index number), enabling linkage to other routinely collected data including primary care prescriptions, hospital admission episodes, and death records. ResultsThe HEPMA data resource captures and compiles information on all medicines prescribed within the ward/hospital covered by the system; this includes medicine name, formulation, strength, dose, route, and frequency of administration, and dates and times of prescribing. In addition, the HEPMA dataset also captures information on medicines administration, including dates and time of administration. Data is available from January 2019 onwards and held by Public Health Scotland. ConclusionThe national HEPMA data resource supports cross-sectional/point-prevalence studies including drug utilisation studies, and also offers scope to conduct longitudinal studies, e.g., cohort and case-control studies. With the possibility to link to other relevant datasets, additional areas of interest may include health policy evaluations and health economics studies. Access to data is subject to approval; researchers need to contact the electronic Data Research and Innovation Service (eDRIS) in the first instance.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"253 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136211198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards a standardised cross-sectoral data access agreement template for research: a core set of principles for data access within trusted research environments 为研究建立标准化的跨部门数据访问协议模板:在可信的研究环境中数据访问的一套核心原则
Pub Date : 2023-10-09 DOI: 10.23889/ijpds.v8i4.2169
Rachel Brophy, Ester Bellavia, Maeve Groot Bluemink, Katharine Evans, Munisa Hashimi, Yemi Macaulay, Edel McNamara, Allison Noble, Paola Quattroni, Amanda Rudczenko, Andrew D Morris, Cassie Smith, Andy Boyd
IntroductionTrusted Research Environments (TREs) are secure computing environments that provide access to data for approved researchers to use in studies that can save and improve lives. TREs rely on Data Access Agreements (DAAs) to bind researchers and their organisations to the terms and conditions of accessing the infrastructure and data use. However, DAAs can be overly lengthy, complex, and can contain outdated terms from historical data sharing agreements for physical exchange of data. This is often cited as a cause of significant delays to legal review and research projects starting. ObjectivesThe aim was to develop a standardised DAA optimised for data science in TREs across the UK and framed around the `Five Safes framework' for trustworthy data use. The DAA is underpinned by principles of data access in TREs, the development of which is described in this paper. MethodsThe Pan-UK Data Governance Steering Group of the UK Health Data Research Alliance led the development of a core set of data access principles. This was informed by a benchmarking exercise of DAAs used by established TREs and consultation with public members and stakeholders. ResultsWe have defined a core set of principles for TRE data access that can be mapped to a common set of DAA terms for UK-based TREs. Flexibility will be ensured by including terms specific to TREs or specific data/data owners in customisable annexes. Public views obtained through public involvement and engagement (PIE) activities are also reported. ConclusionsThese principles provide the foundation for a standardised UK TRE DAA template, designed to support the growing ecosystem of TREs. By providing a familiar structure and terms, this template aims to build trust among data owners and the UK public and to provide clarity to researchers on their obligations to protect the data. Widespread adoption is intended to accelerate health data research by enabling faster approval of projects, ultimately enabling more timely and effective research.
可信研究环境(TREs)是一种安全的计算环境,为获得批准的研究人员提供数据访问,用于可以拯救和改善生命的研究。TREs依靠数据访问协议(DAAs)来约束研究人员及其组织访问基础设施和数据使用的条款和条件。然而,daa可能过于冗长、复杂,并且可能包含用于物理数据交换的历史数据共享协议中的过时术语。这通常被认为是法律审查和研究项目启动严重延误的原因。目的是开发一个标准化的DAA,优化整个英国的TREs数据科学,并围绕“五个安全框架”构建可靠的数据使用。DAA以TREs中的数据访问原则为基础,本文对其发展进行了描述。方法英国健康数据研究联盟的泛英国数据治理指导小组领导制定了一套核心数据访问原则。这是通过对已建立的技术服务中心使用的daa进行基准测试,并与公众成员和利益相关者进行磋商得出的结论。结果我们已经定义了一组核心原则,用于TRE数据访问,这些原则可以映射到一组通用的DAA术语,用于基于英国的TRE。通过在可定制的附件中包括特定于TREs或特定数据/数据所有者的条款,将确保灵活性。此外,亦报告透过公众参与和参与活动所取得的公众意见。这些原则为标准化的英国TRE DAA模板提供了基础,旨在支持不断增长的TREs生态系统。通过提供一个熟悉的结构和术语,该模板旨在在数据所有者和英国公众之间建立信任,并向研究人员明确他们保护数据的义务。广泛采用该方法的目的是加快项目审批速度,从而加速健康数据研究,最终实现更及时、更有效的研究。
{"title":"Towards a standardised cross-sectoral data access agreement template for research: a core set of principles for data access within trusted research environments","authors":"Rachel Brophy, Ester Bellavia, Maeve Groot Bluemink, Katharine Evans, Munisa Hashimi, Yemi Macaulay, Edel McNamara, Allison Noble, Paola Quattroni, Amanda Rudczenko, Andrew D Morris, Cassie Smith, Andy Boyd","doi":"10.23889/ijpds.v8i4.2169","DOIUrl":"https://doi.org/10.23889/ijpds.v8i4.2169","url":null,"abstract":"IntroductionTrusted Research Environments (TREs) are secure computing environments that provide access to data for approved researchers to use in studies that can save and improve lives. TREs rely on Data Access Agreements (DAAs) to bind researchers and their organisations to the terms and conditions of accessing the infrastructure and data use. However, DAAs can be overly lengthy, complex, and can contain outdated terms from historical data sharing agreements for physical exchange of data. This is often cited as a cause of significant delays to legal review and research projects starting. ObjectivesThe aim was to develop a standardised DAA optimised for data science in TREs across the UK and framed around the `Five Safes framework' for trustworthy data use. The DAA is underpinned by principles of data access in TREs, the development of which is described in this paper. MethodsThe Pan-UK Data Governance Steering Group of the UK Health Data Research Alliance led the development of a core set of data access principles. This was informed by a benchmarking exercise of DAAs used by established TREs and consultation with public members and stakeholders. ResultsWe have defined a core set of principles for TRE data access that can be mapped to a common set of DAA terms for UK-based TREs. Flexibility will be ensured by including terms specific to TREs or specific data/data owners in customisable annexes. Public views obtained through public involvement and engagement (PIE) activities are also reported. ConclusionsThese principles provide the foundation for a standardised UK TRE DAA template, designed to support the growing ecosystem of TREs. By providing a familiar structure and terms, this template aims to build trust among data owners and the UK public and to provide clarity to researchers on their obligations to protect the data. Widespread adoption is intended to accelerate health data research by enabling faster approval of projects, ultimately enabling more timely and effective research.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135141696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Four Questions to Guide Decision-Making for Data Sharing and Integration 指导数据共享和集成决策的四个问题
Pub Date : 2023-10-04 DOI: 10.23889/ijpds.v8i4.2159
Amy Hawn Nelson, Sharon Zanti
IntroductionThis paper presents a Four Question Framework to guide data integration partners in building a strong governance and legal foundation to support ethical data use. ObjectivesWhile this framework was developed based on work in the United States that routinely integrates public data, it is meant to be a simple, digestible tool that can be adapted to any context. MethodsThe framework was developed through a series of public deliberation workgroups and 15 years of field experience working with a diversity of data integration efforts across the United States. ResultsThe Four Questions - Is this legal? Is this ethical? Is this a good idea? How do we know (and who decides)? - should be considered within an established data governance framework and alongside core partners to determine whether and how to move forward when building an Integrated Data System (IDS) and also at each stage of a specific data project. We discuss these questions in depth, with a particular focus on the role of governance in establishing legal and ethical data use. In addition, we provide example data governance structures from two IDS sites and hypothetical scenarios that illustrate key considerations for the Four Question Framework. ConclusionsA robust governance process is essential for determining whether data sharing and integration is legal, ethical, and a good idea within the local context. This process is iterative and as relational as it is technical, which means authentic collaboration across partners should be prioritized at each stage of a data use project. The Four Questions serve as a guide for determining whether to undertake data sharing and integration and should be regularly revisited throughout the life of a project. Highlights Strong data governance has five qualities: it is purpose-, value-, and principle-driven; strategically located; collaborative; iterative; and transparent. Through a series of public deliberation workgroups and 15 years of field experience, we developed a Four Question Framework to determine whether and how to move forward with building an IDS and at each stage of a data sharing and integration project. The Four Questions—Is this legal? Is this ethical? Is this a good idea? How do we know (and who decides)? —should be carefully considered within established data governance processes and among core partners.
本文提出了一个四问题框架,以指导数据集成合作伙伴建立强大的治理和法律基础,以支持道德数据使用。虽然这个框架是基于美国常规整合公共数据的工作而开发的,但它的目的是成为一个简单、易于理解的工具,可以适应任何环境。该框架是通过一系列公共审议工作组和15年在美国各地开展的各种数据集成工作的现场经验开发的。四个问题——这合法吗?这合乎道德吗?这是个好主意吗?我们怎么知道(谁决定)?-应在既定的数据治理框架内进行考虑,并与核心合作伙伴一起决定在构建集成数据系统(IDS)时以及在特定数据项目的每个阶段是否以及如何推进。我们将深入讨论这些问题,特别关注治理在建立合法和道德数据使用中的作用。此外,我们还提供了来自两个IDS站点的示例数据治理结构和假设场景,说明了四问题框架的关键考虑因素。结论健全的治理流程对于确定数据共享和整合是否合法、合乎道德以及在当地情况下是否是个好主意至关重要。这个过程是迭代的,与技术相关,这意味着在数据使用项目的每个阶段都应该优先考虑合作伙伴之间的真正协作。这四个问题可以作为确定是否进行数据共享和集成的指南,并应在项目的整个生命周期中定期重新审视。强大的数据治理有五个特点:目的驱动、价值驱动和原则驱动;战略定位;协作;迭代;和透明。通过一系列公共审议工作组和15年的实地经验,我们制定了一个四问题框架,以确定是否以及如何在数据共享和集成项目的每个阶段推进建立IDS。四个问题——这合法吗?这合乎道德吗?这是个好主意吗?我们怎么知道(谁决定)?-应在已建立的数据治理流程和核心合作伙伴之间仔细考虑。
{"title":"Four Questions to Guide Decision-Making for Data Sharing and Integration","authors":"Amy Hawn Nelson, Sharon Zanti","doi":"10.23889/ijpds.v8i4.2159","DOIUrl":"https://doi.org/10.23889/ijpds.v8i4.2159","url":null,"abstract":"IntroductionThis paper presents a Four Question Framework to guide data integration partners in building a strong governance and legal foundation to support ethical data use. ObjectivesWhile this framework was developed based on work in the United States that routinely integrates public data, it is meant to be a simple, digestible tool that can be adapted to any context. MethodsThe framework was developed through a series of public deliberation workgroups and 15 years of field experience working with a diversity of data integration efforts across the United States. ResultsThe Four Questions - Is this legal? Is this ethical? Is this a good idea? How do we know (and who decides)? - should be considered within an established data governance framework and alongside core partners to determine whether and how to move forward when building an Integrated Data System (IDS) and also at each stage of a specific data project. We discuss these questions in depth, with a particular focus on the role of governance in establishing legal and ethical data use. In addition, we provide example data governance structures from two IDS sites and hypothetical scenarios that illustrate key considerations for the Four Question Framework. ConclusionsA robust governance process is essential for determining whether data sharing and integration is legal, ethical, and a good idea within the local context. This process is iterative and as relational as it is technical, which means authentic collaboration across partners should be prioritized at each stage of a data use project. The Four Questions serve as a guide for determining whether to undertake data sharing and integration and should be regularly revisited throughout the life of a project. Highlights Strong data governance has five qualities: it is purpose-, value-, and principle-driven; strategically located; collaborative; iterative; and transparent. Through a series of public deliberation workgroups and 15 years of field experience, we developed a Four Question Framework to determine whether and how to move forward with building an IDS and at each stage of a data sharing and integration project. The Four Questions—Is this legal? Is this ethical? Is this a good idea? How do we know (and who decides)? —should be carefully considered within established data governance processes and among core partners.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135590831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal for Population Data Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1