首页 > 最新文献

International Journal of Population Data Science最新文献

英文 中文
Building the Iowa Data Drive: a participatory approach to developing early childhood indicators for state and local policymaking. 建立爱荷华州数据驱动:为州和地方政策制定制定幼儿指标的参与式方法。
IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-10-23 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i3.2969
Heather Rouse, Sharon Zanti, Hannah Kim, Cassandra Dorius, Todd Abraham, Giorgi Chighladze

Introduction: Public service leaders face increasing challenges using data effectively due to program silos, limited resources, and the increasing complexity of data. To address these challenges, Iowa's Integrated Data System for Decision-Making (I2D2) partnered with state and local leaders in early childhood to curate key indicators and develop population-level data tools and training to promote policy and practice improvements.

Methods: We relied on a mixed-methods, participatory approach to understand early childhood data and reporting requirements and how state and local leaders leverage data to meet these requirements and inform decisions. We conducted a Data Landscape Overview consisting of interviews, surveys, document review, and meetings with state and local leaders. Public deliberation facilitated iterative feedback and collective decision-making through stakeholder discussions.

Results: Our participatory approach resulted in three actions to improve data collection and use within Iowa's early childhood system: curating a set of early childhood indicators; developing training and strategic planning tools for effective data use; and building the Iowa Data Drive (IDD), an interactive data portal for accessing key early childhood indicators and population-level insights.

Conclusions: A robust IDS can promote systems change when grounded in strong partnerships, phased implementation, and a commitment to clear communication. By centering local voices and fostering trust, we developed indicators and tools that support data-informed decisions and improved services for young children and their families.

引言:由于项目孤岛、有限的资源和日益复杂的数据,公共服务领导者在有效利用数据方面面临越来越多的挑战。为了应对这些挑战,爱荷华州的决策综合数据系统(I2D2)与州和地方儿童早期领导者合作,制定关键指标,开发人口层面的数据工具和培训,以促进政策和实践的改进。方法:我们采用混合方法、参与式方法来了解幼儿数据和报告要求,以及州和地方领导人如何利用数据来满足这些要求并为决策提供信息。我们进行了一项数据全景概述,包括访谈、调查、文件审查以及与州和地方领导人的会议。公众审议通过利益相关者的讨论促进了迭代反馈和集体决策。结果:我们的参与式方法导致了三项行动,以改善爱荷华州早期儿童系统的数据收集和使用:策划一套早期儿童指标;为有效使用数据开发培训和战略规划工具;建立爱荷华州数据驱动(IDD),这是一个交互式数据门户网站,用于获取关键的幼儿指标和人口层面的见解。结论:在强有力的伙伴关系、分阶段实施和明确沟通的承诺的基础上,强大的IDS可以促进系统变革。通过集中地方声音和培养信任,我们制定了指标和工具,支持基于数据的决策,并改善了对幼儿及其家庭的服务。
{"title":"Building the Iowa Data Drive: a participatory approach to developing early childhood indicators for state and local policymaking.","authors":"Heather Rouse, Sharon Zanti, Hannah Kim, Cassandra Dorius, Todd Abraham, Giorgi Chighladze","doi":"10.23889/ijpds.v10i3.2969","DOIUrl":"10.23889/ijpds.v10i3.2969","url":null,"abstract":"<p><strong>Introduction: </strong>Public service leaders face increasing challenges using data effectively due to program silos, limited resources, and the increasing complexity of data. To address these challenges, Iowa's Integrated Data System for Decision-Making (I2D2) partnered with state and local leaders in early childhood to curate key indicators and develop population-level data tools and training to promote policy and practice improvements.</p><p><strong>Methods: </strong>We relied on a mixed-methods, participatory approach to understand early childhood data and reporting requirements and how state and local leaders leverage data to meet these requirements and inform decisions. We conducted a Data Landscape Overview consisting of interviews, surveys, document review, and meetings with state and local leaders. Public deliberation facilitated iterative feedback and collective decision-making through stakeholder discussions.</p><p><strong>Results: </strong>Our participatory approach resulted in three actions to improve data collection and use within Iowa's early childhood system: curating a set of early childhood indicators; developing training and strategic planning tools for effective data use; and building the Iowa Data Drive (IDD), an interactive data portal for accessing key early childhood indicators and population-level insights.</p><p><strong>Conclusions: </strong>A robust IDS can promote systems change when grounded in strong partnerships, phased implementation, and a commitment to clear communication. By centering local voices and fostering trust, we developed indicators and tools that support data-informed decisions and improved services for young children and their families.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 3","pages":"2969"},"PeriodicalIF":2.2,"publicationDate":"2025-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12626109/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145557843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Access to services for mental ill-health and substance use among people released from prison in Scotland (RELEASE): Retrospective observational cohort study protocol. 苏格兰监狱释放人员获得精神疾病和药物使用服务的情况(RELEASE):回顾性观察队列研究协议。
IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-10-16 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i1.2971
Richard Kjellgren, Jan Savinc, Nadine Dougall, Amanj Kurdi, Alastair Leyland, Emily Tweed, Jim Watson, Kate Hunt, Catriona Connell

Introduction: Mental health and substance use (MH/SU) problems are highly prevalent among the prison population. However, early and preventative post-imprisonment care appears to be insufficient to meet the MH/SU needs of people released. This is demonstrated by elevated rates of MH/SU-related emergency care and deaths attributable to alcohol, drugs and suicide. Studies examining post-imprisonment healthcare contacts across community, outpatient, inpatient and emergency services for MH/SU are required to address this issue. This protocol paper describes the outcome of data linkage and details our plans for data cleaning and analysis.

Methods: The RELEASE study will follow a retrospective observational cohort design. This is the first study using national individual-level linked administrative health and prison data from Scotland. We report the results of creating the cohort, and outline proposed methods for data preparation and analysis. Within the cohort, the exposed group comprises everyone released from prison in 2015, and the unexposed group consists of a random sample of the general population matched (1:5 ratio) on age, sex, postcode and postcode-derived index of multiple deprivation, and with no prison exposure in the preceding 5 years. Health data (community prescribing, outpatient visits, specialist substance use, psychiatric inpatient, general inpatient, out-of-hours general practice, 24-hour National Health Service [NHS] helpline, ambulance, and emergency services), deaths data, and prison data (admissions, releases, demographic data) were linked to the cohort using unique identifiers. Service contacts associated with MH/SU will be quantified and compared across the two groups using regression modelling, controlling for potential confounding variables, reimprisonment and deaths.

Conclusion: RELEASE is a comprehensive study with potential to inform post-imprisonment MH/SU service delivery, whilst the dataset holds significant potential for exploring other health conditions and outcomes. This research will allow for an unprecedented understanding of post-imprisonment service use patterns in Scotland, and RELEASE will make a significant public health contribution given the overrepresentation of people released in costly emergency care contact and death rates.

导言:精神健康和药物使用问题在监狱人口中非常普遍。然而,监禁后的早期和预防性护理似乎不足以满足出狱人员的保健/支助需求。与MH/ su相关的紧急护理以及因酒精、毒品和自杀而死亡的比率上升就证明了这一点。为解决这一问题,需要研究监狱/州立医院的社区、门诊、住院和急诊服务部门的监禁后医疗保健接触情况。这份协议文件描述了数据链接的结果,并详细介绍了我们的数据清理和分析计划。方法:RELEASE研究采用回顾性观察队列设计。这是第一次使用苏格兰国家个人层面的行政卫生和监狱数据进行研究。我们报告了创建队列的结果,并概述了数据准备和分析的建议方法。在队列中,暴露组由2015年出狱的所有人组成,未暴露组由年龄、性别、邮政编码和邮政编码衍生的多重剥夺指数匹配(1:5比例)的普通人群随机抽样组成,并且在过去5年内没有监狱暴露。使用唯一标识符将健康数据(社区处方、门诊就诊、专科药物使用、精神科住院患者、普通住院患者、非工作时间一般执业、24小时国民保健服务热线、救护车和紧急服务)、死亡数据和监狱数据(入院、释放、人口统计数据)与队列联系起来。将使用回归模型对与MH/SU相关的服务接触进行量化,并在两组之间进行比较,控制潜在的混淆变量、再监禁和死亡。结论:RELEASE是一项综合性研究,有可能为监禁后的MH/SU服务提供提供信息,同时该数据集具有探索其他健康状况和结果的巨大潜力。这项研究将使人们对苏格兰监禁后服务的使用模式有前所未有的了解,鉴于在昂贵的紧急护理接触和死亡率中被释放的人比例过高,RELEASE将对公共卫生做出重大贡献。
{"title":"Access to services for mental ill-health and substance use among people released from prison in Scotland (RELEASE): Retrospective observational cohort study protocol.","authors":"Richard Kjellgren, Jan Savinc, Nadine Dougall, Amanj Kurdi, Alastair Leyland, Emily Tweed, Jim Watson, Kate Hunt, Catriona Connell","doi":"10.23889/ijpds.v10i1.2971","DOIUrl":"10.23889/ijpds.v10i1.2971","url":null,"abstract":"<p><strong>Introduction: </strong>Mental health and substance use (MH/SU) problems are highly prevalent among the prison population. However, early and preventative post-imprisonment care appears to be insufficient to meet the MH/SU needs of people released. This is demonstrated by elevated rates of MH/SU-related emergency care and deaths attributable to alcohol, drugs and suicide. Studies examining post-imprisonment healthcare contacts across community, outpatient, inpatient and emergency services for MH/SU are required to address this issue. This protocol paper describes the outcome of data linkage and details our plans for data cleaning and analysis.</p><p><strong>Methods: </strong>The RELEASE study will follow a retrospective observational cohort design. This is the first study using national individual-level linked administrative health and prison data from Scotland. We report the results of creating the cohort, and outline proposed methods for data preparation and analysis. Within the cohort, the exposed group comprises everyone released from prison in 2015, and the unexposed group consists of a random sample of the general population matched (1:5 ratio) on age, sex, postcode and postcode-derived index of multiple deprivation, and with no prison exposure in the preceding 5 years. Health data (community prescribing, outpatient visits, specialist substance use, psychiatric inpatient, general inpatient, out-of-hours general practice, 24-hour National Health Service [NHS] helpline, ambulance, and emergency services), deaths data, and prison data (admissions, releases, demographic data) were linked to the cohort using unique identifiers. Service contacts associated with MH/SU will be quantified and compared across the two groups using regression modelling, controlling for potential confounding variables, reimprisonment and deaths.</p><p><strong>Conclusion: </strong>RELEASE is a comprehensive study with potential to inform post-imprisonment MH/SU service delivery, whilst the dataset holds significant potential for exploring other health conditions and outcomes. This research will allow for an unprecedented understanding of post-imprisonment service use patterns in Scotland, and RELEASE will make a significant public health contribution given the overrepresentation of people released in costly emergency care contact and death rates.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 1","pages":"2971"},"PeriodicalIF":2.2,"publicationDate":"2025-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12530171/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145330145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A review of synthetic data terminology for privacy preserving use cases. 对保护隐私用例的合成数据术语的回顾。
IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-10-15 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i2.2967
Lora Frayling, Shah Suraj Bharat, Elizabeth Pattinson, Joshua Stock, Fiona Lugg-Widger, Emma Gordon, Emily Oliver

Synthetic data is emerging as a key area of development for supporting research that involves secure forms of administrative and health data, both in the United Kingdom and globally. In practice, key challenges in the generation and adoption of synthetic data are closely tied to the need for agreed and consistent terminology for describing it. The absence of standardised language hinders the setting of quality standards, establishment of governance and guidelines and effective sharing of knowledge and best practices. This has implications for research that uses synthetic healthcare and administrative data, particularly when such data are generated from protected personal data. This commentary paper reviews existing literature on synthetic data to explore how key terms are currently defined in practice, with a focus on privacy-preserving use cases. Our analysis reveals that terms describing properties of synthetic data are often lacking and inconsistent, largely due to the breadth of synthetic data types, contexts and use cases. Context-specific terminology with nuanced meanings complicates efforts for the development of universally agreed definitions, particularly for privacy-preserving synthetic data that captures characteristics from protected data sources. To address this, we propose broad definitions for key terms including synthetic data, utility, utility measure and fidelity. We conclude by offering a set of recommendations emphasising the need for consensus on terminology and encouraging clearer descriptions in future literature that specify both the intended use of the data and the measures used to describe it.

在联合王国和全球范围内,综合数据正在成为支持涉及安全形式的行政和卫生数据的研究的一个关键发展领域。实际上,合成数据产生和采用方面的主要挑战与需要商定和一致的术语来描述合成数据密切相关。标准化语言的缺乏阻碍了质量标准的制定、治理和指导方针的建立以及知识和最佳做法的有效分享。这对使用综合医疗保健和管理数据的研究有影响,特别是当这些数据是从受保护的个人数据生成的。这篇评论文章回顾了关于合成数据的现有文献,以探索当前在实践中如何定义关键术语,重点是保护隐私的用例。我们的分析表明,描述合成数据属性的术语经常缺乏且不一致,这主要是由于合成数据类型、上下文和用例的广度。具有微妙含义的特定于上下文的术语使开发普遍同意的定义的工作变得复杂,特别是对于从受保护的数据源捕获特征的保留隐私的合成数据。为了解决这个问题,我们提出了对关键术语的广泛定义,包括合成数据、效用、效用度量和保真度。最后,我们提供了一组建议,强调需要在术语上达成共识,并鼓励在未来的文献中更清晰地描述数据的预期用途和用于描述数据的测量方法。
{"title":"A review of synthetic data terminology for privacy preserving use cases.","authors":"Lora Frayling, Shah Suraj Bharat, Elizabeth Pattinson, Joshua Stock, Fiona Lugg-Widger, Emma Gordon, Emily Oliver","doi":"10.23889/ijpds.v10i2.2967","DOIUrl":"10.23889/ijpds.v10i2.2967","url":null,"abstract":"<p><p>Synthetic data is emerging as a key area of development for supporting research that involves secure forms of administrative and health data, both in the United Kingdom and globally. In practice, key challenges in the generation and adoption of synthetic data are closely tied to the need for agreed and consistent terminology for describing it. The absence of standardised language hinders the setting of quality standards, establishment of governance and guidelines and effective sharing of knowledge and best practices. This has implications for research that uses synthetic healthcare and administrative data, particularly when such data are generated from protected personal data. This commentary paper reviews existing literature on synthetic data to explore how key terms are currently defined in practice, with a focus on privacy-preserving use cases. Our analysis reveals that terms describing properties of synthetic data are often lacking and inconsistent, largely due to the breadth of synthetic data types, contexts and use cases. Context-specific terminology with nuanced meanings complicates efforts for the development of universally agreed definitions, particularly for privacy-preserving synthetic data that captures characteristics from protected data sources. To address this, we propose broad definitions for key terms including <i>synthetic data</i>, <i>utility</i>, <i>utility measure</i> and <i>fidelity</i>. We conclude by offering a set of recommendations emphasising the need for consensus on terminology and encouraging clearer descriptions in future literature that specify both the intended use of the data and the measures used to describe it.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 2","pages":"2967"},"PeriodicalIF":2.2,"publicationDate":"2025-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12622363/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145551373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data dictionaries: essential tools for the ethical and transparent use of integrated data. 数据字典:合乎道德和透明地使用综合数据的基本工具。
IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-10-13 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i2.2956
Rebecca S Pepe, Kristen Coe

Data transparency lays the groundwork for the ethical use of administrative data. This is particularly true for linked administrative data within integrated data systems (IDS). Data dictionaries, resources that maintain the metadata of the information housed in an IDS, offer a tool to ensure transparency throughout the data life cycle. The FAIR Principles, which assert that data be Findable, Accessible, Interoperable, and Reusable provide a useful framework by which to measure the effectiveness of data dictionaries in the IDS context. This paper uses the FAIR Principles to discuss the ways in which data dictionaries serve as tools in the ethical and transparent use of integrated data as well as the challenges that remain. Linked administrative data is a valuable source of information for programmatic and academic research. Data dictionaries facilitate the ethical handling of this sensitive information and maintain a commitment to transparency in data inquiry and research.

数据透明度为合乎道德地使用行政数据奠定了基础。对于集成数据系统(IDS)中的链接管理数据尤其如此。数据字典是维护IDS中包含的信息元数据的资源,它提供了一种工具来确保整个数据生命周期的透明性。FAIR原则断言数据是可查找的、可访问的、可互操作的和可重用的,它提供了一个有用的框架,通过这个框架可以衡量IDS上下文中数据字典的有效性。本文使用公平原则来讨论数据字典作为整合数据的道德和透明使用工具的方式以及仍然存在的挑战。关联管理数据是规划和学术研究的宝贵信息来源。数据字典有助于道德地处理这些敏感信息,并保持对数据查询和研究透明度的承诺。
{"title":"Data dictionaries: essential tools for the ethical and transparent use of integrated data.","authors":"Rebecca S Pepe, Kristen Coe","doi":"10.23889/ijpds.v10i2.2956","DOIUrl":"10.23889/ijpds.v10i2.2956","url":null,"abstract":"<p><p>Data transparency lays the groundwork for the ethical use of administrative data. This is particularly true for linked administrative data within integrated data systems (IDS). Data dictionaries, resources that maintain the metadata of the information housed in an IDS, offer a tool to ensure transparency throughout the data life cycle. The FAIR Principles, which assert that data be Findable, Accessible, Interoperable, and Reusable provide a useful framework by which to measure the effectiveness of data dictionaries in the IDS context. This paper uses the FAIR Principles to discuss the ways in which data dictionaries serve as tools in the ethical and transparent use of integrated data as well as the challenges that remain. Linked administrative data is a valuable source of information for programmatic and academic research. Data dictionaries facilitate the ethical handling of this sensitive information and maintain a commitment to transparency in data inquiry and research.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 2","pages":"2956"},"PeriodicalIF":2.2,"publicationDate":"2025-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12517314/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145293723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using graph theory to flexibly construct patient journeys in linked healthcare data. 利用图论在关联的医疗数据中灵活构建患者旅程。
IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-10-09 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i1.2371
Ian Powell, Zhisheng Sa, Branislav Igic, Maria Alfaro-Ramirez, Rachel Farber, Michael Nelson

Introduction: Studies of epidemiology and health system use that use linked admitted patient data benefit from understanding the patient journey, particularly when it spans multiple records within or across multiple datasets.

Objectives: To develop a flexible method for grouping together administrative admitted patient records into periods of hospital care that follow patients from admission to discharge.

Methods: We describe a flexible and generalisable graph theoretic algorithm for grouping patient records into periods of hospital care. The algorithm can account for a variety of complex hospitalisation patterns involving multiple transfers and overlapping records. An R package, journeyer, that implements this algorithm, is included in the Supplementary Material.

Results: This algorithm was applied to the New South Wales Admitted Patient Data Collection, finding 21,405,451 periods of hospital care from 22,794,746 hospital records. The parameters and decisions required for this algorithm were assessed and found appropriate for this dataset, but we offer some advice for generalisation to other datasets.

Conclusions: Our method assists in preparing data for epidemiological research in New South Wales and can be generalised to inpatient data in other jurisdictions. The method can be extended to include ambulance and emergency department data.

引言:使用关联入院患者数据的流行病学和卫生系统使用研究受益于了解患者旅程,特别是当它跨越多个数据集中或跨多个数据集的多个记录时。目的:开发一种灵活的方法,将行政住院患者记录分组到医院护理期间,跟踪患者从入院到出院。方法:我们描述了一种灵活且可推广的图论算法,用于将患者记录分组到医院护理期间。该算法可以解释各种复杂的住院模式,包括多次转移和重叠的记录。一个R包,旅行者,实现这个算法,包括在补充材料。结果:该算法应用于新南威尔士州住院患者数据收集,从22,794,746份医院记录中找到21,405,451个医院护理期。对该算法所需的参数和决策进行了评估,并发现适合该数据集,但我们为推广到其他数据集提供了一些建议。结论:我们的方法有助于为新南威尔士州的流行病学研究准备数据,并可推广到其他司法管辖区的住院患者数据。该方法可以扩展到包括救护车和急诊科的数据。
{"title":"Using graph theory to flexibly construct patient journeys in linked healthcare data.","authors":"Ian Powell, Zhisheng Sa, Branislav Igic, Maria Alfaro-Ramirez, Rachel Farber, Michael Nelson","doi":"10.23889/ijpds.v10i1.2371","DOIUrl":"10.23889/ijpds.v10i1.2371","url":null,"abstract":"<p><strong>Introduction: </strong>Studies of epidemiology and health system use that use linked admitted patient data benefit from understanding the patient journey, particularly when it spans multiple records within or across multiple datasets.</p><p><strong>Objectives: </strong>To develop a flexible method for grouping together administrative admitted patient records into periods of hospital care that follow patients from admission to discharge.</p><p><strong>Methods: </strong>We describe a flexible and generalisable graph theoretic algorithm for grouping patient records into periods of hospital care. The algorithm can account for a variety of complex hospitalisation patterns involving multiple transfers and overlapping records. An R package, <i>journeyer</i>, that implements this algorithm, is included in the Supplementary Material.</p><p><strong>Results: </strong>This algorithm was applied to the New South Wales Admitted Patient Data Collection, finding 21,405,451 periods of hospital care from 22,794,746 hospital records. The parameters and decisions required for this algorithm were assessed and found appropriate for this dataset, but we offer some advice for generalisation to other datasets.</p><p><strong>Conclusions: </strong>Our method assists in preparing data for epidemiological research in New South Wales and can be generalised to inpatient data in other jurisdictions. The method can be extended to include ambulance and emergency department data.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 1","pages":"2371"},"PeriodicalIF":2.2,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12510292/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145281383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using linked Census ancestry data to examine all-cause mortality by ethnicity in Australia. 使用相关的人口普查祖先数据来检查澳大利亚种族的全因死亡率。
IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-09-11 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i1.2476
Fiona Stanaway, Lin Zhu, Bree McDonald, Jioji Ravulo, Michelle Dickson, Natasha Nassar, Mei Ling Yap, Louisa Jorm, Sarah Aitken, Leonard Kritharides, Andrew Wilson, Fiona M Blyth, Carmen Huckel Schneider, Saman Khalatbari Soltani, Benjumin Hsu, Liz Allen

Introduction: Ethnicity in Australia's non-Indigenous population is not collected routinely in health data but the proxy of ancestry is collected in the Census.

Objectives: We aimed to develop an approach to using ancestry data to examine health inequalities by ethnicity in Australia's non-Indigenous population. We then applied this to the example of all-cause mortality.

Methods: We established an expert and community panel to inform our approach to categorising ancestry data. This included shifting those identifying as 'Australian' or 'New Zealander' from the Oceanian to the European continental category; prioritising ethnic minority identities over national identities in those with two ancestries; and examining outcomes using the smallest ethnicity categories possible. We examined how results compared to existing approaches based on country of birth or ancestry (without our modifications) in the detection of mortality inequalities using 2016 Australian Census data linked to death registrations for 2016-2021 in 20.3 million people.

Results: We found important differences in mortality inequalities observed in Māori and Pasifika populations in Australia based on the method used. Ancestry data was able to demonstrate significantly higher mortality that was not observed when using country of birth in Māori females (747 vs 507 per 100,000 person-years), Melanesian and Papuan males (1684 vs 617 per 100,000 person-years) and Polynesian males and females (928 vs 724 in males and 693 vs 569 per 100,000 person-years in females). The size of the inequalities observed was larger using our expert and community informed approach compared to existing approaches (e.g. Polynesian males 928 vs 853 per 100,000 person-years).

Conclusions: We demonstrated an approach to using ancestry data from the Australian Census that improved identification of mortality inequalities in Māori and Pasifika ethnic groups. Inequalities were either hidden or underestimated when country of birth or the standard approach to ancestry data was used.

导言:澳大利亚非土著人口的种族在健康数据中没有常规收集,但在人口普查中收集了祖先的代理。目的:我们旨在开发一种方法,利用祖先数据来检查澳大利亚非土著人口中按种族划分的健康不平等。然后我们把这个应用到全因死亡率的例子中。方法:我们建立了一个专家和社区小组来告知我们对祖先数据进行分类的方法。这包括将那些自认为是“澳大利亚人”或“新西兰人”的人从大洋洲人转移到欧洲大陆人;优先考虑具有双重血统的少数民族身份而不是民族身份;并使用尽可能小的种族类别来检查结果。我们使用2016年澳大利亚人口普查数据(与2016-2021年2030万人的死亡登记相关),研究了在检测死亡率不平等方面,将结果与基于出生国家或祖先的现有方法(未经修改)进行比较的结果。结果:根据使用的方法,我们发现在澳大利亚Māori和Pasifika人群中观察到的死亡率不平等存在重要差异。祖先数据显示,在使用出生国时,Māori女性(747对507 / 100000人年)、美拉尼西亚和巴布亚男性(1684对617 / 100000人年)和波利尼西亚男性和女性(928对724,693对569 / 100000人年)的死亡率明显更高,这一点在使用出生国时没有观察到。与现有方法相比,使用我们的专家和社区知情方法观察到的不平等规模更大(例如,波利尼西亚男性928 vs 853 / 100,000人年)。结论:我们展示了一种使用来自澳大利亚人口普查的祖先数据的方法,该方法改进了Māori和Pasifika族裔群体死亡率不平等的识别。当使用出生国或祖先数据的标准方法时,不平等要么被隐藏,要么被低估。
{"title":"Using linked Census ancestry data to examine all-cause mortality by ethnicity in Australia.","authors":"Fiona Stanaway, Lin Zhu, Bree McDonald, Jioji Ravulo, Michelle Dickson, Natasha Nassar, Mei Ling Yap, Louisa Jorm, Sarah Aitken, Leonard Kritharides, Andrew Wilson, Fiona M Blyth, Carmen Huckel Schneider, Saman Khalatbari Soltani, Benjumin Hsu, Liz Allen","doi":"10.23889/ijpds.v10i1.2476","DOIUrl":"10.23889/ijpds.v10i1.2476","url":null,"abstract":"<p><strong>Introduction: </strong>Ethnicity in Australia's non-Indigenous population is not collected routinely in health data but the proxy of ancestry is collected in the Census.</p><p><strong>Objectives: </strong>We aimed to develop an approach to using ancestry data to examine health inequalities by ethnicity in Australia's non-Indigenous population. We then applied this to the example of all-cause mortality.</p><p><strong>Methods: </strong>We established an expert and community panel to inform our approach to categorising ancestry data. This included shifting those identifying as 'Australian' or 'New Zealander' from the Oceanian to the European continental category; prioritising ethnic minority identities over national identities in those with two ancestries; and examining outcomes using the smallest ethnicity categories possible. We examined how results compared to existing approaches based on country of birth or ancestry (without our modifications) in the detection of mortality inequalities using 2016 Australian Census data linked to death registrations for 2016-2021 in 20.3 million people.</p><p><strong>Results: </strong>We found important differences in mortality inequalities observed in Māori and Pasifika populations in Australia based on the method used. Ancestry data was able to demonstrate significantly higher mortality that was not observed when using country of birth in Māori females (747 vs 507 per 100,000 person-years), Melanesian and Papuan males (1684 vs 617 per 100,000 person-years) and Polynesian males and females (928 vs 724 in males and 693 vs 569 per 100,000 person-years in females). The size of the inequalities observed was larger using our expert and community informed approach compared to existing approaches (e.g. Polynesian males 928 vs 853 per 100,000 person-years).</p><p><strong>Conclusions: </strong>We demonstrated an approach to using ancestry data from the Australian Census that improved identification of mortality inequalities in Māori and Pasifika ethnic groups. Inequalities were either hidden or underestimated when country of birth or the standard approach to ancestry data was used.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 1","pages":"2476"},"PeriodicalIF":2.2,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12581141/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145446126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The nature and extent of the literature on linked reproductive health datasets in the UK: a scoping review. 联合王国相关生殖健康数据集文献的性质和范围:范围审查。
IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-09-09 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i1.2989
Jennifer Hall, Lois Harvey-Pescott, Sum Yue Jessica Ko, Rose Stevens, Neha Pathak, Ifra Ali, Geraldine Barrett, Jenny Shand, Kelly Dickson

Introduction: Data linkage methodologies are increasingly being utilised across research, but there is currently no evidence on the extent and nature of studies that have used linked reproductive health data. The objective of this scoping review is to identify UK studies that use reproductive health data linkage, to improve our understanding of how data linkage could be used for policy, practice, and research in reproductive health.

Methods: We conducted a scoping review using a systematic search in five databases: MEDLINE, EMBASE, CINAHL, MIDIRS, and PSYCINFO to identify literature published in English between January 2000 - April 2024. Following duplication removal, piloting, and screening of titles/abstracts, screening of full texts was conducted. Publications using reproductive health data linkage among UK participants of reproductive age were included. Data was extracted from included articles to capture details relating to study characteristics and what, how, and why data was linked.

Findings: Of the 7,291 identified studies, 272 studies were included in the review. Most studies using data linkage answered questions around reproductive cancer and maternal and child health, whilst only a few studies focused on abortion, contraception, menopause, and preconception health. Several nationally agreed reproductive health indicators did not appear in any included study. Information on sample sociodemographic characteristics, such as ethnicity and deprivation, was often unreported, limiting the identification of health inequalities. Many different datasets were linked (n = 155) with routine health data sources, such as hospital episode statistics (HES), being the most frequently linked.

Interpretation: There is a growing body of research using linked UK reproductive health data, with gaps in which reproductive health domains are covered and which sample characteristics are reported. Further efforts to create a comprehensive, linked reproductive health data resource with robust linkage methods would enable us to fill data gaps, examine inequalities, and explore reproductive health trajectories.

Funding: National Institute for Health and Care Research (NIHR) Policy Research Unit in Reproductive Health.

导言:数据联系方法越来越多地在研究中使用,但目前没有证据表明使用联系生殖健康数据的研究的程度和性质。本次范围审查的目的是确定使用生殖健康数据联系的联合王国研究,以提高我们对如何将数据联系用于生殖健康方面的政策、实践和研究的理解。方法:我们对MEDLINE、EMBASE、CINAHL、MIDIRS和PSYCINFO五个数据库进行了系统检索,以确定2000年1月至2024年4月间发表的英文文献。在删除重复、试点和筛选标题/摘要之后,对全文进行了筛选。纳入了在英国育龄参与者中使用生殖健康数据联系的出版物。从纳入的文章中提取数据,以捕获与研究特征以及数据链接的内容、方式和原因有关的细节。结果:在7291项确定的研究中,有272项研究纳入了本综述。大多数使用数据链接的研究回答了有关生殖癌和孕产妇和儿童健康的问题,而只有少数研究关注堕胎、避孕、更年期和孕前健康。一些国家商定的生殖健康指标没有出现在任何纳入的研究中。关于抽样社会人口特征(如种族和贫困)的信息往往没有报告,限制了对保健不平等现象的查明。许多不同的数据集被链接(n = 155),常规健康数据源,如医院发作统计(HES),是最常链接的。解释:越来越多的研究机构使用联合王国相关的生殖健康数据,但在涵盖生殖健康领域和报告样本特征方面存在差距。通过强有力的联系方法,进一步努力建立一个全面的、相互联系的生殖健康数据资源,将使我们能够填补数据空白,审查不平等现象,探索生殖健康的发展轨迹。资助:国家卫生和保健研究所(NIHR)生殖健康政策研究室。
{"title":"The nature and extent of the literature on linked reproductive health datasets in the UK: a scoping review.","authors":"Jennifer Hall, Lois Harvey-Pescott, Sum Yue Jessica Ko, Rose Stevens, Neha Pathak, Ifra Ali, Geraldine Barrett, Jenny Shand, Kelly Dickson","doi":"10.23889/ijpds.v10i1.2989","DOIUrl":"10.23889/ijpds.v10i1.2989","url":null,"abstract":"<p><strong>Introduction: </strong>Data linkage methodologies are increasingly being utilised across research, but there is currently no evidence on the extent and nature of studies that have used linked reproductive health data. The objective of this scoping review is to identify UK studies that use reproductive health data linkage, to improve our understanding of how data linkage could be used for policy, practice, and research in reproductive health.</p><p><strong>Methods: </strong>We conducted a scoping review using a systematic search in five databases: MEDLINE, EMBASE, CINAHL, MIDIRS, and PSYCINFO to identify literature published in English between January 2000 - April 2024. Following duplication removal, piloting, and screening of titles/abstracts, screening of full texts was conducted. Publications using reproductive health data linkage among UK participants of reproductive age were included. Data was extracted from included articles to capture details relating to study characteristics and what, how, and why data was linked.</p><p><strong>Findings: </strong>Of the 7,291 identified studies, 272 studies were included in the review. Most studies using data linkage answered questions around reproductive cancer and maternal and child health, whilst only a few studies focused on abortion, contraception, menopause, and preconception health. Several nationally agreed reproductive health indicators did not appear in any included study. Information on sample sociodemographic characteristics, such as ethnicity and deprivation, was often unreported, limiting the identification of health inequalities. Many different datasets were linked (n = 155) with routine health data sources, such as hospital episode statistics (HES), being the most frequently linked.</p><p><strong>Interpretation: </strong>There is a growing body of research using linked UK reproductive health data, with gaps in which reproductive health domains are covered and which sample characteristics are reported. Further efforts to create a comprehensive, linked reproductive health data resource with robust linkage methods would enable us to fill data gaps, examine inequalities, and explore reproductive health trajectories.</p><p><strong>Funding: </strong>National Institute for Health and Care Research (NIHR) Policy Research Unit in Reproductive Health.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 1","pages":"2989"},"PeriodicalIF":2.2,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12668254/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145662041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Considerations for selecting and implementing comorbidity indices when using secondary data sources: a guide for health researchers. 使用二手数据源时选择和实施合并症指数的考虑:卫生研究人员指南。
IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-09-02 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i3.2973
Boglarka Soos, Tyler Williamson, Kerry McBrien, Samuel Wiebe, Marcello Tonelli, Danielle A Southern, Cathy A Eastwood, Bing Li, Hude Quan, Paul Ronksley

Comorbidity measures, such as the Charlson Comorbidity Index, are commonly used in risk adjustment models to account for variability in disease burden. This narrative synthesis describes and critiques available comorbidity indices and offers implementation guidance to researchers based on a critical review of existing literature. First, common comorbidity measures are described. Instruments derived using case definitions, grouping of International Classification of Diseases (ICD) codes, and mapping of dispensed medications to chronic conditions are presented. Comorbidity indices that combine diagnostic and medication data are also introduced. No single option consistently outperforms the rest. Next, important considerations when applying a comorbidity index are described. It is crucial to respect temporality and exclude health events that arise after the study index date. Researchers must also weigh the interpretability of using a weighted sum against the flexibility of using a large set of binary variables. When modelling long-term outcomes, there are benefits to applying a one-year look-back window and augmenting data via linkage. For short-term outcomes, certain chronic conditions may exhibit a protective association; however, not all indices capture these relationships. Implementation of these findings will improve the interpretability of comorbidity measures and the quality of future studies.

共病指标,如查理森共病指数,通常用于风险调整模型,以解释疾病负担的可变性。这种叙事综合描述和批评现有的共病指数,并提供实施指导,以现有文献的批判性审查为基础的研究人员。首先,描述了常见的共病措施。使用病例定义、国际疾病分类(ICD)代码分组和分配的药物映射到慢性病的工具被提出。还介绍了结合诊断和药物数据的合并症指标。没有任何一种选择总是优于其他选择。接下来,描述了应用合并症指数时的重要考虑因素。至关重要的是要尊重暂时性,并排除在研究索引日期之后出现的健康事件。研究人员还必须权衡使用加权和的可解释性和使用大量二元变量的灵活性。在对长期结果进行建模时,应用一年回顾窗口并通过链接增加数据是有好处的。就短期结果而言,某些慢性疾病可能表现出保护性关联;然而,并非所有指数都能捕捉到这些关系。这些发现的实施将提高合并症测量的可解释性和未来研究的质量。
{"title":"Considerations for selecting and implementing comorbidity indices when using secondary data sources: a guide for health researchers.","authors":"Boglarka Soos, Tyler Williamson, Kerry McBrien, Samuel Wiebe, Marcello Tonelli, Danielle A Southern, Cathy A Eastwood, Bing Li, Hude Quan, Paul Ronksley","doi":"10.23889/ijpds.v10i3.2973","DOIUrl":"10.23889/ijpds.v10i3.2973","url":null,"abstract":"<p><p>Comorbidity measures, such as the Charlson Comorbidity Index, are commonly used in risk adjustment models to account for variability in disease burden. This narrative synthesis describes and critiques available comorbidity indices and offers implementation guidance to researchers based on a critical review of existing literature. First, common comorbidity measures are described. Instruments derived using case definitions, grouping of International Classification of Diseases (ICD) codes, and mapping of dispensed medications to chronic conditions are presented. Comorbidity indices that combine diagnostic and medication data are also introduced. No single option consistently outperforms the rest. Next, important considerations when applying a comorbidity index are described. It is crucial to respect temporality and exclude health events that arise after the study index date. Researchers must also weigh the interpretability of using a weighted sum against the flexibility of using a large set of binary variables. When modelling long-term outcomes, there are benefits to applying a one-year look-back window and augmenting data via linkage. For short-term outcomes, certain chronic conditions may exhibit a protective association; however, not all indices capture these relationships. Implementation of these findings will improve the interpretability of comorbidity measures and the quality of future studies.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"6 1","pages":"2973"},"PeriodicalIF":2.2,"publicationDate":"2025-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12659803/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145649521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Placing conditions on sharing general practice data for research: Recommendations from two community juries. 为研究共享一般实践数据设定条件:来自两个社区陪审团的建议。
IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-08-18 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i1.2469
Annette Joy Braunack-Mayer, Heidi Green, Lucy Carolan, Belinda Fabrianesi, Carolan Adams, Felicity Flack, Anthony Brown, Kate Miller, Carrie Hayter, Joel Rhee, Alberto Nettel-Aguirre, Justin Bielby, Matthew Wright-Simon

Objective: There is increasing demand for access to general practice health records for secondary purposes, including research. However, the extent to which the public supports such use is unclear. We sought to explore informed Australians' perspectives on conditions under which the use of general practice data for research would be acceptable.

Methods: We conducted two community juries in July and August 2023 with 20 participants, selected for diversity, in each jury. Jurors worked for 36 hours, in a combination of online and face-to-face sessions, over 6 days. They listened to expert presentations, discussed, and challenged experts, deliberated, and developed their own recommendations.

Results: Both juries, in principle, supported sharing general practice data for research purposes. They made 24 (Sydney jury) and 19 (Melbourne jury) recommendations related to consent, information provision, public benefit, data security, governance and costs.

Conclusions: The outcomes of the deliberative process suggest that an informed group of Australian citizens are willing to share general practice data for research provided strict conditions are met.

Implications for public health: Adopting the recommendations from the juries will require a range of policy and regulatory responses including legislative changes.

目的:为次要目的(包括研究)获取全科医疗记录的需求越来越大。然而,公众在多大程度上支持这种使用尚不清楚。我们试图探索知情的澳大利亚人对在什么情况下可以接受使用全科实践数据进行研究的观点。方法:我们于2023年7月和8月进行了两次社区评委会,每个评委会中有20名参与者。在6天的时间里,陪审员们工作了36个小时,结合了在线和面对面的会议。他们听取了专家的报告,进行了讨论,并对专家提出了质疑,进行了审议,并提出了自己的建议。结果:原则上,两个陪审团都支持为研究目的共享一般实践数据。他们提出了24项(悉尼陪审团)和19项(墨尔本陪审团)建议,涉及同意、信息提供、公共利益、数据安全、治理和成本。结论:审议过程的结果表明,在满足严格条件的情况下,一群知情的澳大利亚公民愿意为研究分享一般实践数据。对公众健康的影响:采纳陪审团的建议将需要一系列政策和监管反应,包括立法改革。
{"title":"Placing conditions on sharing general practice data for research: Recommendations from two community juries.","authors":"Annette Joy Braunack-Mayer, Heidi Green, Lucy Carolan, Belinda Fabrianesi, Carolan Adams, Felicity Flack, Anthony Brown, Kate Miller, Carrie Hayter, Joel Rhee, Alberto Nettel-Aguirre, Justin Bielby, Matthew Wright-Simon","doi":"10.23889/ijpds.v10i1.2469","DOIUrl":"10.23889/ijpds.v10i1.2469","url":null,"abstract":"<p><strong>Objective: </strong>There is increasing demand for access to general practice health records for secondary purposes, including research. However, the extent to which the public supports such use is unclear. We sought to explore informed Australians' perspectives on conditions under which the use of general practice data for research would be acceptable.</p><p><strong>Methods: </strong>We conducted two community juries in July and August 2023 with 20 participants, selected for diversity, in each jury. Jurors worked for 36 hours, in a combination of online and face-to-face sessions, over 6 days. They listened to expert presentations, discussed, and challenged experts, deliberated, and developed their own recommendations.</p><p><strong>Results: </strong>Both juries, in principle, supported sharing general practice data for research purposes. They made 24 (Sydney jury) and 19 (Melbourne jury) recommendations related to consent, information provision, public benefit, data security, governance and costs.</p><p><strong>Conclusions: </strong>The outcomes of the deliberative process suggest that an informed group of Australian citizens are willing to share general practice data for research provided strict conditions are met.</p><p><strong>Implications for public health: </strong>Adopting the recommendations from the juries will require a range of policy and regulatory responses including legislative changes.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 1","pages":"2469"},"PeriodicalIF":2.2,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12581136/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145446107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving transparency in data access processes: Developing best practice standards and promoting system-wide change through a competitive funding call. 提高数据获取过程的透明度:制定最佳做法标准,并通过竞争性筹资呼吁促进全系统变革。
IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-07-22 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i2.2949
Yemi Macaulay, Ester Bellavia, Rachel Brophy, Angela Coulter, Ben Glampson, Bethany Gilbert, Alan Holcroft, Erik Mayer, Edel McNamara, Katherine O'Sullivan, Paola Quattroni, David Seymour, Yvonne Silove, Doreen Tembo, Andrew D Morris, Cassie Smith, Andy Boyd

Introduction: Transparency in the use of data for research benefits the public and researchers by fostering trust and enabling efficient data sharing. Public support for access to their data for research depends on robust data security, the absence of conflicting interests, and a clear demonstration of public benefit, all of which must be evident through transparent practices. A lack of clarity in data access processes can delay research, highlighting the need for clear and streamlined approval procedures. To maintain what is often referred to as a 'social license to operate', organisations must meet and uphold societal expectations, with transparency being a key dimension of that responsibility.

Objective: To develop and foster adoption of a set of transparency standards for the data science community, supporting trustworthy and streamlined data use for health and socio-economic research and planning.

Methods: A multi-stakeholder deliberation was undertaken, informed by two reviews of existing data access procedures across participating organisations. Stakeholders included healthcare and research organisations, data custodians, regulators, industry representatives, academic experts, and members of the public.

Results: The review and deliberation identified missed opportunities to inform and involve the public in data access procedures, along with inconsistencies in data access processes and supporting materials across the organisations. In response, we developed the Transparency Standards, comprising 28 recommended actions grouped into four themes: provision of clear data access guidance; clear website navigation designed to meet the needs of public and research users; regular review and iterative improvement of processes; and reporting of data access outcomes and information security findings. A targeted funding call facilitated the adoption of standards in 19 organisations, resulting in reusable transparency materials and transferable knowledge to support wider implementation.

Conclusion: The Transparency Standards support data custodians in strengthening openness and accountability in data access processes, helping to build public trust while simplifying procedures for researchers. Their broad adoption demonstrates a shared commitment to the ethical use of data. However, varying levels of implementation point to the need for continued investment to sustain progress and respond to public and researcher expectations.

引言:研究数据使用的透明度通过培养信任和实现有效的数据共享,使公众和研究人员受益。公众对获取其数据进行研究的支持取决于强大的数据安全性,没有利益冲突,以及明确的公共利益证明,所有这些都必须通过透明的实践来证明。数据访问过程缺乏明确性可能会延迟研究,从而突出了明确和简化审批程序的必要性。为了保持通常所说的“社会经营许可证”,组织必须满足并维护社会期望,而透明度是该责任的一个关键方面。目标:为数据科学界制定和促进采用一套透明度标准,支持在卫生和社会经济研究和规划中可靠和精简地使用数据。方法:通过对参与组织现有数据访问程序的两次审查,进行了多方利益相关者审议。利益相关者包括医疗保健和研究组织、数据保管人、监管机构、行业代表、学术专家和公众。结果:审查和审议确定了错失的让公众参与数据访问程序的机会,以及各组织在数据访问过程和支持材料方面的不一致。为此,我们制定了《透明度标准》,其中包括28项建议行动,分为四个主题:提供明确的数据访问指导;清晰的网站导航设计,以满足公众和研究用户的需求;定期检讨及迭代改进流程;报告数据访问结果和信息安全发现。有针对性的资助呼吁促进了19个组织采用标准,产生了可重复使用的透明材料和可转移的知识,以支持更广泛的实施。结论:透明度标准支持数据保管人加强数据获取过程的开放性和问责制,有助于建立公众信任,同时简化研究人员的程序。它们的广泛采用表明了对数据道德使用的共同承诺。然而,不同程度的实施表明,需要继续投资,以维持进展,并回应公众和研究人员的期望。
{"title":"Improving transparency in data access processes: Developing best practice standards and promoting system-wide change through a competitive funding call.","authors":"Yemi Macaulay, Ester Bellavia, Rachel Brophy, Angela Coulter, Ben Glampson, Bethany Gilbert, Alan Holcroft, Erik Mayer, Edel McNamara, Katherine O'Sullivan, Paola Quattroni, David Seymour, Yvonne Silove, Doreen Tembo, Andrew D Morris, Cassie Smith, Andy Boyd","doi":"10.23889/ijpds.v10i2.2949","DOIUrl":"10.23889/ijpds.v10i2.2949","url":null,"abstract":"<p><strong>Introduction: </strong>Transparency in the use of data for research benefits the public and researchers by fostering trust and enabling efficient data sharing. Public support for access to their data for research depends on robust data security, the absence of conflicting interests, and a clear demonstration of public benefit, all of which must be evident through transparent practices. A lack of clarity in data access processes can delay research, highlighting the need for clear and streamlined approval procedures. To maintain what is often referred to as a 'social license to operate', organisations must meet and uphold societal expectations, with transparency being a key dimension of that responsibility.</p><p><strong>Objective: </strong>To develop and foster adoption of a set of transparency standards for the data science community, supporting trustworthy and streamlined data use for health and socio-economic research and planning.</p><p><strong>Methods: </strong>A multi-stakeholder deliberation was undertaken, informed by two reviews of existing data access procedures across participating organisations. Stakeholders included healthcare and research organisations, data custodians, regulators, industry representatives, academic experts, and members of the public.</p><p><strong>Results: </strong>The review and deliberation identified missed opportunities to inform and involve the public in data access procedures, along with inconsistencies in data access processes and supporting materials across the organisations. In response, we developed the Transparency Standards, comprising 28 recommended actions grouped into four themes: provision of clear data access guidance; clear website navigation designed to meet the needs of public and research users; regular review and iterative improvement of processes; and reporting of data access outcomes and information security findings. A targeted funding call facilitated the adoption of standards in 19 organisations, resulting in reusable transparency materials and transferable knowledge to support wider implementation.</p><p><strong>Conclusion: </strong>The Transparency Standards support data custodians in strengthening openness and accountability in data access processes, helping to build public trust while simplifying procedures for researchers. Their broad adoption demonstrates a shared commitment to the ethical use of data. However, varying levels of implementation point to the need for continued investment to sustain progress and respond to public and researcher expectations.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 2","pages":"2949"},"PeriodicalIF":2.2,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12581233/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145446140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Population Data Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1