International Journal of Population Data Science最新文献_第2页

Global Mind Project data in the United States: A comparison with national statistics. 美国的全球思维项目数据：与国家统计数据的比较。

IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES

International Journal of Population Data Science

Pub Date : 2026-02-16 eCollection Date: 2026-01-01 DOI: 10.23889/ijpds.v11i1.3148

Joseph Taylor, Oleksii Sukhoi, Jennifer Jane Newson, Tara C Thiagarajan

Background: The rapid growth of internet and mobile technologies has opened up new, low-cost methods for large-scale population surveys. The Global Mind Project (GMP) is one such survey that uses quota-based online strategies that dynamically target respondents by age, sex, and location. However, how well this method aligns with national population statistics remains unclear.Objective: To evaluate how well GMP data collected through online recruitment aligns demographically with United States (US) benchmarks from traditional probability-based surveys, including the American Community Survey (ACS), Household Pulse Survey (HPS), and American Trends Panel (ATP).Methods: We analysed 114,721 GMP responses collected in the US between 2020 and 2024. Participants were recruited via Facebook and Google AdSense using broad interest-based keywords and stratified demographic targeting. GMP data were time- and question-matched with ACS, HPS, and ATP data to compare trends in educational attainment, marital status, mental health treatment, and number of close friends.Results: Demographic patterns in GMP data typically aligned with national statistics within a 5-7% margin. Educational attainment by age was similar to ACS data, except among 65+, where GMP consistently showed a 5% and 10% higher rate of High School and Bachelor's completion, respectively. GMP and ACS matched near-perfectly for Divorced and Widowed marital status by age while 'Not married' in the GMP was 6-10% higher compared to 'Never married' individuals in the ACS and, conversely, lower in the Married group. GMP aggregate mental health treatment estimates were within ±1% of HPS values for three of the four years studied, although age-specific differences ranged from 5-8%. Compared to ATP, those reporting two or fewer friends were 15% higher in the GMP. These differences reflect differences in sampling methodology but also imperfect matches of categories and differing non-response bias arising from mode of survey.Conclusions: GMP data demonstrate that with dynamic targeting and quota-based sampling, online recruitment methods can produce data that align well with traditional national surveys. This data, therefore, offers real-time, inclusive and cost-efficient population-level monitoring of mental health and social trends, with potential for use in public health research and policy.Highlights: The Global Mind Project (GMP) uses quota-based dynamic online ad targeting (Q-DOAT) via Meta and Google Ads to recruit large-scale populations.Analysis of 114,721 US responses (2020-2024) showed GMP demographic trends aligned within 5-7% of national statistics from ACS, HPS, and ATP.Slight differences were observed, including 5-10% higher representation of single individuals, those with fewer close friends, and those seeking mental health treatment.GMP data demonstrate

背景：互联网和移动技术的快速发展为大规模人口调查开辟了新的、低成本的方法。全球思维项目（GMP）就是这样一项调查，它使用基于配额的在线策略，根据年龄、性别和地点动态地定位受访者。然而，这种方法与国家人口统计数据的一致性仍不清楚。目的：评估通过在线招聘收集的GMP数据在人口学上与美国（US）传统基于概率的调查(包括美国社区调查（ACS）、家庭脉搏调查（HPS）和美国趋势小组（ATP）的基准相一致的程度。方法：我们分析了2020年至2024年间在美国收集的114,721份GMP回复。参与者通过Facebook和谷歌AdSense招募，使用广泛的基于兴趣的关键词和分层的人口目标。GMP数据在时间和问题上与ACS、HPS和ATP数据相匹配，以比较受教育程度、婚姻状况、心理健康治疗和亲密朋友数量的趋势。结果：GMP数据中的人口统计模式通常与国家统计数据在5-7%的范围内保持一致。按年龄划分的受教育程度与ACS数据相似，除了65岁以上，GMP一致显示高中和学士学位完成率分别高出5%和10%。按年龄划分，GMP组和ACS组的离婚和丧偶婚姻状况几乎完全匹配，而GMP组中“未婚”的比例比ACS组中“从未结婚”的比例高6-10%，相反，已婚组的比例较低。在研究的四年中，有三年的GMP总体心理健康治疗估计值在HPS值的±1%以内，尽管年龄特异性差异在5-8%之间。与ATP相比，那些有两个或更少朋友的人的GMP要高15%。这些差异反映了抽样方法的差异，但也反映了类别的不完美匹配和调查方式引起的不同的非反应偏差。结论：GMP数据表明，通过动态目标和基于配额的抽样，在线招聘方法可以产生与传统国家调查相一致的数据。因此，这些数据提供了对心理健康和社会趋势的实时、包容和具有成本效益的人口监测，有可能用于公共卫生研究和政策。亮点：Global Mind Project （GMP）通过Meta和谷歌Ads使用基于配额的动态在线广告定位（Q-DOAT）来招募大规模人群。对114,721份美国回复（2020-2024）的分析显示，GMP人口统计趋势与ACS、HPS和ATP的国家统计数据的5-7%一致。观察到轻微的差异，包括单身人士、亲密朋友较少的人和寻求心理健康治疗的人的比例高出5-10%。GMP数据表明，在线招聘与后分层相结合，可以产生符合国家人口和社会趋势的数据。该研究支持GMP作为可扩展的、接近实时的人口健康监测平台的效用。

{"title":"Global Mind Project data in the United States: A comparison with national statistics.","authors":"Joseph Taylor, Oleksii Sukhoi, Jennifer Jane Newson, Tara C Thiagarajan","doi":"10.23889/ijpds.v11i1.3148","DOIUrl":"10.23889/ijpds.v11i1.3148","url":null,"abstract":"Background: The rapid growth of internet and mobile technologies has opened up new, low-cost methods for large-scale population surveys. The Global Mind Project (GMP) is one such survey that uses quota-based online strategies that dynamically target respondents by age, sex, and location. However, how well this method aligns with national population statistics remains unclear.Objective: To evaluate how well GMP data collected through online recruitment aligns demographically with United States (US) benchmarks from traditional probability-based surveys, including the American Community Survey (ACS), Household Pulse Survey (HPS), and American Trends Panel (ATP).Methods: We analysed 114,721 GMP responses collected in the US between 2020 and 2024. Participants were recruited via Facebook and Google AdSense using broad interest-based keywords and stratified demographic targeting. GMP data were time- and question-matched with ACS, HPS, and ATP data to compare trends in educational attainment, marital status, mental health treatment, and number of close friends.Results: Demographic patterns in GMP data typically aligned with national statistics within a 5-7% margin. Educational attainment by age was similar to ACS data, except among 65+, where GMP consistently showed a 5% and 10% higher rate of High School and Bachelor's completion, respectively. GMP and ACS matched near-perfectly for Divorced and Widowed marital status by age while 'Not married' in the GMP was 6-10% higher compared to 'Never married' individuals in the ACS and, conversely, lower in the Married group. GMP aggregate mental health treatment estimates were within ±1% of HPS values for three of the four years studied, although age-specific differences ranged from 5-8%. Compared to ATP, those reporting two or fewer friends were 15% higher in the GMP. These differences reflect differences in sampling methodology but also imperfect matches of categories and differing non-response bias arising from mode of survey.Conclusions: GMP data demonstrate that with dynamic targeting and quota-based sampling, online recruitment methods can produce data that align well with traditional national surveys. This data, therefore, offers real-time, inclusive and cost-efficient population-level monitoring of mental health and social trends, with potential for use in public health research and policy.Highlights: The Global Mind Project (GMP) uses quota-based dynamic online ad targeting (Q-DOAT) via Meta and Google Ads to recruit large-scale populations.Analysis of 114,721 US responses (2020-2024) showed GMP demographic trends aligned within 5-7% of national statistics from ACS, HPS, and ATP.Slight differences were observed, including 5-10% higher representation of single individuals, those with fewer close friends, and those seeking mental health treatment.GMP data demonstrate","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"11 1","pages":"3148"},"PeriodicalIF":2.2,"publicationDate":"2026-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC13001805/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147499980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Evaluation of special educational needs and disability provision in English primary schools using administrative health and education data in the ECHILD database. 利用欧洲儿童权利中心数据库中的行政保健和教育数据评估英国小学的特殊教育需要和残疾情况。

IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES

International Journal of Population Data Science

Pub Date : 2026-02-12 eCollection Date: 2026-01-01 DOI: 10.23889/ijpds.v11i2.3331

Ruth Gilbert, Jenny Saxton, Ayana Cant, Kate Lewis, Vincent Nguyen, Hayley Gains, Ania Zylbersztejn, Joachim Tan, Laura Gimeno, Jacob Matthews, Isaac Winterburn, Jugnoo Rahi, Johnny Downs, Stuart Logan, Will Farr, Lorraine Dearden, Katie Harron, Tamsin Ford, Bianca L De Stavola

Introduction: Schools worldwide balance whole-class teaching with additional provision for children with special educational needs or disability (SEND). Robust evidence on equity and effectiveness of SEND provision is essential to address growing demand and rising costs globally.

Objectives: To synthesise findings from the Health Outcomes for young People throughout Education (HOPE) evaluation of variation in SEND provision and its impact on health and education outcomes in English primary schools. We integrated findings from 14 sub-studies using administrative data in the Education and Child Health Insights from Linked Data (ECHILD) database and 10 mixed methods sub-studies.

Methods: Analyses of ECHILD data followed children from birth to age 11 years. We examined how variation in SEND provision was associated with health conditions, and school, social and organisational factors. Using target trial emulation, we estimated the impact of SEND provision on hospital admissions, school absences and attainment. We surveyed and interviewed young people, parents, and professionals and reviewed information about services to understand SEND processes and contexts.

Results: Of 3.8 million children born 2004 to 2013, 30% had SEND provision recorded by age 11. Health conditions were only partially associated with SEND provision, which was also related to male gender, social disadvantage, low attainment and type of school. SEND provision modestly reduced rates of unauthorised absences in subgroups of children but showed no measurable benefit on hospital admissions or school attainment. Mixed methods studies highlighted benefits of early, responsive support, challenges posed by limited capacity, harms caused by delayed or inadequate provision, and need for parent advocacy to access SEND provision.

Discussion: Weak evidence of benefits of SEND provision in causal analyses likely reflects unmeasured confounding, lack of measures of provision received and insensitive outcomes in ECHILD data. SEND policies need robust evidence from analyses across jurisdictions using administrative data, enhanced with better measures, experimental methods and contextual evaluation.

简介：世界各地的学校都在平衡整班教学和为有特殊教育需要或残疾的儿童提供额外的服务（SEND）。对于解决全球不断增长的需求和不断上升的成本问题，提供公平和有效的有力证据至关重要。目的：综合英国小学SEND提供变化及其对健康和教育结果影响的青少年教育健康结果（HOPE）评估结果。我们整合了14个子研究的结果，这些子研究使用了关联数据（ECHILD）数据库中的教育和儿童健康洞察管理数据和10个混合方法子研究。方法：对儿童从出生至11岁的ECHILD资料进行分析。我们研究了SEND提供的变化与健康状况、学校、社会和组织因素之间的关系。通过目标试验模拟，我们估计了SEND提供对住院率、缺勤率和学业成就的影响。我们对年轻人、家长和专业人士进行了调查和访谈，并审查了有关服务的信息，以了解SEND的流程和背景。结果：在2004年至2013年出生的380万名儿童中，30%的儿童在11岁前有SEND规定。健康状况仅部分与SEND提供有关，这还与男性性别、社会劣势、成绩低和学校类型有关。SEND的提供适度地降低了儿童亚组中未经授权的缺勤率，但在住院率或学业成绩方面没有显示出可衡量的好处。混合方法研究强调了早期响应性支持的好处，有限能力带来的挑战，延迟或不充分提供造成的危害，以及家长倡导获得SEND提供的必要性。讨论：因果分析中关于SEND提供益处的证据不足，可能反映了未测量的混杂因素，缺乏对收到的提供的测量和ECHILD数据中不敏感的结果。SEND政策需要来自跨司法管辖区使用行政数据进行分析的有力证据，并通过更好的措施、实验方法和情境评估加以加强。

{"title":"Evaluation of special educational needs and disability provision in English primary schools using administrative health and education data in the ECHILD database.","authors":"Ruth Gilbert, Jenny Saxton, Ayana Cant, Kate Lewis, Vincent Nguyen, Hayley Gains, Ania Zylbersztejn, Joachim Tan, Laura Gimeno, Jacob Matthews, Isaac Winterburn, Jugnoo Rahi, Johnny Downs, Stuart Logan, Will Farr, Lorraine Dearden, Katie Harron, Tamsin Ford, Bianca L De Stavola","doi":"10.23889/ijpds.v11i2.3331","DOIUrl":"https://doi.org/10.23889/ijpds.v11i2.3331","url":null,"abstract":"Introduction: Schools worldwide balance whole-class teaching with additional provision for children with special educational needs or disability (SEND). Robust evidence on equity and effectiveness of SEND provision is essential to address growing demand and rising costs globally.Objectives: To synthesise findings from the Health Outcomes for young People throughout Education (HOPE) evaluation of variation in SEND provision and its impact on health and education outcomes in English primary schools. We integrated findings from 14 sub-studies using administrative data in the Education and Child Health Insights from Linked Data (ECHILD) database and 10 mixed methods sub-studies.Methods: Analyses of ECHILD data followed children from birth to age 11 years. We examined how variation in SEND provision was associated with health conditions, and school, social and organisational factors. Using target trial emulation, we estimated the impact of SEND provision on hospital admissions, school absences and attainment. We surveyed and interviewed young people, parents, and professionals and reviewed information about services to understand SEND processes and contexts.Results: Of 3.8 million children born 2004 to 2013, 30% had SEND provision recorded by age 11. Health conditions were only partially associated with SEND provision, which was also related to male gender, social disadvantage, low attainment and type of school. SEND provision modestly reduced rates of unauthorised absences in subgroups of children but showed no measurable benefit on hospital admissions or school attainment. Mixed methods studies highlighted benefits of early, responsive support, challenges posed by limited capacity, harms caused by delayed or inadequate provision, and need for parent advocacy to access SEND provision.Discussion: Weak evidence of benefits of SEND provision in causal analyses likely reflects unmeasured confounding, lack of measures of provision received and insensitive outcomes in ECHILD data. SEND policies need robust evidence from analyses across jurisdictions using administrative data, enhanced with better measures, experimental methods and contextual evaluation.","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"11 2","pages":"3331"},"PeriodicalIF":2.2,"publicationDate":"2026-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12998597/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147487784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Genetic risk score informed re-evaluation of phenotype quality control to maximise power in epidemiological studies: application to lung function. 遗传风险评分告知表型质量控制的重新评估，以最大限度地发挥流行病学研究的力量：应用于肺功能。

IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES

International Journal of Population Data Science

Pub Date : 2026-02-09 eCollection Date: 2026-01-01 DOI: 10.23889/ijpds.v11i1.2997

Jing Chen, Nick Shrine, Abril G Izquierdo, Anna Guyatt, Alex Williams, Henry Völzke, Stephanie London, Ian P Hall, Frank Dudbridge, Louise V Wain, Martin D Tobin, Catherine John

Introduction: Clinical guidelines may reduce statistical power in epidemiological studies by discarding informative measures. Epidemiological studies of lung function may discard one-third to one-half of participants due to spirometry measures deemed "low quality" using criteria adapted from clinical practice.

Objectives: To optimise the signal-to-noise ratio in epidemiological studies of lung function, we aimed to develop a data-driven method to refine spirometry quality control (QC) criteria.

Methods: We proposed a genetic risk score (GRS) informed strategy to categorise spirometer blows by quality criteria. GRS was built using SNPs associated with lung function traits in non-UK Biobank cohorts. In the UK Biobank, we applied a step-wise testing of the GRS association across groups of spirometry blows stratified by acceptability flags to rank the blow quality. We reassessed QC criteria by comparing the genetic associations under different acceptability flags and repeatability thresholds to determine the trade-off between sample size and measurement error.

Results: We found that including blows previously excluded by strict QC criteria would maximise the statistical power for genome-wide association study and retain acceptable precision in the UK Biobank. This approach allowed the inclusion of 29% more participants compared to the strictest clinical guidelines and demonstrated genetic signals could be identified earlier.

Conclusions: Our GRS-based method offers an important framework to challenge prevailing practices that exclude informative measures and limit power in epidemiological studies.

导言：临床指南可能会通过丢弃信息性措施来降低流行病学研究的统计能力。肺功能的流行病学研究可能会丢弃三分之一到一半的参与者，因为肺活量测定被认为是“低质量”的，使用的标准是根据临床实践改编的。目的：为了优化肺功能流行病学研究中的信噪比，我们旨在开发一种数据驱动的方法来完善肺量测定质量控制（QC）标准。方法：我们提出了一种遗传风险评分（GRS）知情策略，根据质量标准对肺活量计打击进行分类。GRS是使用非英国生物库队列中与肺功能特征相关的snp构建的。在UK Biobank中，我们应用了GRS关联的逐步测试，通过可接受标志分层呼吸量测定打击组，以对打击质量进行排名。我们通过比较不同可接受标志和可重复性阈值下的遗传关联来重新评估QC标准，以确定样本量和测量误差之间的权衡。结果：我们发现，包括以前被严格的质量控制标准排除的打击将最大限度地提高全基因组关联研究的统计能力，并在英国生物银行中保持可接受的精度。与最严格的临床指南相比，这种方法允许多纳入29%的参与者，并证明可以更早地识别遗传信号。结论：我们基于gis的方法提供了一个重要的框架，以挑战流行病学研究中排除信息测量和限制力量的流行做法。

{"title":"Genetic risk score informed re-evaluation of phenotype quality control to maximise power in epidemiological studies: application to lung function.","authors":"Jing Chen, Nick Shrine, Abril G Izquierdo, Anna Guyatt, Alex Williams, Henry Völzke, Stephanie London, Ian P Hall, Frank Dudbridge, Louise V Wain, Martin D Tobin, Catherine John","doi":"10.23889/ijpds.v11i1.2997","DOIUrl":"10.23889/ijpds.v11i1.2997","url":null,"abstract":"Introduction: Clinical guidelines may reduce statistical power in epidemiological studies by discarding informative measures. Epidemiological studies of lung function may discard one-third to one-half of participants due to spirometry measures deemed \"low quality\" using criteria adapted from clinical practice.Objectives: To optimise the signal-to-noise ratio in epidemiological studies of lung function, we aimed to develop a data-driven method to refine spirometry quality control (QC) criteria.Methods: We proposed a genetic risk score (GRS) informed strategy to categorise spirometer blows by quality criteria. GRS was built using SNPs associated with lung function traits in non-UK Biobank cohorts. In the UK Biobank, we applied a step-wise testing of the GRS association across groups of spirometry blows stratified by acceptability flags to rank the blow quality. We reassessed QC criteria by comparing the genetic associations under different acceptability flags and repeatability thresholds to determine the trade-off between sample size and measurement error.Results: We found that including blows previously excluded by strict QC criteria would maximise the statistical power for genome-wide association study and retain acceptable precision in the UK Biobank. This approach allowed the inclusion of 29% more participants compared to the strictest clinical guidelines and demonstrated genetic signals could be identified earlier.Conclusions: Our GRS-based method offers an important framework to challenge prevailing practices that exclude informative measures and limit power in epidemiological studies.","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"11 1","pages":"2997"},"PeriodicalIF":2.2,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12893191/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146182977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Cleaning and validating longitudinal maternal and child postcode histories from a national healthcare registry for environmental health research in London, UK. 清理和验证来自英国伦敦环境卫生研究的国家卫生保健登记处的纵向母婴邮编历史。

IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES

International Journal of Population Data Science

Pub Date : 2026-02-05 eCollection Date: 2026-01-01 DOI: 10.23889/ijpds.v11i1.2990

Pia Hardelid, Glory Atilola, Amal Rammah, Bianca De Stavola, Tom Clemens, Steve Cunningham, Chris Dibben, Samantha Hajna, Alison Macfarlane, Ai Milojevic, Jonathon Taylor, Linda Wijlaars

Aim: To create longitudinal postcode history datasets that allocate mothers to one postcode for each week of pregnancy and children to one postcode for each week of infancy for a study of air pollution and respiratory infections in infants.

Datasets: We used linked birth registrations and NHS birth notifications for all children born in London between 2010 and 2014, which constituted the spine for the Air Pollution, housing and respiratory tract Infections in Children: National Birth Cohort Study (PICNIC) study. The birth data were linked by NHS England to the Personal Demographics Service (PDS) in order to derive maternal and child postcode histories for each week of pregnancy and infancy.

Challenges: While the research team had extensive experience working with administrative data, including birth registrations and notifications, the postcode history data was a new resource and lacked meta-data, papers or reports from previous users. A substantial number of records were missing a move-in date, or both a move-in date and postcode, adding complexities when ascertaining an address history for study participants. Further, we encountered instances of incorrectly recorded postcodes and implausible numbers of postcodes recorded in a week.

Lessons learned: One half of children in this London-based cohort moved during infancy, and one third of their mothers moved during pregnancy. This highlights the importance of taking into account changes in residential address in studies examining the association between environmental exposures and health outcomes. Cleaned and validated longitudinal national address records are crucial for environmental health studies. However, they are also resource intensive, with implications for researchers and research funders.

目的：创建纵向邮编历史数据集，将母亲在怀孕的每一周分配到一个邮编，儿童在婴儿的每一周分配到一个邮编，用于研究婴儿的空气污染和呼吸道感染。数据集：我们对2010年至2014年间在伦敦出生的所有儿童使用了关联的出生登记和NHS出生通知，这构成了儿童空气污染、住房和呼吸道感染：国家出生队列研究（PICNIC）研究的核心。出生数据由英国国家医疗服务体系（NHS England）与个人人口统计服务（PDS）联系起来，以便获得母亲和儿童在怀孕和婴儿期每周的邮政编码历史。挑战：虽然研究团队在处理行政数据（包括出生登记和通知）方面拥有丰富的经验，但邮政编码历史数据是一种新资源，缺乏来自以前用户的元数据、论文或报告。大量的记录缺少入住日期，或者既没有入住日期又没有邮政编码，这增加了确定研究参与者的地址历史的复杂性。此外，我们还遇到了错误记录邮政编码的情况，以及一周内记录的邮政编码数量令人难以置信。经验教训：在伦敦的这个队列中，有一半的孩子在婴儿期搬家，三分之一的母亲在怀孕期间搬家。这突出了在研究环境暴露与健康结果之间的关系时考虑到居住地址变化的重要性。清洁和验证的纵向国家地址记录对环境卫生研究至关重要。然而，它们也是资源密集型的，对研究人员和研究资助者有影响。

{"title":"Cleaning and validating longitudinal maternal and child postcode histories from a national healthcare registry for environmental health research in London, UK.","authors":"Pia Hardelid, Glory Atilola, Amal Rammah, Bianca De Stavola, Tom Clemens, Steve Cunningham, Chris Dibben, Samantha Hajna, Alison Macfarlane, Ai Milojevic, Jonathon Taylor, Linda Wijlaars","doi":"10.23889/ijpds.v11i1.2990","DOIUrl":"10.23889/ijpds.v11i1.2990","url":null,"abstract":"Aim: To create longitudinal postcode history datasets that allocate mothers to one postcode for each week of pregnancy and children to one postcode for each week of infancy for a study of air pollution and respiratory infections in infants.Datasets: We used linked birth registrations and NHS birth notifications for all children born in London between 2010 and 2014, which constituted the spine for the Air Pollution, housing and respiratory tract Infections in Children: National Birth Cohort Study (PICNIC) study. The birth data were linked by NHS England to the Personal Demographics Service (PDS) in order to derive maternal and child postcode histories for each week of pregnancy and infancy.Challenges: While the research team had extensive experience working with administrative data, including birth registrations and notifications, the postcode history data was a new resource and lacked meta-data, papers or reports from previous users. A substantial number of records were missing a move-in date, or both a move-in date and postcode, adding complexities when ascertaining an address history for study participants. Further, we encountered instances of incorrectly recorded postcodes and implausible numbers of postcodes recorded in a week.Lessons learned: One half of children in this London-based cohort moved during infancy, and one third of their mothers moved during pregnancy. This highlights the importance of taking into account changes in residential address in studies examining the association between environmental exposures and health outcomes. Cleaned and validated longitudinal national address records are crucial for environmental health studies. However, they are also resource intensive, with implications for researchers and research funders.","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"11 1","pages":"2990"},"PeriodicalIF":2.2,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12887630/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146167283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Data resource profile: The Life and health After Childhood cancEr (LACE) project. 数据来源简介：儿童癌症后的生活与健康（LACE）项目。

IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES

International Journal of Population Data Science

Pub Date : 2026-02-02 eCollection Date: 2023-01-01 DOI: 10.23889/ijpds.v8i6.2988

Heather J Baldwin, Sarah Pont, Anne Currell, Laura Newey, Danny R Youlden, Natalie Bradford, Peter D Baade, Joanne F Aitken, Jason D Pole, Natasha Nassar

Introduction: In Australia, around 85% of children survive childhood cancer. Yet, up to 80% of survivors experience subsequent adverse health conditions called late effects, largely attributed to cancer treatment. The LACE study is a population-based linked data resource that aims to facilitate the investigation of childhood cancer and its treatment and the impact on late effects for childhood cancer survivors.

Methods: The study links the Australian Childhood Cancer Registry to administrative cross-jurisdictional health and education data to enable ongoing follow-up of outcomes for childhood cancer survivors. The study population includes all Australian children aged less than 15 years, diagnosed with cancer 1983-2021, and comparison groups comprising siblings of childhood cancer patients and a random sample of children from the general population frequency matched by age, sex and residential location to cases.

Results: To date, the case cohort includes 25,226 children diagnosed with cancer, with longest follow-up to the age of 53 years. The most commonly diagnosed childhood cancers were leukaemia and related cancers (n=8182, 32.4%), followed by central nervous system and related cancers (n=5850, 23.2%), and lymphomas and reticuloendothelial neoplasms (n=2568, 10.2%). Overall, 16,314 (64.7%) children underwent chemotherapy, 5555 (22.0%) received radiotherapy and 7300 (28.9%) had surgical treatment for their cancer, with immunotherapy use reported for 641 (2.5%), hormonal therapy for 4549 (18.0%) and ancillary therapies for 2581 (10.2%). A total of 19,321 (76.6%) cases were alive at the end of the study.

Conclusion: This new comprehensive national data linkage resource represents a valuable asset that will facilitate research to identify the risk of late effects and effective follow-up care to inform counselling patients and their families, as well as guidelines, models of care and personalised follow-up care plans. Further, it will enable identification of inequities in healthcare access and outcomes across population sub-groups.

简介：在澳大利亚，大约85%的儿童在儿童癌症中幸存下来。然而，高达80%的幸存者随后经历了称为晚期效应的不良健康状况，主要归因于癌症治疗。LACE研究是一项基于人群的关联数据资源，旨在促进儿童癌症及其治疗的调查，以及对儿童癌症幸存者的后期影响的影响。方法：该研究将澳大利亚儿童癌症登记处与行政跨司法管辖区的健康和教育数据联系起来，以便对儿童癌症幸存者的结果进行持续随访。研究人群包括所有年龄在15岁以下、1983-2021年被诊断患有癌症的澳大利亚儿童，对照组包括儿童癌症患者的兄弟姐妹，以及根据年龄、性别和居住地点与病例匹配的普通人群中随机抽取的儿童样本。结果：迄今为止，病例队列包括25,226名诊断为癌症的儿童，最长随访时间为53岁。最常见的儿童癌症是白血病及相关癌症（n=8182, 32.4%），其次是中枢神经系统及相关癌症（n=5850, 23.2%），淋巴瘤和网状内皮肿瘤（n=2568, 10.2%）。总体而言，16314名（64.7%）儿童接受了化疗，5555名（22.0%）接受了放疗，7300名（28.9%）接受了手术治疗，其中免疫治疗641名（2.5%），激素治疗4549名（18.0%），辅助治疗2581名（10.2%）。研究结束时，共有19321例（76.6%）患者存活。结论：这一新的综合性国家数据链接资源是一项宝贵的资产，它将促进研究，以确定晚期影响的风险和有效的后续护理，为咨询患者及其家属提供信息，以及指导方针、护理模式和个性化后续护理计划。此外，它还将有助于查明各个人口分组在获得医疗保健和取得成果方面的不公平现象。

{"title":"Data resource profile: The Life and health After Childhood cancEr (LACE) project.","authors":"Heather J Baldwin, Sarah Pont, Anne Currell, Laura Newey, Danny R Youlden, Natalie Bradford, Peter D Baade, Joanne F Aitken, Jason D Pole, Natasha Nassar","doi":"10.23889/ijpds.v8i6.2988","DOIUrl":"10.23889/ijpds.v8i6.2988","url":null,"abstract":"Introduction: In Australia, around 85% of children survive childhood cancer. Yet, up to 80% of survivors experience subsequent adverse health conditions called late effects, largely attributed to cancer treatment. The LACE study is a population-based linked data resource that aims to facilitate the investigation of childhood cancer and its treatment and the impact on late effects for childhood cancer survivors.Methods: The study links the Australian Childhood Cancer Registry to administrative cross-jurisdictional health and education data to enable ongoing follow-up of outcomes for childhood cancer survivors. The study population includes all Australian children aged less than 15 years, diagnosed with cancer 1983-2021, and comparison groups comprising siblings of childhood cancer patients and a random sample of children from the general population frequency matched by age, sex and residential location to cases.Results: To date, the case cohort includes 25,226 children diagnosed with cancer, with longest follow-up to the age of 53 years. The most commonly diagnosed childhood cancers were leukaemia and related cancers (n=8182, 32.4%), followed by central nervous system and related cancers (n=5850, 23.2%), and lymphomas and reticuloendothelial neoplasms (n=2568, 10.2%). Overall, 16,314 (64.7%) children underwent chemotherapy, 5555 (22.0%) received radiotherapy and 7300 (28.9%) had surgical treatment for their cancer, with immunotherapy use reported for 641 (2.5%), hormonal therapy for 4549 (18.0%) and ancillary therapies for 2581 (10.2%). A total of 19,321 (76.6%) cases were alive at the end of the study.Conclusion: This new comprehensive national data linkage resource represents a valuable asset that will facilitate research to identify the risk of late effects and effective follow-up care to inform counselling patients and their families, as well as guidelines, models of care and personalised follow-up care plans. Further, it will enable identification of inequities in healthcare access and outcomes across population sub-groups.","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"8 6","pages":"2988"},"PeriodicalIF":2.2,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12887628/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146167278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Identifying local priorities for research with linked routine data: an online workshop method. 利用关联的常规数据确定当地的研究重点：一种在线研讨会方法。

IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES

International Journal of Population Data Science

Pub Date : 2026-01-29 eCollection Date: 2026-01-01 DOI: 10.23889/ijpds.v11i1.3045

Hollie Henderson, Sally Bridges, Maria Bryant, Kayley Ciesla, Kate Pickett

Introduction: Priority setting with patients, public and professionals is essential for research utilising routinely collected data, as this ensures data are being used in the public interest. However, it is challenging to identify research priorities that are relevant to a wide range of local stakeholders and can be addressed with routinely collected data.

Objectives: To describe and present the results of a priority setting exercise aiming to identify research priorities for Born in Bradford for All (BiB4All), a routine data linkage cohort of mothers and babies born in Bradford, a city in the north of England.

Methods: We developed a two-hour online workshop to engage a range of stakeholders across Bradford, including parents, early years practitioners, commissioners, and service providers. The workshop method combined elements of existing priority setting approaches to ensure priorities were identified in an inclusive, timely and deliberative way, and supported stakeholders to develop their understanding of using linked routine data for research.

Results: The workshop identified seventeen important and urgent research priorities around child and maternal health for research with locally linked routine data. Key topic areas included maternal and infant mental health, the long-term impact of the Covid-19 pandemic on maternal and child health outcomes, inequalities in access to services, and infant feeding experiences.

Conclusions: The identified research priorities have been shared widely amongst interested networks and have shaped the BiB4All research agenda, demonstrating the feasibility of the stakeholder engagement method. They also have important implications for policy and practice. For policy, they provide an understanding of the key issues faced by local communities, which can steer policy priorities and investment in evidence generation. For practice, involvement in the workshop has generated a greater understanding of how local service data can be used for research and to inform improvements to service delivery.

为患者、公众和专业人员设置优先级对于利用常规收集的数据进行研究至关重要，因为这确保了数据被用于公共利益。然而，确定与广泛的地方利益相关者相关的研究重点，并可以通过常规收集的数据来解决，这是一项挑战。目的：描述和呈现优先级设置练习的结果，旨在为出生在布拉德福德的所有人（BiB4All）确定研究优先级，这是一个常规的数据链接队列，出生在英格兰北部城市布拉德福德的母亲和婴儿。方法：我们开发了一个两小时的在线研讨会，让布拉德福德的一系列利益相关者参与进来，包括家长、早期从业人员、专员和服务提供商。研讨会方法结合了现有优先事项确定方法的要素，以确保以包容、及时和审慎的方式确定优先事项，并支持利益攸关方发展他们对使用相关常规数据进行研究的理解。结果：讲习班确定了围绕儿童和孕产妇保健的17项重要和紧迫的研究优先事项，以便利用与当地相关的常规数据进行研究。主要议题领域包括孕产妇和婴儿心理健康、Covid-19大流行对孕产妇和儿童健康结果的长期影响、获得服务的不平等以及婴儿喂养经验。结论：确定的研究重点已在感兴趣的网络中广泛共享，并形成了BiB4All研究议程，证明了利益相关者参与方法的可行性。它们对政策和实践也有重要影响。就政策而言，它们提供了对当地社区面临的关键问题的理解，这可以指导政策重点和证据生成方面的投资。在实践中，参与讲习班使人们对如何将当地服务数据用于研究和为改进服务提供提供信息有了更深入的了解。

{"title":"Identifying local priorities for research with linked routine data: an online workshop method.","authors":"Hollie Henderson, Sally Bridges, Maria Bryant, Kayley Ciesla, Kate Pickett","doi":"10.23889/ijpds.v11i1.3045","DOIUrl":"10.23889/ijpds.v11i1.3045","url":null,"abstract":"Introduction: Priority setting with patients, public and professionals is essential for research utilising routinely collected data, as this ensures data are being used in the public interest. However, it is challenging to identify research priorities that are relevant to a wide range of local stakeholders and can be addressed with routinely collected data.Objectives: To describe and present the results of a priority setting exercise aiming to identify research priorities for Born in Bradford for All (BiB4All), a routine data linkage cohort of mothers and babies born in Bradford, a city in the north of England.Methods: We developed a two-hour online workshop to engage a range of stakeholders across Bradford, including parents, early years practitioners, commissioners, and service providers. The workshop method combined elements of existing priority setting approaches to ensure priorities were identified in an inclusive, timely and deliberative way, and supported stakeholders to develop their understanding of using linked routine data for research.Results: The workshop identified seventeen important and urgent research priorities around child and maternal health for research with locally linked routine data. Key topic areas included maternal and infant mental health, the long-term impact of the Covid-19 pandemic on maternal and child health outcomes, inequalities in access to services, and infant feeding experiences.Conclusions: The identified research priorities have been shared widely amongst interested networks and have shaped the BiB4All research agenda, demonstrating the feasibility of the stakeholder engagement method. They also have important implications for policy and practice. For policy, they provide an understanding of the key issues faced by local communities, which can steer policy priorities and investment in evidence generation. For practice, involvement in the workshop has generated a greater understanding of how local service data can be used for research and to inform improvements to service delivery.","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"11 1","pages":"3045"},"PeriodicalIF":2.2,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12880541/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146143891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Leveraging census data to design and implement an area-based deprivation index to assess health inequalities in Ecuador. 利用人口普查数据设计和实施基于地区的剥夺指数，以评估厄瓜多尔的健康不平等。

IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES

International Journal of Population Data Science

Pub Date : 2026-01-26 eCollection Date: 2025-01-01 DOI: 10.23889/ijpds.v10i3.2970

Diego Andrade Ortiz, Ruth Dundas, Jonathan R Olsen, Irina Chis Ster

Introduction: Deprivation measures have been used in research to assess within-country health inequalities globally. Most of these indices are created using data from national census, given their availability and nationwide coverage.

Objectives: This study aims to create a census-based deprivation index in Ecuador, the Ecuadorian Deprivation Index (EDI), that reflects the country specific context using national census data for four geographical units (census sector, parish, canton and province). It will be compared to two traditional small area indices (Townsend and Carstairs) to assess the most appropriate and context specific index for Ecuador. Finally, the performance of the three indices will be assessed by examining the association and extent of inequalities with teenage pregnancy as this has been shown to be socially patterned in other countries.

Methods: This study uses the 2010 Ecuadorian census and follows the stages and recommendations for developing small-area deprivation indices. The Townsend and Carstairs are firstly replicated. For the EDI, Principal Component Analysis is used to select the most appropriate indicators. Summary measures for higher-level geographical areas were developed following the techniques used in the English Index of Multiple Deprivation. Inequalities in teenage pregnancy is measured using the Slope index of inequality and the Relative index of inequality.

Results: The three indices exhibit a good match in urban areas and can describe pattern of inequalities in teenage pregnancy. However, the EDI Index captures rural deprivation more appropriately and that includes the Coast and Amazon geographical regions.

Conclusions: Traditional deprivation measures may not adequately identify deprivation in Ecuador, given the country's unique specific contextual factors. The wider scope of the EDI will inform policy-makers towards developing tailored programs to alleviate deprivation and health inequalities in Ecuador.

导言：剥夺措施已被用于评估全球国家内部卫生不平等的研究。考虑到这些指数的可用性和覆盖范围，大多数指数都是根据国家人口普查的数据创建的。目的：本研究旨在创建厄瓜多尔基于人口普查的剥夺指数，即厄瓜多尔剥夺指数（EDI），该指数利用四个地理单位（人口普查部门、教区、州和省）的全国人口普查数据反映该国的具体情况。它将与两个传统的小区域指数（Townsend和Carstairs）进行比较，以评估最适合厄瓜多尔的具体情况的指数。最后，将通过检查不平等与少女怀孕的关系和程度来评估这三个指数的表现，因为这已被证明是其他国家的社会模式。方法：采用2010年厄瓜多尔人口普查数据，对小区域剥夺指数的发展阶段和建议进行研究。Townsend和Carstairs首先被复制。对于EDI，主成分分析用于选择最合适的指标。根据英语多重剥夺指数中使用的技术，制定了高级地理区域的总结措施。少女怀孕的不平等是用不平等的斜率指数和不平等的相对指数来衡量的。结果：3个指标在城市地区具有较好的匹配性，能较好地描述青少年怀孕的不平等格局。然而，EDI指数更恰当地反映了农村的贫困情况，其中包括沿海和亚马逊地区。结论：考虑到厄瓜多尔独特的具体背景因素，传统的剥夺措施可能无法充分识别厄瓜多尔的剥夺情况。更广泛的EDI范围将为决策者提供信息，以便制定有针对性的方案，以减轻厄瓜多尔的贫困和卫生不平等。

{"title":"Leveraging census data to design and implement an area-based deprivation index to assess health inequalities in Ecuador.","authors":"Diego Andrade Ortiz, Ruth Dundas, Jonathan R Olsen, Irina Chis Ster","doi":"10.23889/ijpds.v10i3.2970","DOIUrl":"10.23889/ijpds.v10i3.2970","url":null,"abstract":"Introduction: Deprivation measures have been used in research to assess within-country health inequalities globally. Most of these indices are created using data from national census, given their availability and nationwide coverage.Objectives: This study aims to create a census-based deprivation index in Ecuador, the Ecuadorian Deprivation Index (EDI), that reflects the country specific context using national census data for four geographical units (census sector, parish, canton and province). It will be compared to two traditional small area indices (Townsend and Carstairs) to assess the most appropriate and context specific index for Ecuador. Finally, the performance of the three indices will be assessed by examining the association and extent of inequalities with teenage pregnancy as this has been shown to be socially patterned in other countries.Methods: This study uses the 2010 Ecuadorian census and follows the stages and recommendations for developing small-area deprivation indices. The Townsend and Carstairs are firstly replicated. For the EDI, Principal Component Analysis is used to select the most appropriate indicators. Summary measures for higher-level geographical areas were developed following the techniques used in the English Index of Multiple Deprivation. Inequalities in teenage pregnancy is measured using the Slope index of inequality and the Relative index of inequality.Results: The three indices exhibit a good match in urban areas and can describe pattern of inequalities in teenage pregnancy. However, the EDI Index captures rural deprivation more appropriately and that includes the Coast and Amazon geographical regions.Conclusions: Traditional deprivation measures may not adequately identify deprivation in Ecuador, given the country's unique specific contextual factors. The wider scope of the EDI will inform policy-makers towards developing tailored programs to alleviate deprivation and health inequalities in Ecuador.","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"10 3","pages":"2970"},"PeriodicalIF":2.2,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12887634/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146167321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Case study: Developing resources to facilitate public conversations about the use of linked address-based data for research. 案例研究：开发资源，促进关于使用基于链接地址的数据进行研究的公开对话。

IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES

International Journal of Population Data Science

Pub Date : 2026-01-22 eCollection Date: 2026-01-01 DOI: 10.23889/ijpds.v11i1.2998

Nicola Firman, Carol Dezateux, Claire Newman, Harriet Baird, Rich Fry, Gill Harper, Lucy Griffit, Chris Dibben, Alison Robert, Alison Thomson

Introduction: Unique Property Reference Numbers (UPRNs) provide every addressable location in the United Kingdom (UK) with an identifier up to 12-digits in length, which are persistently unique, and are a mandated standard across the public sector in the UK. This standardisation means they are suited to be pseudonymised for data linkage for research, innovation and public benefit. While there have been many consultations exploring public trust in, and attitudes to, using patient data for research, none have explicitly considered their use for address-based linkage using UPRNs.

Objectives: Our overarching aim is to build public trust in the uses of address-based data at household level. We set out to develop and test materials to facilitate conversations about the use of address-based data linkage at the household-level. In this case study, we describe the development of information materials and an initial dialogue to inform future public deliberation.

Methods: In collaboration with designers and researchers, we generated a prototype website and shared this with experienced public advisory groups. Feedback from these groups informed development of a suite of resources, including slides and a facilitator's script to guide workshop discussions. These were supplemented by interactive, tactile tools designed to promote understanding of key concepts, and to encourage participants to ask questions relevant to their interests and concerns. We hosted two workshops with residents in a multi-ethnic, disadvantaged inner city locality to test and refine these materials.

Results: Dialogue with residents emphasised the importance of accessibility, including clear descriptions of technical jargon, and the effectiveness of using less text-heavy materials and more interactive formats, particularly for participants for whom English is not their first language. Visual representations of people included in workshop materials need to reflect diversity in age, gender, ethnicity, and mobility to ensure resources are relatable. Adapting the approach to delivering information - whether through digital or physical formats - proved crucial in engaging with participants and meeting their diverse needs.

Conclusions: We have created and tested with different public groups a toolkit to support conversations with academic and public audiences about research using address-linked patient data. The toolkit has been disseminated and made freely available for use by the research community.

简介：唯一属性参考号码（uprn）为英国（UK）的每个可寻址位置提供长达12位的标识符，该标识符保持唯一，并且是英国公共部门的强制标准。这种标准化意味着它们适合假名化，用于研究、创新和公共利益的数据链接。虽然已经有许多咨询探讨公众对使用患者数据进行研究的信任和态度，但没有人明确考虑使用upn进行基于地址的链接。目标：我们的首要目标是建立公众对在家庭一级使用基于地址的数据的信任。我们开始开发和测试材料，以促进在家庭一级使用基于地址的数据链接的对话。在这个案例研究中，我们描述了信息材料的发展和初步对话，为未来的公众审议提供信息。方法：与设计师和研究人员合作，我们制作了一个原型网站，并与经验丰富的公共咨询团体分享。来自这些小组的反馈通知了一套资源的开发，包括幻灯片和引导者的脚本，以指导研讨会讨论。此外，这些工具还配有互动式触觉工具，旨在促进对关键概念的理解，并鼓励参与者提出与他们的兴趣和关注相关的问题。我们举办了两次研讨会，在一个多民族的，弱势的内城地区的居民来测试和完善这些材料。结果：与居民的对话强调了可访问性的重要性，包括对技术术语的清晰描述，以及使用较少文本的材料和更多互动形式的有效性，特别是对于英语不是第一语言的参与者。讲习班材料中包含的人物的视觉表现需要反映年龄、性别、种族和流动性的多样性，以确保资源的相关性。事实证明，调整传递信息的方法——无论是通过数字形式还是实体形式——对于吸引参与者并满足他们的不同需求至关重要。结论：我们创建了一个工具包，并在不同的公共团体中进行了测试，以支持与学术界和公众就使用地址链接的患者数据进行研究的对话。该工具包已被分发并免费提供给研究界使用。

{"title":"Case study: Developing resources to facilitate public conversations about the use of linked address-based data for research.","authors":"Nicola Firman, Carol Dezateux, Claire Newman, Harriet Baird, Rich Fry, Gill Harper, Lucy Griffit, Chris Dibben, Alison Robert, Alison Thomson","doi":"10.23889/ijpds.v11i1.2998","DOIUrl":"10.23889/ijpds.v11i1.2998","url":null,"abstract":"Introduction: Unique Property Reference Numbers (UPRNs) provide every addressable location in the United Kingdom (UK) with an identifier up to 12-digits in length, which are persistently unique, and are a mandated standard across the public sector in the UK. This standardisation means they are suited to be pseudonymised for data linkage for research, innovation and public benefit. While there have been many consultations exploring public trust in, and attitudes to, using patient data for research, none have explicitly considered their use for address-based linkage using UPRNs.Objectives: Our overarching aim is to build public trust in the uses of address-based data at household level. We set out to develop and test materials to facilitate conversations about the use of address-based data linkage at the household-level. In this case study, we describe the development of information materials and an initial dialogue to inform future public deliberation.Methods: In collaboration with designers and researchers, we generated a prototype website and shared this with experienced public advisory groups. Feedback from these groups informed development of a suite of resources, including slides and a facilitator's script to guide workshop discussions. These were supplemented by interactive, tactile tools designed to promote understanding of key concepts, and to encourage participants to ask questions relevant to their interests and concerns. We hosted two workshops with residents in a multi-ethnic, disadvantaged inner city locality to test and refine these materials.Results: Dialogue with residents emphasised the importance of accessibility, including clear descriptions of technical jargon, and the effectiveness of using less text-heavy materials and more interactive formats, particularly for participants for whom English is not their first language. Visual representations of people included in workshop materials need to reflect diversity in age, gender, ethnicity, and mobility to ensure resources are relatable. Adapting the approach to delivering information - whether through digital or physical formats - proved crucial in engaging with participants and meeting their diverse needs.Conclusions: We have created and tested with different public groups a toolkit to support conversations with academic and public audiences about research using address-linked patient data. The toolkit has been disseminated and made freely available for use by the research community.","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"11 1","pages":"2998"},"PeriodicalIF":2.2,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12884232/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146158559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Linking primary care data from clinical practice research datalink to secondary care and other health-related patient data: update and implications. 将来自临床实践研究数据链的初级保健数据与二级保健和其他与健康相关的患者数据联系起来：更新和影响。

IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES

International Journal of Population Data Science

Pub Date : 2026-01-19 eCollection Date: 2026-01-01 DOI: 10.23889/ijpds.v11i1.3069

Justin Edward Chan, Rhys Barnett, Susan Hodgson, Prinal Chohan, Giulia Mantovani, Jennifer Campbell

Introduction: Being able to accurately link primary and secondary healthcare records is invaluable for public health research. The Clinical Practice Research Datalink (CPRD) collects and curates primary care electronic health records from UK GP practices. These data are linked to secondary health data by National Health Service (NHS) England. As of 2020, NHS England introduced the Master Person Service (MPS) method to link data at the person-level. The method was first applied to CPRD data in the November 2024 linked data release.

Objectives: This paper provides an overview of the MPS linkage method and its impact on linked CPRD data.

Methods: The MPS linkage method searches each set of personal identifiers against records within the Personal Demographics Service and the MPS record bucket. Successful matches are assigned a patient identifier 'Person_ID', which is used to link records between datasets. The number of successfully linked CPRD patients was compared between the MPS and the previous linkage method. The impact of the change in linkage eligibility definition was also examined.

Results: There are 7.9 million (CPRD GOLD) and 34.2 million (CPRD Aurum) patient records in the December 2024 primary care builds that are of research quality and were successfully linked to a Person_ID. Compared to the previous linkage method, the proportion of patient records who were defined as eligible to be linked to Hospital Episode Statistics Admitted Patient Care (HES APC) and had data in HES APC increased from 75.7% to 81.0% in CPRD GOLD and from 72.1% to 79.0% in CPRD Aurum.

Conclusion: The new linkage eligibility definition is superior to the previous definition, resulting in greater ability to define appropriate denominator populations and to differentiate why some patients do not have linked data. The MPS linkage method offers the potential for CPRD to investigate individuals with duplicate records and practice mergers.

引言：能够准确地将初级和二级医疗记录联系起来，对于公共卫生研究是非常宝贵的。临床实践研究数据链（CPRD）收集和策划初级保健电子健康记录从英国全科医生的做法。这些数据与英国国民保健服务（NHS）的二级保健数据相关联。截至2020年，英国国家医疗服务体系引入了主个人服务（MPS）方法来链接个人层面的数据。该方法在2024年11月的关联数据发布中首次应用于CPRD数据。目的：本文概述了MPS链接方法及其对链接CPRD数据的影响。方法：MPS链接方法根据个人人口统计服务和MPS记录桶中的记录搜索每组个人标识符。成功匹配的将被分配一个患者标识符“Person_ID”，用于在数据集之间链接记录。将MPS与之前的连接方法成功连接的CPRD患者数量进行比较。对连锁资格定义变化的影响也进行了研究。结果：在2024年12月的初级保健构建中，有790万（CPRD GOLD）和3420万（CPRD Aurum）患者记录具有研究质量，并成功链接到Person_ID。与之前的关联方法相比，在CPRD GOLD中，定义为符合医院事件统计住院患者护理（HES APC）并具有HES APC数据的患者记录的比例从75.7%增加到81.0%，在CPRD Aurum中从72.1%增加到79.0%。结论：新的连锁资格定义优于以前的定义，从而更有能力定义适当的分母人群，并区分为什么有些患者没有关联数据。MPS链接方法为CPRD提供了调查具有重复记录和执业合并的个人的潜力。

{"title":"Linking primary care data from clinical practice research datalink to secondary care and other health-related patient data: update and implications.","authors":"Justin Edward Chan, Rhys Barnett, Susan Hodgson, Prinal Chohan, Giulia Mantovani, Jennifer Campbell","doi":"10.23889/ijpds.v11i1.3069","DOIUrl":"10.23889/ijpds.v11i1.3069","url":null,"abstract":"Introduction: Being able to accurately link primary and secondary healthcare records is invaluable for public health research. The Clinical Practice Research Datalink (CPRD) collects and curates primary care electronic health records from UK GP practices. These data are linked to secondary health data by National Health Service (NHS) England. As of 2020, NHS England introduced the Master Person Service (MPS) method to link data at the person-level. The method was first applied to CPRD data in the November 2024 linked data release.Objectives: This paper provides an overview of the MPS linkage method and its impact on linked CPRD data.Methods: The MPS linkage method searches each set of personal identifiers against records within the Personal Demographics Service and the MPS record bucket. Successful matches are assigned a patient identifier 'Person_ID', which is used to link records between datasets. The number of successfully linked CPRD patients was compared between the MPS and the previous linkage method. The impact of the change in linkage eligibility definition was also examined.Results: There are 7.9 million (CPRD GOLD) and 34.2 million (CPRD Aurum) patient records in the December 2024 primary care builds that are of research quality and were successfully linked to a Person_ID. Compared to the previous linkage method, the proportion of patient records who were defined as eligible to be linked to Hospital Episode Statistics Admitted Patient Care (HES APC) and had data in HES APC increased from 75.7% to 81.0% in CPRD GOLD and from 72.1% to 79.0% in CPRD Aurum.Conclusion: The new linkage eligibility definition is superior to the previous definition, resulting in greater ability to define appropriate denominator populations and to differentiate why some patients do not have linked data. The MPS linkage method offers the potential for CPRD to investigate individuals with duplicate records and practice mergers.","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"11 1","pages":"3069"},"PeriodicalIF":2.2,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12884226/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146158646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Data resource profile: the Scottish Combined Medicines Dataset (SCoMeD). 数据资源简介：苏格兰联合药物数据集（SCoMeD）。

IF 2.2 Q3 HEALTH CARE SCIENCES & SERVICES

International Journal of Population Data Science

Pub Date : 2026-01-15 eCollection Date: 2023-01-01 DOI: 10.23889/ijpds.v8i6.3006

Tanja Mueller, Lynne Jarvis, Victoria Stark, Morven Millar, Amy Hynd, Elaine Pauline, Amanj Kurdi, Laura Stobo, Stuart McTaggart, Marion Bennie

Introduction: Prescribing data has been collected electronically in Scotland for many years; however, data are collated in individual, non-overlapping datasets based on the origin of the prescription (e.g., primary or secondary care). The vision was to create a unified view of all prescribing data to provide a longitudinal dataset of medicines use for patients treated by the National Health Services (NHS) Scotland, irrespective of where or how that care was provided.

Methods: The Scottish Combined Medicines Dataset (SCoMeD) is, in essence, a data virtualisation tool collating information from three previously available prescribing datasets: the Prescribing Information System (PIS); the Hospital Electronic Prescribing and Medicines Administration (HEPMA) national dataset; and the Homecare Medicines (HCM) dataset. This allows the creation of study cohorts (patient groups of interest) that meet specified criteria across all prescribing settings and facilitates the retrieval of the prescribing history for individuals pre-identified from other datasets. Records contain a unique patient identifier (Community Health Index number) which is used to identify patients for inclusion in the dataset and also enables linkage to other routinely collected data, including hospital admission episodes and death records.

Results: SCoMeD contains details on the patient (age, sex, geographical information) and on the medication prescribed. Medication-related information includes what was received and when; strength and dose information are also available. The earliest date of data availability depends on the source (PIS, 01/2010; HEPMA, 07/2022; HCM, 01/2019). Data is held by Public Health Scotland.

Conclusion: SCoMeD facilitates a range of different studies, including cross-sectional/point-prevalence studies and drug utilisation studies as well as longitudinal studies, e.g., cohort and case-control studies. With the possibility to link to other relevant datasets, additional areas of interest may include health policy evaluations and health economics studies. Access to data is subject to approval; researchers need to contact the electronic Data Research and Innovation Service in the first instance.

简介：处方数据在苏格兰已经电子化收集多年；然而，数据是根据处方来源（例如，初级或二级保健）在单独的、不重叠的数据集中进行整理的。其愿景是创建所有处方数据的统一视图，为苏格兰国民保健服务（NHS）治疗的患者提供药物使用的纵向数据集，而不管在哪里或如何提供这种护理。方法：苏格兰联合药物数据集（scoed）本质上是一个数据虚拟化工具，从三个以前可用的处方数据集整理信息：处方信息系统（PIS）；医院电子处方和药品管理局（HEPMA）国家数据集；以及家庭护理药物（HCM）数据集。这允许在所有处方设置中创建符合指定标准的研究队列（感兴趣的患者组），并促进从其他数据集中预先识别的个体的处方历史的检索。记录包含一个唯一的患者标识符（社区卫生指数号），用于识别要纳入数据集的患者，还可以与其他常规收集的数据（包括住院事件和死亡记录）建立联系。结果：SCoMeD包含患者的详细信息（年龄、性别、地理信息）和处方药物。与药物相关的信息包括收到的药物和时间；强度和剂量信息也可用。可获得数据的最早日期取决于来源（PIS， 2010年1月；HEPMA， 2022年7月；HCM， 2019年1月）。数据由苏格兰公共卫生部保管。结论：scoed促进了一系列不同的研究，包括横断面/点流行研究和药物利用研究以及纵向研究，例如队列和病例对照研究。由于有可能与其他相关数据集联系，其他感兴趣的领域可能包括卫生政策评价和卫生经济学研究。查阅资料须经批准；研究人员需要首先联系电子数据研究与创新服务中心。

{"title":"Data resource profile: the Scottish Combined Medicines Dataset (SCoMeD).","authors":"Tanja Mueller, Lynne Jarvis, Victoria Stark, Morven Millar, Amy Hynd, Elaine Pauline, Amanj Kurdi, Laura Stobo, Stuart McTaggart, Marion Bennie","doi":"10.23889/ijpds.v8i6.3006","DOIUrl":"10.23889/ijpds.v8i6.3006","url":null,"abstract":"Introduction: Prescribing data has been collected electronically in Scotland for many years; however, data are collated in individual, non-overlapping datasets based on the origin of the prescription (e.g., primary or secondary care). The vision was to create a unified view of all prescribing data to provide a longitudinal dataset of medicines use for patients treated by the National Health Services (NHS) Scotland, irrespective of where or how that care was provided.Methods: The Scottish Combined Medicines Dataset (SCoMeD) is, in essence, a data virtualisation tool collating information from three previously available prescribing datasets: the Prescribing Information System (PIS); the Hospital Electronic Prescribing and Medicines Administration (HEPMA) national dataset; and the Homecare Medicines (HCM) dataset. This allows the creation of study cohorts (patient groups of interest) that meet specified criteria across all prescribing settings and facilitates the retrieval of the prescribing history for individuals pre-identified from other datasets. Records contain a unique patient identifier (Community Health Index number) which is used to identify patients for inclusion in the dataset and also enables linkage to other routinely collected data, including hospital admission episodes and death records.Results: SCoMeD contains details on the patient (age, sex, geographical information) and on the medication prescribed. Medication-related information includes what was received and when; strength and dose information are also available. The earliest date of data availability depends on the source (PIS, 01/2010; HEPMA, 07/2022; HCM, 01/2019). Data is held by Public Health Scotland.Conclusion: SCoMeD facilitates a range of different studies, including cross-sectional/point-prevalence studies and drug utilisation studies as well as longitudinal studies, e.g., cohort and case-control studies. With the possibility to link to other relevant datasets, additional areas of interest may include health policy evaluations and health economics studies. Access to data is subject to approval; researchers need to contact the electronic Data Research and Innovation Service in the first instance.","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"8 6","pages":"3006"},"PeriodicalIF":2.2,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12809194/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145999245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0