Pub Date : 2023-02-02eCollection Date: 2023-01-01DOI: 10.23889/ijpds.v8i1.1843
Jeanne Sinclair, Scott Davies, Magdalena Janus
Introduction: Longitudinal data that tracks student achievement over many years are crucial for understanding children's learning and for guiding effective policies and interventions. Despite being Canada's most populous province, Ontario lacks such large-scale and longitudinal data on student learning. Linking datasets across cohorts requires rigorous linkage protocols, flexible handling of complex cohort structures, methods to validate linked datasets, and viable organizational partnerships. We linked administrative data on early child development and educational achievement and merged two datasets on characteristics of students' neighborhoods and schools. We developed a linkage protocol and validated how the resulting database could be generalized to Ontario's student population.
Methods and analysis: Two main individual-level data sources were linked: 1) the Early Development Instrument (EDI), a school readiness assessment of all Ontario public school kindergartners that is administered in three-year cycles, and 2) Ontario's Educational Quality and Assessment Office's (EQAO) math and reading assessments in grades 3, 6, 9, and 10. To compensate for their lack of a common personal identification number, a deterministic linkage process was developed using several administrative variables. A school-level and a neighborhood-level dataset were also later linked. We examined differences between unlinked and linked cases across several variables.
Results and implications: We successfully linked 50% of the EDI's 374,239 cases, 86,778 of which contained all five datapoints, creating a database tracking achievement for multiple cohorts from kindergarten through grade 10, with covariates for their development, demographics, affect, neighborhoods, and schools. Analyses revealed only negligible differences between linked and unlinked cases across several demographic measures, while small differences were detected across a neighborhood socioeconomic index and some measures of child development. In conclusion, we recommend the filling of key voids in sustainable research capacity by creating representative data through linkage protocols and data verification.
{"title":"Student achievement trajectories in Ontario: Creating and validating a province-wide, multi-cohort and longitudinal database.","authors":"Jeanne Sinclair, Scott Davies, Magdalena Janus","doi":"10.23889/ijpds.v8i1.1843","DOIUrl":"10.23889/ijpds.v8i1.1843","url":null,"abstract":"<p><strong>Introduction: </strong>Longitudinal data that tracks student achievement over many years are crucial for understanding children's learning and for guiding effective policies and interventions. Despite being Canada's most populous province, Ontario lacks such large-scale and longitudinal data on student learning. Linking datasets across cohorts requires rigorous linkage protocols, flexible handling of complex cohort structures, methods to validate linked datasets, and viable organizational partnerships. We linked administrative data on early child development and educational achievement and merged two datasets on characteristics of students' neighborhoods and schools. We developed a linkage protocol and validated how the resulting database could be generalized to Ontario's student population.</p><p><strong>Methods and analysis: </strong>Two main individual-level data sources were linked: 1) the Early Development Instrument (EDI), a school readiness assessment of all Ontario public school kindergartners that is administered in three-year cycles, and 2) Ontario's Educational Quality and Assessment Office's (EQAO) math and reading assessments in grades 3, 6, 9, and 10. To compensate for their lack of a common personal identification number, a deterministic linkage process was developed using several administrative variables. A school-level and a neighborhood-level dataset were also later linked. We examined differences between unlinked and linked cases across several variables.</p><p><strong>Results and implications: </strong>We successfully linked 50% of the EDI's 374,239 cases, 86,778 of which contained all five datapoints, creating a database tracking achievement for multiple cohorts from kindergarten through grade 10, with covariates for their development, demographics, affect, neighborhoods, and schools. Analyses revealed only negligible differences between linked and unlinked cases across several demographic measures, while small differences were detected across a neighborhood socioeconomic index and some measures of child development. In conclusion, we recommend the filling of key voids in sustainable research capacity by creating representative data through linkage protocols and data verification.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"8 1","pages":"1843"},"PeriodicalIF":1.6,"publicationDate":"2023-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/9b/ba/ijpds-08-1843.PMC10450363.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10111635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-01-11eCollection Date: 2022-01-01DOI: 10.23889/ijpds.v7i4.1764
J Trent Alexander, Katie R Genadek
This article describes the linkage methods that will be used in the Decennial Census Digitization and Linkage project (DCDL), which is completing the final four decades of a longitudinal census infrastructure covering the past 170 years of United States history. DCDL is digitizing and creating linkages between nearly a billion records across the 1960 through 1990 U.S. censuses, as well as to already-linked records from the censuses of 1940, 2000, 2010, and 2020. Our main goals in this article are to (1) describe the development of the DCDL and the protocol we will follow to build the linkages between the census files, (2) outline the techniques we will use to evaluate the quality of the links, and (3) show how the assignment and evaluation of these linkages leverages the joint use of routinely collected administrative data and non-routine survey data.
{"title":"Using administrative records to support the linkage of census data: protocol for building a longitudinal infrastructure of U.S. census records.","authors":"J Trent Alexander, Katie R Genadek","doi":"10.23889/ijpds.v7i4.1764","DOIUrl":"10.23889/ijpds.v7i4.1764","url":null,"abstract":"<p><p>This article describes the linkage methods that will be used in the Decennial Census Digitization and Linkage project (DCDL), which is completing the final four decades of a longitudinal census infrastructure covering the past 170 years of United States history. DCDL is digitizing and creating linkages between nearly a billion records across the 1960 through 1990 U.S. censuses, as well as to already-linked records from the censuses of 1940, 2000, 2010, and 2020. Our main goals in this article are to (1) describe the development of the DCDL and the protocol we will follow to build the linkages between the census files, (2) outline the techniques we will use to evaluate the quality of the links, and (3) show how the assignment and evaluation of these linkages leverages the joint use of routinely collected administrative data and non-routine survey data.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"7 4","pages":"1764"},"PeriodicalIF":1.6,"publicationDate":"2023-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/9c/d6/ijpds-07-1764.PMC9869857.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9200879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-01-01DOI: 10.23889/ijpds.v8i1.2115
Peter Christen, Rainer Schnell
Databases covering all individuals of a population are increasingly used for research and decision-making. The massive size of such databases is often mistaken as a guarantee for valid inferences. However, population data have characteristics that make them challenging to use. Various assumptions on population coverage and data quality are commonly made, including how such data were captured and what types of processing have been applied to them. Furthermore, the full potential of population data can often only be unlocked when such data are linked to other databases. Record linkage often implies subtle technical problems, which are easily missed. We discuss a diverse range of myths and misconceptions relevant for anybody capturing, processing, linking, or analysing population data. Remarkably, many of these myths and misconceptions are due to the social nature of data collections and are therefore missed by purely technical accounts of data processing. Many are also not well documented in scientific publications. We conclude with a set of recommendations for using population data.
{"title":"Thirty-three myths and misconceptions about population data: from data capture and processing to linkage.","authors":"Peter Christen, Rainer Schnell","doi":"10.23889/ijpds.v8i1.2115","DOIUrl":"https://doi.org/10.23889/ijpds.v8i1.2115","url":null,"abstract":"<p><p>Databases covering all individuals of a population are increasingly used for research and decision-making. The massive size of such databases is often mistaken as a guarantee for valid inferences. However, population data have characteristics that make them challenging to use. Various assumptions on population coverage and data quality are commonly made, including how such data were captured and what types of processing have been applied to them. Furthermore, the full potential of population data can often only be unlocked when such data are linked to other databases. Record linkage often implies subtle technical problems, which are easily missed. We discuss a diverse range of myths and misconceptions relevant for anybody capturing, processing, linking, or analysing population data. Remarkably, many of these myths and misconceptions are due to the social nature of data collections and are therefore missed by purely technical accounts of data processing. Many are also not well documented in scientific publications. We conclude with a set of recommendations for using population data.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"8 1","pages":"2115"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/b0/03/ijpds-08-2115.PMC10454001.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10503962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-01-01DOI: 10.23889/ijpds.v8i1.2152
Jinette Comeau, Li Wang, Laura Duncan, Jordan Edwards, Katholiki Georgiades, Kelly K Anderson, Piotr Wilk, Tammy Lau
<p><strong>Introduction: </strong>Knowledge of the sociodemographic, behavioural, and clinical characteristics of children visiting emergency departments (EDs) for mental health or substance use concerns in Ontario, Canada is lacking.</p><p><strong>Objectives: </strong>Using data from a population-based survey linked at the individual level to administrative health data, this study leverages a provincially representative sample and quasi-experimental design to strengthen inferences regarding the extent to which children's sociodemographic, behavioural, and clinical characteristics are associated with the risk of a mental health or substance use related ED visit.</p><p><strong>Methods: </strong>9,301 children aged 4-17 years participating in the 2014 Ontario Child Health Study were linked retrospectively (6 months) and prospectively (12 months) with administrative health data on ED visits from the National Ambulatory Care Reporting System. Modified Poisson regression was used to examine correlates of mental health and substance use related ED visits among children aged 4-17 years over a 12-month period following their survey completion date, adjusting for ED visits in the 6 months prior to their survey completion date. Subgroup analyses of youths aged 14-17 years who independently completed survey content related to peer victimisation, substance use, and suicidality were also conducted.</p><p><strong>Results: </strong>Among children aged 4-17 years, older age, parental immigrant status, internalising problems, and perceived need for professional help were statistically significant correlates that increased the risk of a mental health or substance use related ED visit; low-income and suicidal ideation with attempt were statistically significant only among youths aged 14-17 years.</p><p><strong>Conclusions: </strong>Knowledge of the sociodemographic, behavioural, and clinical characteristics of children visiting EDs for mental health and substance use related concerns is required to better understand patient needs to coordinate effective emergency mental health care that optimises child outcomes, and to inform the development and targeting of upstream interventions that have the potential to prevent avoidable ED visits.</p><p><strong>Highlights: </strong>Growing rates of child mental health and substance use related ED visits have been observed internationally.A population-based survey linked at the individual level to administrative health data was used to examine the extent to which children's sociodemographic, behavioural, and clinical characteristics are associated with the risk of a mental health or substance use related ED visit in Ontario, Canada.Older age, low-income, parental immigrant status, perceived need for professional help, internalising problems, and suicidality increase the risk of an ED visit.Knowledge of the characteristics of children visiting EDs can be used to coordinate effective emergency mental health care that optimises ch
{"title":"Correlates of child mental health and substance use related emergency department visits in Ontario: A linked population survey and administrative health data study.","authors":"Jinette Comeau, Li Wang, Laura Duncan, Jordan Edwards, Katholiki Georgiades, Kelly K Anderson, Piotr Wilk, Tammy Lau","doi":"10.23889/ijpds.v8i1.2152","DOIUrl":"https://doi.org/10.23889/ijpds.v8i1.2152","url":null,"abstract":"<p><strong>Introduction: </strong>Knowledge of the sociodemographic, behavioural, and clinical characteristics of children visiting emergency departments (EDs) for mental health or substance use concerns in Ontario, Canada is lacking.</p><p><strong>Objectives: </strong>Using data from a population-based survey linked at the individual level to administrative health data, this study leverages a provincially representative sample and quasi-experimental design to strengthen inferences regarding the extent to which children's sociodemographic, behavioural, and clinical characteristics are associated with the risk of a mental health or substance use related ED visit.</p><p><strong>Methods: </strong>9,301 children aged 4-17 years participating in the 2014 Ontario Child Health Study were linked retrospectively (6 months) and prospectively (12 months) with administrative health data on ED visits from the National Ambulatory Care Reporting System. Modified Poisson regression was used to examine correlates of mental health and substance use related ED visits among children aged 4-17 years over a 12-month period following their survey completion date, adjusting for ED visits in the 6 months prior to their survey completion date. Subgroup analyses of youths aged 14-17 years who independently completed survey content related to peer victimisation, substance use, and suicidality were also conducted.</p><p><strong>Results: </strong>Among children aged 4-17 years, older age, parental immigrant status, internalising problems, and perceived need for professional help were statistically significant correlates that increased the risk of a mental health or substance use related ED visit; low-income and suicidal ideation with attempt were statistically significant only among youths aged 14-17 years.</p><p><strong>Conclusions: </strong>Knowledge of the sociodemographic, behavioural, and clinical characteristics of children visiting EDs for mental health and substance use related concerns is required to better understand patient needs to coordinate effective emergency mental health care that optimises child outcomes, and to inform the development and targeting of upstream interventions that have the potential to prevent avoidable ED visits.</p><p><strong>Highlights: </strong>Growing rates of child mental health and substance use related ED visits have been observed internationally.A population-based survey linked at the individual level to administrative health data was used to examine the extent to which children's sociodemographic, behavioural, and clinical characteristics are associated with the risk of a mental health or substance use related ED visit in Ontario, Canada.Older age, low-income, parental immigrant status, perceived need for professional help, internalising problems, and suicidality increase the risk of an ED visit.Knowledge of the characteristics of children visiting EDs can be used to coordinate effective emergency mental health care that optimises ch","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"8 1","pages":"2152"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/82/40/ijpds-08-2152.PMC10476702.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10522940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-01-01DOI: 10.23889/ijpds.v8i1.1751
Branislav Igic, Rachel Farber, Maria Alfaro-Ramirez, Michael A Nelson, Lee K Taylor
Introduction: The patient journey for residents of New South Wales (NSW) Australia with ST-elevation myocardial infarction (STEMI) often involves transfer between hospitals and these can include stays in hospitals in other jurisdictions.
Objective: To estimate the change in enumeration of STEMI hospitalisations and time to subsequent cardiac procedures for NSW residents using cross-jurisdictional linkage of administrative health data.
Methods: Records for NSW residents aged 20 years and over admitted to hospitals in NSW and four adjacent jurisdictions (Australian Capital Territory, Queensland, South Australia, and Victoria) between 1 July 2013 and 30 June 2018 with a principal diagnosis of STEMI were linked with records of the Australian Government Medicare Benefits Schedule (MBS). The number of STEMI hospitalisations, and rates of angiography, percutaneous coronary intervention and coronary artery bypass graft were compared for residents of different local health districts within NSW with and without inclusion of cross-jurisdictional data.
Results: Inclusion of cross-jurisdictional hospital and MBS data increased the enumeration of STEMI hospitalisations for NSW residents by 8% (from 15,420 to 16,659) and procedure rates from 85.6% to 88.2%. For NSW residents who lived adjacent to a jurisdictional border, hospitalisation counts increased by up to 210% and procedure rates by up to 70 percentage points.
Conclusions: Cross-jurisdictional linked hospital data is essential to understand patient journeys of NSW residents who live in border areas and to evaluate adherence to treatment guidelines for STEMI. MBS data are useful where hospital data are not available and for procedures that may be conducted in out-patient settings.
{"title":"The impact of cross-jurisdictional patient flows on ascertainment of hospitalisations and cardiac procedures for ST-segment-elevation myocardial infarction in an Australian population.","authors":"Branislav Igic, Rachel Farber, Maria Alfaro-Ramirez, Michael A Nelson, Lee K Taylor","doi":"10.23889/ijpds.v8i1.1751","DOIUrl":"https://doi.org/10.23889/ijpds.v8i1.1751","url":null,"abstract":"<p><strong>Introduction: </strong>The patient journey for residents of New South Wales (NSW) Australia with ST-elevation myocardial infarction (STEMI) often involves transfer between hospitals and these can include stays in hospitals in other jurisdictions.</p><p><strong>Objective: </strong>To estimate the change in enumeration of STEMI hospitalisations and time to subsequent cardiac procedures for NSW residents using cross-jurisdictional linkage of administrative health data.</p><p><strong>Methods: </strong>Records for NSW residents aged 20 years and over admitted to hospitals in NSW and four adjacent jurisdictions (Australian Capital Territory, Queensland, South Australia, and Victoria) between 1 July 2013 and 30 June 2018 with a principal diagnosis of STEMI were linked with records of the Australian Government Medicare Benefits Schedule (MBS). The number of STEMI hospitalisations, and rates of angiography, percutaneous coronary intervention and coronary artery bypass graft were compared for residents of different local health districts within NSW with and without inclusion of cross-jurisdictional data.</p><p><strong>Results: </strong>Inclusion of cross-jurisdictional hospital and MBS data increased the enumeration of STEMI hospitalisations for NSW residents by 8% (from 15,420 to 16,659) and procedure rates from 85.6% to 88.2%. For NSW residents who lived adjacent to a jurisdictional border, hospitalisation counts increased by up to 210% and procedure rates by up to 70 percentage points.</p><p><strong>Conclusions: </strong>Cross-jurisdictional linked hospital data is essential to understand patient journeys of NSW residents who live in border areas and to evaluate adherence to treatment guidelines for STEMI. MBS data are useful where hospital data are not available and for procedures that may be conducted in out-patient settings.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"8 1","pages":"1751"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/c9/ee/ijpds-08-1751.PMC10450362.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10133624","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-01-01DOI: 10.23889/ijpds.v8i1.2130
Piotr Teodorowski, Sarah E Rodgers, Kate Fleming, Naheed Tahir, Saiqa Ahmed, Lucy Frith
Introduction: Involving public contributors helps researchers to ensure that public views are taken into consideration when designing and planning research, so that it is person-centred and relevant to the public. This paper will consider public involvement in big data research. Inclusion of different communities is needed to ensure everyone's voice is heard. However, there remains limited evidence on how to improve the involvement of seldom-heard communities in big data research.
Objectives: This study aims to understand how South Asians and Polish communities in the UK can be encouraged to participate in public involvement initiatives in big data research.
Methods: Forty interviews were conducted with Polish (n=20) and South Asian (n=20) participants on Zoom. The participants were living in the United Kingdom and had not previously been involved as public contributors. Transcribed interviews were analysed using reflexive thematic analysis.
Results: We identified eight themes. The 'happy to reuse data' theme sets the scene by exploring our participants' views towards big data research and under what circumstances they thought that data could be used. The remaining themes were mapped under the capability-opportunity-motivation-behaviour (COM-B) model, as developed by Michie and colleagues. This allowed us to discuss multiple factors that could influence people's willingness to become public contributors.
Conclusions: Our study is the first to explore how to improve the involvement and engagement of seldom-heard communities in big data research using the COM-B model. The results have the potential to support researchers who want to identify what can influence members of the public to be involved. By using the COM-B model, it is possible to determine what measures could be implemented to better engage these communities.
{"title":"Exploring how to improve the involvement of Polish and South Asian communities around big data research. A qualitative study using COM-B model.","authors":"Piotr Teodorowski, Sarah E Rodgers, Kate Fleming, Naheed Tahir, Saiqa Ahmed, Lucy Frith","doi":"10.23889/ijpds.v8i1.2130","DOIUrl":"https://doi.org/10.23889/ijpds.v8i1.2130","url":null,"abstract":"<p><strong>Introduction: </strong>Involving public contributors helps researchers to ensure that public views are taken into consideration when designing and planning research, so that it is person-centred and relevant to the public. This paper will consider public involvement in big data research. Inclusion of different communities is needed to ensure everyone's voice is heard. However, there remains limited evidence on how to improve the involvement of seldom-heard communities in big data research.</p><p><strong>Objectives: </strong>This study aims to understand how South Asians and Polish communities in the UK can be encouraged to participate in public involvement initiatives in big data research.</p><p><strong>Methods: </strong>Forty interviews were conducted with Polish (n=20) and South Asian (n=20) participants on Zoom. The participants were living in the United Kingdom and had not previously been involved as public contributors. Transcribed interviews were analysed using reflexive thematic analysis.</p><p><strong>Results: </strong>We identified eight themes. The 'happy to reuse data' theme sets the scene by exploring our participants' views towards big data research and under what circumstances they thought that data could be used. The remaining themes were mapped under the capability-opportunity-motivation-behaviour (COM-B) model, as developed by Michie and colleagues. This allowed us to discuss multiple factors that could influence people's willingness to become public contributors.</p><p><strong>Conclusions: </strong>Our study is the first to explore how to improve the involvement and engagement of seldom-heard communities in big data research using the COM-B model. The results have the potential to support researchers who want to identify what can influence members of the public to be involved. By using the COM-B model, it is possible to determine what measures could be implemented to better engage these communities.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"8 1","pages":"2130"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/cd/72/ijpds-08-2130.PMC10476635.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10225320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-01-01DOI: 10.23889/ijpds.v8i1.2118
Sarah Ahmed, Allan Pollack, Alys Havard, Sallie-Anne Pearson, Kendal Chidwick
Introduction: Understanding the level of recording of acute serious events in general practice electronic health records (EHRs) is critical for making decisions about the suitability of general practice datasets to address research questions and requirements for linking general practice EHRs with other datasets.
Objectives: To examine data source agreement of five serious acute events (myocardial infarction, stroke, venous thromboembolism (VTE), pancreatitis and suicide) recorded in general practice EHRs compared with hospital, emergency department (ED) and mortality data.
Methods: Data from 61 general practices routinely contributing data to the MedicineInsight database was linked with New South Wales administrative hospital, ED and mortality data. The study population comprised patients with at least three clinical encounters at participating general practices between 2019 and 2020 and at least one record in hospital, ED or mortality data between 2010 and 2020. Agreement was assessed between MedicineInsight diagnostic algorithms for the five events of interest and coded diagnoses in the administrative data. Dates of concordant events were compared.
Results: The study included 274,420 general practice patients with at least one record in the administrative data between 2010 and 2020. Across the five acute events, specificity and NPV were excellent (>98%) but sensitivity (13%-51%) and PPV (30%-75%) were low. Sensitivity and PPV were highest for VTE (50.9%) and acute pancreatitis (75.2%), respectively. The majority (roughly 70-80%) of true positive cases were recorded in the EHR within 30 days of administrative records.
Conclusion: Large proportions of events identified from administrative data were not detected by diagnostic algorithms applied to general practice EHRs within the specific time period. EHR data extraction and study design only partly explain the low sensitivities/PPVs. Our findings support the use of Australian general practice EHRs linked to hospital, ED and mortality data for robust research on the selected serious acute conditions.
{"title":"Agreement of acute serious events recorded across datasets using linked Australian general practice, hospital, emergency department and death data: implications for research and surveillance.","authors":"Sarah Ahmed, Allan Pollack, Alys Havard, Sallie-Anne Pearson, Kendal Chidwick","doi":"10.23889/ijpds.v8i1.2118","DOIUrl":"https://doi.org/10.23889/ijpds.v8i1.2118","url":null,"abstract":"<p><strong>Introduction: </strong>Understanding the level of recording of acute serious events in general practice electronic health records (EHRs) is critical for making decisions about the suitability of general practice datasets to address research questions and requirements for linking general practice EHRs with other datasets.</p><p><strong>Objectives: </strong>To examine data source agreement of five serious acute events (myocardial infarction, stroke, venous thromboembolism (VTE), pancreatitis and suicide) recorded in general practice EHRs compared with hospital, emergency department (ED) and mortality data.</p><p><strong>Methods: </strong>Data from 61 general practices routinely contributing data to the MedicineInsight database was linked with New South Wales administrative hospital, ED and mortality data. The study population comprised patients with at least three clinical encounters at participating general practices between 2019 and 2020 and at least one record in hospital, ED or mortality data between 2010 and 2020. Agreement was assessed between MedicineInsight diagnostic algorithms for the five events of interest and coded diagnoses in the administrative data. Dates of concordant events were compared.</p><p><strong>Results: </strong>The study included 274,420 general practice patients with at least one record in the administrative data between 2010 and 2020. Across the five acute events, specificity and NPV were excellent (>98%) but sensitivity (13%-51%) and PPV (30%-75%) were low. Sensitivity and PPV were highest for VTE (50.9%) and acute pancreatitis (75.2%), respectively. The majority (roughly 70-80%) of true positive cases were recorded in the EHR within 30 days of administrative records.</p><p><strong>Conclusion: </strong>Large proportions of events identified from administrative data were not detected by diagnostic algorithms applied to general practice EHRs within the specific time period. EHR data extraction and study design only partly explain the low sensitivities/PPVs. Our findings support the use of Australian general practice EHRs linked to hospital, ED and mortality data for robust research on the selected serious acute conditions.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"6 1","pages":"2118"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/71/ab/ijpds-08-2118.PMC10454002.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10481861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Abstract Introduction Data unavailability poses multiple challenges in many health fields, especially within ethnic subgroups in Canada, who may be hesitant to share their health data with researchers. Since health information availability is controlled by the participant, it is important to understand the willingness to share health information by an ethnic population to increase data availability within ethnocultural communities. Methods We employed a qualitative descriptive approach to better understand willingness to share health information by South Asian participants and operated through a lens that considered the cultural and sociodemographic aspect of ethnocultural communities. A total of 22 in-depth interviews were conducted between March and July 2020. Results The results of this study show that health researchers should aim to develop a mutually beneficial information-sharing partnership with communities, with an emphasis on the ethnocultural and socio-ecological aspects of health within populations. Conclusion The findings support the need for culturally sensitive and respectful engagement with the community, ethically sound research practices that make participants feel comfortable in sharing their information, and an easy sharing process to share health information feasibly.
{"title":"Color coded health data: factors related to willingness to share health information in South Asian community members in Canada.","authors":"Iffat Naeem, Meriem Aroua, Nashit Chowdhury, Vineet Saini, Hude Quan, Tanvir C Turin","doi":"10.23889/ijpds.v8i1.2134","DOIUrl":"https://doi.org/10.23889/ijpds.v8i1.2134","url":null,"abstract":"Abstract Introduction Data unavailability poses multiple challenges in many health fields, especially within ethnic subgroups in Canada, who may be hesitant to share their health data with researchers. Since health information availability is controlled by the participant, it is important to understand the willingness to share health information by an ethnic population to increase data availability within ethnocultural communities. Methods We employed a qualitative descriptive approach to better understand willingness to share health information by South Asian participants and operated through a lens that considered the cultural and sociodemographic aspect of ethnocultural communities. A total of 22 in-depth interviews were conducted between March and July 2020. Results The results of this study show that health researchers should aim to develop a mutually beneficial information-sharing partnership with communities, with an emphasis on the ethnocultural and socio-ecological aspects of health within populations. Conclusion The findings support the need for culturally sensitive and respectful engagement with the community, ethically sound research practices that make participants feel comfortable in sharing their information, and an easy sharing process to share health information feasibly.","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"8 1","pages":"2134"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/48/38/ijpds-08-2134.PMC10476700.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10225322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-01-01DOI: 10.23889/ijpds.v8i1.2116
Lindsay A Pearce, Rohan Borschmann, Jesse T Young, Stuart A Kinner
The use of administrative health data for research, monitoring, and quality improvement has proliferated in recent decades, leading to improvements in health across many disease areas and across the life course. However, not all populations are equally visible in administrative health data, and those that are less visible may be excluded from the benefits of associated research. Socially excluded populations - including the homeless, people with substance dependence, people involved in sex work, migrants or asylum seekers, and people with a history of incarceration - are typically characterised by health inequity. Yet people who experience social exclusion are often invisible within routinely collected administrative health data because information on their markers of social exclusion are not routinely recorded by healthcare providers. These circumstances make it difficult to understand the often complex health needs of socially excluded populations, evaluate and improve the quality of health services that they interact with, provide more accessible and appropriate health services, and develop effective and integrated responses to reduce health inequity. In this commentary we discuss how linking data from multiple sectors with administrative health data, often called cross-sectoral data linkage, is a key method for systematically identifying socially excluded populations in administrative health data and addressing other issues related to data quality and representativeness. We discuss how cross-sectoral data linkage can improve the representation of socially excluded populations in research, monitoring, and quality improvement initiatives, which can in turn inform coordinated responses across multiple sectors of service delivery. Finally, we articulate key challenges and potential solutions for advancing the use of cross-sectoral data linkage to improve the health of socially excluded populations, using international examples.
{"title":"Advancing cross-sectoral data linkage to understand and address the health impacts of social exclusion: Challenges and potential solutions.","authors":"Lindsay A Pearce, Rohan Borschmann, Jesse T Young, Stuart A Kinner","doi":"10.23889/ijpds.v8i1.2116","DOIUrl":"https://doi.org/10.23889/ijpds.v8i1.2116","url":null,"abstract":"<p><p>The use of administrative health data for research, monitoring, and quality improvement has proliferated in recent decades, leading to improvements in health across many disease areas and across the life course. However, not all populations are equally visible in administrative health data, and those that are less visible may be excluded from the benefits of associated research. Socially excluded populations - including the homeless, people with substance dependence, people involved in sex work, migrants or asylum seekers, and people with a history of incarceration - are typically characterised by health inequity. Yet people who experience social exclusion are often invisible within routinely collected administrative health data because information on their markers of social exclusion are not routinely recorded by healthcare providers. These circumstances make it difficult to understand the often complex health needs of socially excluded populations, evaluate and improve the quality of health services that they interact with, provide more accessible and appropriate health services, and develop effective and integrated responses to reduce health inequity. In this commentary we discuss how linking data from multiple sectors with administrative health data, often called <i>cross-sectoral data linkage</i>, is a key method for systematically identifying socially excluded populations in administrative health data and addressing other issues related to data quality and representativeness. We discuss how cross-sectoral data linkage can improve the representation of socially excluded populations in research, monitoring, and quality improvement initiatives, which can in turn inform coordinated responses across multiple sectors of service delivery. Finally, we articulate key challenges and potential solutions for advancing the use of cross-sectoral data linkage to improve the health of socially excluded populations, using international examples.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"8 1","pages":"2116"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/4f/35/ijpds-08-2116.PMC10476462.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10170162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-01-01DOI: 10.23889/ijpds.v8i1.1901
Jamie Danemayer, Sophie Mitra, Cathy Holloway, Shereen Hussein
Functional limitations become more prevalent as populations age, emphasising an increasingly urgent need for assistive technology (AT). Critical to meeting this need trajectory is understanding AT access in older ages. Yet few publications examine this from a longitudinal perspective. This review aims to identify and collate what data exist globally, seeking all population-based cohorts and repeated cross-sectional surveys through the Maelstrom Research Catalogue (searched May 10, 2022) and the Disability Data Report (published 2022), respectively. Datasets incorporating functional limitations modules and question(s) dedicated to AT, with a wave of data collection since 2009, were included. Of 81 cohorts and 202 surveys identified, 47 and 62 meet inclusion criteria, respectively. Over 40% of cohorts were drawn from high-income countries which have already experienced significant population ageing. Cohorts often exclude participants based on pre-existing support needs. For surveys, Africa is the most represented region (40%). Globally, 73% of waves were conducted since 2016. 'Use' is the most collected AT access indicator (69% of cohorts and 85% of surveys). Glasses (78%) and hearing aids (77%) are the most represented AT. While gaps in data coverage and representation are significant, collating existing datasets highlights current opportunities for analyses and methods for improving data collection across the sector.
{"title":"Assistive technology access in longitudinal datasets: a global review.","authors":"Jamie Danemayer, Sophie Mitra, Cathy Holloway, Shereen Hussein","doi":"10.23889/ijpds.v8i1.1901","DOIUrl":"https://doi.org/10.23889/ijpds.v8i1.1901","url":null,"abstract":"<p><p>Functional limitations become more prevalent as populations age, emphasising an increasingly urgent need for assistive technology (AT). Critical to meeting this need trajectory is understanding AT access in older ages. Yet few publications examine this from a longitudinal perspective. This review aims to identify and collate what data exist globally, seeking all population-based cohorts and repeated cross-sectional surveys through the Maelstrom Research Catalogue (searched May 10, 2022) and the Disability Data Report (published 2022), respectively. Datasets incorporating functional limitations modules and question(s) dedicated to AT, with a wave of data collection since 2009, were included. Of 81 cohorts and 202 surveys identified, 47 and 62 meet inclusion criteria, respectively. Over 40% of cohorts were drawn from high-income countries which have already experienced significant population ageing. Cohorts often exclude participants based on pre-existing support needs. For surveys, Africa is the most represented region (40%). Globally, 73% of waves were conducted since 2016. 'Use' is the most collected AT access indicator (69% of cohorts and 85% of surveys). Glasses (78%) and hearing aids (77%) are the most represented AT. While gaps in data coverage and representation are significant, collating existing datasets highlights current opportunities for analyses and methods for improving data collection across the sector.</p>","PeriodicalId":36483,"journal":{"name":"International Journal of Population Data Science","volume":"8 1","pages":"1901"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/b9/13/ijpds-08-1901.PMC10448602.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10517348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}