Introduction: The patient journey for residents of New South Wales (NSW) Australia with ST-elevation myocardial infarction (STEMI) often involves transfer between hospitals and these can include stays in hospitals in other jurisdictions.
Objective: To estimate the change in enumeration of STEMI hospitalisations and time to subsequent cardiac procedures for NSW residents using cross-jurisdictional linkage of administrative health data.
Methods: Records for NSW residents aged 20 years and over admitted to hospitals in NSW and four adjacent jurisdictions (Australian Capital Territory, Queensland, South Australia, and Victoria) between 1 July 2013 and 30 June 2018 with a principal diagnosis of STEMI were linked with records of the Australian Government Medicare Benefits Schedule (MBS). The number of STEMI hospitalisations, and rates of angiography, percutaneous coronary intervention and coronary artery bypass graft were compared for residents of different local health districts within NSW with and without inclusion of cross-jurisdictional data.
Results: Inclusion of cross-jurisdictional hospital and MBS data increased the enumeration of STEMI hospitalisations for NSW residents by 8% (from 15,420 to 16,659) and procedure rates from 85.6% to 88.2%. For NSW residents who lived adjacent to a jurisdictional border, hospitalisation counts increased by up to 210% and procedure rates by up to 70 percentage points.
Conclusions: Cross-jurisdictional linked hospital data is essential to understand patient journeys of NSW residents who live in border areas and to evaluate adherence to treatment guidelines for STEMI. MBS data are useful where hospital data are not available and for procedures that may be conducted in out-patient settings.
Introduction: Involving public contributors helps researchers to ensure that public views are taken into consideration when designing and planning research, so that it is person-centred and relevant to the public. This paper will consider public involvement in big data research. Inclusion of different communities is needed to ensure everyone's voice is heard. However, there remains limited evidence on how to improve the involvement of seldom-heard communities in big data research.
Objectives: This study aims to understand how South Asians and Polish communities in the UK can be encouraged to participate in public involvement initiatives in big data research.
Methods: Forty interviews were conducted with Polish (n=20) and South Asian (n=20) participants on Zoom. The participants were living in the United Kingdom and had not previously been involved as public contributors. Transcribed interviews were analysed using reflexive thematic analysis.
Results: We identified eight themes. The 'happy to reuse data' theme sets the scene by exploring our participants' views towards big data research and under what circumstances they thought that data could be used. The remaining themes were mapped under the capability-opportunity-motivation-behaviour (COM-B) model, as developed by Michie and colleagues. This allowed us to discuss multiple factors that could influence people's willingness to become public contributors.
Conclusions: Our study is the first to explore how to improve the involvement and engagement of seldom-heard communities in big data research using the COM-B model. The results have the potential to support researchers who want to identify what can influence members of the public to be involved. By using the COM-B model, it is possible to determine what measures could be implemented to better engage these communities.
Introduction: Understanding the level of recording of acute serious events in general practice electronic health records (EHRs) is critical for making decisions about the suitability of general practice datasets to address research questions and requirements for linking general practice EHRs with other datasets.
Objectives: To examine data source agreement of five serious acute events (myocardial infarction, stroke, venous thromboembolism (VTE), pancreatitis and suicide) recorded in general practice EHRs compared with hospital, emergency department (ED) and mortality data.
Methods: Data from 61 general practices routinely contributing data to the MedicineInsight database was linked with New South Wales administrative hospital, ED and mortality data. The study population comprised patients with at least three clinical encounters at participating general practices between 2019 and 2020 and at least one record in hospital, ED or mortality data between 2010 and 2020. Agreement was assessed between MedicineInsight diagnostic algorithms for the five events of interest and coded diagnoses in the administrative data. Dates of concordant events were compared.
Results: The study included 274,420 general practice patients with at least one record in the administrative data between 2010 and 2020. Across the five acute events, specificity and NPV were excellent (>98%) but sensitivity (13%-51%) and PPV (30%-75%) were low. Sensitivity and PPV were highest for VTE (50.9%) and acute pancreatitis (75.2%), respectively. The majority (roughly 70-80%) of true positive cases were recorded in the EHR within 30 days of administrative records.
Conclusion: Large proportions of events identified from administrative data were not detected by diagnostic algorithms applied to general practice EHRs within the specific time period. EHR data extraction and study design only partly explain the low sensitivities/PPVs. Our findings support the use of Australian general practice EHRs linked to hospital, ED and mortality data for robust research on the selected serious acute conditions.
The use of administrative health data for research, monitoring, and quality improvement has proliferated in recent decades, leading to improvements in health across many disease areas and across the life course. However, not all populations are equally visible in administrative health data, and those that are less visible may be excluded from the benefits of associated research. Socially excluded populations - including the homeless, people with substance dependence, people involved in sex work, migrants or asylum seekers, and people with a history of incarceration - are typically characterised by health inequity. Yet people who experience social exclusion are often invisible within routinely collected administrative health data because information on their markers of social exclusion are not routinely recorded by healthcare providers. These circumstances make it difficult to understand the often complex health needs of socially excluded populations, evaluate and improve the quality of health services that they interact with, provide more accessible and appropriate health services, and develop effective and integrated responses to reduce health inequity. In this commentary we discuss how linking data from multiple sectors with administrative health data, often called cross-sectoral data linkage, is a key method for systematically identifying socially excluded populations in administrative health data and addressing other issues related to data quality and representativeness. We discuss how cross-sectoral data linkage can improve the representation of socially excluded populations in research, monitoring, and quality improvement initiatives, which can in turn inform coordinated responses across multiple sectors of service delivery. Finally, we articulate key challenges and potential solutions for advancing the use of cross-sectoral data linkage to improve the health of socially excluded populations, using international examples.
Functional limitations become more prevalent as populations age, emphasising an increasingly urgent need for assistive technology (AT). Critical to meeting this need trajectory is understanding AT access in older ages. Yet few publications examine this from a longitudinal perspective. This review aims to identify and collate what data exist globally, seeking all population-based cohorts and repeated cross-sectional surveys through the Maelstrom Research Catalogue (searched May 10, 2022) and the Disability Data Report (published 2022), respectively. Datasets incorporating functional limitations modules and question(s) dedicated to AT, with a wave of data collection since 2009, were included. Of 81 cohorts and 202 surveys identified, 47 and 62 meet inclusion criteria, respectively. Over 40% of cohorts were drawn from high-income countries which have already experienced significant population ageing. Cohorts often exclude participants based on pre-existing support needs. For surveys, Africa is the most represented region (40%). Globally, 73% of waves were conducted since 2016. 'Use' is the most collected AT access indicator (69% of cohorts and 85% of surveys). Glasses (78%) and hearing aids (77%) are the most represented AT. While gaps in data coverage and representation are significant, collating existing datasets highlights current opportunities for analyses and methods for improving data collection across the sector.
Background: Approximately thirty thousand people in Scotland are diagnosed with cancer annually, of whom a third live less than one year. The timing, nature and value of hospital-based healthcare for patients with advanced cancer are not well understood. The study's aim was to describe the timing and nature of hospital-based healthcare use and associated costs in the last year of life for patients with a cancer diagnosis.
Methods: We undertook a Scottish population-wide administrative data linkage study of hospital-based healthcare use for individuals with a cancer diagnosis, who died aged 60 and over between 2012 and 2017. Hospital admissions and length of stay (LOS), as well as the number and nature of outpatient and day case appointments were analysed. Generalised linear models were used to adjust costs for age, gender, socioeconomic deprivation status, rural-urban (RU) status and comorbidity.
Results: The study included 85,732 decedents with a cancer diagnosis. For 64,553 (75.3%) of them, cancer was the primary cause of death. Mean age at death was 80.01 (SD 8.15) years. The mean number of inpatient stays in the last year of life was 5.88 (SD 5.68), with a mean LOS of 7 days. Admission rates rose sharply in the last month of life. One year adjusted and unadjusted costs decreased with increasing age. A higher comorbidity burden was associated with higher costs. Major cost differences were present between cancer types.
Conclusions: People in Scotland in their last year of life with cancer are high users of secondary care. Hospitalisation accounts for a high proportion of costs, particularly in the last month of life. Further research is needed to examine triggers for hospitalisations and to identify influenceable reasons for unwarranted variation in hospital use among different cancer cohorts.
Introduction: Digitalisation of Electronic Health Record (EHR) data has created unique opportunities for research. However, these data are routinely collected for operational purposes and so are not curated to the standard required for research. Harnessing such routine data at large scale allows efficient and long-term epidemiological and health services research.
Objectives: To describe the establishment a linked EHR derived data platform in the National Centre for Healthy Ageing, Melbourne, Australia, aimed at enabling research targeting national health priority areas in ageing.
Methods: Our approach incorporated: data validation, curation and warehousing to ensure quality and completeness; end-user engagement and consensus on the platform content; implementation of an artificial intelligence (AI) pipeline for extraction of text-based data items; early consumer involvement; and implementation of routine collection of patient reported outcome measures, in a multisite public health service.
Results: Data for a cohort of >800,000 patients collected over a 10-year period have been curated within the platform's research data warehouse. So far 117 items have been identified as suitable for inclusion, from 11 research relevant datasets held within the health service EHR systems. Data access, extraction and release processes, guided by the Five Safes Framework, are being tested through project use-cases. A natural language processing (NLP) pipeline has been implemented and a framework for the routine collection and incorporation of patient reported outcome measures developed.
Conclusions: We highlight the importance of establishing comprehensive processes for the foundations of a data platform utilising routine data not collected for research purposes. These robust foundations will facilitate future expansion through linkages to other datasets for the efficient and cost-effective study of health related to ageing at a large scale.