Network Analysis and Machine Learning for Signal Detection and Prioritization Using Electronic Healthcare Records and Administrative Databases: A Proof of Concept in Drug-Induced Acute Myocardial Infarction.
Maria Antonietta Barbieri, Andrea Abate, Olivér M Balogh, Mátyás Pétervári, Péter Ferdinandy, Bence Ágg, Vera Battini, Marianna Cocco, Andrea Rossi, Carla Carnovale, Manuela Casula, Edoardo Spina, Maurizio Sessa
{"title":"Network Analysis and Machine Learning for Signal Detection and Prioritization Using Electronic Healthcare Records and Administrative Databases: A Proof of Concept in Drug-Induced Acute Myocardial Infarction.","authors":"Maria Antonietta Barbieri, Andrea Abate, Olivér M Balogh, Mátyás Pétervári, Péter Ferdinandy, Bence Ágg, Vera Battini, Marianna Cocco, Andrea Rossi, Carla Carnovale, Manuela Casula, Edoardo Spina, Maurizio Sessa","doi":"10.1007/s40264-025-01515-y","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Safety signals for potential drug-induced adverse events (AEs) typically emerge from multiple data sources, primarily spontaneous reporting systems, despite known limitations. Increasingly, real-world data from sources such as electronic health records (EHRs) and administrative databases are leveraged for signal detection. Although network analysis has shown promise in mapping relationships between clinical attributes for signal detection in spontaneous reporting system databases, its application in real-world data from EHRs and administrative databases remains limited.</p><p><strong>Objective: </strong>This study aimed to evaluate the performance of network analysis in detecting safety signals within Italian administrative databases, using drug-induced acute myocardial infarction (AMI) as a proof of concept.</p><p><strong>Methods: </strong>We employed a case-crossover design to explore the association between drug exposure and AMI using the Healthcare Administrative Database of Mantova, Italy, from 2014 to 2018. Patients with their first AMI hospitalization were identified after a 365-day washout period to exclude prior hospitalizations. We constructed a network to analyse the relationships between prescribed drugs and diagnoses, represented as nodes, with undirected edges illustrating their interactions. For each patient with AMI, we identified all diagnoses and drugs recorded or redeemed within 365 days of the first AMI episode and generated various drug-diagnosis, drug-drug, and diagnosis-diagnosis pairs. We calculated the frequency of these pairs, and three types of edge weights quantified the strength of connections. We identified outlier drug-AMI pairs using a predictive score (F) based on frequency (C) and full edge weights (W<sub>F</sub>), with validation for known AMI associations. We prioritized signals using the F score, C of AMI, and W<sub>F</sub>, analysed through k-means clustering to identify patterns in the data.</p><p><strong>Results: </strong>From 2014 to 2018, a total of 3918 patients had an AMI, with 4686 AMI diagnoses. Of those, 2866 had prescriptions in the previous year, totalling 498,591 prescriptions. A network analysis identified 2968 unique nodes, revealing 529,935 diagnosis-diagnosis connections, 235,380 drug-diagnosis connections, and 102,831 drug-drug connections. The median number of connections (C) was 404 (Q1-Q3: 194-671) for drug nodes and 380 (Q1-Q3: 216-664) for diagnosis nodes. The median W<sub>F</sub> was 11.8 (Q1-Q3: 9-14), and the median F score across pairs was 0.1 (Q1-Q3: 0.1-0.3). A total of 249 potential safety signals were detected, with 63.4% aligning with known AEs. Among the remaining signals, 80 were prioritized, and five emerged as the highest priority: terazosin, tamsulosin, allopurinol, esomeprazole, and omeprazole.</p><p><strong>Conclusions: </strong>Overall, our novel method demonstrates that network analysis is a valuable tool for signal detection and prioritization in drug-induced AEs based on EHRs and administrative databases.</p>","PeriodicalId":11382,"journal":{"name":"Drug Safety","volume":" ","pages":""},"PeriodicalIF":4.0000,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Drug Safety","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s40264-025-01515-y","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Safety signals for potential drug-induced adverse events (AEs) typically emerge from multiple data sources, primarily spontaneous reporting systems, despite known limitations. Increasingly, real-world data from sources such as electronic health records (EHRs) and administrative databases are leveraged for signal detection. Although network analysis has shown promise in mapping relationships between clinical attributes for signal detection in spontaneous reporting system databases, its application in real-world data from EHRs and administrative databases remains limited.
Objective: This study aimed to evaluate the performance of network analysis in detecting safety signals within Italian administrative databases, using drug-induced acute myocardial infarction (AMI) as a proof of concept.
Methods: We employed a case-crossover design to explore the association between drug exposure and AMI using the Healthcare Administrative Database of Mantova, Italy, from 2014 to 2018. Patients with their first AMI hospitalization were identified after a 365-day washout period to exclude prior hospitalizations. We constructed a network to analyse the relationships between prescribed drugs and diagnoses, represented as nodes, with undirected edges illustrating their interactions. For each patient with AMI, we identified all diagnoses and drugs recorded or redeemed within 365 days of the first AMI episode and generated various drug-diagnosis, drug-drug, and diagnosis-diagnosis pairs. We calculated the frequency of these pairs, and three types of edge weights quantified the strength of connections. We identified outlier drug-AMI pairs using a predictive score (F) based on frequency (C) and full edge weights (WF), with validation for known AMI associations. We prioritized signals using the F score, C of AMI, and WF, analysed through k-means clustering to identify patterns in the data.
Results: From 2014 to 2018, a total of 3918 patients had an AMI, with 4686 AMI diagnoses. Of those, 2866 had prescriptions in the previous year, totalling 498,591 prescriptions. A network analysis identified 2968 unique nodes, revealing 529,935 diagnosis-diagnosis connections, 235,380 drug-diagnosis connections, and 102,831 drug-drug connections. The median number of connections (C) was 404 (Q1-Q3: 194-671) for drug nodes and 380 (Q1-Q3: 216-664) for diagnosis nodes. The median WF was 11.8 (Q1-Q3: 9-14), and the median F score across pairs was 0.1 (Q1-Q3: 0.1-0.3). A total of 249 potential safety signals were detected, with 63.4% aligning with known AEs. Among the remaining signals, 80 were prioritized, and five emerged as the highest priority: terazosin, tamsulosin, allopurinol, esomeprazole, and omeprazole.
Conclusions: Overall, our novel method demonstrates that network analysis is a valuable tool for signal detection and prioritization in drug-induced AEs based on EHRs and administrative databases.
期刊介绍:
Drug Safety is the official journal of the International Society of Pharmacovigilance. The journal includes:
Overviews of contentious or emerging issues.
Comprehensive narrative reviews that provide an authoritative source of information on epidemiology, clinical features, prevention and management of adverse effects of individual drugs and drug classes.
In-depth benefit-risk assessment of adverse effect and efficacy data for a drug in a defined therapeutic area.
Systematic reviews (with or without meta-analyses) that collate empirical evidence to answer a specific research question, using explicit, systematic methods as outlined by the PRISMA statement.
Original research articles reporting the results of well-designed studies in disciplines such as pharmacoepidemiology, pharmacovigilance, pharmacology and toxicology, and pharmacogenomics.
Editorials and commentaries on topical issues.
Additional digital features (including animated abstracts, video abstracts, slide decks, audio slides, instructional videos, infographics, podcasts and animations) can be published with articles; these are designed to increase the visibility, readership and educational value of the journal’s content. In addition, articles published in Drug Safety Drugs may be accompanied by plain language summaries to assist readers who have some knowledge of, but not in-depth expertise in, the area to understand important medical advances.