Weiru Han, Robert Morris, Kun Bu, Tianrui Zhu, Hong Huang, Feng Cheng
{"title":"Analysis of literature-derived duplicate records in the FDA Adverse Event Reporting System (FAERS) database.","authors":"Weiru Han, Robert Morris, Kun Bu, Tianrui Zhu, Hong Huang, Feng Cheng","doi":"10.1139/cjpp-2024-0078","DOIUrl":null,"url":null,"abstract":"<p><p>The FDA Adverse Event Reporting System (FAERS) is a large-scale repository of reports concerning adverse drug events (ADEs). The same published clinical study or report may be reviewed by multiple companies or healthcare professionals and reported separately to the FDA, leading to a significant presence of duplicate reports in FAERS. These duplicate records can result in the identification of false associations between a given drug and an ADE. In this study, we first assessed the consistency of drug and ADE information in FAERS reports from Alzheimer's disease patients. Our findings showed greater congruence in drug-related information compared to ADE-related information, likely due to the greater heterogeneity and variety of terms or phrases used to describe ADEs. We then demonstrated that text comparison methods are effective in identifying duplicate records based on literature citations, testing 10 different comparison functions for their overall efficacy. Token-based methods (such as COSINE, QGRAM, and JACCARD), edit-based approaches (including OSA, LV, and DL), and sequence-based techniques like LCS have proven highly effective in accurately detecting identical publications within free text, demonstrating both high sensitivity and specificity. These results offer valuable insights for identifying duplicate FAERS reports and improving the reliability of detected associations between drugs and ADEs.</p>","PeriodicalId":9520,"journal":{"name":"Canadian journal of physiology and pharmacology","volume":" ","pages":"56-69"},"PeriodicalIF":1.7000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Canadian journal of physiology and pharmacology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1139/cjpp-2024-0078","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/4 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0
Abstract
The FDA Adverse Event Reporting System (FAERS) is a large-scale repository of reports concerning adverse drug events (ADEs). The same published clinical study or report may be reviewed by multiple companies or healthcare professionals and reported separately to the FDA, leading to a significant presence of duplicate reports in FAERS. These duplicate records can result in the identification of false associations between a given drug and an ADE. In this study, we first assessed the consistency of drug and ADE information in FAERS reports from Alzheimer's disease patients. Our findings showed greater congruence in drug-related information compared to ADE-related information, likely due to the greater heterogeneity and variety of terms or phrases used to describe ADEs. We then demonstrated that text comparison methods are effective in identifying duplicate records based on literature citations, testing 10 different comparison functions for their overall efficacy. Token-based methods (such as COSINE, QGRAM, and JACCARD), edit-based approaches (including OSA, LV, and DL), and sequence-based techniques like LCS have proven highly effective in accurately detecting identical publications within free text, demonstrating both high sensitivity and specificity. These results offer valuable insights for identifying duplicate FAERS reports and improving the reliability of detected associations between drugs and ADEs.
期刊介绍:
Published since 1929, the Canadian Journal of Physiology and Pharmacology is a monthly journal that reports current research in all aspects of physiology, nutrition, pharmacology, and toxicology, contributed by recognized experts and scientists. It publishes symposium reviews and award lectures and occasionally dedicates entire issues or portions of issues to subjects of special interest to its international readership. The journal periodically publishes a “Made In Canada” special section that features invited review articles from internationally recognized scientists who have received some of their training in Canada.