{"title":"Estimating Disease Prevalence in Administrative Data.","authors":"Jacek A Kopec","doi":"10.25011/cim.v45i2.38100","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Disease prevalence estimates from population-based administrative databases are often biased due to measurement (misclassification) errors. The purpose of this article is to review the methodology for estimating disease prevalence in administrative data, with a focus on bias correction.</p><p><strong>Source: </strong>Several approaches to bias correction in administrative data were reviewed and application of these methods was demonstrated using an example from the literature: physician claims and hospitalization data were employed to estimate diabetes prevalence in Ontario, Canada.</p><p><strong>Findings: </strong>Misclassification bias in prevalence estimates from administrative data can be reduced by developing and selecting an optimal algorithm for case identification, applying a bias correction formula, or using statistical modelling. An algorithm for which sensitivity equals positive predictive value provides an unbiased estimate of prevalence. Bias reduction methods generally require information about the measurement properties of the algorithm, such as sensitivity, specificity, or predictive value. These properties depend on disease type, prevalence, algorithm definition (including the observation window), and may vary by population and time. Prevalence estimates can be improved by applying multivariable disease prediction models.</p><p><strong>Conclusion: </strong>Frequency of a positive case identification algorithm in administrative data is generally not equivalent to disease prevalence. Although prevalence estimates can be corrected for bias using known measurement properties of the algorithm, these properties may be difficult to estimate accurately; therefore, disease prevalence estimates based on administrative data must be treated with caution.</p>","PeriodicalId":50683,"journal":{"name":"Clinical and Investigative Medicine","volume":null,"pages":null},"PeriodicalIF":1.2000,"publicationDate":"2022-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical and Investigative Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.25011/cim.v45i2.38100","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 1
Abstract
Purpose: Disease prevalence estimates from population-based administrative databases are often biased due to measurement (misclassification) errors. The purpose of this article is to review the methodology for estimating disease prevalence in administrative data, with a focus on bias correction.
Source: Several approaches to bias correction in administrative data were reviewed and application of these methods was demonstrated using an example from the literature: physician claims and hospitalization data were employed to estimate diabetes prevalence in Ontario, Canada.
Findings: Misclassification bias in prevalence estimates from administrative data can be reduced by developing and selecting an optimal algorithm for case identification, applying a bias correction formula, or using statistical modelling. An algorithm for which sensitivity equals positive predictive value provides an unbiased estimate of prevalence. Bias reduction methods generally require information about the measurement properties of the algorithm, such as sensitivity, specificity, or predictive value. These properties depend on disease type, prevalence, algorithm definition (including the observation window), and may vary by population and time. Prevalence estimates can be improved by applying multivariable disease prediction models.
Conclusion: Frequency of a positive case identification algorithm in administrative data is generally not equivalent to disease prevalence. Although prevalence estimates can be corrected for bias using known measurement properties of the algorithm, these properties may be difficult to estimate accurately; therefore, disease prevalence estimates based on administrative data must be treated with caution.
期刊介绍:
Clinical and Investigative Medicine (CIM), publishes original work in the field of Clinical Investigation. Original work includes clinical or laboratory investigations and clinical reports. Reviews include information for Continuing Medical Education (CME), narrative review articles, systematic reviews, and meta-analyses.