Alexandrea J Anderson, Lisa Lix, Carla Loeppky, Paul Van Caeseele, John A Queenan, Alyson L Mahar
{"title":"利用马尼托巴省的行政数据验证识别人体免疫缺陷病毒病例的算法。","authors":"Alexandrea J Anderson, Lisa Lix, Carla Loeppky, Paul Van Caeseele, John A Queenan, Alyson L Mahar","doi":"10.17269/s41997-024-00951-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Population-based administrative data are valuable for describing human immunodeficiency virus (HIV) cases, and their health status and outcomes. Our objective was to validate algorithms consisting of physician visits, hospitalizations, and antiretroviral prescriptions against positive confirmatory HIV laboratory tests to identify individuals living with HIV.</p><p><strong>Methods: </strong>The primary validation cohort consisted of adult Manitoban residents with at least 3 years of health coverage between 2007 and 2018. Positive confirmatory HIV tests from the provincial laboratory were the reference standard. We evaluated 15 algorithms requiring 2 or 3 years of administrative data (hospital, physician, and prescription records) to ascertain cases. Seven measures of accuracy were estimated: sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), Youden's J, kappa, and area under the receiver operating characteristic curve (AUC) and their 95% confidence intervals. Validity was estimated for pregnant females.</p><p><strong>Results: </strong>The primary validation cohort included 966,507 individuals, of whom 1452 (0.2%) were HIV cases. Algorithm sensitivity ranged from 82.8% to 97.5%. PPV ranged from 51.8% to 97.8%. Youden's J ranged from 0.83 to 0.97. Kappa ranged from 0.68 to 0.93. AUC ranged from 0.91 to 0.99.</p><p><strong>Conclusion: </strong>Researchers have a range of algorithms to ascertain HIV cases in administrative data; selection of an appropriate algorithm depends on the user goal. To maximize performance to distinguish HIV cases and non-cases while minimizing data requirements, an algorithm based on three or more physician visits in 2 years is recommended. Further validation in other provinces and territories will assess the generalizability of these findings.</p>","PeriodicalId":51407,"journal":{"name":"Canadian Journal of Public Health-Revue Canadienne De Sante Publique","volume":" ","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Validation of algorithms to identify human immunodeficiency virus cases using administrative data in Manitoba.\",\"authors\":\"Alexandrea J Anderson, Lisa Lix, Carla Loeppky, Paul Van Caeseele, John A Queenan, Alyson L Mahar\",\"doi\":\"10.17269/s41997-024-00951-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>Population-based administrative data are valuable for describing human immunodeficiency virus (HIV) cases, and their health status and outcomes. Our objective was to validate algorithms consisting of physician visits, hospitalizations, and antiretroviral prescriptions against positive confirmatory HIV laboratory tests to identify individuals living with HIV.</p><p><strong>Methods: </strong>The primary validation cohort consisted of adult Manitoban residents with at least 3 years of health coverage between 2007 and 2018. Positive confirmatory HIV tests from the provincial laboratory were the reference standard. We evaluated 15 algorithms requiring 2 or 3 years of administrative data (hospital, physician, and prescription records) to ascertain cases. Seven measures of accuracy were estimated: sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), Youden's J, kappa, and area under the receiver operating characteristic curve (AUC) and their 95% confidence intervals. Validity was estimated for pregnant females.</p><p><strong>Results: </strong>The primary validation cohort included 966,507 individuals, of whom 1452 (0.2%) were HIV cases. Algorithm sensitivity ranged from 82.8% to 97.5%. PPV ranged from 51.8% to 97.8%. Youden's J ranged from 0.83 to 0.97. Kappa ranged from 0.68 to 0.93. AUC ranged from 0.91 to 0.99.</p><p><strong>Conclusion: </strong>Researchers have a range of algorithms to ascertain HIV cases in administrative data; selection of an appropriate algorithm depends on the user goal. To maximize performance to distinguish HIV cases and non-cases while minimizing data requirements, an algorithm based on three or more physician visits in 2 years is recommended. Further validation in other provinces and territories will assess the generalizability of these findings.</p>\",\"PeriodicalId\":51407,\"journal\":{\"name\":\"Canadian Journal of Public Health-Revue Canadienne De Sante Publique\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2024-11-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Canadian Journal of Public Health-Revue Canadienne De Sante Publique\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.17269/s41997-024-00951-0\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Canadian Journal of Public Health-Revue Canadienne De Sante Publique","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.17269/s41997-024-00951-0","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0
摘要
目的:基于人口的管理数据对于描述人类免疫缺陷病毒(HIV)病例及其健康状况和结果非常有价值。我们的目标是验证由医生就诊、住院治疗和抗逆转录病毒处方组成的算法与阳性 HIV 确证实验室检测结果的对比,以识别 HIV 感染者:主要验证队列由 2007 年至 2018 年期间至少有 3 年医疗保险的马尼托巴州成年居民组成。省级实验室的阳性确证 HIV 检测结果为参考标准。我们评估了 15 种算法,这些算法需要 2 年或 3 年的管理数据(医院、医生和处方记录)来确定病例。我们估算了七项准确性指标:灵敏度、特异性、阳性预测值(PPV)、阴性预测值(NPV)、Youden's J、kappa、接收者工作特征曲线下面积(AUC)及其 95% 置信区间。对怀孕女性的有效性进行了估计:主要验证队列包括 966 507 人,其中 1452 人(0.2%)为 HIV 感染病例。算法灵敏度在 82.8% 到 97.5% 之间。PPV从51.8%到97.8%不等。Youden's J 介于 0.83 到 0.97 之间。Kappa 为 0.68 至 0.93。AUC 在 0.91 到 0.99 之间:研究人员有一系列算法来确定行政数据中的 HIV 病例;选择合适的算法取决于用户的目标。为了最大限度地提高区分艾滋病病例和非病例的性能,同时最大限度地减少数据需求,建议采用基于两年内三次或三次以上医生就诊的算法。在其他省份和地区的进一步验证将评估这些发现的普遍性。
Validation of algorithms to identify human immunodeficiency virus cases using administrative data in Manitoba.
Objective: Population-based administrative data are valuable for describing human immunodeficiency virus (HIV) cases, and their health status and outcomes. Our objective was to validate algorithms consisting of physician visits, hospitalizations, and antiretroviral prescriptions against positive confirmatory HIV laboratory tests to identify individuals living with HIV.
Methods: The primary validation cohort consisted of adult Manitoban residents with at least 3 years of health coverage between 2007 and 2018. Positive confirmatory HIV tests from the provincial laboratory were the reference standard. We evaluated 15 algorithms requiring 2 or 3 years of administrative data (hospital, physician, and prescription records) to ascertain cases. Seven measures of accuracy were estimated: sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), Youden's J, kappa, and area under the receiver operating characteristic curve (AUC) and their 95% confidence intervals. Validity was estimated for pregnant females.
Results: The primary validation cohort included 966,507 individuals, of whom 1452 (0.2%) were HIV cases. Algorithm sensitivity ranged from 82.8% to 97.5%. PPV ranged from 51.8% to 97.8%. Youden's J ranged from 0.83 to 0.97. Kappa ranged from 0.68 to 0.93. AUC ranged from 0.91 to 0.99.
Conclusion: Researchers have a range of algorithms to ascertain HIV cases in administrative data; selection of an appropriate algorithm depends on the user goal. To maximize performance to distinguish HIV cases and non-cases while minimizing data requirements, an algorithm based on three or more physician visits in 2 years is recommended. Further validation in other provinces and territories will assess the generalizability of these findings.
期刊介绍:
The Canadian Journal of Public Health is dedicated to fostering excellence in public health research, scholarship, policy and practice. The aim of the Journal is to advance public health research and practice in Canada and around the world, thus contributing to the improvement of the health of populations and the reduction of health inequalities.
CJPH publishes original research and scholarly articles submitted in either English or French that are relevant to population and public health.
CJPH is an independent, peer-reviewed journal owned by the Canadian Public Health Association and published by Springer.
Énoncé de mission
La Revue canadienne de santé publique se consacre à promouvoir l’excellence dans la recherche, les travaux d’érudition, les politiques et les pratiques de santé publique. Son but est de faire progresser la recherche et les pratiques de santé publique au Canada et dans le monde, contribuant ainsi à l’amélioration de la santé des populations et à la réduction des inégalités de santé.
La RCSP publie des articles savants et des travaux inédits, soumis en anglais ou en français, qui sont d’intérêt pour la santé publique et des populations.
La RCSP est une revue indépendante avec comité de lecture, propriété de l’Association canadienne de santé publique et publiée par Springer.