Rupa Makadia, Azza Shoaibi, Gowtham A Rao, Anna Ostropolets, Peter R Rijnbeek, Erica A Voss, Talita Duarte-Salles, Juan Manuel Ramírez-Anguita, Miguel A Mayer, Filip Maljković, Spiros Denaxas, Fredrik Nyberg, Vaclav Papez, Anthony G Sena, Thamir M Alshammari, Lana Y H Lai, Kevin Haynes, Marc A Suchard, George Hripcsak, Patrick B Ryan
{"title":"评估全球数据网络中不同表型定义对发病率的影响。","authors":"Rupa Makadia, Azza Shoaibi, Gowtham A Rao, Anna Ostropolets, Peter R Rijnbeek, Erica A Voss, Talita Duarte-Salles, Juan Manuel Ramírez-Anguita, Miguel A Mayer, Filip Maljković, Spiros Denaxas, Fredrik Nyberg, Vaclav Papez, Anthony G Sena, Thamir M Alshammari, Lana Y H Lai, Kevin Haynes, Marc A Suchard, George Hripcsak, Patrick B Ryan","doi":"10.1093/jamiaopen/ooad096","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Developing accurate phenotype definitions is critical in obtaining reliable and reproducible background rates in safety research. This study aims to illustrate the differences in background incidence rates by comparing definitions for a given outcome.</p><p><strong>Materials and methods: </strong>We used 16 data sources to systematically generate and evaluate outcomes for 13 adverse events and their overall background rates. We examined the effect of different modifications (inpatient setting, standardization of code set, and code set changes) to the computable phenotype on background incidence rates.</p><p><strong>Results: </strong>Rate ratios (RRs) of the incidence rates from each computable phenotype definition varied across outcomes, with inpatient restriction showing the highest variation from 1 to 11.93. Standardization of code set RRs ranges from 1 to 1.64, and code set changes range from 1 to 2.52.</p><p><strong>Discussion: </strong>The modification that has the highest impact is requiring inpatient place of service, leading to at least a 2-fold higher incidence rate in the base definition. Standardization showed almost no change when using source code variations. The strength of the effect in the inpatient restriction is highly dependent on the outcome. Changing definitions from broad to narrow showed the most variability by age/gender/database across phenotypes and less than a 2-fold increase in rate compared to the base definition.</p><p><strong>Conclusion: </strong>Characterization of outcomes across a network of databases yields insights into sensitivity and specificity trade-offs when definitions are altered. Outcomes should be thoroughly evaluated prior to use for background rates for their plausibility for use across a global network.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"6 4","pages":"ooad096"},"PeriodicalIF":2.5000,"publicationDate":"2023-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10662662/pdf/","citationCount":"0","resultStr":"{\"title\":\"Evaluating the impact of alternative phenotype definitions on incidence rates across a global data network.\",\"authors\":\"Rupa Makadia, Azza Shoaibi, Gowtham A Rao, Anna Ostropolets, Peter R Rijnbeek, Erica A Voss, Talita Duarte-Salles, Juan Manuel Ramírez-Anguita, Miguel A Mayer, Filip Maljković, Spiros Denaxas, Fredrik Nyberg, Vaclav Papez, Anthony G Sena, Thamir M Alshammari, Lana Y H Lai, Kevin Haynes, Marc A Suchard, George Hripcsak, Patrick B Ryan\",\"doi\":\"10.1093/jamiaopen/ooad096\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>Developing accurate phenotype definitions is critical in obtaining reliable and reproducible background rates in safety research. This study aims to illustrate the differences in background incidence rates by comparing definitions for a given outcome.</p><p><strong>Materials and methods: </strong>We used 16 data sources to systematically generate and evaluate outcomes for 13 adverse events and their overall background rates. We examined the effect of different modifications (inpatient setting, standardization of code set, and code set changes) to the computable phenotype on background incidence rates.</p><p><strong>Results: </strong>Rate ratios (RRs) of the incidence rates from each computable phenotype definition varied across outcomes, with inpatient restriction showing the highest variation from 1 to 11.93. Standardization of code set RRs ranges from 1 to 1.64, and code set changes range from 1 to 2.52.</p><p><strong>Discussion: </strong>The modification that has the highest impact is requiring inpatient place of service, leading to at least a 2-fold higher incidence rate in the base definition. Standardization showed almost no change when using source code variations. The strength of the effect in the inpatient restriction is highly dependent on the outcome. Changing definitions from broad to narrow showed the most variability by age/gender/database across phenotypes and less than a 2-fold increase in rate compared to the base definition.</p><p><strong>Conclusion: </strong>Characterization of outcomes across a network of databases yields insights into sensitivity and specificity trade-offs when definitions are altered. Outcomes should be thoroughly evaluated prior to use for background rates for their plausibility for use across a global network.</p>\",\"PeriodicalId\":36278,\"journal\":{\"name\":\"JAMIA Open\",\"volume\":\"6 4\",\"pages\":\"ooad096\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2023-11-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10662662/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JAMIA Open\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/jamiaopen/ooad096\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/12/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JAMIA Open","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jamiaopen/ooad096","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/12/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
Evaluating the impact of alternative phenotype definitions on incidence rates across a global data network.
Objective: Developing accurate phenotype definitions is critical in obtaining reliable and reproducible background rates in safety research. This study aims to illustrate the differences in background incidence rates by comparing definitions for a given outcome.
Materials and methods: We used 16 data sources to systematically generate and evaluate outcomes for 13 adverse events and their overall background rates. We examined the effect of different modifications (inpatient setting, standardization of code set, and code set changes) to the computable phenotype on background incidence rates.
Results: Rate ratios (RRs) of the incidence rates from each computable phenotype definition varied across outcomes, with inpatient restriction showing the highest variation from 1 to 11.93. Standardization of code set RRs ranges from 1 to 1.64, and code set changes range from 1 to 2.52.
Discussion: The modification that has the highest impact is requiring inpatient place of service, leading to at least a 2-fold higher incidence rate in the base definition. Standardization showed almost no change when using source code variations. The strength of the effect in the inpatient restriction is highly dependent on the outcome. Changing definitions from broad to narrow showed the most variability by age/gender/database across phenotypes and less than a 2-fold increase in rate compared to the base definition.
Conclusion: Characterization of outcomes across a network of databases yields insights into sensitivity and specificity trade-offs when definitions are altered. Outcomes should be thoroughly evaluated prior to use for background rates for their plausibility for use across a global network.