Pub Date : 2021-06-01Epub Date: 2020-10-20DOI: 10.1007/s10742-020-00222-8
Ryan D Ross, Xu Shi, Megan E V Caram, Pheobe A Tsao, Paul Lin, Amy Bohnert, Min Zhang, Bhramar Mukherjee
Medical insurance claims are becoming increasingly common data sources to answer a variety of questions in biomedical research. Although comprehensive in terms of longitudinal characterization of disease development and progression for a potentially large number of patients, population-based inference using these datasets require thoughtful modifications to sample selection and analytic strategies relative to other types of studies. Along with complex selection bias and missing data issues, claims-based studies are purely observational, which limits effective understanding and characterization of the treatment differences between groups being compared. All these issues contribute to a crisis in reproducibility and replication of comparative findings using medical claims. This paper offers practical guidance to the analytical process, demonstrates methods for estimating causal treatment effects with propensity score methods for several types of outcomes common to such studies, such as binary, count, time to event and longitudinally-varying measures, and also aims to increase transparency and reproducibility of reporting of results from these investigations. We provide an online version of the paper with readily implementable code for the entire analysis pipeline to serve as a guided tutorial for practitioners. The online version can be accessed at https://rydaro.github.io/. The analytic pipeline is illustrated using a sub-cohort of patients with advanced prostate cancer from the large Clinformatics TM Data Mart Database (OptumInsight, Eden Prairie, Minnesota), consisting of 73 million distinct private payer insurees from 2001-2016.
医疗保险索赔正日益成为回答生物医学研究中各种问题的常见数据来源。尽管这些数据集对潜在的大量患者的疾病发展和进展进行了全面的纵向表征,但与其他类型的研究相比,基于人群的推断需要对样本选择和分析策略进行深思熟虑的修改。伴随着复杂的选择偏差和缺失数据问题,基于声明的研究纯粹是观察性的,这限制了对被比较组之间治疗差异的有效理解和表征。所有这些问题都造成了利用医疗索赔对比较结果进行再现和复制的危机。本文为分析过程提供了实用指导,展示了使用倾向评分方法估计因果治疗效果的方法,这些方法用于此类研究常见的几种结果类型,如二进制、计数、事件时间和纵向变化测量,并且还旨在提高这些调查结果报告的透明度和可重复性。我们提供论文的在线版本,其中包含整个分析管道的易于实现的代码,以作为实践者的指导教程。在线版本可访问https://rydaro.github.io/。该分析流程使用来自大型Clinformatics TM数据集市数据库(OptumInsight, Eden Prairie, Minnesota)的晚期前列腺癌患者亚队列进行说明,其中包括2001-2016年7300万不同的私人付款人保险。
{"title":"Veridical Causal Inference using Propensity Score Methods for Comparative Effectiveness Research with Medical Claims.","authors":"Ryan D Ross, Xu Shi, Megan E V Caram, Pheobe A Tsao, Paul Lin, Amy Bohnert, Min Zhang, Bhramar Mukherjee","doi":"10.1007/s10742-020-00222-8","DOIUrl":"https://doi.org/10.1007/s10742-020-00222-8","url":null,"abstract":"<p><p>Medical insurance claims are becoming increasingly common data sources to answer a variety of questions in biomedical research. Although comprehensive in terms of longitudinal characterization of disease development and progression for a potentially large number of patients, population-based inference using these datasets require thoughtful modifications to sample selection and analytic strategies relative to other types of studies. Along with complex selection bias and missing data issues, claims-based studies are purely observational, which limits effective understanding and characterization of the treatment differences between groups being compared. All these issues contribute to a crisis in reproducibility and replication of comparative findings using medical claims. This paper offers practical guidance to the analytical process, demonstrates methods for estimating causal treatment effects with propensity score methods for several types of outcomes common to such studies, such as binary, count, time to event and longitudinally-varying measures, and also aims to increase transparency and reproducibility of reporting of results from these investigations. We provide an online version of the paper with readily implementable code for the entire analysis pipeline to serve as a guided tutorial for practitioners. The online version can be accessed at https://rydaro.github.io/. The analytic pipeline is illustrated using a sub-cohort of patients with advanced prostate cancer from the large Clinformatics TM Data Mart Database (OptumInsight, Eden Prairie, Minnesota), consisting of 73 million distinct private payer insurees from 2001-2016.</p>","PeriodicalId":45600,"journal":{"name":"Health Services and Outcomes Research Methodology","volume":"21 2","pages":"206-228"},"PeriodicalIF":1.5,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s10742-020-00222-8","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39022715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-06-01Epub Date: 2020-11-12DOI: 10.1007/s10742-020-00229-1
Sasmira Matta, Joanne W Hsu, Theodore J Iwashyna, Micah Y Baum, Kenneth M Langa, Lauren Hersch Nicholas
Marital status is recognized as an important social determinant of health, income, and social support, but is rarely available in administrative data. We assessed the feasibility of using exact address data and zip code history to identify cohabiting couples using the 2018 Medicare Vital Status file and ZIP codes in the 2011-2014 Master Beneficiary Summary Files. Medicare beneficiaries meeting our algorithm displayed characteristics consistent with assortative mating and resembled known married couples in the Health and Retirement Study linked to Medicare claims. Address information represents a promising strategy for identifying cohabiting couples in administrative data including healthcare claims and other data types.
{"title":"Identifying Cohabiting Couples in Administrative Data: Evidence from Medicare Address Data.","authors":"Sasmira Matta, Joanne W Hsu, Theodore J Iwashyna, Micah Y Baum, Kenneth M Langa, Lauren Hersch Nicholas","doi":"10.1007/s10742-020-00229-1","DOIUrl":"https://doi.org/10.1007/s10742-020-00229-1","url":null,"abstract":"<p><p>Marital status is recognized as an important social determinant of health, income, and social support, but is rarely available in administrative data. We assessed the feasibility of using exact address data and zip code history to identify cohabiting couples using the 2018 Medicare Vital Status file and ZIP codes in the 2011-2014 Master Beneficiary Summary Files. Medicare beneficiaries meeting our algorithm displayed characteristics consistent with assortative mating and resembled known married couples in the Health and Retirement Study linked to Medicare claims. Address information represents a promising strategy for identifying cohabiting couples in administrative data including healthcare claims and other data types.</p>","PeriodicalId":45600,"journal":{"name":"Health Services and Outcomes Research Methodology","volume":"21 2","pages":"238-247"},"PeriodicalIF":1.5,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s10742-020-00229-1","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39249437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-05-07DOI: 10.1007/s10742-021-00249-5
Samuel Justice, Daniel K. Sewell, A. Miller, J. Simmering, P. Polgreen
{"title":"Inferring patient transfer networks between healthcare facilities","authors":"Samuel Justice, Daniel K. Sewell, A. Miller, J. Simmering, P. Polgreen","doi":"10.1007/s10742-021-00249-5","DOIUrl":"https://doi.org/10.1007/s10742-021-00249-5","url":null,"abstract":"","PeriodicalId":45600,"journal":{"name":"Health Services and Outcomes Research Methodology","volume":"12 1","pages":"1-15"},"PeriodicalIF":1.5,"publicationDate":"2021-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77853442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-04-21DOI: 10.1007/s10742-022-00283-x
O. Bernstein, Brian G. Vegetabile, C. R. Salazar, J. Grill, D. Gillen
{"title":"Adjustment for biased sampling using NHANES derived propensity weights","authors":"O. Bernstein, Brian G. Vegetabile, C. R. Salazar, J. Grill, D. Gillen","doi":"10.1007/s10742-022-00283-x","DOIUrl":"https://doi.org/10.1007/s10742-022-00283-x","url":null,"abstract":"","PeriodicalId":45600,"journal":{"name":"Health Services and Outcomes Research Methodology","volume":"76 1","pages":"21 - 44"},"PeriodicalIF":1.5,"publicationDate":"2021-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88372449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-04-18DOI: 10.1007/s10742-021-00247-7
Brant Morefield, L. Tomai
{"title":"Distinguishing frontloading: an examination of medicare home health claims","authors":"Brant Morefield, L. Tomai","doi":"10.1007/s10742-021-00247-7","DOIUrl":"https://doi.org/10.1007/s10742-021-00247-7","url":null,"abstract":"","PeriodicalId":45600,"journal":{"name":"Health Services and Outcomes Research Methodology","volume":"5 1","pages":"477 - 485"},"PeriodicalIF":1.5,"publicationDate":"2021-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88383588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-03-09DOI: 10.1007/s10742-021-00245-9
Katelyn Cordell, H. Rao, John Lyons
{"title":"Authentic assessments: a method to detect anomalies in assessment response patterns via neural network","authors":"Katelyn Cordell, H. Rao, John Lyons","doi":"10.1007/s10742-021-00245-9","DOIUrl":"https://doi.org/10.1007/s10742-021-00245-9","url":null,"abstract":"","PeriodicalId":45600,"journal":{"name":"Health Services and Outcomes Research Methodology","volume":"35 1","pages":"439 - 458"},"PeriodicalIF":1.5,"publicationDate":"2021-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78030617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-03-01Epub Date: 2021-01-04DOI: 10.1007/s10742-020-00230-8
Guanqing Chen, Valerie A Lewis, Daniel Gottlieb, A James O'Malley
First introduced in early 2000s, the accountable care organization (ACO) is designed to lower health care costs while improving quality of care and has become one of the most important coordinated care technologies in the United States. In this research, we use the Medicare fee-for-service claims data from 2009-2014 to estimate the heterogeneous effects of Medicare ACO programs on hospital admissions across hospital referral regions (HRRs) and provider groups. To conduct our analysis, a model for a difference-in-difference (DID) study is embellished in multiple ways to account for intricacies and complexity with the data not able to be accounted for using existing models. Of particular note, we propose a Gaussian mixture model to account for the inability to observe the practice group affiliation of physicians if the organization they worked for did not become an ACO, which is needed to ensure appropriate partitioning of variation across the different units. The results suggest that the ACO programs reduced the rate of readmission to hospital, that the ACO program may have reduced heterogeneity in readmission rates, and that the effect of joining an ACO varied considerably across medical groups.
{"title":"Estimating Heterogeneous Effects of a Policy Intervention across Organizations when Organization Affiliation is Missing for the Control Group: Application to the Evaluation of Accountable Care Organizations.","authors":"Guanqing Chen, Valerie A Lewis, Daniel Gottlieb, A James O'Malley","doi":"10.1007/s10742-020-00230-8","DOIUrl":"https://doi.org/10.1007/s10742-020-00230-8","url":null,"abstract":"<p><p>First introduced in early 2000s, the accountable care organization (ACO) is designed to lower health care costs while improving quality of care and has become one of the most important coordinated care technologies in the United States. In this research, we use the Medicare fee-for-service claims data from 2009-2014 to estimate the heterogeneous effects of Medicare ACO programs on hospital admissions across hospital referral regions (HRRs) and provider groups. To conduct our analysis, a model for a difference-in-difference (DID) study is embellished in multiple ways to account for intricacies and complexity with the data not able to be accounted for using existing models. Of particular note, we propose a Gaussian mixture model to account for the inability to observe the practice group affiliation of physicians if the organization they worked for did not become an ACO, which is needed to ensure appropriate partitioning of variation across the different units. The results suggest that the ACO programs reduced the rate of readmission to hospital, that the ACO program may have reduced heterogeneity in readmission rates, and that the effect of joining an ACO varied considerably across medical groups.</p>","PeriodicalId":45600,"journal":{"name":"Health Services and Outcomes Research Methodology","volume":"21 1","pages":"54-68"},"PeriodicalIF":1.5,"publicationDate":"2021-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s10742-020-00230-8","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25426299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-03-01Epub Date: 2020-11-05DOI: 10.1007/s10742-020-00227-3
David Kline, Yixuan Ji, Staci Hepler
Opioid misuse is a significant public health issue and a national epidemic with a high prevalence of associated morbidity and mortality. The epidemic is particularly severe in Ohio which has some of the highest overdose rates in the country. It is important to understand spatial and temporal trends of the opioid epidemic to learn more about areas that are most affected and to inform potential community interventions and resource allocation. We propose a multivariate spatio-temporal model to leverage existing surveillance measures, opioid-associated deaths and treatment admissions, to learn about the underlying epidemic for counties in Ohio. We do this using a temporally varying spatial factor that synthesizes information from both counts to estimate common underlying risk which we interpret as the burden of the epidemic. We demonstrate the use of this model with county-level data from 2007-2018 in Ohio. Through our model estimates, we identify counties with above and below average burden and examine how those regions have shifted over time given overall statewide trends. Specifically, we highlight the sustained above average burden of the opioid epidemic on southern Ohio throughout the 12 years examined.
{"title":"A multivariate spatio-temporal model of the opioid epidemic in Ohio: A factor model approach.","authors":"David Kline, Yixuan Ji, Staci Hepler","doi":"10.1007/s10742-020-00227-3","DOIUrl":"https://doi.org/10.1007/s10742-020-00227-3","url":null,"abstract":"<p><p>Opioid misuse is a significant public health issue and a national epidemic with a high prevalence of associated morbidity and mortality. The epidemic is particularly severe in Ohio which has some of the highest overdose rates in the country. It is important to understand spatial and temporal trends of the opioid epidemic to learn more about areas that are most affected and to inform potential community interventions and resource allocation. We propose a multivariate spatio-temporal model to leverage existing surveillance measures, opioid-associated deaths and treatment admissions, to learn about the underlying epidemic for counties in Ohio. We do this using a temporally varying spatial factor that synthesizes information from both counts to estimate common underlying risk which we interpret as the burden of the epidemic. We demonstrate the use of this model with county-level data from 2007-2018 in Ohio. Through our model estimates, we identify counties with above and below average burden and examine how those regions have shifted over time given overall statewide trends. Specifically, we highlight the sustained above average burden of the opioid epidemic on southern Ohio throughout the 12 years examined.</p>","PeriodicalId":45600,"journal":{"name":"Health Services and Outcomes Research Methodology","volume":"21 ","pages":"42-53"},"PeriodicalIF":1.5,"publicationDate":"2021-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s10742-020-00227-3","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39219285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-03-01Epub Date: 2021-02-13DOI: 10.1007/s10742-020-00236-2
Brian G Vegetabile, Beth Ann Griffin, Donna L Coffman, Matthew Cefalu, Michael W Robbins, Daniel F McCaffrey
Weighted estimators are commonly used for estimating exposure effects in observational settings to establish causal relations. These estimators have a long history of development when the exposure of interest is binary and where the weights are typically functions of an estimated propensity score. Recent developments in optimization-based estimators for constructing weights in binary exposure settings, such as those based on entropy balancing, have shown more promise in estimating treatment effects than those methods that focus on the direct estimation of the propensity score using likelihood-based methods. This paper explores recent developments of entropy balancing methods to continuous exposure settings and the estimation of population dose-response curves using nonparametric estimation combined with entropy balancing weights, focusing on factors that would be important to applied researchers in medical or health services research. The methods developed here are applied to data from a study assessing the effect of non-randomized components of an evidence-based substance use treatment program on emotional and substance use clinical outcomes.
{"title":"Nonparametric Estimation of Population Average Dose-Response Curves using Entropy Balancing Weights for Continuous Exposures.","authors":"Brian G Vegetabile, Beth Ann Griffin, Donna L Coffman, Matthew Cefalu, Michael W Robbins, Daniel F McCaffrey","doi":"10.1007/s10742-020-00236-2","DOIUrl":"https://doi.org/10.1007/s10742-020-00236-2","url":null,"abstract":"<p><p>Weighted estimators are commonly used for estimating exposure effects in observational settings to establish causal relations. These estimators have a long history of development when the exposure of interest is binary and where the weights are typically functions of an estimated propensity score. Recent developments in optimization-based estimators for constructing weights in binary exposure settings, such as those based on entropy balancing, have shown more promise in estimating treatment effects than those methods that focus on the direct estimation of the propensity score using likelihood-based methods. This paper explores recent developments of entropy balancing methods to continuous exposure settings and the estimation of population dose-response curves using nonparametric estimation combined with entropy balancing weights, focusing on factors that would be important to applied researchers in medical or health services research. The methods developed here are applied to data from a study assessing the effect of non-randomized components of an evidence-based substance use treatment program on emotional and substance use clinical outcomes.</p>","PeriodicalId":45600,"journal":{"name":"Health Services and Outcomes Research Methodology","volume":"21 1","pages":"69-110"},"PeriodicalIF":1.5,"publicationDate":"2021-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s10742-020-00236-2","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39385609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-02-26DOI: 10.1007/s10742-021-00244-w
S. Lindner, K. McConnell
{"title":"Heterogeneous treatment effects and bias in the analysis of the stepped wedge design","authors":"S. Lindner, K. McConnell","doi":"10.1007/s10742-021-00244-w","DOIUrl":"https://doi.org/10.1007/s10742-021-00244-w","url":null,"abstract":"","PeriodicalId":45600,"journal":{"name":"Health Services and Outcomes Research Methodology","volume":"31 1","pages":"419 - 438"},"PeriodicalIF":1.5,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s10742-021-00244-w","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72398102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}