Pub Date : 2024-11-15DOI: 10.1177/09622802241281035
Muhammad Qasim, Kristofer Månsson, Narayanaswamy Balakrishnan
Valid instrumental variables (IVs) must not directly impact the outcome variable and must also be uncorrelated with nonmeasured variables. However, in practice, IVs are likely to be invalid. The existing methods can lead to large bias relative to standard errors in situations with many weak and invalid instruments. In this paper, we derive a LASSO procedure for the k-class IV estimation methods in the linear IV model. In addition, we propose the jackknife IV method by using LASSO to address the problem of many weak invalid instruments in the case of heteroscedastic data. The proposed methods are robust for estimating causal effects in the presence of many invalid and valid instruments, with theoretical assurances of their execution. In addition, two-step numerical algorithms are developed for the estimation of causal effects. The performance of the proposed estimators is demonstrated via Monte Carlo simulations as well as an empirical application. We use Mendelian randomization as an application, wherein we estimate the causal effect of body mass index on the health-related quality of life index using single nucleotide polymorphisms as instruments for body mass index.
有效的工具变量(IV)必须不直接影响结果变量,而且必须与非测量变量不相关。然而,在实践中,IV 很可能是无效的。在存在许多弱工具和无效工具的情况下,现有方法可能会导致相对于标准误差的较大偏差。本文推导了线性 IV 模型中 k 类 IV 估计方法的 LASSO 程序。此外,我们还利用 LASSO 提出了 jackknife IV 方法,以解决异方差数据中许多弱无效工具的问题。所提出的方法在存在许多无效和有效工具的情况下都能稳健地估计因果效应,并从理论上保证了这些方法的执行。此外,还开发了用于估计因果效应的两步数字算法。我们通过蒙特卡罗模拟和经验应用证明了所提出的估计方法的性能。我们将孟德尔随机化作为一个应用,使用单核苷酸多态性作为体重指数的工具来估计体重指数对健康相关生活质量指数的因果效应。
{"title":"LASSO-type instrumental variable selection methods with an application to Mendelian randomization.","authors":"Muhammad Qasim, Kristofer Månsson, Narayanaswamy Balakrishnan","doi":"10.1177/09622802241281035","DOIUrl":"10.1177/09622802241281035","url":null,"abstract":"<p><p>Valid instrumental variables (IVs) must not directly impact the outcome variable and must also be uncorrelated with nonmeasured variables. However, in practice, IVs are likely to be invalid. The existing methods can lead to large bias relative to standard errors in situations with many weak and invalid instruments. In this paper, we derive a LASSO procedure for the <i>k</i>-class IV estimation methods in the linear IV model. In addition, we propose the jackknife IV method by using LASSO to address the problem of many weak invalid instruments in the case of heteroscedastic data. The proposed methods are robust for estimating causal effects in the presence of many invalid and valid instruments, with theoretical assurances of their execution. In addition, two-step numerical algorithms are developed for the estimation of causal effects. The performance of the proposed estimators is demonstrated via Monte Carlo simulations as well as an empirical application. We use Mendelian randomization as an application, wherein we estimate the causal effect of body mass index on the health-related quality of life index using single nucleotide polymorphisms as instruments for body mass index.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"9622802241281035"},"PeriodicalIF":1.6,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142628529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-05DOI: 10.1177/09622802241293783
Jules Antoine Pereira Macedo, Bruno Giraudeau, Escient Collaborators
In cluster randomized trials (CRTs) with a binary outcome, intervention effects are usually reported as odds ratios, but the CONSORT statement advocates reporting both a relative and an absolute intervention effect. With a simulation study, we assessed several methods to estimate a risk difference (RD) in the framework of a CRT with adjustment on both individual- and cluster-level covariates. We considered both a conditional approach (with the generalized linear mixed model [GLMM]) and a marginal approach (with the generalized estimating equation [GEE]). For both approaches, we considered the Gaussian, binomial, and Poisson distributions. When considering the binomial or Poisson distribution, we used the g-computation method to estimate the RD. Convergence problems were observed with the GEE approach, especially with low intra-cluster coefficient correlation values, small number of clusters, small mean cluster size, high number of covariates, and prevalences close to 0. All methods reported no bias. The Gaussian distribution with both approaches and binomial and Poisson distributions with the GEE approach had satisfactory results in estimating the standard error. Results for type I error and coverage rates were better with the GEE than GLMM approach. We recommend using the Gaussian distribution because of its ease of use (the RD is estimated in one step only). The GEE approach should be preferred and replaced with the GLMM approach in cases of convergence problems.
{"title":"Estimating an adjusted risk difference in a cluster randomized trial with individual-level analyses.","authors":"Jules Antoine Pereira Macedo, Bruno Giraudeau, Escient Collaborators","doi":"10.1177/09622802241293783","DOIUrl":"https://doi.org/10.1177/09622802241293783","url":null,"abstract":"<p><p>In cluster randomized trials (CRTs) with a binary outcome, intervention effects are usually reported as odds ratios, but the CONSORT statement advocates reporting both a relative and an absolute intervention effect. With a simulation study, we assessed several methods to estimate a risk difference (RD) in the framework of a CRT with adjustment on both individual- and cluster-level covariates. We considered both a conditional approach (with the generalized linear mixed model [GLMM]) and a marginal approach (with the generalized estimating equation [GEE]). For both approaches, we considered the Gaussian, binomial, and Poisson distributions. When considering the binomial or Poisson distribution, we used the g-computation method to estimate the RD. Convergence problems were observed with the GEE approach, especially with low intra-cluster coefficient correlation values, small number of clusters, small mean cluster size, high number of covariates, and prevalences close to 0. All methods reported no bias. The Gaussian distribution with both approaches and binomial and Poisson distributions with the GEE approach had satisfactory results in estimating the standard error. Results for type I error and coverage rates were better with the GEE than GLMM approach. We recommend using the Gaussian distribution because of its ease of use (the RD is estimated in one step only). The GEE approach should be preferred and replaced with the GLMM approach in cases of convergence problems.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"9622802241293783"},"PeriodicalIF":1.6,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142584356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-01Epub Date: 2024-10-07DOI: 10.1177/09622802241280782
Seungjae Lee, Ji Hoon Park, Woojoo Lee
The difference in restricted mean survival time has been increasingly used as an alternative measure to the hazard ratio in survival analysis. Although some statistical methods have been developed for estimating the difference in restricted mean survival time adjusted for measured confounders in observational studies, the impact of unmeasured confounding on the estimate has rarely been assessed. We develop a novel sensitivity analysis for the estimate of the difference in restricted mean survival time with respect to unmeasured confounding. After formulating the sensitivity analysis problem as an optimization problem, we explain how to obtain the sensitivity range of the difference in restricted mean survival time efficiently and assess its uncertainty using the percentile bootstrap confidence interval. Analytic results are provided for some important survival settings. Simulation studies show that the proposed methods perform well in various settings. We illustrate the proposed sensitivity analysis method by analyzing data from the German Breast Cancer Study Group study.
{"title":"Sensitivity analysis for unmeasured confounding in estimating the difference in restricted mean survival time.","authors":"Seungjae Lee, Ji Hoon Park, Woojoo Lee","doi":"10.1177/09622802241280782","DOIUrl":"10.1177/09622802241280782","url":null,"abstract":"<p><p>The difference in restricted mean survival time has been increasingly used as an alternative measure to the hazard ratio in survival analysis. Although some statistical methods have been developed for estimating the difference in restricted mean survival time adjusted for measured confounders in observational studies, the impact of unmeasured confounding on the estimate has rarely been assessed. We develop a novel sensitivity analysis for the estimate of the difference in restricted mean survival time with respect to unmeasured confounding. After formulating the sensitivity analysis problem as an optimization problem, we explain how to obtain the sensitivity range of the difference in restricted mean survival time efficiently and assess its uncertainty using the percentile bootstrap confidence interval. Analytic results are provided for some important survival settings. Simulation studies show that the proposed methods perform well in various settings. We illustrate the proposed sensitivity analysis method by analyzing data from the German Breast Cancer Study Group study.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1979-1992"},"PeriodicalIF":1.6,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142381666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-01Epub Date: 2024-10-23DOI: 10.1177/09622802241283882
Yuxi Zhu, Guy Brock, Lang Li
Multistate transition models (MSTMs) are valuable tools depicting disease progression. However, due to the complexity of MSTMs, larger sample size and longer follow-up time in real-world data, the computation of statistical estimation and inference for MSTMs becomes challenging. A bounded Taylor series in Newton-Raphson procedure is proposed which leverages the uniformization technique to derive maximum likelihood estimates and corresponding covariance matrix. The proposed method, namely uniformization Taylor-bounded Newton-Raphson, is validated in three simulation studies, which demonstrate the accuracy in parameter estimation, the efficiency in computation time and robustness in terms of different situations. This method is also illustrated using a large electronic medical record data related to statin-induced side effects and discontinuation.
{"title":"Uniformization and bounded Taylor series in Newton-Raphson method improves computational performance for a multistate transition model estimation and inference.","authors":"Yuxi Zhu, Guy Brock, Lang Li","doi":"10.1177/09622802241283882","DOIUrl":"10.1177/09622802241283882","url":null,"abstract":"<p><p>Multistate transition models (MSTMs) are valuable tools depicting disease progression. However, due to the complexity of MSTMs, larger sample size and longer follow-up time in real-world data, the computation of statistical estimation and inference for MSTMs becomes challenging. A bounded Taylor series in Newton-Raphson procedure is proposed which leverages the uniformization technique to derive maximum likelihood estimates and corresponding covariance matrix. The proposed method, namely uniformization Taylor-bounded Newton-Raphson, is validated in three simulation studies, which demonstrate the accuracy in parameter estimation, the efficiency in computation time and robustness in terms of different situations. This method is also illustrated using a large electronic medical record data related to statin-induced side effects and discontinuation.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1901-1919"},"PeriodicalIF":1.6,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142508326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Joint modeling of longitudinal and survival data is increasingly used in biomedical studies. However, existing joint models are not applicable to model the longitudinal ordinal responses with non-ignorable missing values caused by the occurrence of events in a multi-state process. In this article, we introduce a joint model for longitudinal ordinal measurements and multi-state data. Our proposed joint model consists of two sub-models: a proportional odds sub-model for longitudinal ordinal measurements and a multi-state sub-model with transition-specific proportional hazards for times of transitions between different health states, both linked by shared random effects. The model parameters were estimated employing the maximum likelihood method for a piecewise constant baseline hazard function. The proposed joint model is evaluated in a simulation study and, as an illustration, it is fitted to real data from people with human immunodeficiency virus.
{"title":"Joint modelling of longitudinal ordinal and multi-state data.","authors":"Behnaz Alafchi, Leili Tapak, Hossein Mahjub, Elaheh Talebi Ghane, Ghodratollah Roshanaei","doi":"10.1177/09622802241281013","DOIUrl":"10.1177/09622802241281013","url":null,"abstract":"<p><p>Joint modeling of longitudinal and survival data is increasingly used in biomedical studies. However, existing joint models are not applicable to model the longitudinal ordinal responses with non-ignorable missing values caused by the occurrence of events in a multi-state process. In this article, we introduce a joint model for longitudinal ordinal measurements and multi-state data. Our proposed joint model consists of two sub-models: a proportional odds sub-model for longitudinal ordinal measurements and a multi-state sub-model with transition-specific proportional hazards for times of transitions between different health states, both linked by shared random effects. The model parameters were estimated employing the maximum likelihood method for a piecewise constant baseline hazard function. The proposed joint model is evaluated in a simulation study and, as an illustration, it is fitted to real data from people with human immunodeficiency virus.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1939-1951"},"PeriodicalIF":1.6,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142584357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-01Epub Date: 2024-10-21DOI: 10.1177/09622802241281959
Sida Chen, Danilo Alvares, Christopher Jackson, Tom Marshall, Krish Nirantharakumar, Sylvia Richardson, Catherine L Saunders, Jessica K Barrett
Multistate models provide a useful framework for modelling complex event history data in clinical settings and have recently been extended to the joint modelling framework to appropriately handle endogenous longitudinal covariates, such as repeatedly measured biomarkers, which are informative about health status and disease progression. However, the practical application of such joint models faces considerable computational challenges. Motivated by a longitudinal multimorbidity analysis of large-scale UK health records, we introduce novel Bayesian inference approaches for these models that are capable of handling complex multistate processes and large datasets with straightforward implementation. These approaches decompose the original estimation task into smaller inference blocks, leveraging parallel computing and facilitating flexible model specification and comparison. Using extensive simulation studies, we show that the proposed approaches achieve satisfactory estimation accuracy, with notable gains in computational efficiency compared to the standard Bayesian estimation strategy. We illustrate our approaches by analysing the coevolution of routinely measured systolic blood pressure and the progression of three important chronic conditions, using a large dataset from the Clinical Practice Research Datalink Aurum database. Our analysis reveals distinct and previously lesser-known association structures between systolic blood pressure and different disease transitions.
{"title":"Bayesian blockwise inference for joint models of longitudinal and multistate data with application to longitudinal multimorbidity analysis.","authors":"Sida Chen, Danilo Alvares, Christopher Jackson, Tom Marshall, Krish Nirantharakumar, Sylvia Richardson, Catherine L Saunders, Jessica K Barrett","doi":"10.1177/09622802241281959","DOIUrl":"10.1177/09622802241281959","url":null,"abstract":"<p><p>Multistate models provide a useful framework for modelling complex event history data in clinical settings and have recently been extended to the joint modelling framework to appropriately handle endogenous longitudinal covariates, such as repeatedly measured biomarkers, which are informative about health status and disease progression. However, the practical application of such joint models faces considerable computational challenges. Motivated by a longitudinal multimorbidity analysis of large-scale UK health records, we introduce novel Bayesian inference approaches for these models that are capable of handling complex multistate processes and large datasets with straightforward implementation. These approaches decompose the original estimation task into smaller inference blocks, leveraging parallel computing and facilitating flexible model specification and comparison. Using extensive simulation studies, we show that the proposed approaches achieve satisfactory estimation accuracy, with notable gains in computational efficiency compared to the standard Bayesian estimation strategy. We illustrate our approaches by analysing the coevolution of routinely measured systolic blood pressure and the progression of three important chronic conditions, using a large dataset from the Clinical Practice Research Datalink Aurum database. Our analysis reveals distinct and previously lesser-known association structures between systolic blood pressure and different disease transitions.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"2027-2042"},"PeriodicalIF":1.6,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11577689/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142475113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-01Epub Date: 2024-10-30DOI: 10.1177/09622802241288350
Joydeep Basu, Nicholas Parsons, Tim Friede, Nigel Stallard
Cancellation or delay of non-essential medical interventions, limitation of face-to-face assessments or outpatient attendance due to lockdown restrictions, illness or fear of hospital or healthcare centre visits, and halting of research to allow diversion of healthcare resources to focus on the pandemic led to the interruption of many clinical trials during the severe acute respiratory syndrome-coronavirus-2 (SARS-CoV-2) pandemic. Appropriate analysis approaches are now required for these interrupted trials. In trials with long follow-up and longitudinal outcomes, data may be available on early outcomes for many patients for whom final, primary outcome data were not observed. A natural question is then how these early data can best be used in the trial analysis. Although recommendations are available from regulators, funders, and methodologists, there is a lack of a review of recent work addressing this problem. This article reports a review of recent methods that can be used in the setting of the analysis of interrupted clinical trials with longitudinal outcomes with monotone missingness. A search for methodological papers published during the period 2020-2023 identified 43 relevant publications. We categorised these articles under the four broad themes of missing value imputation, modelling and covariate adjustment, simulation and estimands. Although motivated by the interruption due to SARS-CoV-2 and the resulting disease, the papers reviewed and methods discussed are also relevant to clinical trials interrupted for other reasons, with follow-up discontinued.
{"title":"Statistical methods for clinical trials interrupted by the severe acute respiratory syndrome-coronavirus-2 (SARS-CoV-2) pandemic: A review.","authors":"Joydeep Basu, Nicholas Parsons, Tim Friede, Nigel Stallard","doi":"10.1177/09622802241288350","DOIUrl":"10.1177/09622802241288350","url":null,"abstract":"<p><p>Cancellation or delay of non-essential medical interventions, limitation of face-to-face assessments or outpatient attendance due to lockdown restrictions, illness or fear of hospital or healthcare centre visits, and halting of research to allow diversion of healthcare resources to focus on the pandemic led to the interruption of many clinical trials during the severe acute respiratory syndrome-coronavirus-2 (SARS-CoV-2) pandemic. Appropriate analysis approaches are now required for these interrupted trials. In trials with long follow-up and longitudinal outcomes, data may be available on early outcomes for many patients for whom final, primary outcome data were not observed. A natural question is then how these early data can best be used in the trial analysis. Although recommendations are available from regulators, funders, and methodologists, there is a lack of a review of recent work addressing this problem. This article reports a review of recent methods that can be used in the setting of the analysis of interrupted clinical trials with longitudinal outcomes with monotone missingness. A search for methodological papers published during the period 2020-2023 identified 43 relevant publications. We categorised these articles under the four broad themes of missing value imputation, modelling and covariate adjustment, simulation and estimands. Although motivated by the interruption due to SARS-CoV-2 and the resulting disease, the papers reviewed and methods discussed are also relevant to clinical trials interrupted for other reasons, with follow-up discontinued.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"2131-2143"},"PeriodicalIF":1.6,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11577686/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142547570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-01Epub Date: 2024-10-23DOI: 10.1177/09622802241275361
Yirui Qian, Stephen J Walters, Richard M Jacques, Laura Flight
Patient-reported outcomes (PROs) that aim to measure patients' subjective attitudes towards their health or health-related conditions in various fields have been increasingly used in randomised controlled trials (RCTs). PRO data is likely to be bounded, discrete, and skewed. Although various statistical methods are available for the analysis of PROs in RCT settings, there is no consensus on what statistical methods are the most appropriate for use. This study aims to use simulation methods to compare the performance (in terms of bias, empirical standard error, coverage of the confidence interval, Type I error, and power) of three different statistical methods, multiple linear regression (MLR), Tobit regression (Tobit), and median regression (Median), to estimate a range of predefined treatment effects for a PRO in a two-arm balanced RCT. We assumed there was an underlying latent continuous outcome that the PRO was measuring, but the actual scores observed were equally spaced and discrete. This study found that MLR was associated with little bias of the estimated treatment effect, small standard errors, and appropriate coverage of the confidence interval under most scenarios. Tobit performed worse than MLR for analysing PROs with a small number of levels, but it had better performance when analysing PROs with more discrete values. Median showed extremely large bias and errors, associated with low power and coverage for most scenarios especially when the number of possible discrete values was small. We recommend MLR as a simple and universal statistical method for the analysis of PROs in RCT settings.
患者报告结果(PROs)旨在衡量患者对其健康或各领域健康相关状况的主观态度,越来越多地被用于随机对照试验(RCTs)中。PRO数据可能是有边界的、离散的和倾斜的。虽然有各种统计方法可用于分析 RCT 环境中的 PROs,但对于什么统计方法最适合使用,目前还没有达成共识。本研究旨在使用模拟方法比较三种不同统计方法(多元线性回归 (MLR)、托比特回归 (Tobit) 和中位回归 (Median))的性能(偏差、经验标准误差、置信区间覆盖率、I 类误差和功率),以估计双臂平衡 RCT 中 PRO 的预定义治疗效果范围。我们假定PRO测量的是潜在的连续结果,但观察到的实际分数是等距和离散的。这项研究发现,在大多数情况下,MLR 与估计治疗效果的偏差小、标准误差小以及置信区间的适当覆盖率有关。在分析具有少量水平的 PRO 时,Tobit 的表现不如 MLR,但在分析具有更多离散值的 PRO 时,Tobit 的表现更好。中位数显示出极大的偏差和误差,在大多数情况下与低功率和低覆盖率有关,尤其是当可能的离散值较少时。我们建议将 MLR 作为一种简单、通用的统计方法,用于 RCT 环境中的 PROs 分析。
{"title":"Comparison of statistical methods for the analysis of patient-reported outcomes in randomised controlled trials: A simulation study.","authors":"Yirui Qian, Stephen J Walters, Richard M Jacques, Laura Flight","doi":"10.1177/09622802241275361","DOIUrl":"10.1177/09622802241275361","url":null,"abstract":"<p><p>Patient-reported outcomes (PROs) that aim to measure patients' subjective attitudes towards their health or health-related conditions in various fields have been increasingly used in randomised controlled trials (RCTs). PRO data is likely to be bounded, discrete, and skewed. Although various statistical methods are available for the analysis of PROs in RCT settings, there is no consensus on what statistical methods are the most appropriate for use. This study aims to use simulation methods to compare the performance (in terms of bias, empirical standard error, coverage of the confidence interval, Type I error, and power) of three different statistical methods, multiple linear regression (MLR), Tobit regression (Tobit), and median regression (Median), to estimate a range of predefined treatment effects for a PRO in a two-arm balanced RCT. We assumed there was an underlying latent continuous outcome that the PRO was measuring, but the actual scores observed were equally spaced and discrete. This study found that MLR was associated with little bias of the estimated treatment effect, small standard errors, and appropriate coverage of the confidence interval under most scenarios. Tobit performed worse than MLR for analysing PROs with a small number of levels, but it had better performance when analysing PROs with more discrete values. Median showed extremely large bias and errors, associated with low power and coverage for most scenarios especially when the number of possible discrete values was small. We recommend MLR as a simple and universal statistical method for the analysis of PROs in RCT settings.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1920-1938"},"PeriodicalIF":1.6,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11577693/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142508324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-01Epub Date: 2024-10-30DOI: 10.1177/09622802241281960
Amir Aamodt Kazemi, Inge Christoffer Olsen
Current instrumental variable methodology focuses mainly on estimating causal effects for a dichotomous or an ordinal treatment variable. Situations with more than two unordered treatments are less explored. The challenge is that assumptions needed to derive point-estimators become increasingly stronger with the number of relevant treatment alternatives. In this article, we aim at deriving causal point-estimators for head-to-head comparisons of the effect of multiple relevant treatments or interventions. We will achieve this with a set of plausible and well-defined rationality assumptions while only considering ordinal instruments. We demonstrate that our methodology provides asymptotically unbiased estimators in the presence of unobserved confounding effects in a simulation study. We then apply the method to compare the effectiveness of five anti-inflammatory drugs in the treatment of rheumatoid arthritis. For this, we use a clinical data set from an observational study in Norway, where price is the primary determinant of the preferred drug and can therefore be considered as an instrument. The developed methodology provides an important addition to the toolbox for causal inference when comparing more than two interventions influenced by an instrumental variable.
{"title":"Instrumental variable analysis with categorical treatment.","authors":"Amir Aamodt Kazemi, Inge Christoffer Olsen","doi":"10.1177/09622802241281960","DOIUrl":"10.1177/09622802241281960","url":null,"abstract":"<p><p>Current instrumental variable methodology focuses mainly on estimating causal effects for a dichotomous or an ordinal treatment variable. Situations with more than two unordered treatments are less explored. The challenge is that assumptions needed to derive point-estimators become increasingly stronger with the number of relevant treatment alternatives. In this article, we aim at deriving causal point-estimators for head-to-head comparisons of the effect of multiple relevant treatments or interventions. We will achieve this with a set of plausible and well-defined rationality assumptions while only considering ordinal instruments. We demonstrate that our methodology provides asymptotically unbiased estimators in the presence of unobserved confounding effects in a simulation study. We then apply the method to compare the effectiveness of five anti-inflammatory drugs in the treatment of rheumatoid arthritis. For this, we use a clinical data set from an observational study in Norway, where price is the primary determinant of the preferred drug and can therefore be considered as an instrument. The developed methodology provides an important addition to the toolbox for causal inference when comparing more than two interventions influenced by an instrumental variable.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"2043-2061"},"PeriodicalIF":1.6,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11577691/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142547569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-01Epub Date: 2024-10-07DOI: 10.1177/09622802241281027
Jin Jin, Liuquan Sun, Huang-Tz Ou, Pei-Fang Su
Recurrent event data, which represent the occurrence of repeated incidences, are common in observational studies. Furthermore, collecting possible spatial correlations in health and environmental data is likely to provide more information for risk prediction. This article proposes a comprehensive proportional intensity model considering spatial random effects for recurrent event data using a Bayesian approach. The spatial information for areal data (where the spatial location is known up to a geographic unit such as a county) and georeferenced data (where the location is exactly observed) is examined. A traditional constant baseline intensity function, as well as a flexible piecewise constant baseline intensity function, are both under consideration. To estimate the parameters, a Markov chain Monte Carlo method with the Metropolis-Hastings algorithm and the adaptive Metropolis algorithm are applied. To assess the performance of model fitting, the deviance information criterion and log pseudo marginal likelihood are proposed. Overall, simulation studies demonstrate that the proposed model is significantly better than models that do not consider spatial effects if spatial correlations exist. Finally, our approach is implemented using a dataset related to the recurrence of cardiovascular diseases, which incorporates spatial information.
{"title":"Analysis of recurrent event data with spatial random effects using a Bayesian approach.","authors":"Jin Jin, Liuquan Sun, Huang-Tz Ou, Pei-Fang Su","doi":"10.1177/09622802241281027","DOIUrl":"10.1177/09622802241281027","url":null,"abstract":"<p><p>Recurrent event data, which represent the occurrence of repeated incidences, are common in observational studies. Furthermore, collecting possible spatial correlations in health and environmental data is likely to provide more information for risk prediction. This article proposes a comprehensive proportional intensity model considering spatial random effects for recurrent event data using a Bayesian approach. The spatial information for areal data (where the spatial location is known up to a geographic unit such as a county) and georeferenced data (where the location is exactly observed) is examined. A traditional constant baseline intensity function, as well as a flexible piecewise constant baseline intensity function, are both under consideration. To estimate the parameters, a Markov chain Monte Carlo method with the Metropolis-Hastings algorithm and the adaptive Metropolis algorithm are applied. To assess the performance of model fitting, the deviance information criterion and log pseudo marginal likelihood are proposed. Overall, simulation studies demonstrate that the proposed model is significantly better than models that do not consider spatial effects if spatial correlations exist. Finally, our approach is implemented using a dataset related to the recurrence of cardiovascular diseases, which incorporates spatial information.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1993-2006"},"PeriodicalIF":1.6,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142381664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}