Statistical Methods in Medical Research最新文献_第5页

Estimating individualized treatment rules by optimizing the adjusted probability of a longer survival. 通过优化调整后的较长存活概率，估算个体化治疗规则。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-09-01 Epub Date: 2024-07-25 DOI: 10.1177/09622802241262525

Qijia He, Shixiao Zhang, Michael L LeBlanc, Ying-Qi Zhao

Individualized treatment rules inform tailored treatment decisions based on the patient's information, where the goal is to optimize clinical benefit for the population. When the clinical outcome of interest is survival time, most of current approaches typically aim to maximize the expected time of survival. We propose a new criterion for constructing Individualized treatment rules that optimize the clinical benefit with survival outcomes, termed as the adjusted probability of a longer survival. This objective captures the likelihood of living longer with being on treatment, compared to the alternative, which provides an alternative and often straightforward interpretation to communicate with clinicians and patients. We view it as an alternative to the survival analysis standard of the hazard ratio and the increasingly used restricted mean survival time. We develop a new method to construct the optimal Individualized treatment rule by maximizing a nonparametric estimator of the adjusted probability of a longer survival for a decision rule. Simulation studies demonstrate the reliability of the proposed method across a range of different scenarios. We further perform data analysis using data collected from a randomized Phase III clinical trial (SWOG S0819).

个体化治疗规则根据患者的信息提供量身定制的治疗决策，其目标是优化人群的临床获益。当关注的临床结果是存活时间时，目前的大多数方法通常以最大化预期存活时间为目标。我们提出了一种新的标准，用于构建个体化治疗规则，优化临床获益与生存结果，即调整后的延长生存概率。这一目标捕捉了与其他方法相比，接受治疗后存活时间更长的可能性，为临床医生和患者提供了另一种直截了当的解释。我们将其视为危险比这一生存分析标准和使用日益广泛的受限平均生存时间的替代方案。我们开发了一种新方法，通过最大化决策规则的调整后较长生存期概率的非参数估计来构建最佳个体化治疗规则。模拟研究证明了所提方法在各种不同情况下的可靠性。我们还利用从随机 III 期临床试验（SWOG S0819）中收集的数据进行了进一步的数据分析。

{"title":"Estimating individualized treatment rules by optimizing the adjusted probability of a longer survival.","authors":"Qijia He, Shixiao Zhang, Michael L LeBlanc, Ying-Qi Zhao","doi":"10.1177/09622802241262525","DOIUrl":"10.1177/09622802241262525","url":null,"abstract":"Individualized treatment rules inform tailored treatment decisions based on the patient's information, where the goal is to optimize clinical benefit for the population. When the clinical outcome of interest is survival time, most of current approaches typically aim to maximize the expected time of survival. We propose a new criterion for constructing Individualized treatment rules that optimize the clinical benefit with survival outcomes, termed as the adjusted probability of a longer survival. This objective captures the likelihood of living longer with being on treatment, compared to the alternative, which provides an alternative and often straightforward interpretation to communicate with clinicians and patients. We view it as an alternative to the survival analysis standard of the hazard ratio and the increasingly used restricted mean survival time. We develop a new method to construct the optimal Individualized treatment rule by maximizing a nonparametric estimator of the adjusted probability of a longer survival for a decision rule. Simulation studies demonstrate the reliability of the proposed method across a range of different scenarios. We further perform data analysis using data collected from a randomized Phase III clinical trial (SWOG S0819).","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1517-1530"},"PeriodicalIF":1.6,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141760990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A useful parametric specification to model epidemiological data: Revival of the Richards' curve. 流行病学数据建模的有用参数规范：理查兹曲线的复兴

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-08-01 DOI: 10.1177/09622802241262522

Marco Mingione, Pierfrancesco Alaimo Di Loro, Antonello Maruotti

A useful parametric specification for the expected value of an epidemiological process is revived, and its statistical and empirical efficacy are explored. The Richards' curve is flexible enough to adapt to several growth phenomena, including recent epidemics and outbreaks. Here, two different estimation methods are described. The first, based on likelihood maximisation, is particularly useful when the outbreak is still ongoing and the main goal is to obtain sufficiently accurate estimates in negligible computational run-time. The second is fully Bayesian and allows for more ambitious modelling attempts such as the inclusion of spatial and temporal dependence, but it requires more data and computational resources. Regardless of the estimation approach, the Richards' specification properly characterises the main features of any growth process (e.g. growth rate, peak phase etc.), leading to a reasonable fit and providing good short- to medium-term predictions. To demonstrate such flexibility, we show different applications using publicly available data on recent epidemics where the data collection processes and transmission patterns are extremely heterogeneous, as well as benchmark datasets widely used in the literature as illustrative.

我们重新提出了流行病学过程预期值的一个有用参数规范，并探讨了其统计和实证功效。理查兹曲线非常灵活，足以适应多种增长现象，包括近期的流行病和疫情爆发。这里介绍两种不同的估算方法。第一种方法基于似然最大化，在疫情仍在持续的情况下特别有用，其主要目标是在可忽略的计算运行时间内获得足够准确的估计值。第二种方法是完全贝叶斯法，允许进行更大胆的建模尝试，如纳入空间和时间依赖性，但需要更多的数据和计算资源。无论采用哪种估计方法，理查兹规范都能正确描述任何增长过程的主要特征（如增长率、峰值阶段等），从而得出合理的拟合结果，并提供良好的中短期预测。为了展示这种灵活性，我们利用公开的近期流行病数据（数据收集过程和传播模式极其不一致）以及文献中广泛使用的基准数据集展示了不同的应用，以资说明。

{"title":"A useful parametric specification to model epidemiological data: Revival of the Richards' curve.","authors":"Marco Mingione, Pierfrancesco Alaimo Di Loro, Antonello Maruotti","doi":"10.1177/09622802241262522","DOIUrl":"https://doi.org/10.1177/09622802241262522","url":null,"abstract":"A useful parametric specification for the expected value of an epidemiological process is revived, and its statistical and empirical efficacy are explored. The Richards' curve is flexible enough to adapt to several growth phenomena, including recent epidemics and outbreaks. Here, two different estimation methods are described. The first, based on likelihood maximisation, is particularly useful when the outbreak is still ongoing and the main goal is to obtain sufficiently accurate estimates in negligible computational run-time. The second is fully Bayesian and allows for more ambitious modelling attempts such as the inclusion of spatial and temporal dependence, but it requires more data and computational resources. Regardless of the estimation approach, the Richards' specification properly characterises the main features of any growth process (e.g. growth rate, peak phase etc.), leading to a reasonable fit and providing good short- to medium-term predictions. To demonstrate such flexibility, we show different applications using publicly available data on recent epidemics where the data collection processes and transmission patterns are extremely heterogeneous, as well as benchmark datasets widely used in the literature as illustrative.","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":"33 8","pages":"1473-1494"},"PeriodicalIF":1.6,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142366594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Measuring the individualization potential of treatment individualization rules: Application to rules built with a new parametric interaction model for parallel-group clinical trials. 测量治疗个体化规则的个体化潜力：应用平行组临床试验的新参数交互模型建立的规则。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-08-01 Epub Date: 2024-08-06 DOI: 10.1177/09622802241259172

Francisco J Diaz

For personalized medicine, we propose a general method of evaluating the potential performance of an individualized treatment rule in future clinical applications with new patients. We focus on rules that choose the most beneficial treatment for the patient out of two active (nonplacebo) treatments, which the clinician will prescribe regularly to the patient after the decision. We develop a measure of the individualization potential (IP) of a rule. The IP compares the expected effectiveness of the rule in a future clinical individualization setting versus the effectiveness of not trying individualization. We illustrate our evaluation method by explaining how to measure the IP of a useful type of individualized rules calculated through a new parametric interaction model of data from parallel-group clinical trials with continuous responses. Our interaction model implies a structural equation model we use to estimate the rule and its IP. We examine the IP both theoretically and with simulations when the estimated individualized rule is put into practice in new patients. Our individualization approach was superior to outcome-weighted machine learning according to simulations. We also show connections with crossover and N-of-1 trials. As a real data application, we estimate a rule for the individualization of treatments for diabetic macular edema and evaluate its IP.

对于个性化医疗，我们提出了一种通用方法，用于评估个性化治疗规则在未来临床应用中对新患者的潜在表现。我们将重点放在从两种有效（非安慰剂）治疗方法中为患者选择最有益治疗方法的规则上，临床医生在做出决定后将定期为患者开具处方。我们开发了一种衡量规则个体化潜力（IP）的方法。IP 将该规则在未来临床个体化设置中的预期效果与不尝试个体化的效果进行比较。我们通过解释如何衡量一种有用的个体化规则的 IP 值来说明我们的评估方法，这种 IP 值是通过一种新的参数交互模型计算出来的，该模型的数据来自具有连续反应的平行组临床试验。我们的交互模型意味着一个结构方程模型，我们用它来估算规则及其 IP。当估计出的个体化规则在新患者身上付诸实践时，我们从理论和模拟两方面对 IP 进行了检验。根据模拟结果，我们的个性化方法优于结果加权机器学习。我们还展示了与交叉试验和 N-of-1 试验之间的联系。在实际数据应用中，我们估算了糖尿病黄斑水肿的个体化治疗规则，并对其IP进行了评估。

{"title":"Measuring the individualization potential of treatment individualization rules: Application to rules built with a new parametric interaction model for parallel-group clinical trials.","authors":"Francisco J Diaz","doi":"10.1177/09622802241259172","DOIUrl":"10.1177/09622802241259172","url":null,"abstract":"For personalized medicine, we propose a general method of evaluating the potential performance of an individualized treatment rule in future clinical applications with new patients. We focus on rules that choose the most beneficial treatment for the patient out of two active (nonplacebo) treatments, which the clinician will prescribe regularly to the patient after the decision. We develop a measure of the individualization potential (IP) of a rule. The IP compares the expected effectiveness of the rule in a future clinical individualization setting versus the effectiveness of not trying individualization. We illustrate our evaluation method by explaining how to measure the IP of a useful type of individualized rules calculated through a new parametric interaction model of data from parallel-group clinical trials with continuous responses. Our interaction model implies a structural equation model we use to estimate the rule and its IP. We examine the IP both theoretically and with simulations when the estimated individualized rule is put into practice in new patients. Our individualization approach was superior to outcome-weighted machine learning according to simulations. We also show connections with crossover and N-of-1 trials. As a real data application, we estimate a rule for the individualization of treatments for diabetic macular edema and evaluate its IP.","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1355-1375"},"PeriodicalIF":1.6,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141894391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimal designs using generalized estimating equations in cluster randomized crossover and stepped wedge trials. 在分组随机交叉试验和阶梯楔形试验中使用广义估计方程进行优化设计。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-08-01 Epub Date: 2024-05-30 DOI: 10.1177/09622802241247717

Jingxia Liu, Fan Li

Cluster randomized crossover and stepped wedge cluster randomized trials are two types of longitudinal cluster randomized trials that leverage both the within- and between-cluster comparisons to estimate the treatment effect and are increasingly used in healthcare delivery and implementation science research. While the variance expressions of estimated treatment effect have been previously developed from the method of generalized estimating equations for analyzing cluster randomized crossover trials and stepped wedge cluster randomized trials, little guidance has been provided for optimal designs to ensure maximum efficiency. Here, an optimal design refers to the combination of optimal cluster-period size and optimal number of clusters that provide the smallest variance of the treatment effect estimator or maximum efficiency under a fixed total budget. In this work, we develop optimal designs for multiple-period cluster randomized crossover trials and stepped wedge cluster randomized trials with continuous outcomes, including both closed-cohort and repeated cross-sectional sampling schemes. Local optimal design algorithms are proposed when the correlation parameters in the working correlation structure are known. MaxiMin optimal design algorithms are proposed when the exact values are unavailable, but investigators may specify a range of correlation values. The closed-form formulae of local optimal design and MaxiMin optimal design are derived for multiple-period cluster randomized crossover trials, where the cluster-period size and number of clusters are decimal. The decimal estimates from closed-form formulae can then be used to investigate the performances of integer estimates from local optimal design and MaxiMin optimal design algorithms. One unique contribution from this work, compared to the previous optimal design research, is that we adopt constrained optimization techniques to obtain integer estimates under the MaxiMin optimal design. To assist practical implementation, we also develop four SAS macros to find local optimal designs and MaxiMin optimal designs.

聚类随机交叉试验和阶梯楔形聚类随机试验是纵向聚类随机试验的两种类型，它们利用聚类内和聚类间的比较来估计治疗效果，在医疗保健服务和实施科学研究中的应用越来越广泛。虽然以前已经从广义估计方程的方法中开发出了估计治疗效果的方差表达式，用于分析分组随机交叉试验和阶梯楔形分组随机试验，但很少有人指导如何进行优化设计以确保最大效率。在这里，最优设计是指在固定总预算下，能提供最小治疗效果估计方差或最高效率的最优分组期规模和最优分组数的组合。在这项工作中，我们开发了多期分组随机交叉试验和连续结果的阶梯楔形分组随机试验的最优设计，包括封闭队列和重复横截面抽样方案。在已知工作相关结构中的相关参数时，提出了局部优化设计算法。MaxiMin 优化设计算法是在无法获得精确值，但研究者可以指定相关值范围的情况下提出的。对于多期群组随机交叉试验，群组周期大小和群组数量均为十进制，推导出局部优化设计和 MaxiMin 优化设计的闭式公式。根据闭合形式公式得出的十进制估计值可用于研究局部优化设计和 MaxiMin 优化设计算法的整数估计值的性能。与以往的优化设计研究相比，这项工作的一个独特贡献是，我们采用了约束优化技术来获取 MaxiMin 优化设计下的整数估计值。为了帮助实际应用，我们还开发了四个 SAS 宏，用于查找局部最优设计和 MaxiMin 最佳设计。

{"title":"Optimal designs using generalized estimating equations in cluster randomized crossover and stepped wedge trials.","authors":"Jingxia Liu, Fan Li","doi":"10.1177/09622802241247717","DOIUrl":"10.1177/09622802241247717","url":null,"abstract":"Cluster randomized crossover and stepped wedge cluster randomized trials are two types of longitudinal cluster randomized trials that leverage both the within- and between-cluster comparisons to estimate the treatment effect and are increasingly used in healthcare delivery and implementation science research. While the variance expressions of estimated treatment effect have been previously developed from the method of generalized estimating equations for analyzing cluster randomized crossover trials and stepped wedge cluster randomized trials, little guidance has been provided for optimal designs to ensure maximum efficiency. Here, an optimal design refers to the combination of optimal cluster-period size and optimal number of clusters that provide the smallest variance of the treatment effect estimator or maximum efficiency under a fixed total budget. In this work, we develop optimal designs for multiple-period cluster randomized crossover trials and stepped wedge cluster randomized trials with continuous outcomes, including both closed-cohort and repeated cross-sectional sampling schemes. Local optimal design algorithms are proposed when the correlation parameters in the working correlation structure are known. MaxiMin optimal design algorithms are proposed when the exact values are unavailable, but investigators may specify a range of correlation values. The closed-form formulae of local optimal design and MaxiMin optimal design are derived for multiple-period cluster randomized crossover trials, where the cluster-period size and number of clusters are decimal. The decimal estimates from closed-form formulae can then be used to investigate the performances of integer estimates from local optimal design and MaxiMin optimal design algorithms. One unique contribution from this work, compared to the previous optimal design research, is that we adopt constrained optimization techniques to obtain integer estimates under the MaxiMin optimal design. To assist practical implementation, we also develop four SAS macros to find local optimal designs and MaxiMin optimal designs.","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1299-1330"},"PeriodicalIF":1.6,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141176266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Erratum to "A dose-effect network meta-analysis model with application in antidepressants using restricted cubic splines". 剂量效应网络荟萃分析模型在抗抑郁药物中的应用--使用限制性三次样条曲线》的勘误。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-08-01 Epub Date: 2024-07-23 DOI: 10.1177/09622802241254569

引用次数: 0

Evaluating prognostic biomarkers for survival outcomes subject to informative censoring. 评估受信息普查影响的生存结果的预后生物标志物。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-08-01 Epub Date: 2024-06-06 DOI: 10.1177/09622802241259170

Wei Liu, Danping Liu, Zhiwei Zhang

Prognostic biomarkers for survival outcomes are widely used in clinical research and practice. Such biomarkers are often evaluated using a C-index as well as quantities based on time-dependent receiver operating characteristic curves. Existing methods for their evaluation generally assume that censoring is uninformative in the sense that the censoring time is independent of the failure time with or without conditioning on the biomarker under evaluation. With focus on the C-index and the area under a particular receiver operating characteristic curve, we describe and compare three estimation methods that account for informative censoring based on observed baseline covariates. Two of them are straightforward extensions of existing plug-in and inverse probability weighting methods for uninformative censoring. By appealing to semiparametric theory, we also develop a doubly robust, locally efficient method that is more robust than the plug-in and inverse probability weighting methods and typically more efficient than the inverse probability weighting method. The methods are evaluated and compared in a simulation study, and applied to real data from studies of breast cancer and heart failure.

预示生存结果的生物标志物被广泛应用于临床研究和实践中。此类生物标志物通常使用 C 指数以及基于时间依赖性接收者工作特征曲线的数量进行评估。现有的评估方法通常假定普查是无信息的，即普查时间与评估生物标志物的失败时间无关。以 C 指数和特定接收者工作特征曲线下的面积为重点，我们描述并比较了三种基于观测到的基线协变量考虑信息性剔除的估算方法。其中两种方法是对现有插件和反概率加权方法的直接扩展，用于非信息性删减。通过利用半参数理论，我们还开发了一种双重稳健、局部有效的方法，它比插入式和反概率加权法更稳健，通常比反概率加权法更有效。我们在模拟研究中对这些方法进行了评估和比较，并将其应用于乳腺癌和心力衰竭研究的真实数据中。

{"title":"Evaluating prognostic biomarkers for survival outcomes subject to informative censoring.","authors":"Wei Liu, Danping Liu, Zhiwei Zhang","doi":"10.1177/09622802241259170","DOIUrl":"10.1177/09622802241259170","url":null,"abstract":"Prognostic biomarkers for survival outcomes are widely used in clinical research and practice. Such biomarkers are often evaluated using a C-index as well as quantities based on time-dependent receiver operating characteristic curves. Existing methods for their evaluation generally assume that censoring is uninformative in the sense that the censoring time is independent of the failure time with or without conditioning on the biomarker under evaluation. With focus on the C-index and the area under a particular receiver operating characteristic curve, we describe and compare three estimation methods that account for informative censoring based on observed baseline covariates. Two of them are straightforward extensions of existing plug-in and inverse probability weighting methods for uninformative censoring. By appealing to semiparametric theory, we also develop a doubly robust, locally efficient method that is more robust than the plug-in and inverse probability weighting methods and typically more efficient than the inverse probability weighting method. The methods are evaluated and compared in a simulation study, and applied to real data from studies of breast cancer and heart failure.","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1342-1354"},"PeriodicalIF":1.6,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141262800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A robust regression model for bounded count health data. 有界计数健康数据的稳健回归模型。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-08-01 Epub Date: 2024-06-07 DOI: 10.1177/09622802241259178

Cristian L Bayes, Jorge Luis Bazán, Luis Valdivieso

Bounded count response data arise naturally in health applications. In general, the well-known beta-binomial regression model form the basis for analyzing this data, specially when we have overdispersed data. Little attention, however, has been given to the literature on the possibility of having extreme observations and overdispersed data. We propose in this work an extension of the beta-binomial regression model, named the beta-2-binomial regression model, which provides a rather flexible approach for fitting a regression model with a wide spectrum of bounded count response data sets under the presence of overdispersion, outliers, or excess of extreme observations. This distribution possesses more skewness and kurtosis than the beta-binomial model but preserves the same mean and variance form of the beta-binomial model. Additional properties of the beta-2-binomial distribution are derived including its behavior on the limits of its parametric space. A penalized maximum likelihood approach is considered to estimate parameters of this model and a residual analysis is included to assess departures from model assumptions as well as to detect outlier observations. Simulation studies, considering the robustness to outliers, are presented confirming that the beta-2-binomial regression model is a better robust alternative, in comparison with the binomial and beta-binomial regression models. We also found that the beta-2-binomial regression model outperformed the binomial and beta-binomial regression models in our applications of predicting liver cancer development in mice and the number of inappropriate days a patient spent in a hospital.

有界计数响应数据自然出现在医疗应用中。一般来说，众所周知的贝塔-二叉回归模型是分析这些数据的基础，特别是当我们有过度分散的数据时。然而，关于极端观测数据和过度分散数据的可能性的文献很少受到关注。在本研究中，我们提出了一种贝塔-二叉回归模型的扩展模型，即贝塔-2-二叉回归模型，该模型提供了一种相当灵活的方法，可在存在过度分散、异常值或过多极端观测值的情况下，拟合各种有界计数响应数据集的回归模型。这种分布比 beta-二叉模型具有更大的偏度和峰度，但保持了与 beta-二叉模型相同的均值和方差形式。还推导出了 beta-2 二叉分布的其他特性，包括其在参数空间极限上的行为。考虑采用惩罚性最大似然法估算该模型的参数，并进行残差分析，以评估偏离模型假设的情况以及检测离群观测值。模拟研究考虑了对异常值的稳健性，证实与二项回归模型和贝塔-二项回归模型相比，贝塔-2-二项回归模型是稳健性更好的替代模型。我们还发现，在预测小鼠肝癌发展和病人住院不当天数的应用中，β-2-二叉回归模型的表现优于二叉和β-二叉回归模型。

{"title":"A robust regression model for bounded count health data.","authors":"Cristian L Bayes, Jorge Luis Bazán, Luis Valdivieso","doi":"10.1177/09622802241259178","DOIUrl":"10.1177/09622802241259178","url":null,"abstract":"Bounded count response data arise naturally in health applications. In general, the well-known beta-binomial regression model form the basis for analyzing this data, specially when we have overdispersed data. Little attention, however, has been given to the literature on the possibility of having extreme observations and overdispersed data. We propose in this work an extension of the beta-binomial regression model, named the beta-2-binomial regression model, which provides a rather flexible approach for fitting a regression model with a wide spectrum of bounded count response data sets under the presence of overdispersion, outliers, or excess of extreme observations. This distribution possesses more skewness and kurtosis than the beta-binomial model but preserves the same mean and variance form of the beta-binomial model. Additional properties of the beta-2-binomial distribution are derived including its behavior on the limits of its parametric space. A penalized maximum likelihood approach is considered to estimate parameters of this model and a residual analysis is included to assess departures from model assumptions as well as to detect outlier observations. Simulation studies, considering the robustness to outliers, are presented confirming that the beta-2-binomial regression model is a better robust alternative, in comparison with the binomial and beta-binomial regression models. We also found that the beta-2-binomial regression model outperformed the binomial and beta-binomial regression models in our applications of predicting liver cancer development in mice and the number of inappropriate days a patient spent in a hospital.","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1392-1411"},"PeriodicalIF":1.6,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141284833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Unsupervised Liu-type shrinkage estimators for mixture of regression models. 回归模型混合物的无监督刘型收缩估计器。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-08-01 Epub Date: 2024-08-28 DOI: 10.1177/09622802241259175

Elsayed Ghanem, Armin Hatefi, Hamid Usefi

The mixture of probabilistic regression models is one of the most common techniques to incorporate the information of covariates into learning of the population heterogeneity. Despite its flexibility, unreliable estimates can occur due to multicollinearity among covariates. In this paper, we develop Liu-type shrinkage methods through an unsupervised learning approach to estimate the model coefficients in the presence of multicollinearity. We evaluate the performance of our proposed methods via classification and stochastic versions of the expectation-maximization algorithm. We show using numerical simulations that the proposed methods outperform their Ridge and maximum likelihood counterparts. Finally, we apply our methods to analyze the bone mineral data of women aged 50 and older.

概率混合回归模型是将协变量信息纳入群体异质性学习的最常用技术之一。尽管它具有灵活性，但由于协变量之间的多重共线性，可能会出现不可靠的估计。在本文中，我们通过无监督学习方法开发了刘式收缩方法，以估计存在多重共线性的模型系数。我们通过分类和随机版本的期望最大化算法来评估所提出方法的性能。通过数值模拟，我们发现所提出的方法优于 Ridge 和最大似然法。最后，我们将我们的方法应用于分析 50 岁及以上女性的骨矿物质数据。

引用次数: 0

Point estimation of the 100p percent lethal dose using a novel penalised likelihood approach. 使用新型惩罚似然法对 100p 百分致死剂量进行点估算。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-08-01 Epub Date: 2024-06-12 DOI: 10.1177/09622802241259174

Yilei Ma, Youpeng Su, Peng Wang, Ping Yin

Estimation of the 100p percent lethal dose ( ${LD}_{100 p}$ ) is of great interest to pharmacologists for assessing the toxicity of certain compounds. However, most existing literature focuses on the interval estimation of ${LD}_{100 p}$ and little attention has been paid to its point estimation. Currently, the most commonly used method for estimating the ${LD}_{100 p}$ is the maximum likelihood estimator (MLE), which can be represented as a ratio estimator, with the denominator being the slope estimated from the logistic regression model. However, the MLE can be seriously biased when the sample size is small, a common nature in such studies, or when the dose-response curve is relatively flat (i.e. the slope approaches zero). In this study, we address these issues by developing a novel penalised maximum likelihood estimator (PMLE) that can prevent the denominator of the ratio from being close to zero. Similar to the MLE, the PMLE is computationally simple and thus can be conveniently used in practice. Moreover, with a suitable penalty parameter, we show that the PMLE can (a) reduce the bias to the second order with respect to the sample size and (b) avoid extreme estimates. Through simulation studies and real data applications, we show that the PMLE generally outperforms the existing methods in terms of bias and root mean square error.

药理学家在评估某些化合物的毒性时，对 100p 百分致死剂量（LD100p）的估算非常感兴趣。然而，现有文献大多侧重于 LD100p 的区间估算，很少关注其点估算。目前，最常用的 LD100p 估计方法是最大似然估计法（MLE），它可以表示为一个比率估计法，分母是逻辑回归模型估计的斜率。然而，当样本量较小时（这在此类研究中很常见），或当剂量-反应曲线相对平坦（即斜率接近零）时，最大似然估计值可能会出现严重偏差。在本研究中，我们通过开发一种新颖的惩罚最大似然估计器（PMLE）来解决这些问题，该估计器可以防止比值的分母接近零。与 MLE 相似，PMLE 计算简单，因此可以方便地用于实践。此外，通过适当的惩罚参数，我们证明 PMLE 可以：（a）将偏差降低到与样本大小相关的二阶；（b）避免极端估计值。通过模拟研究和实际数据应用，我们表明 PMLE 在偏差和均方根误差方面普遍优于现有方法。

{"title":"Point estimation of the 100p percent lethal dose using a novel penalised likelihood approach.","authors":"Yilei Ma, Youpeng Su, Peng Wang, Ping Yin","doi":"10.1177/09622802241259174","DOIUrl":"10.1177/09622802241259174","url":null,"abstract":"Estimation of the 100p percent lethal dose (<math><msub><mtext>LD</mtext><mrow><mn>100</mn><mi>p</mi></mrow></msub></math>) is of great interest to pharmacologists for assessing the toxicity of certain compounds. However, most existing literature focuses on the interval estimation of <math><msub><mtext>LD</mtext><mrow><mn>100</mn><mi>p</mi></mrow></msub></math> and little attention has been paid to its point estimation. Currently, the most commonly used method for estimating the <math><msub><mtext>LD</mtext><mrow><mn>100</mn><mi>p</mi></mrow></msub></math> is the maximum likelihood estimator (MLE), which can be represented as a ratio estimator, with the denominator being the slope estimated from the logistic regression model. However, the MLE can be seriously biased when the sample size is small, a common nature in such studies, or when the dose-response curve is relatively flat (i.e. the slope approaches zero). In this study, we address these issues by developing a novel penalised maximum likelihood estimator (PMLE) that can prevent the denominator of the ratio from being close to zero. Similar to the MLE, the PMLE is computationally simple and thus can be conveniently used in practice. Moreover, with a suitable penalty parameter, we show that the PMLE can (a) reduce the bias to the second order with respect to the sample size and (b) avoid extreme estimates. Through simulation studies and real data applications, we show that the PMLE generally outperforms the existing methods in terms of bias and root mean square error.","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1331-1341"},"PeriodicalIF":1.6,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141306885","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Analyzing heterogeneity in biomarker discriminative performance through partial time-dependent receiver operating characteristic curve modeling. 通过部分时间依赖性接收器工作特征曲线建模分析生物标记物鉴别性能的异质性。

IF 1.6 3区医学 Q3 HEALTH CARE SCIENCES & SERVICES

Statistical Methods in Medical Research

Pub Date : 2024-08-01 Epub Date: 2024-07-25 DOI: 10.1177/09622802241262521

Xinyang Jiang, Wen Li, Kang Wang, Ruosha Li, Jing Ning

This study investigates the heterogeneity of a biomarker's discriminative performance for predicting subsequent time-to-event outcomes across different patient subgroups. While the area under the curve (AUC) for the time-dependent receiver operating characteristic curve is commonly used to assess biomarker performance, the partial time-dependent AUC (PAUC) provides insights that are often more pertinent for population screening and diagnostic testing. To achieve this objective, we propose a regression model tailored for PAUC and develop two distinct estimation procedures for discrete and continuous covariates, employing a pseudo-partial likelihood method. Simulation studies are conducted to assess the performance of these procedures across various scenarios. We apply our model and inference procedure to the Alzheimer's Disease Neuroimaging Initiative data set to evaluate potential heterogeneities in the discriminative performance of biomarkers for early Alzheimer's disease diagnosis based on patients' characteristics.

本研究调查了生物标记物在不同患者亚群中预测后续时间到事件结果的鉴别性能的异质性。虽然随时间变化的接收者操作特征曲线的曲线下面积（AUC）通常用于评估生物标记物的性能，但部分随时间变化的AUC（PAUC）提供的见解往往与人群筛查和诊断检测更相关。为实现这一目标，我们提出了一个为 PAUC 量身定制的回归模型，并采用伪偏似方法为离散和连续协变量开发了两种不同的估计程序。我们进行了模拟研究，以评估这些程序在各种情况下的性能。我们将模型和推理过程应用于阿尔茨海默病神经影像倡议数据集，以评估基于患者特征的早期阿尔茨海默病诊断生物标记物的鉴别性能的潜在异质性。

{"title":"Analyzing heterogeneity in biomarker discriminative performance through partial time-dependent receiver operating characteristic curve modeling.","authors":"Xinyang Jiang, Wen Li, Kang Wang, Ruosha Li, Jing Ning","doi":"10.1177/09622802241262521","DOIUrl":"10.1177/09622802241262521","url":null,"abstract":"This study investigates the heterogeneity of a biomarker's discriminative performance for predicting subsequent time-to-event outcomes across different patient subgroups. While the area under the curve (AUC) for the time-dependent receiver operating characteristic curve is commonly used to assess biomarker performance, the partial time-dependent AUC (PAUC) provides insights that are often more pertinent for population screening and diagnostic testing. To achieve this objective, we propose a regression model tailored for PAUC and develop two distinct estimation procedures for discrete and continuous covariates, employing a pseudo-partial likelihood method. Simulation studies are conducted to assess the performance of these procedures across various scenarios. We apply our model and inference procedure to the Alzheimer's Disease Neuroimaging Initiative data set to evaluate potential heterogeneities in the discriminative performance of biomarkers for early Alzheimer's disease diagnosis based on patients' characteristics.","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1424-1436"},"PeriodicalIF":1.6,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11449645/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141760988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0