首页 > 最新文献

Biometrical Journal最新文献

英文 中文
Causal inference in the absence of positivity: The role of overlap weights 缺乏正向性的因果推理:重叠权重的作用
IF 1.7 3区 生物学 Q2 Mathematics Pub Date : 2024-06-07 DOI: 10.1002/bimj.202300156
Roland A. Matsouaka, Yunji Zhou

How to analyze data when there is violation of the positivity assumption? Several possible solutions exist in the literature. In this paper, we consider propensity score (PS) methods that are commonly used in observational studies to assess causal treatment effects in the context where the positivity assumption is violated. We focus on and examine four specific alternative solutions to the inverse probability weighting (IPW) trimming and truncation: matching weight (MW), Shannon's entropy weight (EW), overlap weight (OW), and beta weight (BW) estimators.

We first specify their target population, the population of patients for whom clinical equipoise, that is, where we have sufficient PS overlap. Then, we establish the nexus among the different corresponding weights (and estimators); this allows us to highlight the shared properties and theoretical implications of these estimators. Finally, we introduce their augmented estimators that take advantage of estimating both the propensity score and outcome regression models to enhance the treatment effect estimators in terms of bias and efficiency. We also elucidate the role of the OW estimator as the flagship of all these methods that target the overlap population.

Our analytic results demonstrate that OW, MW, and EW are preferable to IPW and some cases of BW when there is a moderate or extreme (stochastic or structural) violation of the positivity assumption. We then evaluate, compare, and confirm the finite-sample performance of the aforementioned estimators via Monte Carlo simulations. Finally, we illustrate these methods using two real-world data examples marked by violations of the positivity assumption.

当违反正向性假设时,如何分析数据?文献中存在几种可能的解决方案。在本文中,我们考虑了倾向得分(PS)方法,这些方法通常用于观察性研究,以评估违反正向性假设情况下的因果治疗效果。我们关注并研究了反概率加权(IPW)修剪和截断的四种具体替代方案:匹配权重(MW)、香农熵权重(EW)、重叠权重(OW)和贝塔权重(BW)估计器。我们首先明确其目标人群,即临床等效的患者人群,也就是我们有足够 PS 重叠的人群。然后,我们在不同的相应权重(和估计器)之间建立联系;这样我们就能突出这些估计器的共同特性和理论意义。最后,我们介绍了它们的增强估计器,这些估计器利用了倾向得分和结果回归模型的估计优势,在偏差和效率方面增强了治疗效果估计器。我们还阐明了 OW 估计器的作用,它是所有这些方法中针对重叠人群的旗舰方法。我们的分析结果表明,当存在中度或极端(随机或结构性)违反正向性假设的情况时,OW、MW 和 EW 比 IPW 和某些情况下的 BW 更优。然后,我们通过蒙特卡罗模拟对上述估计器的有限样本性能进行评估、比较和确认。最后,我们使用两个以违反正向性假设为特征的实际数据示例来说明这些方法。
{"title":"Causal inference in the absence of positivity: The role of overlap weights","authors":"Roland A. Matsouaka,&nbsp;Yunji Zhou","doi":"10.1002/bimj.202300156","DOIUrl":"10.1002/bimj.202300156","url":null,"abstract":"<p>How to analyze data when there is violation of the positivity assumption? Several possible solutions exist in the literature. In this paper, we consider propensity score (PS) methods that are commonly used in observational studies to assess causal treatment effects in the context where the positivity assumption is violated. We focus on and examine four specific alternative solutions to the inverse probability weighting (IPW) trimming and truncation: matching weight (MW), Shannon's entropy weight (EW), overlap weight (OW), and beta weight (BW) estimators.</p><p>We first specify their target population, the population of patients for whom clinical equipoise, that is, where we have sufficient PS overlap. Then, we establish the nexus among the different corresponding weights (and estimators); this allows us to highlight the shared properties and theoretical implications of these estimators. Finally, we introduce their augmented estimators that take advantage of estimating both the propensity score and outcome regression models to enhance the treatment effect estimators in terms of bias and efficiency. We also elucidate the role of the OW estimator as the flagship of all these methods that target the overlap population.</p><p>Our analytic results demonstrate that OW, MW, and EW are preferable to IPW and some cases of BW when there is a moderate or extreme (stochastic or structural) violation of the positivity assumption. We then evaluate, compare, and confirm the finite-sample performance of the aforementioned estimators via Monte Carlo simulations. Finally, we illustrate these methods using two real-world data examples marked by violations of the positivity assumption.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141285482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive predictor-set linear model: An imputation-free method for linear regression prediction on data sets with missing values 自适应预测集线性模型:对有缺失值的数据集进行线性回归预测的免估算方法。
IF 1.7 3区 生物学 Q2 Mathematics Pub Date : 2024-05-30 DOI: 10.1002/bimj.202300090
Benjamin Planterose Jiménez, Manfred Kayser, Athina Vidaki, Amke Caliebe

Linear regression (LR) is vastly used in data analysis for continuous outcomes in biomedicine and epidemiology. Despite its popularity, LR is incompatible with missing data, which frequently occur in health sciences. For parameter estimation, this shortcoming is usually resolved by complete-case analysis or imputation. Both work-arounds, however, are inadequate for prediction, since they either fail to predict on incomplete records or ignore missingness-induced reduction in prediction accuracy and rely on (unrealistic) assumptions about the missing mechanism. Here, we derive adaptive predictor-set linear model (aps-lm), capable of making predictions for incomplete data without the need for imputation. It is derived by using a predictor-selection operation, the Moore–Penrose pseudoinverse, and the reduced QR decomposition. aps-lm is an LR generalization that inherently handles missing values. It is applied on a reference data set, where complete predictors and outcome are available, and yields a set of privacy-preserving parameters. In a second stage, these are shared for making predictions of the outcome on external data sets with missing entries for predictors without imputation. Moreover, aps-lm computes prediction errors that account for the pattern of missing values even under extreme missingness. We benchmark aps-lm in a simulation study. aps-lm showed greater prediction accuracy and reduced bias compared to popular imputation strategies under a wide range of scenarios including variation of sample size, goodness of fit, missing value type, and covariance structure. Finally, as a proof-of-principle, we apply aps-lm in the context of epigenetic aging clocks, linear models that predict a person's biological age from epigenetic data with promising clinical applications.

线性回归(LR)广泛应用于生物医学和流行病学中连续结果的数据分析。尽管线性回归很受欢迎,但它与缺失数据不兼容,而缺失数据在健康科学中经常出现。在参数估计中,这一缺陷通常通过完整案例分析或估算来解决。然而,这两种变通方法都不足以进行预测,因为它们要么无法对不完整的记录进行预测,要么忽略了缺失导致的预测准确性下降,并且依赖于对缺失机制的(不切实际的)假设。在这里,我们推导出了自适应预测集线性模型(aps-lm),它无需估算就能对不完整数据进行预测。它是通过使用预测器选择操作、摩尔-彭罗斯(Moore-Penrose)伪逆和还原 QR 分解得出的。它应用于参考数据集(其中有完整的预测因子和结果),并产生一组保护隐私的参数。在第二阶段,这些参数将被共享,用于对外部数据集的结果进行预测,外部数据集中的预测因子有缺失项,无需估算。此外,即使在极端缺失的情况下,aps-lm 也能计算出考虑到缺失值模式的预测误差。我们在模拟研究中对 aps-lm 进行了基准测试。与流行的估算策略相比,aps-lm 在样本量、拟合度、缺失值类型和协方差结构等多种情况下都显示出更高的预测准确性和更小的偏差。最后,作为原理验证,我们将 aps-lm 应用于表观遗传衰老时钟,这种线性模型可以从表观遗传数据中预测一个人的生物年龄,具有良好的临床应用前景。
{"title":"Adaptive predictor-set linear model: An imputation-free method for linear regression prediction on data sets with missing values","authors":"Benjamin Planterose Jiménez,&nbsp;Manfred Kayser,&nbsp;Athina Vidaki,&nbsp;Amke Caliebe","doi":"10.1002/bimj.202300090","DOIUrl":"10.1002/bimj.202300090","url":null,"abstract":"<p>Linear regression (LR) is vastly used in data analysis for continuous outcomes in biomedicine and epidemiology. Despite its popularity, LR is incompatible with missing data, which frequently occur in health sciences. For parameter estimation, this shortcoming is usually resolved by complete-case analysis or imputation. Both work-arounds, however, are inadequate for prediction, since they either fail to predict on incomplete records or ignore missingness-induced reduction in prediction accuracy and rely on (unrealistic) assumptions about the missing mechanism. Here, we derive adaptive predictor-set linear model (aps-lm), capable of making predictions for incomplete data without the need for imputation. It is derived by using a predictor-selection operation, the Moore–Penrose pseudoinverse, and the reduced QR decomposition. aps-lm is an LR generalization that inherently handles missing values. It is applied on a reference data set, where complete predictors and outcome are available, and yields a set of privacy-preserving parameters. In a second stage, these are shared for making predictions of the outcome on external data sets with missing entries for predictors without imputation. Moreover, aps-lm computes prediction errors that account for the pattern of missing values even under extreme missingness. We benchmark aps-lm in a simulation study. aps-lm showed greater prediction accuracy and reduced bias compared to popular imputation strategies under a wide range of scenarios including variation of sample size, goodness of fit, missing value type, and covariance structure. Finally, as a proof-of-principle, we apply aps-lm in the context of epigenetic aging clocks, linear models that predict a person's biological age from epigenetic data with promising clinical applications.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.202300090","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141177096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bayesian hierarchical hidden Markov model for clustering and gene selection: Application to kidney cancer gene expression data 用于聚类和基因选择的贝叶斯分层隐马尔可夫模型:应用于肾癌基因表达数据。
IF 1.7 3区 生物学 Q2 Mathematics Pub Date : 2024-05-30 DOI: 10.1002/bimj.202300173
Thierry Chekouo, Himadri Mukherjee

We introduce a Bayesian approach for biclustering that accounts for the prior functional dependence between genes using hidden Markov models (HMMs). We utilize biological knowledge gathered from gene ontologies and the hidden Markov structure to capture the potential coexpression of neighboring genes. Our interpretable model-based clustering characterized each cluster of samples by three groups of features: overexpressed, underexpressed, and irrelevant features. The proposed methods have been implemented in an R package and are used to analyze both the simulated data and The Cancer Genome Atlas kidney cancer data.

我们介绍了一种贝叶斯双聚类方法,该方法利用隐马尔可夫模型(HMM)考虑了基因之间的先验功能依赖性。我们利用从基因本体和隐马尔可夫结构中收集的生物知识来捕捉相邻基因的潜在共表达。我们基于可解释模型的聚类方法通过三组特征来表征每个样本集群:过度表达、表达不足和无关特征。提出的方法已在 R 软件包中实现,并用于分析模拟数据和癌症基因组图谱肾癌数据。
{"title":"A Bayesian hierarchical hidden Markov model for clustering and gene selection: Application to kidney cancer gene expression data","authors":"Thierry Chekouo,&nbsp;Himadri Mukherjee","doi":"10.1002/bimj.202300173","DOIUrl":"10.1002/bimj.202300173","url":null,"abstract":"<p>We introduce a Bayesian approach for biclustering that accounts for the prior functional dependence between genes using hidden Markov models (HMMs). We utilize biological knowledge gathered from gene ontologies and the hidden Markov structure to capture the potential coexpression of neighboring genes. Our interpretable model-based clustering characterized each cluster of samples by three groups of features: overexpressed, underexpressed, and irrelevant features. The proposed methods have been implemented in an R package and are used to analyze both the simulated data and The Cancer Genome Atlas kidney cancer data.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.202300173","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141181467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Valid instrumental variable selection method using negative control outcomes and constructing efficient estimator 使用负控制结果的有效工具变量选择法和构建高效估计器。
IF 1.7 3区 生物学 Q2 Mathematics Pub Date : 2024-05-27 DOI: 10.1002/bimj.202300113
Shunichiro Orihara, Atsushi Goto, Masataka Taguri

In observational studies, instrumental variable (IV) methods are commonly applied when there are unmeasured covariates. In Mendelian randomization, constructing an allele score using many single nucleotide polymorphisms is often implemented; however, estimating biased causal effects by including some invalid IVs poses some risks. Invalid IVs are those IV candidates that are associated with unobserved variables. To solve this problem, we developed a novel strategy using negative control outcomes (NCOs) as auxiliary variables. Using NCOs, we are able to select only valid IVs and exclude invalid IVs without knowing which of the instruments are invalid. We also developed a new two-step estimation procedure and proved the semiparametric efficiency of our estimator. The performance of our proposed method was superior to some previous methods through simulations. Subsequently, we applied the proposed method to the UK Biobank dataset. Our results demonstrate that the use of an auxiliary variable, such as an NCO, enables the selection of valid IVs with assumptions different from those used in previous methods.

在观察性研究中,当存在无法测量的协变量时,通常会采用工具变量(IV)方法。在孟德尔随机化中,通常会使用许多单核苷酸多态性来构建等位基因得分;然而,通过包含一些无效的 IV 来估计有偏差的因果效应会带来一些风险。无效的 IV 是指那些与非观测变量相关的 IV 候选者。为了解决这个问题,我们开发了一种新策略,将负控制结果(NCOs)作为辅助变量。利用负控制结果,我们可以只选择有效的 IV,排除无效的 IV,而无需知道哪些工具是无效的。我们还开发了一种新的两步估计程序,并证明了我们的估计器的半参数效率。通过模拟,我们提出的方法的性能优于之前的一些方法。随后,我们将提出的方法应用于英国生物库数据集。我们的结果表明,使用辅助变量(如 NCO)可以选择有效的 IV,其假设条件与之前的方法不同。
{"title":"Valid instrumental variable selection method using negative control outcomes and constructing efficient estimator","authors":"Shunichiro Orihara,&nbsp;Atsushi Goto,&nbsp;Masataka Taguri","doi":"10.1002/bimj.202300113","DOIUrl":"10.1002/bimj.202300113","url":null,"abstract":"<p>In observational studies, instrumental variable (IV) methods are commonly applied when there are unmeasured covariates. In Mendelian randomization, constructing an allele score using many single nucleotide polymorphisms is often implemented; however, estimating biased causal effects by including some invalid IVs poses some risks. Invalid IVs are those IV candidates that are associated with unobserved variables. To solve this problem, we developed a novel strategy using negative control outcomes (NCOs) as auxiliary variables. Using NCOs, we are able to select only valid IVs and exclude invalid IVs without knowing which of the instruments are invalid. We also developed a new two-step estimation procedure and proved the semiparametric efficiency of our estimator. The performance of our proposed method was superior to some previous methods through simulations. Subsequently, we applied the proposed method to the UK Biobank dataset. Our results demonstrate that the use of an auxiliary variable, such as an NCO, enables the selection of valid IVs with assumptions different from those used in previous methods.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141155844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting class switch recombination in B-cells from antibody repertoire data 从抗体库数据预测 B 细胞中的类开关重组
IF 1.7 3区 生物学 Q2 Mathematics Pub Date : 2024-05-24 DOI: 10.1002/bimj.202300171
Lutecia Servius, Davide Pigoli, Joseph Ng, Franca Fraternali

Statistical and machine learning methods have proved useful in many areas of immunology. In this paper, we address for the first time the problem of predicting the occurrence of class switch recombination (CSR) in B-cells, a problem of interest in understanding antibody response under immunological challenges. We propose a framework to analyze antibody repertoire data, based on clonal (CG) group representation in a way that allows us to predict CSR events using CG level features as input. We assess and compare the performance of several predicting models (logistic regression, LASSO logistic regression, random forest, and support vector machine) in carrying out this task. The proposed approach can obtain an unweighted average recall of 71%$71%$ with models based on variable region descriptors and measures of CG diversity during an immune challenge and, most notably, before an immune challenge.

事实证明,统计和机器学习方法在免疫学的许多领域都很有用。在本文中,我们首次解决了预测 B 细胞中类开关重组(CSR)发生的问题,这是了解免疫学挑战下抗体反应的一个重要问题。我们提出了一个基于克隆(CG)组表示法的分析抗体复合物数据的框架,该框架允许我们使用克隆组水平特征作为输入来预测 CSR 事件。我们评估并比较了几种预测模型(逻辑回归、LASSO 逻辑回归、随机森林和支持向量机)在执行这项任务时的性能。在免疫挑战期间,最明显的是在免疫挑战之前,基于可变区域描述符和 CG 多样性测量的模型,所提出的方法可以获得 71% $71%$ 的非加权平均召回率。
{"title":"Predicting class switch recombination in B-cells from antibody repertoire data","authors":"Lutecia Servius,&nbsp;Davide Pigoli,&nbsp;Joseph Ng,&nbsp;Franca Fraternali","doi":"10.1002/bimj.202300171","DOIUrl":"10.1002/bimj.202300171","url":null,"abstract":"<p>Statistical and machine learning methods have proved useful in many areas of immunology. In this paper, we address for the first time the problem of predicting the occurrence of class switch recombination (CSR) in B-cells, a problem of interest in understanding antibody response under immunological challenges. We propose a framework to analyze antibody repertoire data, based on clonal (CG) group representation in a way that allows us to predict CSR events using CG level features as input. We assess and compare the performance of several predicting models (logistic regression, LASSO logistic regression, random forest, and support vector machine) in carrying out this task. The proposed approach can obtain an unweighted average recall of <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mn>71</mn>\u0000 <mo>%</mo>\u0000 </mrow>\u0000 <annotation>$71%$</annotation>\u0000 </semantics></math> with models based on variable region descriptors and measures of CG diversity during an immune challenge and, most notably, before an immune challenge.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.202300171","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141089535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A nonparametric proportional risk model to assess a treatment effect in time-to-event data 在时间到事件数据中评估治疗效果的非参数比例风险模型。
IF 1.7 3区 生物学 Q2 Mathematics Pub Date : 2024-05-24 DOI: 10.1002/bimj.202300147
Lucia Ameis, Oliver Kuss, Annika Hoyer, Kathrin Möllenhoff

Time-to-event analysis often relies on prior parametric assumptions, or, if a semiparametric approach is chosen, Cox's model. This is inherently tied to the assumption of proportional hazards, with the analysis potentially invalidated if this assumption is not fulfilled. In addition, most interpretations focus on the hazard ratio, that is often misinterpreted as the relative risk (RR), the ratio of the cumulative distribution functions. In this paper, we introduce an alternative to current methodology for assessing a treatment effect in a two-group situation, not relying on the proportional hazards assumption but assuming proportional risks. Precisely, we propose a new nonparametric model to directly estimate the RR of two groups to experience an event under the assumption that the risk ratio is constant over time. In addition to this relative measure, our model allows for calculating the number needed to treat as an absolute measure, providing the possibility of an easy and holistic interpretation of the data. We demonstrate the validity of the approach by means of a simulation study and present an application to data from a large randomized controlled trial investigating the effect of dapagliflozin on all-cause mortality.

时间到事件分析通常依赖于先验参数假设,如果选择半参数方法,则依赖于考克斯模型。这与比例危险假设有着内在联系,如果这一假设不成立,分析就可能失效。此外,大多数解释都侧重于危险比,而危险比往往被误解为相对风险(RR),即累积分布函数的比值。在本文中,我们提出了一种替代目前评估两组情况下治疗效果的方法,即不依赖比例危险假设,而是假设比例风险。确切地说,我们提出了一种新的非参数模型,在假设风险比随时间变化不变的情况下,直接估算两组发生事件的 RR。除了这一相对指标外,我们的模型还能计算出治疗所需人数的绝对指标,从而提供了对数据进行简便、全面解释的可能性。我们通过模拟研究证明了该方法的有效性,并介绍了该方法在一项大型随机对照试验数据中的应用,该试验调查了达帕利洛嗪对全因死亡率的影响。
{"title":"A nonparametric proportional risk model to assess a treatment effect in time-to-event data","authors":"Lucia Ameis,&nbsp;Oliver Kuss,&nbsp;Annika Hoyer,&nbsp;Kathrin Möllenhoff","doi":"10.1002/bimj.202300147","DOIUrl":"10.1002/bimj.202300147","url":null,"abstract":"<p>Time-to-event analysis often relies on prior parametric assumptions, or, if a semiparametric approach is chosen, Cox's model. This is inherently tied to the assumption of proportional hazards, with the analysis potentially invalidated if this assumption is not fulfilled. In addition, most interpretations focus on the hazard ratio, that is often misinterpreted as the relative risk (RR), the ratio of the cumulative distribution functions. In this paper, we introduce an alternative to current methodology for assessing a treatment effect in a two-group situation, not relying on the proportional hazards assumption but assuming proportional risks. Precisely, we propose a new nonparametric model to directly estimate the RR of two groups to experience an event under the assumption that the risk ratio is constant over time. In addition to this relative measure, our model allows for calculating the number needed to treat as an absolute measure, providing the possibility of an easy and holistic interpretation of the data. We demonstrate the validity of the approach by means of a simulation study and present an application to data from a large randomized controlled trial investigating the effect of dapagliflozin on all-cause mortality.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.202300147","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141089524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A method for determining groups in cumulative incidence curves in competing risk data 在竞争风险数据的累积发病率曲线中确定组别的方法。
IF 1.7 3区 生物学 Q2 Mathematics Pub Date : 2024-05-22 DOI: 10.1002/bimj.202300084
Marta Sestelo, Luís Meira-Machado, Nora M. Villanueva, Javier Roca-Pardiñas

The cumulative incidence function is the standard method for estimating the marginal probability of a given event in the presence of competing risks. One basic but important goal in the analysis of competing risk data is the comparison of these curves, for which limited literature exists. We proposed a new procedure that lets us not only test the equality of these curves but also group them if they are not equal. The proposed method allows determining the composition of the groups as well as an automatic selection of their number. Simulation studies show the good numerical behavior of the proposed methods for finite sample size. The applicability of the proposed method is illustrated using real data.

累积发生率函数是在存在竞争风险的情况下估算特定事件边际概率的标准方法。分析竞争风险数据的一个基本但重要的目标是比较这些曲线,但这方面的文献有限。我们提出了一种新的程序,不仅可以检验这些曲线是否相等,还可以在不相等的情况下对它们进行分组。所提出的方法可以确定组的组成,并自动选择组的数量。模拟研究表明,在样本数量有限的情况下,所提出的方法具有良好的数值表现。使用真实数据说明了所提方法的适用性。
{"title":"A method for determining groups in cumulative incidence curves in competing risk data","authors":"Marta Sestelo,&nbsp;Luís Meira-Machado,&nbsp;Nora M. Villanueva,&nbsp;Javier Roca-Pardiñas","doi":"10.1002/bimj.202300084","DOIUrl":"10.1002/bimj.202300084","url":null,"abstract":"<p>The cumulative incidence function is the standard method for estimating the marginal probability of a given event in the presence of competing risks. One basic but important goal in the analysis of competing risk data is the comparison of these curves, for which limited literature exists. We proposed a new procedure that lets us not only test the equality of these curves but also group them if they are not equal. The proposed method allows determining the composition of the groups as well as an automatic selection of their number. Simulation studies show the good numerical behavior of the proposed methods for finite sample size. The applicability of the proposed method is illustrated using real data.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.202300084","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141077094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sparse Group Penalties for bi-level variable selection 用于双级变量选择的稀疏组惩罚。
IF 1.7 3区 生物学 Q2 Mathematics Pub Date : 2024-05-15 DOI: 10.1002/bimj.202200334
Gregor Buch, Andreas Schulz, Irene Schmidtmann, Konstantin Strauch, Philipp S. Wild

Many data sets exhibit a natural group structure due to contextual similarities or high correlations of variables, such as lipid markers that are interrelated based on biochemical principles. Knowledge of such groupings can be used through bi-level selection methods to identify relevant feature groups and highlight their predictive members. One of the best known approaches of this kind combines the classical Least Absolute Shrinkage and Selection Operator (LASSO) with the Group LASSO, resulting in the Sparse Group LASSO. We propose the Sparse Group Penalty (SGP) framework, which allows for a flexible combination of different SGL-style shrinkage conditions. Analogous to SGL, we investigated the combination of the Smoothly Clipped Absolute Deviation (SCAD), the Minimax Concave Penalty (MCP) and the Exponential Penalty (EP) with their group versions, resulting in the Sparse Group SCAD, the Sparse Group MCP, and the novel Sparse Group EP (SGE). Those shrinkage operators provide refined control of the effect of group formation on the selection process through a tuning parameter. In simulation studies, SGPs were compared with other bi-level selection methods (Group Bridge, composite MCP, and Group Exponential LASSO) for variable and group selection evaluated with the Matthews correlation coefficient. We demonstrated the advantages of the new SGE in identifying parsimonious models, but also identified scenarios that highlight the limitations of the approach. The performance of the techniques was further investigated in a real-world use case for the selection of regulated lipids in a randomized clinical trial.

许多数据集由于上下文相似性或变量的高度相关性(如基于生化原理相互关联的脂质标记)而呈现出一种自然的分组结构。这种分组知识可通过双级选择方法来识别相关特征组并突出其预测成员。这类方法中最著名的一种是将经典的最小绝对收缩和选择算子(LASSO)与组 LASSO 结合起来,形成稀疏组 LASSO。我们提出了稀疏组惩罚(SGP)框架,它允许灵活组合不同的 SGL 式收缩条件。与 SGL 类似,我们研究了平滑截断绝对偏差(SCAD)、最小值凹惩罚(MCP)和指数惩罚(EP)与它们的组版本的组合,最终得出稀疏组 SCAD、稀疏组 MCP 和新型稀疏组 EP (SGE)。这些收缩算子通过一个调整参数对分组形成对选择过程的影响进行了精细控制。在模拟研究中,我们将 SGP 与其他双层选择方法(群桥、复合 MCP 和群指数 LASSO)进行了比较,并用马修斯相关系数对变量和群选择进行了评估。我们证明了新的 SGE 在确定拟合模型方面的优势,但也发现了一些凸显该方法局限性的情况。在随机临床试验中选择受调控血脂的实际应用案例中,我们进一步研究了这些技术的性能。
{"title":"Sparse Group Penalties for bi-level variable selection","authors":"Gregor Buch,&nbsp;Andreas Schulz,&nbsp;Irene Schmidtmann,&nbsp;Konstantin Strauch,&nbsp;Philipp S. Wild","doi":"10.1002/bimj.202200334","DOIUrl":"10.1002/bimj.202200334","url":null,"abstract":"<p>Many data sets exhibit a natural group structure due to contextual similarities or high correlations of variables, such as lipid markers that are interrelated based on biochemical principles. Knowledge of such groupings can be used through bi-level selection methods to identify relevant feature groups and highlight their predictive members. One of the best known approaches of this kind combines the classical <i>Least Absolute Shrinkage and Selection Operator</i> (LASSO) with the <i>Group LASSO</i>, resulting in the <i>Sparse Group LASSO</i>. We propose the Sparse Group Penalty (SGP) framework, which allows for a flexible combination of different SGL-style shrinkage conditions. Analogous to SGL, we investigated the combination of the <i>Smoothly Clipped Absolute Deviation</i> (SCAD), the <i>Minimax Concave Penalty</i> (MCP) and the <i>Exponential Penalty</i> (EP) with their group versions, resulting in the <i>Sparse Group SCAD</i>, the <i>Sparse Group MCP</i>, and the novel <i>Sparse Group EP</i> (SGE). Those shrinkage operators provide refined control of the effect of group formation on the selection process through a tuning parameter. In simulation studies, SGPs were compared with other bi-level selection methods (Group Bridge, composite MCP, and Group Exponential LASSO) for variable and group selection evaluated with the Matthews correlation coefficient. We demonstrated the advantages of the new SGE in identifying parsimonious models, but also identified scenarios that highlight the limitations of the approach. The performance of the techniques was further investigated in a real-world use case for the selection of regulated lipids in a randomized clinical trial.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.202200334","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140924034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative review of novel model-assisted designs for phase I/II clinical trials 对 I/II 期临床试验的新型模型辅助设计进行比较审查。
IF 1.7 3区 生物学 Q2 Mathematics Pub Date : 2024-05-13 DOI: 10.1002/bimj.202300398
Haolun Shi, Ruitao Lin, Xiaolei Lin

In recent years, both model-based and model-assisted designs have emerged to efficiently determine the optimal biological dose (OBD) in phase I/II trials for immunotherapy and targeted cellular agents. Model-based designs necessitate repeated model fitting and computationally intensive posterior sampling for each dose-assignment decision, limiting their practical application in real trials. On the other hand, model-assisted designs employ simple statistical models and facilitate the precalculation of a decision table for use throughout the trial, eliminating the need for repeated model fitting. Due to their simplicity and transparency, model-assisted designs are often preferred in phase I/II trials. In this paper, we systematically evaluate and compare the operating characteristics of several recent model-assisted phase I/II designs, including TEPI, PRINTE, Joint i3+3, BOIN-ET, STEIN, uTPI, and BOIN12, in addition to the well-known model-based EffTox design, using comprehensive numerical simulations. To ensure an unbiased comparison, we generated 10,000 dosing scenarios using a random scenario generation algorithm for each predetermined OBD location. We thoroughly assess various performance metrics, such as the selection percentages, average patient allocation to OBD, and overdose percentages across the eight designs. Based on these assessments, we offer design recommendations tailored to different objectives, sample sizes, and starting dose locations.

近年来,出现了基于模型的设计和模型辅助设计,用于在免疫疗法和靶向细胞药物的I/II期试验中有效确定最佳生物剂量(OBD)。基于模型的设计需要反复进行模型拟合,每次剂量分配决策都需要进行计算密集的后验取样,这限制了其在实际试验中的应用。另一方面,模型辅助设计采用简单的统计模型,便于预先计算供整个试验使用的决策表,从而无需反复进行模型拟合。由于模型辅助设计既简单又透明,因此在 I/II 期试验中常常受到青睐。在本文中,除了众所周知的基于模型的 EffTox 设计外,我们还通过全面的数值模拟,系统地评估和比较了近期几种模型辅助 I/II 期设计的操作特性,包括 TEPI、PRINTE、Joint i3+3、BOIN-ET、STEIN、uTPI 和 BOIN12。为确保比较无偏见,我们使用随机情景生成算法为每个预定的 OBD 位置生成了 10,000 个剂量情景。我们全面评估了八种设计的各种性能指标,如选择百分比、OBD 患者平均分配率和超剂量百分比。在这些评估的基础上,我们提出了针对不同目标、样本大小和起始剂量位置的设计建议。
{"title":"Comparative review of novel model-assisted designs for phase I/II clinical trials","authors":"Haolun Shi,&nbsp;Ruitao Lin,&nbsp;Xiaolei Lin","doi":"10.1002/bimj.202300398","DOIUrl":"10.1002/bimj.202300398","url":null,"abstract":"<p>In recent years, both model-based and model-assisted designs have emerged to efficiently determine the optimal biological dose (OBD) in phase I/II trials for immunotherapy and targeted cellular agents. Model-based designs necessitate repeated model fitting and computationally intensive posterior sampling for each dose-assignment decision, limiting their practical application in real trials. On the other hand, model-assisted designs employ simple statistical models and facilitate the precalculation of a decision table for use throughout the trial, eliminating the need for repeated model fitting. Due to their simplicity and transparency, model-assisted designs are often preferred in phase I/II trials. In this paper, we systematically evaluate and compare the operating characteristics of several recent model-assisted phase I/II designs, including TEPI, PRINTE, Joint i3+3, BOIN-ET, STEIN, uTPI, and BOIN12, in addition to the well-known model-based EffTox design, using comprehensive numerical simulations. To ensure an unbiased comparison, we generated 10,000 dosing scenarios using a random scenario generation algorithm for each predetermined OBD location. We thoroughly assess various performance metrics, such as the selection percentages, average patient allocation to OBD, and overdose percentages across the eight designs. Based on these assessments, we offer design recommendations tailored to different objectives, sample sizes, and starting dose locations.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140913431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Issue Information: Biometrical Journal 4'24 期刊信息:生物计量学杂志 4'24
IF 1.7 3区 生物学 Q2 Mathematics Pub Date : 2024-05-03 DOI: 10.1002/bimj.202470004
{"title":"Issue Information: Biometrical Journal 4'24","authors":"","doi":"10.1002/bimj.202470004","DOIUrl":"https://doi.org/10.1002/bimj.202470004","url":null,"abstract":"","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.202470004","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140820534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biometrical Journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1