International Journal of Biostatistics最新文献

英文中文

Improving the mixed model for repeated measures to robustly increase precision in randomized trials. 改进重复测量的混合模型，显著提高随机试验的精度。

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2023-11-29 eCollection Date: 2024-11-01 DOI: 10.1515/ijb-2022-0101

Bingkai Wang, Yu Du

In randomized trials, repeated measures of the outcome are routinely collected. The mixed model for repeated measures (MMRM) leverages the information from these repeated outcome measures, and is often used for the primary analysis to estimate the average treatment effect at the primary endpoint. MMRM, however, can suffer from bias and precision loss when it models intermediate outcomes incorrectly, and hence fails to use the post-randomization information harmlessly. This paper proposes an extension of the commonly used MMRM, called IMMRM, that improves the robustness and optimizes the precision gain from covariate adjustment, stratified randomization, and adjustment for intermediate outcome measures. Under regularity conditions and missing completely at random, we prove that the IMMRM estimator for the average treatment effect is robust to arbitrary model misspecification and is asymptotically equal or more precise than the analysis of covariance (ANCOVA) estimator and the MMRM estimator. Under missing at random, IMMRM is less likely to be misspecified than MMRM, and we demonstrate via simulation studies that IMMRM continues to have less bias and smaller variance. Our results are further supported by a re-analysis of a randomized trial for the treatment of diabetes.

在随机试验中，结果的重复测量是常规收集的。重复测量的混合模型(MMRM)利用这些重复结果测量的信息，通常用于主要分析，以估计主要终点的平均治疗效果。然而，当MMRM不正确地模拟中间结果时，它可能会遭受偏差和精度损失，因此不能无害地使用随机化后的信息。本文提出了一种常用的MMRM的扩展，称为IMMRM，它提高了鲁棒性并优化了协变量调整、分层随机化和中间结果测量调整的精度增益。在正则性条件和完全随机缺失条件下，证明了IMMRM估计对任意模型错规范的平均处理效果具有鲁棒性，并且与协方差分析(ANCOVA)估计和MMRM估计渐近相等或更精确。在随机缺失的情况下，IMMRM比MMRM更不容易被错误指定，并且我们通过模拟研究证明IMMRM仍然具有更小的偏差和更小的方差。我们的研究结果得到了一项针对糖尿病治疗的随机试验的再分析的进一步支持。

{"title":"Improving the mixed model for repeated measures to robustly increase precision in randomized trials.","authors":"Bingkai Wang, Yu Du","doi":"10.1515/ijb-2022-0101","DOIUrl":"10.1515/ijb-2022-0101","url":null,"abstract":"In randomized trials, repeated measures of the outcome are routinely collected. The mixed model for repeated measures (MMRM) leverages the information from these repeated outcome measures, and is often used for the primary analysis to estimate the average treatment effect at the primary endpoint. MMRM, however, can suffer from bias and precision loss when it models intermediate outcomes incorrectly, and hence fails to use the post-randomization information harmlessly. This paper proposes an extension of the commonly used MMRM, called IMMRM, that improves the robustness and optimizes the precision gain from covariate adjustment, stratified randomization, and adjustment for intermediate outcome measures. Under regularity conditions and missing completely at random, we prove that the IMMRM estimator for the average treatment effect is robust to arbitrary model misspecification and is asymptotically equal or more precise than the analysis of covariance (ANCOVA) estimator and the MMRM estimator. Under missing at random, IMMRM is less likely to be misspecified than MMRM, and we demonstrate via simulation studies that IMMRM continues to have less bias and smaller variance. Our results are further supported by a re-analysis of a randomized trial for the treatment of diabetes.","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"585-598"},"PeriodicalIF":1.2,"publicationDate":"2023-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138452976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Prediction-based variable selection for component-wise gradient boosting. 基于预测的梯度增强变量选择。

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2023-11-27 eCollection Date: 2024-05-01 DOI: 10.1515/ijb-2023-0052

Sophie Potts, Elisabeth Bergherr, Constantin Reinke, Colin Griesbach

Model-based component-wise gradient boosting is a popular tool for data-driven variable selection. In order to improve its prediction and selection qualities even further, several modifications of the original algorithm have been developed, that mainly focus on different stopping criteria, leaving the actual variable selection mechanism untouched. We investigate different prediction-based mechanisms for the variable selection step in model-based component-wise gradient boosting. These approaches include Akaikes Information Criterion (AIC) as well as a selection rule relying on the component-wise test error computed via cross-validation. We implemented the AIC and cross-validation routines for Generalized Linear Models and evaluated them regarding their variable selection properties and predictive performance. An extensive simulation study revealed improved selection properties whereas the prediction error could be lowered in a real world application with age-standardized COVID-19 incidence rates.

基于模型的组件梯度增强是一种流行的数据驱动变量选择工具。为了进一步提高其预测和选择质量，对原始算法进行了一些修改，主要关注不同的停止准则，而没有改变实际的变量选择机制。我们研究了基于模型的组件梯度增强中变量选择步骤的不同基于预测的机制。这些方法包括赤池氏信息准则(Akaikes Information Criterion, AIC)以及依赖于通过交叉验证计算的组件测试误差的选择规则。我们实现了广义线性模型的AIC和交叉验证例程，并评估了它们的变量选择特性和预测性能。一项广泛的模拟研究揭示了改进的选择特性，而在年龄标准化的COVID-19发病率的现实世界应用中，预测误差可以降低。

引用次数: 0

Bayesian second-order sensitivity of longitudinal inferences to non-ignorability: an application to antidepressant clinical trial data. 纵向推论对不可忽略性的贝叶斯二阶敏感性:抗抑郁药临床试验数据的应用。

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2023-11-27 eCollection Date: 2024-11-01 DOI: 10.1515/ijb-2022-0014

Elahe Momeni Roochi, Samaneh Eftekhari Mahabadi

Incomplete data is a prevalent complication in longitudinal studies due to individuals' drop-out before intended completion time. Currently available methods via commercial software for analyzing incomplete longitudinal data at best rely on the ignorability of the drop-outs. If the underlying missing mechanism was non-ignorable, potential bias arises in the statistical inferences. To remove the bias when the drop-out is non-ignorable, joint complete-data and drop-out models have been proposed which involve computational difficulties and untestable assumptions. Since the critical ignorability assumption is unverifiable based on the observed part of the sample, some local sensitivity indices have been proposed in the literature. Specifically, Eftekhari Mahabadi (Second-order local sensitivity to non-ignorability in Bayesian inferences. Stat Med 2018;59:55-95) proposed a second-order local sensitivity tool for Bayesian analysis of cross-sectional studies and show its better performance for handling bias compared with the first-order ones. In this paper, we aim to extend this index for the Bayesian sensitivity analysis of normal longitudinal studies with drop-outs. The index is driven based on a selection model for the drop-out mechanism and a Bayesian linear mixed-effect complete-data model. The presented formulas are calculated using the posterior estimation and draws from the simpler ignorable model. The method is illustrated via some simulation studies and sensitivity analysis of a real antidepressant clinical trial data. Overall, the numerical analysis showed that when repeated outcomes are subject to missingness, regression coefficient estimates are nearly approximated well by a linear function in the neighbourhood of MAR model, but there are a considerable amount of second-order sensitivity for the error term and random effect variances in Bayesian linear mixed-effect model framework.

在纵向研究中，由于个体在预期完成时间之前退出，数据不完整是一个普遍的并发症。目前可用的通过商业软件分析不完整纵向数据的方法，最多依赖于辍学的可忽略性。如果潜在的缺失机制是不可忽略的，则在统计推断中产生潜在的偏差。为了消除drop-out不可忽略时的偏差，提出了联合完整数据和drop-out模型，该模型涉及计算困难和不可检验的假设。由于临界可忽略性假设无法根据样本的观测部分进行验证，因此文献中提出了一些局部敏感性指标。具体地说，Eftekhari Mahabadi(二阶局部灵敏度对贝叶斯推理的不可忽略性)。Stat Med 2018;59:55-95)提出了一种用于横断面研究贝叶斯分析的二阶局部灵敏度工具，与一阶工具相比，其处理偏倚的性能更好。在本文中，我们的目标是将该指标扩展到具有辍学的正常纵向研究的贝叶斯灵敏度分析。该指标基于退出机制的选择模型和贝叶斯线性混合效应完整数据模型驱动。给出的公式是用后验估计计算的，并从更简单的可忽略模型中得出。通过模拟研究和对真实抗抑郁药物临床试验数据的敏感性分析来说明该方法。总体而言，数值分析表明，当重复结果存在缺失时，回归系数估计可以通过MAR模型邻域的线性函数近似地逼近，但贝叶斯线性混合效应模型框架中误差项和随机效应方差存在相当大的二阶敏感性。

{"title":"Bayesian second-order sensitivity of longitudinal inferences to non-ignorability: an application to antidepressant clinical trial data.","authors":"Elahe Momeni Roochi, Samaneh Eftekhari Mahabadi","doi":"10.1515/ijb-2022-0014","DOIUrl":"10.1515/ijb-2022-0014","url":null,"abstract":"Incomplete data is a prevalent complication in longitudinal studies due to individuals' drop-out before intended completion time. Currently available methods via commercial software for analyzing incomplete longitudinal data at best rely on the ignorability of the drop-outs. If the underlying missing mechanism was non-ignorable, potential bias arises in the statistical inferences. To remove the bias when the drop-out is non-ignorable, joint complete-data and drop-out models have been proposed which involve computational difficulties and untestable assumptions. Since the critical ignorability assumption is unverifiable based on the observed part of the sample, some local sensitivity indices have been proposed in the literature. Specifically, Eftekhari Mahabadi (Second-order local sensitivity to non-ignorability in Bayesian inferences. Stat Med 2018;59:55-95) proposed a second-order local sensitivity tool for Bayesian analysis of cross-sectional studies and show its better performance for handling bias compared with the first-order ones. In this paper, we aim to extend this index for the Bayesian sensitivity analysis of normal longitudinal studies with drop-outs. The index is driven based on a selection model for the drop-out mechanism and a Bayesian linear mixed-effect complete-data model. The presented formulas are calculated using the posterior estimation and draws from the simpler ignorable model. The method is illustrated via some simulation studies and sensitivity analysis of a real antidepressant clinical trial data. Overall, the numerical analysis showed that when repeated outcomes are subject to missingness, regression coefficient estimates are nearly approximated well by a linear function in the neighbourhood of MAR model, but there are a considerable amount of second-order sensitivity for the error term and random effect variances in Bayesian linear mixed-effect model framework.","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"599-629"},"PeriodicalIF":1.2,"publicationDate":"2023-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138441586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Revisiting incidence rates comparison under right censorship. 在正确的审查制度下重温发病率比较。

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2023-11-14 eCollection Date: 2024-11-01 DOI: 10.1515/ijb-2023-0025

Pablo Martínez-Camblor, Susana Díaz-Coto

Data description is the first step for understanding the nature of the problem at hand. Usually, it is a simple task that does not require any particular assumption. However, the interpretation of the used descriptive measures can be a source of confusion and misunderstanding. The incidence rate is the quotient between the number of observed events and the sum of time that the studied population was at risk of having this event (person-time). Despite this apparently simple definition, its interpretation is not free of complexity. In this piece of research, we revisit the incidence rate estimator under right-censorship. We analyze the effect that the censoring time distribution can have on the observed results, and its relevance in the comparison of two or more incidence rates. We propose a solution for limiting the impact that the data collection process can have on the results of the hypothesis testing. We explore the finite-sample behavior of the considered estimators from Monte Carlo simulations. Two examples based on synthetic data illustrate the considered problem. The R code and data used are provided as Supplementary Material.

数据描述是理解手头问题本质的第一步。通常，这是一个简单的任务，不需要任何特定的假设。然而，对所使用的描述性度量的解释可能是混淆和误解的来源。发病率是观察到的事件数与研究人群有发生该事件风险的时间总和(人-时间)之间的商。尽管这个定义看起来很简单，但它的解释并非没有复杂性。在这篇研究中，我们重新审视了权利审查下的发生率估计器。我们分析了审查时间分布对观测结果的影响，以及它在两个或多个发病率比较中的相关性。我们提出了一个解决方案来限制数据收集过程对假设检验结果的影响。我们从蒙特卡洛模拟中探讨了所考虑的估计器的有限样本行为。基于综合数据的两个示例说明了所考虑的问题。R代码和使用的数据作为补充材料提供。

引用次数: 0

Frontmatter 头版头条

4区数学

International Journal of Biostatistics

Pub Date : 2023-11-01 DOI: 10.1515/ijb-2023-frontmatter2

引用次数: 0

Testing for association between ordinal traits and genetic variants in pedigree-structured samples by collapsing and kernel methods. 通过折叠和核方法测试系谱结构样本中序数性状和遗传变异之间的关联。

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2023-09-26 eCollection Date: 2024-11-01 DOI: 10.1515/ijb-2022-0123

Li-Chu Chien

In genome-wide association studies (GWAS), logistic regression is one of the most popular analytics methods for binary traits. Multinomial regression is an extension of binary logistic regression that allows for multiple categories. However, many GWAS methods have been limited application to binary traits. These methods have improperly often been used to account for ordinal traits, which causes inappropriate type I error rates and poor statistical power. Owing to the lack of analysis methods, GWAS of ordinal traits has been known to be problematic and gaining attention. In this paper, we develop a general framework for identifying ordinal traits associated with genetic variants in pedigree-structured samples by collapsing and kernel methods. We use the local odds ratios GEE technology to account for complicated correlation structures between family members and ordered categorical traits. We use the retrospective idea to treat the genetic markers as random variables for calculating genetic correlations among markers. The proposed genetic association method can accommodate ordinal traits and allow for the covariate adjustment. We conduct simulation studies to compare the proposed tests with the existing models for analyzing the ordered categorical data under various configurations. We illustrate application of the proposed tests by simultaneously analyzing a family study and a cross-sectional study from the Genetic Analysis Workshop 19 (GAW19) data.

在全基因组关联研究（GWAS）中，逻辑回归是最流行的二元性状分析方法之一。多项式回归是二元逻辑回归的扩展，允许多个类别。然而，许多GWAS方法在二元性状上的应用受到限制。这些方法经常被不恰当地用于解释有序特征，这导致了不恰当的I型错误率和较差的统计能力。由于缺乏分析方法，序列性状的GWAS一直存在问题，并引起了人们的关注。在本文中，我们开发了一个通用的框架，用于通过折叠和核方法识别谱系结构样本中与遗传变异相关的有序性状。我们使用局部优势比GEE技术来解释家庭成员和有序分类特征之间的复杂相关性结构。我们使用回顾性的思想将遗传标记作为随机变量来计算标记之间的遗传相关性。所提出的遗传关联方法可以适应序数性状，并允许协变量调整。我们进行了模拟研究，将所提出的测试与现有的模型进行比较，以分析各种配置下的有序分类数据。我们通过同时分析遗传分析工作坊19（GAW19）数据的一项家庭研究和一项横断面研究来说明所提出的测试的应用。

{"title":"Testing for association between ordinal traits and genetic variants in pedigree-structured samples by collapsing and kernel methods.","authors":"Li-Chu Chien","doi":"10.1515/ijb-2022-0123","DOIUrl":"10.1515/ijb-2022-0123","url":null,"abstract":"In genome-wide association studies (GWAS), logistic regression is one of the most popular analytics methods for binary traits. Multinomial regression is an extension of binary logistic regression that allows for multiple categories. However, many GWAS methods have been limited application to binary traits. These methods have improperly often been used to account for ordinal traits, which causes inappropriate type I error rates and poor statistical power. Owing to the lack of analysis methods, GWAS of ordinal traits has been known to be problematic and gaining attention. In this paper, we develop a general framework for identifying ordinal traits associated with genetic variants in pedigree-structured samples by collapsing and kernel methods. We use the local odds ratios GEE technology to account for complicated correlation structures between family members and ordered categorical traits. We use the retrospective idea to treat the genetic markers as random variables for calculating genetic correlations among markers. The proposed genetic association method can accommodate ordinal traits and allow for the covariate adjustment. We conduct simulation studies to compare the proposed tests with the existing models for analyzing the ordered categorical data under various configurations. We illustrate application of the proposed tests by simultaneously analyzing a family study and a cross-sectional study from the Genetic Analysis Workshop 19 (GAW19) data.","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"677-690"},"PeriodicalIF":1.2,"publicationDate":"2023-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41177324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Frontmatter 头版头条

4区数学

International Journal of Biostatistics

Pub Date : 2023-05-01 DOI: 10.1515/ijb-2023-frontmatter1

引用次数: 0

Approximate reciprocal relationship between two cause-specific hazard ratios in COVID-19 data with mutually exclusive events 在具有互斥事件的COVID-19数据中，两个病因特异性风险比之间存在近似的倒数关系

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2021-04-27 DOI: 10.1101/2021.04.22.21255955

Wentian Li, S. Cetin, A. Ulgen, M. Cetin, Hakan Şıvgın, Yaning Yang

Abstract COVID-19 survival data presents a special situation where not only the time-to-event period is short, but also the two events or outcome types, death and release from hospital, are mutually exclusive, leading to two cause-specific hazard ratios (csHR d and csHR r ). The eventual mortality/release outcome is also analyzed by logistic regression to obtain odds-ratio (OR). We have the following three empirical observations: (1) The magnitude of OR is an upper limit of the csHR d : |log(OR)| ≥ |log(csHR d )|. This relationship between OR and HR might be understood from the definition of the two quantities; (2) csHR d and csHR r point in opposite directions: log(csHR d ) ⋅ log(csHR r ) < 0; This relation is a direct consequence of the nature of the two events; and (3) there is a tendency for a reciprocal relation between csHR d and csHR r : csHR d ∼ 1/csHR r . Though an approximate reciprocal trend between the two hazard ratios is in indication that the same factor causing faster death also lead to slow recovery by a similar mechanism, and vice versa, a quantitative relation between csHR d and csHR r in this context is not obvious. These results may help future analyses of data from COVID-19 or other similar diseases, in particular if the deceased patients are lacking, whereas surviving patients are abundant.

摘要新冠肺炎生存数据呈现出一种特殊情况，即不仅事件发生时间短，而且死亡和出院这两种事件或结果类型相互排斥，导致两种原因特异性风险比（csHR d和csHR r）。最终的死亡率/释放结果也通过逻辑回归进行分析，以获得比值比（OR）。我们有以下三个经验观察结果：（1）OR的大小是csHR d:|log（OR）|≥|log（csHR d）|的上限。OR和HR之间的这种关系可以从这两个量的定义中理解；（2） csHR d和csHR r指向相反的方向：log（csHR d）－log（csHRr）＜0；这种关系是这两个事件性质的直接结果；和（3）csHR d和csHR r之间存在一种相互关系的趋势：csHR d～1/csHR r。尽管两个危险比之间的近似倒数趋势表明，导致更快死亡的同一因素也会通过类似的机制导致缓慢恢复，反之亦然，但在这种情况下，csHR d和csHR r之间的定量关系并不明显。这些结果可能有助于未来分析新冠肺炎或其他类似疾病的数据，特别是如果死亡患者缺乏，而幸存患者充足。

{"title":"Approximate reciprocal relationship between two cause-specific hazard ratios in COVID-19 data with mutually exclusive events","authors":"Wentian Li, S. Cetin, A. Ulgen, M. Cetin, Hakan Şıvgın, Yaning Yang","doi":"10.1101/2021.04.22.21255955","DOIUrl":"https://doi.org/10.1101/2021.04.22.21255955","url":null,"abstract":"Abstract COVID-19 survival data presents a special situation where not only the time-to-event period is short, but also the two events or outcome types, death and release from hospital, are mutually exclusive, leading to two cause-specific hazard ratios (csHR d and csHR r ). The eventual mortality/release outcome is also analyzed by logistic regression to obtain odds-ratio (OR). We have the following three empirical observations: (1) The magnitude of OR is an upper limit of the csHR d : |log(OR)| ≥ |log(csHR d )|. This relationship between OR and HR might be understood from the definition of the two quantities; (2) csHR d and csHR r point in opposite directions: log(csHR d ) ⋅ log(csHR r ) < 0; This relation is a direct consequence of the nature of the two events; and (3) there is a tendency for a reciprocal relation between csHR d and csHR r : csHR d ∼ 1/csHR r . Though an approximate reciprocal trend between the two hazard ratios is in indication that the same factor causing faster death also lead to slow recovery by a similar mechanism, and vice versa, a quantitative relation between csHR d and csHR r in this context is not obvious. These results may help future analyses of data from COVID-19 or other similar diseases, in particular if the deceased patients are lacking, whereas surviving patients are abundant.","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":"0 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2021-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42193520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Asymptotic properties of the two one-sided t-tests – new insights and the Schuirmann-constant 两个单侧t检验的渐近性质——新见解和Schuirmann常数

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2021-01-08 DOI: 10.1515/IJB-2020-0057

Christian Palmes, Tobias Bluhmki, Benedikt Funke, E. Bluhmki

Abstract The two one-sided t-tests (TOST) method is the most popular statistical equivalence test with many areas of application, i.e., in the pharmaceutical industry. Proper sample size calculation is needed in order to show equivalence with a certain power. Here, the crucial problem of choosing a suitable mean-difference in TOST sample size calculations is addressed. As an alternative concept, it is assumed that the mean-difference follows an a-priori distribution. Special interest is given to the uniform and some centered triangle a-priori distributions. Using a newly developed asymptotical theory a helpful analogy principle is found: every a-priori distribution corresponds to a point mean-difference, which we call its Schuirmann-constant. This constant does not depend on the standard deviation and aims to support the investigator in finding a well-considered mean-difference for proper sample size calculations in complex data situations. In addition to the proposed concept, we demonstrate that well-known sample size approximation formulas in the literature are in fact biased and state their unbiased corrections as well. Moreover, an R package is provided for a right away application of our newly developed concepts.

摘要双单侧t检验(TOST)方法是最常用的统计等价检验方法，在许多领域都有应用，如制药行业。为了在一定的幂次下显示等值，需要适当的样本量计算。在这里，选择一个合适的平均差在TOST样本大小计算的关键问题是解决。作为一种替代概念，假设均值差遵循先验分布。对均匀分布和一些有中心的三角形先验分布特别感兴趣。利用一个新发展的渐近理论，我们发现了一个有用的类比原理:每个先验分布对应于一个点均值差，我们称之为它的舒尔曼常数。这个常数不依赖于标准偏差，旨在支持研究者在复杂的数据情况下找到一个经过深思熟虑的平均差异，以进行适当的样本量计算。除了提出的概念外，我们还证明了文献中众所周知的样本量近似公式实际上是有偏的，并说明了它们的无偏修正。此外，还提供了一个R包，可以立即应用我们新开发的概念。

{"title":"Asymptotic properties of the two one-sided t-tests – new insights and the Schuirmann-constant","authors":"Christian Palmes, Tobias Bluhmki, Benedikt Funke, E. Bluhmki","doi":"10.1515/IJB-2020-0057","DOIUrl":"https://doi.org/10.1515/IJB-2020-0057","url":null,"abstract":"Abstract The two one-sided t-tests (TOST) method is the most popular statistical equivalence test with many areas of application, i.e., in the pharmaceutical industry. Proper sample size calculation is needed in order to show equivalence with a certain power. Here, the crucial problem of choosing a suitable mean-difference in TOST sample size calculations is addressed. As an alternative concept, it is assumed that the mean-difference follows an a-priori distribution. Special interest is given to the uniform and some centered triangle a-priori distributions. Using a newly developed asymptotical theory a helpful analogy principle is found: every a-priori distribution corresponds to a point mean-difference, which we call its Schuirmann-constant. This constant does not depend on the standard deviation and aims to support the investigator in finding a well-considered mean-difference for proper sample size calculations in complex data situations. In addition to the proposed concept, we demonstrate that well-known sample size approximation formulas in the literature are in fact biased and state their unbiased corrections as well. Moreover, an R package is provided for a right away application of our newly developed concepts.","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":"18 1","pages":"19 - 38"},"PeriodicalIF":1.2,"publicationDate":"2021-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/IJB-2020-0057","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46667419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Estimation of semi-Markov multi-state models: a comparison of the sojourn times and transition intensities approaches 半马尔可夫多状态模型的估计:逗留时间和转移强度方法的比较

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2020-05-29 DOI: 10.1515/IJB-2020-0083

A. Asanjarani, B. Liquet, Y. Nazarathy

Abstract Semi-Markov models are widely used for survival analysis and reliability analysis. In general, there are two competing parameterizations and each entails its own interpretation and inference properties. On the one hand, a semi-Markov process can be defined based on the distribution of sojourn times, often via hazard rates, together with transition probabilities of an embedded Markov chain. On the other hand, intensity transition functions may be used, often referred to as the hazard rates of the semi-Markov process. We summarize and contrast these two parameterizations both from a probabilistic and an inference perspective, and we highlight relationships between the two approaches. In general, the intensity transition based approach allows the likelihood to be split into likelihoods of two-state models having fewer parameters, allowing efficient computation and usage of many survival analysis tools. Nevertheless, in certain cases the sojourn time based approach is natural and has been exploited extensively in applications. In contrasting the two approaches and contemporary relevant R packages used for inference, we use two real datasets highlighting the probabilistic and inference properties of each approach. This analysis is accompanied by an R vignette.

摘要半马尔可夫模型广泛用于生存分析和可靠性分析。一般来说，有两个相互竞争的参数化，每个参数化都有自己的解释和推理特性。一方面，半马尔可夫过程可以基于逗留时间的分布来定义，通常通过风险率，以及嵌入马尔可夫链的转移概率。另一方面，可以使用强度转移函数，通常称为半马尔可夫过程的风险率。我们从概率和推理的角度总结和比较了这两种参数化，并强调了两种方法之间的关系。通常，基于强度转换的方法允许将似然性划分为具有较少参数的两状态模型的似然性，从而允许高效计算和使用许多生存分析工具。然而，在某些情况下，基于逗留时间的方法是自然的，并且在应用中得到了广泛的利用。在对比这两种方法和用于推理的当代相关R包时，我们使用了两个真实数据集，突出了每种方法的概率和推理特性。此分析附有一个R小插曲。

{"title":"Estimation of semi-Markov multi-state models: a comparison of the sojourn times and transition intensities approaches","authors":"A. Asanjarani, B. Liquet, Y. Nazarathy","doi":"10.1515/IJB-2020-0083","DOIUrl":"https://doi.org/10.1515/IJB-2020-0083","url":null,"abstract":"Abstract Semi-Markov models are widely used for survival analysis and reliability analysis. In general, there are two competing parameterizations and each entails its own interpretation and inference properties. On the one hand, a semi-Markov process can be defined based on the distribution of sojourn times, often via hazard rates, together with transition probabilities of an embedded Markov chain. On the other hand, intensity transition functions may be used, often referred to as the hazard rates of the semi-Markov process. We summarize and contrast these two parameterizations both from a probabilistic and an inference perspective, and we highlight relationships between the two approaches. In general, the intensity transition based approach allows the likelihood to be split into likelihoods of two-state models having fewer parameters, allowing efficient computation and usage of many survival analysis tools. Nevertheless, in certain cases the sojourn time based approach is natural and has been exploited extensively in applications. In contrasting the two approaches and contemporary relevant R packages used for inference, we use two real datasets highlighting the probabilistic and inference properties of each approach. This analysis is accompanied by an R vignette.","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":"18 1","pages":"243 - 262"},"PeriodicalIF":1.2,"publicationDate":"2020-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/IJB-2020-0083","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43491644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

International Journal of Biostatistics

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀