International Journal of Biostatistics最新文献

英文中文

A hybrid hazard-based model using two-piece distributions. 使用两件分布的基于危险的混合模型。

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2025-04-30 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2023-0153

Worku Biyadgie Ewnetu, Irène Gijbels, Anneleen Verhasselt

Cox proportional hazards model is widely used to study the relationship between the survival time of an event and covariates. Its primary objective is parameter estimation assuming a constant relative hazard throughout the entire follow-up time. The baseline hazard is thus treated as a nuisance parameter. However, if the interest is to predict possible outcomes like specific quantiles of the distribution (e.g. median survival time), survival and hazard functions, it may be more convenient to use a parametric baseline distribution. Such a parametric model should however be flexible enough to allow for various shapes of e.g. the hazard function. In this paper we propose flexible hazard-based models for right censored data using a large class of two-piece asymmetric baseline distributions. The effect of covariates is characterized through time-scale changes on hazard progression and on the relative hazard ratio; and can take three possible functional forms: parametric, semi-parametric (partly linear) and non-parametric. In the first case, the usual full likelihood estimation method is applied. In the semi-parametric and non-parametric settings a general profile (local) likelihood estimation approach is proposed. An extensive simulation study investigates the finite-sample performances of the proposed method. Its use in data analysis is illustrated in real data examples.

Cox比例风险模型被广泛用于研究事件生存时间与协变量之间的关系。其主要目标是在整个随访时间内假设一个恒定的相对危险度的参数估计。因此，基线危险被视为有害参数。然而，如果兴趣是预测可能的结果，如分布的特定分位数（例如中位生存时间），生存和风险函数，则使用参数基线分布可能更方便。然而，这种参数化模型应该足够灵活，以允许各种形状，例如危险函数。在本文中，我们提出了灵活的基于风险的模型右截尾数据使用大类两件不对称基线分布。协变量的影响表现为时间尺度变化对危险进展和相对危险比的影响；它可以有三种可能的函数形式：参数、半参数（部分线性）和非参数。在第一种情况下，采用通常的全似然估计方法。在半参数和非参数条件下，提出了一种通用的轮廓（局部）似然估计方法。广泛的仿真研究探讨了该方法的有限样本性能。通过实际数据实例说明了它在数据分析中的应用。

{"title":"A hybrid hazard-based model using two-piece distributions.","authors":"Worku Biyadgie Ewnetu, Irène Gijbels, Anneleen Verhasselt","doi":"10.1515/ijb-2023-0153","DOIUrl":"10.1515/ijb-2023-0153","url":null,"abstract":"Cox proportional hazards model is widely used to study the relationship between the survival time of an event and covariates. Its primary objective is parameter estimation assuming a constant relative hazard throughout the entire follow-up time. The baseline hazard is thus treated as a nuisance parameter. However, if the interest is to predict possible outcomes like specific quantiles of the distribution (e.g. median survival time), survival and hazard functions, it may be more convenient to use a parametric baseline distribution. Such a parametric model should however be flexible enough to allow for various shapes of e.g. the hazard function. In this paper we propose flexible hazard-based models for right censored data using a large class of two-piece asymmetric baseline distributions. The effect of covariates is characterized through time-scale changes on hazard progression and on the relative hazard ratio; and can take three possible functional forms: parametric, semi-parametric (partly linear) and non-parametric. In the first case, the usual full likelihood estimation method is applied. In the semi-parametric and non-parametric settings a general profile (local) likelihood estimation approach is proposed. An extensive simulation study investigates the finite-sample performances of the proposed method. Its use in data analysis is illustrated in real data examples.","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"67-95"},"PeriodicalIF":1.2,"publicationDate":"2025-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144038766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Homogeneity test and sample size of response rates for AC ₁ in a stratified evaluation design. 分层评价设计中ac1反应率的同质性检验和样本量。

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2025-04-30 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2024-0080

Jingwei Jia, Yuanbo Liu, Jikai Yang, Zhiming Li

Gwet's first-order agreement coefficient (AC ₁) is widely used to evaluate the consistency between raters. Considering the existence of a certain relationship between the raters, the paper aims to test the equality of response rates and the dependency between two raters of modified AC ₁'s in a stratified design and estimates the sample size for a given significance level. We first establish a probability model and then estimate the unknown parameters. Further, we explore the homogeneity test of these AC ₁'s under the asymptotic method, such as likelihood ratio, score, and Wald-type statistics. In numerical simulation, the performance of statistics is investigated in terms of type I error rates (TIEs) and power while finding a suitable sample size under a given power. The results show that the Wald-type statistic has robust TIEs and satisfactory power and is suitable for large samples (n≥50). Under the same power, the sample size of the Wald-type test is smaller when the number of strata is large. The higher the power, the larger the required sample size. Finally, two real examples are given to illustrate these methods.

Gwet的一阶一致系数（ac1）被广泛用于评价评价者之间的一致性。考虑到评分者之间存在一定的关系，本文的目的是在分层设计中检验反应率的相等性和修正AC 1的两个评分者之间的依赖关系，并估计给定显著性水平下的样本量。首先建立概率模型，然后对未知参数进行估计。进一步，我们探讨了这些AC 1在渐近方法下的同质性检验，如似然比、分数和wald型统计量。在数值模拟中，统计性能是根据I型错误率（TIEs）和功率来研究的，同时在给定功率下找到合适的样本量。结果表明，wald型统计量具有鲁棒性和令人满意的功率，适用于大样本（n≥50）。在相同的功率下，当岩层数较大时，wald型试验的样本量较小。功率越高，所需的样本量越大。最后，给出了两个实例来说明这些方法。

{"title":"Homogeneity test and sample size of response rates for AC 1 in a stratified evaluation design.","authors":"Jingwei Jia, Yuanbo Liu, Jikai Yang, Zhiming Li","doi":"10.1515/ijb-2024-0080","DOIUrl":"10.1515/ijb-2024-0080","url":null,"abstract":"Gwet's first-order agreement coefficient (AC 1) is widely used to evaluate the consistency between raters. Considering the existence of a certain relationship between the raters, the paper aims to test the equality of response rates and the dependency between two raters of modified AC 1's in a stratified design and estimates the sample size for a given significance level. We first establish a probability model and then estimate the unknown parameters. Further, we explore the homogeneity test of these AC 1's under the asymptotic method, such as likelihood ratio, score, and Wald-type statistics. In numerical simulation, the performance of statistics is investigated in terms of type I error rates (TIEs) and power while finding a suitable sample size under a given power. The results show that the Wald-type statistic has robust TIEs and satisfactory power and is suitable for large samples (n≥50). Under the same power, the sample size of the Wald-type test is smaller when the number of strata is large. The higher the power, the larger the required sample size. Finally, two real examples are given to illustrate these methods.","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"17-35"},"PeriodicalIF":1.2,"publicationDate":"2025-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144025779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A review of survival stacking: a method to cast survival regression analysis as a classification problem. 生存叠加：一种将生存回归分析作为分类问题的方法。

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2025-03-28 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2022-0055

Erin Craig, Chenyang Zhong, Robert Tibshirani

While there are many well-developed data science methods for classification and regression, there are relatively few methods for working with right-censored data. Here, we review survival stacking, a method for casting a survival regression analysis problem as a classification problem, thereby allowing the use of general classification methods and software in a survival setting. Inspired by the Cox partial likelihood, survival stacking collects features and outcomes of survival data in a large data frame with a binary outcome. We show that survival stacking with logistic regression is approximately equivalent to the Cox proportional hazards model. We further illustrate survival stacking on real and simulated data. By reframing survival regression problems as classification problems, survival stacking removes the reliance on specialized tools for survival regression, and makes it straightforward for data scientists to use well-known learning algorithms and software for classification in the survival setting. This in turn lowers the barrier for flexible survival modeling.

虽然有很多成熟的分类和回归数据科学方法，但处理右删失数据的方法相对较少。在这里，我们回顾了生存堆叠法，这是一种将生存回归分析问题作为分类问题来处理的方法，从而允许在生存环境中使用一般的分类方法和软件。受 Cox 部分似然法的启发，生存堆叠法在一个具有二元结果的大型数据框架中收集生存数据的特征和结果。我们的研究表明，使用逻辑回归的生存堆积近似等同于 Cox 比例危险模型。我们还在真实数据和模拟数据上进一步说明了生存堆叠。通过将生存回归问题重构为分类问题，生存堆叠消除了对生存回归专用工具的依赖，使数据科学家可以直接使用众所周知的学习算法和软件在生存环境中进行分类。这反过来又降低了灵活建立生存模型的门槛。

{"title":"A review of survival stacking: a method to cast survival regression analysis as a classification problem.","authors":"Erin Craig, Chenyang Zhong, Robert Tibshirani","doi":"10.1515/ijb-2022-0055","DOIUrl":"10.1515/ijb-2022-0055","url":null,"abstract":"While there are many well-developed data science methods for classification and regression, there are relatively few methods for working with right-censored data. Here, we review survival stacking, a method for casting a survival regression analysis problem as a classification problem, thereby allowing the use of general classification methods and software in a survival setting. Inspired by the Cox partial likelihood, survival stacking collects features and outcomes of survival data in a large data frame with a binary outcome. We show that survival stacking with logistic regression is approximately equivalent to the Cox proportional hazards model. We further illustrate survival stacking on real and simulated data. By reframing survival regression problems as classification problems, survival stacking removes the reliance on specialized tools for survival regression, and makes it straightforward for data scientists to use well-known learning algorithms and software for classification in the survival setting. This in turn lowers the barrier for flexible survival modeling.","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"37-51"},"PeriodicalIF":1.2,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12247786/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143732819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A multivariate Bayesian learning approach for improved detection of doping in athletes using urinary steroid profiles. 一种多变量贝叶斯学习方法，用于使用尿类固醇谱改善运动员兴奋剂检测。

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2025-03-28 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2024-0019

Dimitra Eleftheriou, Thomas Piper, Mario Thevis, Tereza Neocleous

Biomarker analysis of athletes' urinary steroid profiles is crucial for the success of anti-doping efforts. Current statistical analysis methods generate personalised limits for each athlete based on univariate modelling of longitudinal biomarker values from the urinary steroid profile. However, simultaneous modelling of multiple biomarkers has the potential to further enhance abnormality detection. In this study, we propose a multivariate Bayesian adaptive model for longitudinal data analysis, which extends the established single-biomarker model in forensic toxicology. The proposed approach employs Markov chain Monte Carlo sampling methods and addresses the scarcity of confirmed abnormal values through a one-class classification algorithm. By adapting decision boundaries as new measurements are obtained, the model provides robust and personalised detection thresholds for each athlete. We tested the proposed approach on a database of 229 athletes, which includes longitudinal steroid profiles containing samples classified as normal, atypical, or confirmed abnormal. Our results demonstrate improved detection performance, highlighting the potential value of a multivariate approach in doping detection.

对运动员尿液类固醇谱进行生物标志物分析是反兴奋剂工作取得成功的关键。目前的统计分析方法是根据尿液类固醇图谱中纵向生物标志物值的单变量建模，为每个运动员生成个性化的限值。然而，对多种生物标志物同时建模有可能进一步提高异常检测水平。在本研究中，我们提出了一种用于纵向数据分析的多变量贝叶斯自适应模型，该模型扩展了法医毒理学中已有的单生物标记物模型。所提出的方法采用马尔可夫链蒙特卡洛抽样方法，并通过单类分类算法解决了证实异常值稀缺的问题。通过在获得新的测量结果时调整决策边界，该模型可为每位运动员提供稳健且个性化的检测阈值。我们在一个包含 229 名运动员的数据库中测试了所提出的方法，该数据库包含纵向类固醇档案，其中的样本被分类为正常、非典型或确认异常。我们的结果证明了检测性能的提高，突出了多元方法在兴奋剂检测中的潜在价值。

{"title":"A multivariate Bayesian learning approach for improved detection of doping in athletes using urinary steroid profiles.","authors":"Dimitra Eleftheriou, Thomas Piper, Mario Thevis, Tereza Neocleous","doi":"10.1515/ijb-2024-0019","DOIUrl":"10.1515/ijb-2024-0019","url":null,"abstract":"Biomarker analysis of athletes' urinary steroid profiles is crucial for the success of anti-doping efforts. Current statistical analysis methods generate personalised limits for each athlete based on univariate modelling of longitudinal biomarker values from the urinary steroid profile. However, simultaneous modelling of multiple biomarkers has the potential to further enhance abnormality detection. In this study, we propose a multivariate Bayesian adaptive model for longitudinal data analysis, which extends the established single-biomarker model in forensic toxicology. The proposed approach employs Markov chain Monte Carlo sampling methods and addresses the scarcity of confirmed abnormal values through a one-class classification algorithm. By adapting decision boundaries as new measurements are obtained, the model provides robust and personalised detection thresholds for each athlete. We tested the proposed approach on a database of 229 athletes, which includes longitudinal steroid profiles containing samples classified as normal, atypical, or confirmed abnormal. Our results demonstrate improved detection performance, highlighting the potential value of a multivariate approach in doping detection.","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"165-181"},"PeriodicalIF":1.2,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143732816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Regression analysis of clustered current status data with informative cluster size under a transformed survival model. 转换生存模型下具有信息聚类大小的聚类现状数据的回归分析。

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2025-03-24 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2023-0130

Yanqin Feng, Shijiao Yin, Jieli Ding

In this paper, we study inference methods for regression analysis of clustered current status data with informative cluster sizes. When the correlated failure times of interest arise from a general class of semiparametric transformation frailty models, we develop a nonparametric maximum likelihood estimation based method for regression analysis and conduct an expectation-maximization algorithm to implement it. The asymptotic properties including consistency and asymptotic normality of the proposed estimators are established. Extensive simulation studies are conducted and indicate that the proposed method works well. The developed approach is applied to analyze a real-life data set from a tumorigenicity study.

本文研究了基于信息聚类大小的聚类现状数据回归分析的推理方法。当相关失效时间来自于一类一般的半参数变换脆弱性模型时，我们开发了一种基于非参数极大似然估计的回归分析方法，并进行了期望最大化算法来实现它。建立了所提估计量的渐近性质，包括相合性和渐近正态性。大量的仿真研究表明，所提出的方法是有效的。所开发的方法被应用于分析来自致瘤性研究的真实数据集。

引用次数: 0

Prognostic adjustment with efficient estimators to unbiasedly leverage historical data in randomized trials. 随机试验中使用有效估计器进行预后调整，以无偏倚地利用历史数据。

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2025-03-11 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2024-0018

Lauren D Liao, Emilie Højbjerre-Frandsen, Alan E Hubbard, Alejandro Schuler

Although randomized controlled trials (RCTs) are a cornerstone of comparative effectiveness, they typically have much smaller sample size than observational studies due to financial and ethical considerations. Therefore there is interest in using plentiful historical data (either observational data or prior trials) to reduce trial sizes. Previous estimators developed for this purpose rely on unrealistic assumptions, without which the added data can bias the treatment effect estimate. Recent work proposed an alternative method (prognostic covariate adjustment) that imposes no additional assumptions and increases efficiency in trial analyses. The idea is to use historical data to learn a prognostic model: a regression of the outcome onto the covariates. The predictions from this model, generated from the RCT subjects' baseline variables, are then used as a covariate in a linear regression analysis of the trial data. In this work, we extend prognostic adjustment to trial analyses with nonparametric efficient estimators, which are more powerful than linear regression. We provide theory that explains why prognostic adjustment improves small-sample point estimation and inference without any possibility of bias. Simulations corroborate the theory: efficient estimators using prognostic adjustment compared to without provides greater power (i.e., smaller standard errors) when the trial is small. Population shifts between historical and trial data attenuate benefits but do not introduce bias. We showcase our estimator using clinical trial data provided by Novo Nordisk A/S that evaluates insulin therapy for individuals with type 2 diabetes.

虽然随机对照试验（rct）是比较有效性的基础，但由于经济和伦理方面的考虑，它们的样本量通常比观察性研究小得多。因此，有兴趣使用大量的历史数据（无论是观察数据还是先前的试验）来减少试验规模。以前为此目的开发的估计依赖于不切实际的假设，没有这些假设，添加的数据可能会使治疗效果估计产生偏差。最近的工作提出了一种替代方法（预后协变量调整），该方法不施加额外的假设并提高了试验分析的效率。这个想法是使用历史数据来学习预测模型：将结果回归到协变量上。该模型的预测由RCT受试者的基线变量生成，然后用作试验数据线性回归分析中的协变量。在这项工作中，我们将预后调整扩展到使用非参数有效估计器的试验分析，它比线性回归更强大。我们提供的理论解释了为什么预测调整改善了小样本点估计和推断，而没有任何偏差的可能性。模拟证实了这一理论：当试验规模较小时，使用预测调整的有效估计值比不使用预测调整的估计值提供更大的功率（即更小的标准误差）。历史数据和试验数据之间的人口转移会减弱获益，但不会引入偏倚。我们使用诺和诺德公司提供的临床试验数据来展示我们的估计器，该数据评估了2型糖尿病患者的胰岛素治疗。

{"title":"Prognostic adjustment with efficient estimators to unbiasedly leverage historical data in randomized trials.","authors":"Lauren D Liao, Emilie Højbjerre-Frandsen, Alan E Hubbard, Alejandro Schuler","doi":"10.1515/ijb-2024-0018","DOIUrl":"10.1515/ijb-2024-0018","url":null,"abstract":"Although randomized controlled trials (RCTs) are a cornerstone of comparative effectiveness, they typically have much smaller sample size than observational studies due to financial and ethical considerations. Therefore there is interest in using plentiful historical data (either observational data or prior trials) to reduce trial sizes. Previous estimators developed for this purpose rely on unrealistic assumptions, without which the added data can bias the treatment effect estimate. Recent work proposed an alternative method (prognostic covariate adjustment) that imposes no additional assumptions and increases efficiency in trial analyses. The idea is to use historical data to learn a prognostic model: a regression of the outcome onto the covariates. The predictions from this model, generated from the RCT subjects' baseline variables, are then used as a covariate in a linear regression analysis of the trial data. In this work, we extend prognostic adjustment to trial analyses with nonparametric efficient estimators, which are more powerful than linear regression. We provide theory that explains why prognostic adjustment improves small-sample point estimation and inference without any possibility of bias. Simulations corroborate the theory: efficient estimators using prognostic adjustment compared to without provides greater power (i.e., smaller standard errors) when the trial is small. Population shifts between historical and trial data attenuate benefits but do not introduce bias. We showcase our estimator using clinical trial data provided by Novo Nordisk A/S that evaluates insulin therapy for individuals with type 2 diabetes.","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"1-15"},"PeriodicalIF":1.2,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12247788/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143598241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Bayesian covariance regression in functional data analysis with applications to functional brain imaging. 贝叶斯协方差回归在功能数据分析中的应用，以及在脑功能成像中的应用。

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2025-02-05 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2023-0029

John Shamshoian, Nicholas Marco, Damla Şentürk, Shafali Jeste, Donatello Telesca

Function on scalar regression models relate functional outcomes to scalar predictors through the conditional mean function. With few and limited exceptions, many functional regression frameworks operate under the assumption that covariate information does not affect patterns of covariation. In this manuscript, we address this disparity by developing a Bayesian functional regression model, providing joint inference for both the conditional mean and covariance functions. Our work hinges on basis expansions of both the functional evaluation domain and covariate space, to define flexible non-parametric forms of dependence. To aid interpretation, we develop novel low-dimensional summaries, which indicate the degree of covariate-dependent heteroskedasticity. The proposed modeling framework is motivated and applied to a case study in functional brain imaging through electroencephalography, aiming to elucidate potential differentiation in the neural development of children with autism spectrum disorder.

标量函数回归模型通过条件均值函数将函数结果与标量预测因子联系起来。除了极少数例外情况，许多函数回归框架都是在协变量信息不会影响协变量模式的假设下运行的。在本手稿中，我们通过建立贝叶斯函数回归模型，为条件均值函数和协方差函数提供联合推断，从而解决了这一差异。我们的工作依赖于功能评估域和协方差空间的基础扩展，以定义灵活的非参数依赖形式。为了帮助解释，我们开发了新颖的低维摘要，用于显示协变量依赖异方差的程度。我们提出了建模框架的动机，并将其应用于通过脑电图进行的脑功能成像案例研究，旨在阐明自闭症谱系障碍儿童神经发育过程中的潜在分化。

引用次数: 0

DsubCox: a fast subsampling algorithm for Cox model with distributed and massive survival data. DsubCox：一种针对分布式海量生存数据的Cox模型的快速子采样算法。

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2025-02-04 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2024-0042

Haixiang Zhang, Yang Li, HaiYing Wang

To ensure privacy protection and alleviate computational burden, we propose a fast subsmaling procedure for the Cox model with massive survival datasets from multi-centered, decentralized sources. The proposed estimator is computed based on optimal subsampling probabilities that we derived and enables transmission of subsample-based summary level statistics between different storage sites with only one round of communication. For inference, the asymptotic properties of the proposed estimator were rigorously established. An extensive simulation study demonstrated that the proposed approach is effective. The methodology was applied to analyze a large dataset from the U.S. airlines.

为了保证隐私保护和减轻计算负担，我们提出了一种基于多中心、分散来源的大量生存数据集的Cox模型的快速子化过程。所提出的估计器是基于我们导出的最优子抽样概率计算的，并且只需要一轮通信就可以在不同的存储站点之间传输基于子抽样的汇总级统计信息。对于推理，严格地建立了所提估计量的渐近性质。大量的仿真研究表明，该方法是有效的。该方法被用于分析来自美国航空公司的大型数据集。

引用次数: 0

Hypothesis testing for detecting outlier evaluators. 检测离群评估员的假设检验。

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2024-11-04 eCollection Date: 2024-11-01 DOI: 10.1515/ijb-2023-0004

Li Xu, David M Zucker, Molin Wang

In epidemiological studies, the measurements of disease outcomes are carried out by different evaluators. In this paper, we propose a two-stage procedure for detecting outlier evaluators. In the first stage, a regression model is fitted to obtain the evaluators' effects. Outlier evaluators have different effects than normal evaluators. In the second stage, stepwise hypothesis tests are performed to detect outlier evaluators. The true positive rate and true negative rate of the proposed procedure are assessed in a simulation study. We apply the proposed method to detect potential outlier audiologists among the audiologists who measured hearing threshold levels of the participants in the Audiology Assessment Arm of the Conservation of Hearing Study, which is an epidemiological study for examining risk factors of hearing loss.

在流行病学研究中，对疾病结果的测量是由不同的评估者进行的。本文提出了一种分两个阶段检测离群评价者的方法。在第一阶段，拟合回归模型以获得评价者的效应。离群评价者与正常评价者的效果不同。在第二阶段，通过逐步假设检验来检测离群评价者。在模拟研究中评估了建议程序的真阳性率和真阴性率。听力保护研究是一项流行病学研究，旨在检查听力损失的风险因素。我们采用所提出的方法，从测量听力保护研究听力评估臂参与者听力阈值水平的听力学家中检测出潜在的离群听力学家。

引用次数: 0

Optimizing personalized treatments for targeted patient populations across multiple domains. 跨领域优化针对目标患者群体的个性化治疗。

IF 1.2 4区数学

International Journal of Biostatistics

Pub Date : 2024-09-26 eCollection Date: 2024-11-01 DOI: 10.1515/ijb-2024-0068

Yuan Chen, Donglin Zeng, Yuanjia Wang

Learning individualized treatment rules (ITRs) for a target patient population with mental disorders is confronted with many challenges. First, the target population may be different from the training population that provided data for learning ITRs. Ignoring differences between the training patient data and the target population can result in sub-optimal treatment strategies for the target population. Second, for mental disorders, a patient's underlying mental state is not observed but can be inferred from measures of high-dimensional combinations of symptomatology. Treatment mechanisms are unknown and can be complex, and thus treatment effect moderation can take complicated forms. To address these challenges, we propose a novel method that connects measurement models, efficient weighting schemes, and flexible neural network architecture through latent variables to tailor treatments for a target population. Patients' underlying mental states are represented by a compact set of latent state variables while preserving interpretability. Weighting schemes are designed based on lower-dimensional latent variables to efficiently balance population differences so that biases in learning the latent structure and treatment effects are mitigated. Extensive simulation studies demonstrated consistent superiority of the proposed method and the weighting approach. Applications to two real-world studies of patients with major depressive disorder have shown a broad utility of the proposed method in improving treatment outcomes in the target population.

针对目标精神障碍患者群体学习个性化治疗规则（ITR）面临着许多挑战。首先，目标人群可能不同于为学习 ITR 提供数据的训练人群。忽略训练患者数据与目标人群之间的差异，可能会导致针对目标人群的治疗策略达不到最佳效果。其次，对于精神障碍而言，患者的基本精神状态无法观察到，但可以通过症状的高维组合测量来推断。治疗机制是未知的，也可能是复杂的，因此治疗效果调节的形式也可能是复杂的。为了应对这些挑战，我们提出了一种新方法，通过潜变量将测量模型、高效加权方案和灵活的神经网络架构联系起来，为目标人群量身定制治疗方案。患者的基本心理状态由一组紧凑的潜在状态变量表示，同时保持可解释性。加权方案的设计基于低维潜在变量，以有效平衡人群差异，从而减轻学习潜在结构和治疗效果的偏差。广泛的模拟研究表明，所提出的方法和加权方法具有一致的优越性。在两项针对重度抑郁症患者的实际研究中的应用表明，所提出的方法在改善目标人群的治疗效果方面具有广泛的实用性。

{"title":"Optimizing personalized treatments for targeted patient populations across multiple domains.","authors":"Yuan Chen, Donglin Zeng, Yuanjia Wang","doi":"10.1515/ijb-2024-0068","DOIUrl":"10.1515/ijb-2024-0068","url":null,"abstract":"Learning individualized treatment rules (ITRs) for a target patient population with mental disorders is confronted with many challenges. First, the target population may be different from the training population that provided data for learning ITRs. Ignoring differences between the training patient data and the target population can result in sub-optimal treatment strategies for the target population. Second, for mental disorders, a patient's underlying mental state is not observed but can be inferred from measures of high-dimensional combinations of symptomatology. Treatment mechanisms are unknown and can be complex, and thus treatment effect moderation can take complicated forms. To address these challenges, we propose a novel method that connects measurement models, efficient weighting schemes, and flexible neural network architecture through latent variables to tailor treatments for a target population. Patients' underlying mental states are represented by a compact set of latent state variables while preserving interpretability. Weighting schemes are designed based on lower-dimensional latent variables to efficiently balance population differences so that biases in learning the latent structure and treatment effects are mitigated. Extensive simulation studies demonstrated consistent superiority of the proposed method and the weighting approach. Applications to two real-world studies of patients with major depressive disorder have shown a broad utility of the proposed method in improving treatment outcomes in the target population.","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"437-453"},"PeriodicalIF":1.2,"publicationDate":"2024-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11661560/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142331579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

International Journal of Biostatistics

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀