首页 > 最新文献

International Journal of Biostatistics最新文献

英文 中文
Risk estimation and boundary detection in Bayesian disease mapping. 贝叶斯疾病制图中的风险估计与边界检测。
IF 1.2 4区 数学 Pub Date : 2025-05-22 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2023-0138
Xueqing Yin, Craig Anderson, Duncan Lee, Gary Napier

Bayesian hierarchical models with a spatially smooth conditional autoregressive prior distribution are commonly used to estimate the spatio-temporal pattern in disease risk from areal unit data. However, most of the modeling approaches do not take possible boundaries of step changes in disease risk between geographically neighbouring areas into consideration, which may lead to oversmoothing of the risk surfaces, prevent the detection of high-risk areas and yield biased estimation of disease risk. In this paper, we propose a two-stage method to jointly estimate the disease risk in small areas over time and detect the locations of boundaries that separate pairs of neighbouring areas exhibiting vastly different risks. In the first stage, we use a graph-based optimisation algorithm to construct a set of candidate neighbourhood matrices that represent a range of possible boundary structures for the disease data. In the second stage, a Bayesian hierarchical spatio-temporal model that takes the boundaries into account is fitted to the data. The performance of the methodology is evidenced by simulation, before being applied to a study of respiratory disease risk in Greater Glasgow, Scotland.

具有空间平滑条件自回归先验分布的贝叶斯层次模型通常用于从面积单位数据估计疾病风险的时空格局。然而,大多数建模方法没有考虑地理相邻区域之间疾病风险阶跃变化的可能边界,这可能导致风险面过于平滑,阻碍高风险区域的检测,并产生疾病风险的偏倚估计。在本文中,我们提出了一种两阶段的方法来共同估计小区域随时间的疾病风险,并检测将具有巨大不同风险的相邻区域分开的边界位置。在第一阶段,我们使用基于图的优化算法来构建一组候选邻域矩阵,这些矩阵代表了疾病数据的一系列可能的边界结构。在第二阶段,将考虑边界的贝叶斯分层时空模型拟合到数据中。在应用于苏格兰大格拉斯哥呼吸系统疾病风险研究之前,通过模拟证明了该方法的性能。
{"title":"Risk estimation and boundary detection in Bayesian disease mapping.","authors":"Xueqing Yin, Craig Anderson, Duncan Lee, Gary Napier","doi":"10.1515/ijb-2023-0138","DOIUrl":"10.1515/ijb-2023-0138","url":null,"abstract":"<p><p>Bayesian hierarchical models with a spatially smooth conditional autoregressive prior distribution are commonly used to estimate the spatio-temporal pattern in disease risk from areal unit data. However, most of the modeling approaches do not take possible boundaries of step changes in disease risk between geographically neighbouring areas into consideration, which may lead to oversmoothing of the risk surfaces, prevent the detection of high-risk areas and yield biased estimation of disease risk. In this paper, we propose a two-stage method to jointly estimate the disease risk in small areas over time and detect the locations of boundaries that separate pairs of neighbouring areas exhibiting vastly different risks. In the first stage, we use a graph-based optimisation algorithm to construct a set of candidate neighbourhood matrices that represent a range of possible boundary structures for the disease data. In the second stage, a Bayesian hierarchical spatio-temporal model that takes the boundaries into account is fitted to the data. The performance of the methodology is evidenced by simulation, before being applied to a study of respiratory disease risk in Greater Glasgow, Scotland.</p>","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"129-150"},"PeriodicalIF":1.2,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144151799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An improved estimator of the logarithmic odds ratio for small sample sizes using a Bayesian approach. 使用贝叶斯方法的小样本量对数比值比的改进估计器。
IF 1.2 4区 数学 Pub Date : 2025-04-30 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2024-0105
Toru Ogura, Takemi Yanagimoto

The logarithmic odds ratio is a well-known method for comparing binary data between two independent groups. Although various existing methods proposed for estimating a logarithmic odds ratio, most methods estimate two proportions in each group independently and then estimate the logarithmic odds ratio using the two estimated proportions. When using a logarithmic odds ratio, researchers are more interested in the logarithmic odds ratio than proportions for each group. Parameter estimations, generally, incur random and systematic errors. These errors in initially estimated parameter may affect later estimated parameter. We propose a Bayesian estimator to directly estimate a logarithmic odds ratio without using proportions for each group. Many existing methods need to estimate two parameters (two proportions in each group) to estimate a logarithmic odds ratio; however, the proposed method only estimates one parameter (logarithmic odds ratio). Therefore, the proposed estimator can be closer to the population's logarithmic odds ratio than existing estimators. Additionally, the validity of the proposed estimator is verified using numerical calculations and applications.

对数优势比是比较两个独立组之间二进制数据的一种众所周知的方法。虽然已有各种方法提出了估计对数优势比,但大多数方法在每组中独立估计两个比例,然后使用这两个估计比例估计对数优势比。当使用对数优势比时,研究人员对对数优势比比对每组的比例更感兴趣。参数估计通常会产生随机和系统误差。初始估计参数的这些误差可能会影响以后的估计参数。我们提出了一个贝叶斯估计器来直接估计对数比值比,而不使用每组的比例。许多现有的方法需要估计两个参数(每组中两个比例)来估计对数优势比;然而,该方法只估计一个参数(对数比值比)。因此,所提出的估计量比现有的估计量更接近总体的对数比值比。此外,通过数值计算和应用验证了所提估计器的有效性。
{"title":"An improved estimator of the logarithmic odds ratio for small sample sizes using a Bayesian approach.","authors":"Toru Ogura, Takemi Yanagimoto","doi":"10.1515/ijb-2024-0105","DOIUrl":"10.1515/ijb-2024-0105","url":null,"abstract":"<p><p>The logarithmic odds ratio is a well-known method for comparing binary data between two independent groups. Although various existing methods proposed for estimating a logarithmic odds ratio, most methods estimate two proportions in each group independently and then estimate the logarithmic odds ratio using the two estimated proportions. When using a logarithmic odds ratio, researchers are more interested in the logarithmic odds ratio than proportions for each group. Parameter estimations, generally, incur random and systematic errors. These errors in initially estimated parameter may affect later estimated parameter. We propose a Bayesian estimator to directly estimate a logarithmic odds ratio without using proportions for each group. Many existing methods need to estimate two parameters (two proportions in each group) to estimate a logarithmic odds ratio; however, the proposed method only estimates one parameter (logarithmic odds ratio). Therefore, the proposed estimator can be closer to the population's logarithmic odds ratio than existing estimators. Additionally, the validity of the proposed estimator is verified using numerical calculations and applications.</p>","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"151-163"},"PeriodicalIF":1.2,"publicationDate":"2025-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144006634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A hybrid hazard-based model using two-piece distributions. 使用两件分布的基于危险的混合模型。
IF 1.2 4区 数学 Pub Date : 2025-04-30 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2023-0153
Worku Biyadgie Ewnetu, Irène Gijbels, Anneleen Verhasselt

Cox proportional hazards model is widely used to study the relationship between the survival time of an event and covariates. Its primary objective is parameter estimation assuming a constant relative hazard throughout the entire follow-up time. The baseline hazard is thus treated as a nuisance parameter. However, if the interest is to predict possible outcomes like specific quantiles of the distribution (e.g. median survival time), survival and hazard functions, it may be more convenient to use a parametric baseline distribution. Such a parametric model should however be flexible enough to allow for various shapes of e.g. the hazard function. In this paper we propose flexible hazard-based models for right censored data using a large class of two-piece asymmetric baseline distributions. The effect of covariates is characterized through time-scale changes on hazard progression and on the relative hazard ratio; and can take three possible functional forms: parametric, semi-parametric (partly linear) and non-parametric. In the first case, the usual full likelihood estimation method is applied. In the semi-parametric and non-parametric settings a general profile (local) likelihood estimation approach is proposed. An extensive simulation study investigates the finite-sample performances of the proposed method. Its use in data analysis is illustrated in real data examples.

Cox比例风险模型被广泛用于研究事件生存时间与协变量之间的关系。其主要目标是在整个随访时间内假设一个恒定的相对危险度的参数估计。因此,基线危险被视为有害参数。然而,如果兴趣是预测可能的结果,如分布的特定分位数(例如中位生存时间),生存和风险函数,则使用参数基线分布可能更方便。然而,这种参数化模型应该足够灵活,以允许各种形状,例如危险函数。在本文中,我们提出了灵活的基于风险的模型右截尾数据使用大类两件不对称基线分布。协变量的影响表现为时间尺度变化对危险进展和相对危险比的影响;它可以有三种可能的函数形式:参数、半参数(部分线性)和非参数。在第一种情况下,采用通常的全似然估计方法。在半参数和非参数条件下,提出了一种通用的轮廓(局部)似然估计方法。广泛的仿真研究探讨了该方法的有限样本性能。通过实际数据实例说明了它在数据分析中的应用。
{"title":"A hybrid hazard-based model using two-piece distributions.","authors":"Worku Biyadgie Ewnetu, Irène Gijbels, Anneleen Verhasselt","doi":"10.1515/ijb-2023-0153","DOIUrl":"10.1515/ijb-2023-0153","url":null,"abstract":"<p><p>Cox proportional hazards model is widely used to study the relationship between the survival time of an event and covariates. Its primary objective is parameter estimation assuming a constant relative hazard throughout the entire follow-up time. The baseline hazard is thus treated as a nuisance parameter. However, if the interest is to predict possible outcomes like specific quantiles of the distribution (e.g. median survival time), survival and hazard functions, it may be more convenient to use a parametric baseline distribution. Such a parametric model should however be flexible enough to allow for various shapes of e.g. the hazard function. In this paper we propose flexible hazard-based models for right censored data using a large class of two-piece asymmetric baseline distributions. The effect of covariates is characterized through time-scale changes on hazard progression and on the relative hazard ratio; and can take three possible functional forms: parametric, semi-parametric (partly linear) and non-parametric. In the first case, the usual full likelihood estimation method is applied. In the semi-parametric and non-parametric settings a general profile (local) likelihood estimation approach is proposed. An extensive simulation study investigates the finite-sample performances of the proposed method. Its use in data analysis is illustrated in real data examples.</p>","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"67-95"},"PeriodicalIF":1.2,"publicationDate":"2025-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144038766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Homogeneity test and sample size of response rates for AC 1 in a stratified evaluation design. 分层评价设计中ac1反应率的同质性检验和样本量。
IF 1.2 4区 数学 Pub Date : 2025-04-30 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2024-0080
Jingwei Jia, Yuanbo Liu, Jikai Yang, Zhiming Li

Gwet's first-order agreement coefficient (AC 1) is widely used to evaluate the consistency between raters. Considering the existence of a certain relationship between the raters, the paper aims to test the equality of response rates and the dependency between two raters of modified AC 1's in a stratified design and estimates the sample size for a given significance level. We first establish a probability model and then estimate the unknown parameters. Further, we explore the homogeneity test of these AC 1's under the asymptotic method, such as likelihood ratio, score, and Wald-type statistics. In numerical simulation, the performance of statistics is investigated in terms of type I error rates (TIEs) and power while finding a suitable sample size under a given power. The results show that the Wald-type statistic has robust TIEs and satisfactory power and is suitable for large samples (n≥50). Under the same power, the sample size of the Wald-type test is smaller when the number of strata is large. The higher the power, the larger the required sample size. Finally, two real examples are given to illustrate these methods.

Gwet的一阶一致系数(ac1)被广泛用于评价评价者之间的一致性。考虑到评分者之间存在一定的关系,本文的目的是在分层设计中检验反应率的相等性和修正AC 1的两个评分者之间的依赖关系,并估计给定显著性水平下的样本量。首先建立概率模型,然后对未知参数进行估计。进一步,我们探讨了这些AC 1在渐近方法下的同质性检验,如似然比、分数和wald型统计量。在数值模拟中,统计性能是根据I型错误率(TIEs)和功率来研究的,同时在给定功率下找到合适的样本量。结果表明,wald型统计量具有鲁棒性和令人满意的功率,适用于大样本(n≥50)。在相同的功率下,当岩层数较大时,wald型试验的样本量较小。功率越高,所需的样本量越大。最后,给出了两个实例来说明这些方法。
{"title":"Homogeneity test and sample size of response rates for <i>AC</i> <sub>1</sub> in a stratified evaluation design.","authors":"Jingwei Jia, Yuanbo Liu, Jikai Yang, Zhiming Li","doi":"10.1515/ijb-2024-0080","DOIUrl":"10.1515/ijb-2024-0080","url":null,"abstract":"<p><p>Gwet's first-order agreement coefficient (<i>AC</i> <sub>1</sub>) is widely used to evaluate the consistency between raters. Considering the existence of a certain relationship between the raters, the paper aims to test the equality of response rates and the dependency between two raters of modified <i>AC</i> <sub>1</sub>'s in a stratified design and estimates the sample size for a given significance level. We first establish a probability model and then estimate the unknown parameters. Further, we explore the homogeneity test of these <i>AC</i> <sub>1</sub>'s under the asymptotic method, such as likelihood ratio, score, and Wald-type statistics. In numerical simulation, the performance of statistics is investigated in terms of type I error rates (TIEs) and power while finding a suitable sample size under a given power. The results show that the Wald-type statistic has robust TIEs and satisfactory power and is suitable for large samples (n≥50). Under the same power, the sample size of the Wald-type test is smaller when the number of strata is large. The higher the power, the larger the required sample size. Finally, two real examples are given to illustrate these methods.</p>","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"17-35"},"PeriodicalIF":1.2,"publicationDate":"2025-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144025779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A review of survival stacking: a method to cast survival regression analysis as a classification problem. 生存叠加:一种将生存回归分析作为分类问题的方法。
IF 1.2 4区 数学 Pub Date : 2025-03-28 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2022-0055
Erin Craig, Chenyang Zhong, Robert Tibshirani

While there are many well-developed data science methods for classification and regression, there are relatively few methods for working with right-censored data. Here, we review survival stacking, a method for casting a survival regression analysis problem as a classification problem, thereby allowing the use of general classification methods and software in a survival setting. Inspired by the Cox partial likelihood, survival stacking collects features and outcomes of survival data in a large data frame with a binary outcome. We show that survival stacking with logistic regression is approximately equivalent to the Cox proportional hazards model. We further illustrate survival stacking on real and simulated data. By reframing survival regression problems as classification problems, survival stacking removes the reliance on specialized tools for survival regression, and makes it straightforward for data scientists to use well-known learning algorithms and software for classification in the survival setting. This in turn lowers the barrier for flexible survival modeling.

虽然有很多成熟的分类和回归数据科学方法,但处理右删失数据的方法相对较少。在这里,我们回顾了生存堆叠法,这是一种将生存回归分析问题作为分类问题来处理的方法,从而允许在生存环境中使用一般的分类方法和软件。受 Cox 部分似然法的启发,生存堆叠法在一个具有二元结果的大型数据框架中收集生存数据的特征和结果。我们的研究表明,使用逻辑回归的生存堆积近似等同于 Cox 比例危险模型。我们还在真实数据和模拟数据上进一步说明了生存堆叠。通过将生存回归问题重构为分类问题,生存堆叠消除了对生存回归专用工具的依赖,使数据科学家可以直接使用众所周知的学习算法和软件在生存环境中进行分类。这反过来又降低了灵活建立生存模型的门槛。
{"title":"A review of survival stacking: a method to cast survival regression analysis as a classification problem.","authors":"Erin Craig, Chenyang Zhong, Robert Tibshirani","doi":"10.1515/ijb-2022-0055","DOIUrl":"10.1515/ijb-2022-0055","url":null,"abstract":"<p><p>While there are many well-developed data science methods for classification and regression, there are relatively few methods for working with right-censored data. Here, we review survival stacking, a method for casting a survival regression analysis problem as a classification problem, thereby allowing the use of general classification methods and software in a survival setting. Inspired by the Cox partial likelihood, survival stacking collects features and outcomes of survival data in a large data frame with a binary outcome. We show that survival stacking with logistic regression is approximately equivalent to the Cox proportional hazards model. We further illustrate survival stacking on real and simulated data. By reframing survival regression problems as classification problems, survival stacking removes the reliance on specialized tools for survival regression, and makes it straightforward for data scientists to use well-known learning algorithms and software for classification in the survival setting. This in turn lowers the barrier for flexible survival modeling.</p>","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"37-51"},"PeriodicalIF":1.2,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12247786/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143732819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A multivariate Bayesian learning approach for improved detection of doping in athletes using urinary steroid profiles. 一种多变量贝叶斯学习方法,用于使用尿类固醇谱改善运动员兴奋剂检测。
IF 1.2 4区 数学 Pub Date : 2025-03-28 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2024-0019
Dimitra Eleftheriou, Thomas Piper, Mario Thevis, Tereza Neocleous

Biomarker analysis of athletes' urinary steroid profiles is crucial for the success of anti-doping efforts. Current statistical analysis methods generate personalised limits for each athlete based on univariate modelling of longitudinal biomarker values from the urinary steroid profile. However, simultaneous modelling of multiple biomarkers has the potential to further enhance abnormality detection. In this study, we propose a multivariate Bayesian adaptive model for longitudinal data analysis, which extends the established single-biomarker model in forensic toxicology. The proposed approach employs Markov chain Monte Carlo sampling methods and addresses the scarcity of confirmed abnormal values through a one-class classification algorithm. By adapting decision boundaries as new measurements are obtained, the model provides robust and personalised detection thresholds for each athlete. We tested the proposed approach on a database of 229 athletes, which includes longitudinal steroid profiles containing samples classified as normal, atypical, or confirmed abnormal. Our results demonstrate improved detection performance, highlighting the potential value of a multivariate approach in doping detection.

对运动员尿液类固醇谱进行生物标志物分析是反兴奋剂工作取得成功的关键。目前的统计分析方法是根据尿液类固醇图谱中纵向生物标志物值的单变量建模,为每个运动员生成个性化的限值。然而,对多种生物标志物同时建模有可能进一步提高异常检测水平。在本研究中,我们提出了一种用于纵向数据分析的多变量贝叶斯自适应模型,该模型扩展了法医毒理学中已有的单生物标记物模型。所提出的方法采用马尔可夫链蒙特卡洛抽样方法,并通过单类分类算法解决了证实异常值稀缺的问题。通过在获得新的测量结果时调整决策边界,该模型可为每位运动员提供稳健且个性化的检测阈值。我们在一个包含 229 名运动员的数据库中测试了所提出的方法,该数据库包含纵向类固醇档案,其中的样本被分类为正常、非典型或确认异常。我们的结果证明了检测性能的提高,突出了多元方法在兴奋剂检测中的潜在价值。
{"title":"A multivariate Bayesian learning approach for improved detection of doping in athletes using urinary steroid profiles.","authors":"Dimitra Eleftheriou, Thomas Piper, Mario Thevis, Tereza Neocleous","doi":"10.1515/ijb-2024-0019","DOIUrl":"10.1515/ijb-2024-0019","url":null,"abstract":"<p><p>Biomarker analysis of athletes' urinary steroid profiles is crucial for the success of anti-doping efforts. Current statistical analysis methods generate personalised limits for each athlete based on univariate modelling of longitudinal biomarker values from the urinary steroid profile. However, simultaneous modelling of multiple biomarkers has the potential to further enhance abnormality detection. In this study, we propose a multivariate Bayesian adaptive model for longitudinal data analysis, which extends the established single-biomarker model in forensic toxicology. The proposed approach employs Markov chain Monte Carlo sampling methods and addresses the scarcity of confirmed abnormal values through a one-class classification algorithm. By adapting decision boundaries as new measurements are obtained, the model provides robust and personalised detection thresholds for each athlete. We tested the proposed approach on a database of 229 athletes, which includes longitudinal steroid profiles containing samples classified as normal, atypical, or confirmed abnormal. Our results demonstrate improved detection performance, highlighting the potential value of a multivariate approach in doping detection.</p>","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"165-181"},"PeriodicalIF":1.2,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143732816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Regression analysis of clustered current status data with informative cluster size under a transformed survival model. 转换生存模型下具有信息聚类大小的聚类现状数据的回归分析。
IF 1.2 4区 数学 Pub Date : 2025-03-24 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2023-0130
Yanqin Feng, Shijiao Yin, Jieli Ding

In this paper, we study inference methods for regression analysis of clustered current status data with informative cluster sizes. When the correlated failure times of interest arise from a general class of semiparametric transformation frailty models, we develop a nonparametric maximum likelihood estimation based method for regression analysis and conduct an expectation-maximization algorithm to implement it. The asymptotic properties including consistency and asymptotic normality of the proposed estimators are established. Extensive simulation studies are conducted and indicate that the proposed method works well. The developed approach is applied to analyze a real-life data set from a tumorigenicity study.

本文研究了基于信息聚类大小的聚类现状数据回归分析的推理方法。当相关失效时间来自于一类一般的半参数变换脆弱性模型时,我们开发了一种基于非参数极大似然估计的回归分析方法,并进行了期望最大化算法来实现它。建立了所提估计量的渐近性质,包括相合性和渐近正态性。大量的仿真研究表明,所提出的方法是有效的。所开发的方法被应用于分析来自致瘤性研究的真实数据集。
{"title":"Regression analysis of clustered current status data with informative cluster size under a transformed survival model.","authors":"Yanqin Feng, Shijiao Yin, Jieli Ding","doi":"10.1515/ijb-2023-0130","DOIUrl":"10.1515/ijb-2023-0130","url":null,"abstract":"<p><p>In this paper, we study inference methods for regression analysis of clustered current status data with informative cluster sizes. When the correlated failure times of interest arise from a general class of semiparametric transformation frailty models, we develop a nonparametric maximum likelihood estimation based method for regression analysis and conduct an expectation-maximization algorithm to implement it. The asymptotic properties including consistency and asymptotic normality of the proposed estimators are established. Extensive simulation studies are conducted and indicate that the proposed method works well. The developed approach is applied to analyze a real-life data set from a tumorigenicity study.</p>","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"97-113"},"PeriodicalIF":1.2,"publicationDate":"2025-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143674819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prognostic adjustment with efficient estimators to unbiasedly leverage historical data in randomized trials. 随机试验中使用有效估计器进行预后调整,以无偏倚地利用历史数据。
IF 1.2 4区 数学 Pub Date : 2025-03-11 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2024-0018
Lauren D Liao, Emilie Højbjerre-Frandsen, Alan E Hubbard, Alejandro Schuler

Although randomized controlled trials (RCTs) are a cornerstone of comparative effectiveness, they typically have much smaller sample size than observational studies due to financial and ethical considerations. Therefore there is interest in using plentiful historical data (either observational data or prior trials) to reduce trial sizes. Previous estimators developed for this purpose rely on unrealistic assumptions, without which the added data can bias the treatment effect estimate. Recent work proposed an alternative method (prognostic covariate adjustment) that imposes no additional assumptions and increases efficiency in trial analyses. The idea is to use historical data to learn a prognostic model: a regression of the outcome onto the covariates. The predictions from this model, generated from the RCT subjects' baseline variables, are then used as a covariate in a linear regression analysis of the trial data. In this work, we extend prognostic adjustment to trial analyses with nonparametric efficient estimators, which are more powerful than linear regression. We provide theory that explains why prognostic adjustment improves small-sample point estimation and inference without any possibility of bias. Simulations corroborate the theory: efficient estimators using prognostic adjustment compared to without provides greater power (i.e., smaller standard errors) when the trial is small. Population shifts between historical and trial data attenuate benefits but do not introduce bias. We showcase our estimator using clinical trial data provided by Novo Nordisk A/S that evaluates insulin therapy for individuals with type 2 diabetes.

虽然随机对照试验(rct)是比较有效性的基础,但由于经济和伦理方面的考虑,它们的样本量通常比观察性研究小得多。因此,有兴趣使用大量的历史数据(无论是观察数据还是先前的试验)来减少试验规模。以前为此目的开发的估计依赖于不切实际的假设,没有这些假设,添加的数据可能会使治疗效果估计产生偏差。最近的工作提出了一种替代方法(预后协变量调整),该方法不施加额外的假设并提高了试验分析的效率。这个想法是使用历史数据来学习预测模型:将结果回归到协变量上。该模型的预测由RCT受试者的基线变量生成,然后用作试验数据线性回归分析中的协变量。在这项工作中,我们将预后调整扩展到使用非参数有效估计器的试验分析,它比线性回归更强大。我们提供的理论解释了为什么预测调整改善了小样本点估计和推断,而没有任何偏差的可能性。模拟证实了这一理论:当试验规模较小时,使用预测调整的有效估计值比不使用预测调整的估计值提供更大的功率(即更小的标准误差)。历史数据和试验数据之间的人口转移会减弱获益,但不会引入偏倚。我们使用诺和诺德公司提供的临床试验数据来展示我们的估计器,该数据评估了2型糖尿病患者的胰岛素治疗。
{"title":"Prognostic adjustment with efficient estimators to unbiasedly leverage historical data in randomized trials.","authors":"Lauren D Liao, Emilie Højbjerre-Frandsen, Alan E Hubbard, Alejandro Schuler","doi":"10.1515/ijb-2024-0018","DOIUrl":"10.1515/ijb-2024-0018","url":null,"abstract":"<p><p>Although randomized controlled trials (RCTs) are a cornerstone of comparative effectiveness, they typically have much smaller sample size than observational studies due to financial and ethical considerations. Therefore there is interest in using plentiful historical data (either observational data or prior trials) to reduce trial sizes. Previous estimators developed for this purpose rely on unrealistic assumptions, without which the added data can bias the treatment effect estimate. Recent work proposed an alternative method (prognostic covariate adjustment) that imposes no additional assumptions and increases efficiency in trial analyses. The idea is to use historical data to learn a prognostic model: a regression of the outcome onto the covariates. The predictions from this model, generated from the RCT subjects' baseline variables, are then used as a covariate in a linear regression analysis of the trial data. In this work, we extend prognostic adjustment to trial analyses with nonparametric efficient estimators, which are more powerful than linear regression. We provide theory that explains why prognostic adjustment improves small-sample point estimation and inference without any possibility of bias. Simulations corroborate the theory: efficient estimators using prognostic adjustment compared to without provides greater power (i.e., smaller standard errors) when the trial is small. Population shifts between historical and trial data attenuate benefits but do not introduce bias. We showcase our estimator using clinical trial data provided by Novo Nordisk A/S that evaluates insulin therapy for individuals with type 2 diabetes.</p>","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"1-15"},"PeriodicalIF":1.2,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12247788/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143598241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian covariance regression in functional data analysis with applications to functional brain imaging. 贝叶斯协方差回归在功能数据分析中的应用,以及在脑功能成像中的应用。
IF 1.2 4区 数学 Pub Date : 2025-02-05 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2023-0029
John Shamshoian, Nicholas Marco, Damla Şentürk, Shafali Jeste, Donatello Telesca

Function on scalar regression models relate functional outcomes to scalar predictors through the conditional mean function. With few and limited exceptions, many functional regression frameworks operate under the assumption that covariate information does not affect patterns of covariation. In this manuscript, we address this disparity by developing a Bayesian functional regression model, providing joint inference for both the conditional mean and covariance functions. Our work hinges on basis expansions of both the functional evaluation domain and covariate space, to define flexible non-parametric forms of dependence. To aid interpretation, we develop novel low-dimensional summaries, which indicate the degree of covariate-dependent heteroskedasticity. The proposed modeling framework is motivated and applied to a case study in functional brain imaging through electroencephalography, aiming to elucidate potential differentiation in the neural development of children with autism spectrum disorder.

标量函数回归模型通过条件均值函数将函数结果与标量预测因子联系起来。除了极少数例外情况,许多函数回归框架都是在协变量信息不会影响协变量模式的假设下运行的。在本手稿中,我们通过建立贝叶斯函数回归模型,为条件均值函数和协方差函数提供联合推断,从而解决了这一差异。我们的工作依赖于功能评估域和协方差空间的基础扩展,以定义灵活的非参数依赖形式。为了帮助解释,我们开发了新颖的低维摘要,用于显示协变量依赖异方差的程度。我们提出了建模框架的动机,并将其应用于通过脑电图进行的脑功能成像案例研究,旨在阐明自闭症谱系障碍儿童神经发育过程中的潜在分化。
{"title":"Bayesian covariance regression in functional data analysis with applications to functional brain imaging.","authors":"John Shamshoian, Nicholas Marco, Damla Şentürk, Shafali Jeste, Donatello Telesca","doi":"10.1515/ijb-2023-0029","DOIUrl":"10.1515/ijb-2023-0029","url":null,"abstract":"<p><p>Function on scalar regression models relate functional outcomes to scalar predictors through the conditional mean function. With few and limited exceptions, many functional regression frameworks operate under the assumption that covariate information does not affect patterns of covariation. In this manuscript, we address this disparity by developing a Bayesian functional regression model, providing joint inference for both the conditional mean and covariance functions. Our work hinges on basis expansions of both the functional evaluation domain and covariate space, to define flexible non-parametric forms of dependence. To aid interpretation, we develop novel low-dimensional summaries, which indicate the degree of covariate-dependent heteroskedasticity. The proposed modeling framework is motivated and applied to a case study in functional brain imaging through electroencephalography, aiming to elucidate potential differentiation in the neural development of children with autism spectrum disorder.</p>","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"115-128"},"PeriodicalIF":1.2,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12247787/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143191163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DsubCox: a fast subsampling algorithm for Cox model with distributed and massive survival data. DsubCox:一种针对分布式海量生存数据的Cox模型的快速子采样算法。
IF 1.2 4区 数学 Pub Date : 2025-02-04 eCollection Date: 2025-05-01 DOI: 10.1515/ijb-2024-0042
Haixiang Zhang, Yang Li, HaiYing Wang

To ensure privacy protection and alleviate computational burden, we propose a fast subsmaling procedure for the Cox model with massive survival datasets from multi-centered, decentralized sources. The proposed estimator is computed based on optimal subsampling probabilities that we derived and enables transmission of subsample-based summary level statistics between different storage sites with only one round of communication. For inference, the asymptotic properties of the proposed estimator were rigorously established. An extensive simulation study demonstrated that the proposed approach is effective. The methodology was applied to analyze a large dataset from the U.S. airlines.

为了保证隐私保护和减轻计算负担,我们提出了一种基于多中心、分散来源的大量生存数据集的Cox模型的快速子化过程。所提出的估计器是基于我们导出的最优子抽样概率计算的,并且只需要一轮通信就可以在不同的存储站点之间传输基于子抽样的汇总级统计信息。对于推理,严格地建立了所提估计量的渐近性质。大量的仿真研究表明,该方法是有效的。该方法被用于分析来自美国航空公司的大型数据集。
{"title":"<b>DsubCox</b>: a fast subsampling algorithm for Cox model with distributed and massive survival data.","authors":"Haixiang Zhang, Yang Li, HaiYing Wang","doi":"10.1515/ijb-2024-0042","DOIUrl":"10.1515/ijb-2024-0042","url":null,"abstract":"<p><p>To ensure privacy protection and alleviate computational burden, we propose a fast subsmaling procedure for the Cox model with massive survival datasets from multi-centered, decentralized sources. The proposed estimator is computed based on optimal subsampling probabilities that we derived and enables transmission of subsample-based summary level statistics between different storage sites with only one round of communication. For inference, the asymptotic properties of the proposed estimator were rigorously established. An extensive simulation study demonstrated that the proposed approach is effective. The methodology was applied to analyze a large dataset from the U.S. airlines.</p>","PeriodicalId":50333,"journal":{"name":"International Journal of Biostatistics","volume":" ","pages":"53-65"},"PeriodicalIF":1.2,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143081371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Biostatistics
全部 Chem. Ecol. ATMOSPHERE-BASEL Clean Technol. Environ. Policy [Sanfujinka chiryo] Obstetrical and gynecological therapy Aust. J. Earth Sci. Adv. Atmos. Sci. 2013 IEEE International Symposium on Hardware-Oriented Security and Trust (HOST) [1993] Proceedings Eighth Annual IEEE Symposium on Logic in Computer Science Environmental Claims Journal Adv. Meteorol. Environ. Chem. INT J MOD PHYS B ENTROPY-SWITZ Asia-Pac. J. Atmos. Sci. 2005 Asian Conference on Sensors and the International Conference on New Techniques in Pharmaceutical and Biomedical Research Energy Storage Int. J. Earth Sci. J. Earth Syst. Sci. 2013 International Conference on Optical MEMS and Nanophotonics (OMN) 2011 IEEE International Conference of Electron Devices and Solid-State Circuits Environ. Technol. Innovation J. Appl. Phys. Engineering Science and Technology, an International Journal Ann. Glaciol. EPL-EUROPHYS LETT J. Geog. Sci. Int. Geol. Rev. J. Hydrol. Environmental Sustainability 2012 International Electron Devices Meeting 2012 IEEE 8th International Conference on E-Science ECOLOGY Aquat. Geochem. Yan Ke Xue Bao (Hong Kong) 2013 IEEE International Solid-State Circuits Conference Digest of Technical Papers Int. J. Astrobiol. ASTROBIOLOGY ACTA CYTOL Exp. Anim. J. Earth Sci. 2013 Fourth International Conference on Computing, Communications and Networking Technologies (ICCCNT) Geosci. Front. J. Phys. Soc. Jpn. ACTA GEOL SIN-ENGL Archaeol. Anthropol. Sci. 2011 VII Southern Conference on Programmable Logic (SPL) ACTA CHIR ORTHOP TR 2009 International Conference on Energy and Environment Technology Environmental Toxicology & Water Quality Resour. Geol. NANOPHOTONICS-BERLIN Org. Geochem. Acta Geophys. Am. J. Sci. AAPG Bull. 2009 12th International Symposium on Design and Diagnostics of Electronic Circuits & Systems Communications Earth & Environment ARCT ANTARCT ALP RES Int. J. Biometeorol. Ecol. Eng. Am. J. Phys. Anthropol. ACTA GEOL POL Environ. Res. Lett. Conserv. Biol. ECOSYSTEMS IZV-PHYS SOLID EART+ J OPT SOC AM A ACTA PETROL SIN J. Atmos. Chem. CRIT REV ENV SCI TEC Contrib. Mineral. Petrol. Carbon Balance Manage. Geobiology APL Photonics Appl. Geochem. Am. Mineral. 2013 Abstracts IEEE International Conference on Plasma Science (ICOPS) Appl. Clay Sci. Annu. Rev. Earth Planet. Sci. Clim. Change Ann. Phys. Acta Geochimica Geochim. Cosmochim. Acta Nucl. Sci. Tech. Ecol. Monogr. Energy Ecol Environ Ecol. Processes Conserv. Genet. Resour. ECOL RESTOR 2012 International Conference on High Voltage Engineering and Application Chin. Phys. B Appl. Phys. Rev. ASTRON ASTROPHYS Clean-Soil Air Water Atmos. Chem. Phys. 2007 IEEE International Test Conference Global Biogeochem. Cycles 2009 International Workshop on Intelligent Systems and Applications ECOTOXICOLOGY Theor. Appl. Climatol.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1