首页 > 最新文献

Journal of Applied Statistics最新文献

英文 中文
A unit-level one-inflated beta model for small area prediction of seat-belt use rates. 用于小区域安全带使用率预测的单位级单膨胀beta模型。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-11-28 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2024.2426016
Zirou Zhou, Emily Berg

We develop a unit-level one-inflated beta model for the purpose of small area estimation. Our specific interest is in estimation of seat-belt use rates for Iowa counties using data from the Iowa Seat-Belt Use Survey. As a result of small county sample sizes, small area estimation methods are needed. We propose frequentist and Bayesian implementations of a unit-level one-inflated beta model. We compare the Bayesian and frequentist predictors to simpler alternatives through simulation. We apply the proposed Bayesian and frequentist procedures to data from the Iowa Seat-Belt Use Survey.

为了小面积估计的目的,我们开发了一个单位级的单膨胀beta模型。我们的具体兴趣是利用爱荷华州安全带使用调查的数据估计爱荷华州各县的安全带使用率。由于县域样本量小,需要采用小面积估算方法。我们提出了一个单位级单膨胀beta模型的频率主义者和贝叶斯实现。我们通过模拟将贝叶斯和频率预测器与更简单的替代方法进行比较。我们将提出的贝叶斯和频率程序应用于爱荷华州安全带使用调查的数据。
{"title":"A unit-level one-inflated beta model for small area prediction of seat-belt use rates.","authors":"Zirou Zhou, Emily Berg","doi":"10.1080/02664763.2024.2426016","DOIUrl":"10.1080/02664763.2024.2426016","url":null,"abstract":"<p><p>We develop a unit-level one-inflated beta model for the purpose of small area estimation. Our specific interest is in estimation of seat-belt use rates for Iowa counties using data from the Iowa Seat-Belt Use Survey. As a result of small county sample sizes, small area estimation methods are needed. We propose frequentist and Bayesian implementations of a unit-level one-inflated beta model. We compare the Bayesian and frequentist predictors to simpler alternatives through simulation. We apply the proposed Bayesian and frequentist procedures to data from the Iowa Seat-Belt Use Survey.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 7","pages":"1381-1404"},"PeriodicalIF":1.1,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12117865/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144180828","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A bivariate load-sharing model. 二元负荷分担模型。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-11-28 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2024.2428267
Debasis Kundu

The motivation of this work came from a data set obtained from an experiment performed on diabetic patients, with diabetic retinopathy disorder. The aim of this experiment is to test whether there is any significant difference between two different treatments which are being used for this disease. The two eyes can be considered as a two-component load-sharing system. In a two-component load-sharing system after the failure of one component, the surviving component has to shoulder extra load. Hence, it is prone to failure at an earlier time than what is expected under the original model. It may also happen sometimes that the failure of one component may release extra resources to the survivor, thus delaying the failure. In most of the existing literature, it has been assumed that at the beginning the lifetime distributions of the two components are independently distributed, which may not be very reasonable in this case. In this paper, we have introduced a new bivariate load-sharing model where the independence assumptions of the lifetime distributions of the two components at the beginning have been relaxed. In this present model, they may be dependent. Further, there is a positive probability that the two components may fail simultaneously. If the two components do not fail simultaneously, it is assumed that the lifetime of the surviving component changes based on the tampered failure rate assumption. The proposed bivariate distribution has a singular component. The likelihood inference of the unknown parameters has been provided. Simulation results and the analysis of the data set have been presented to show the effectiveness of the proposed model.

这项工作的动机来自于一项对糖尿病视网膜病变患者进行的实验数据集。这个实验的目的是测试治疗这种疾病的两种不同治疗方法之间是否存在显著差异。这两只眼睛可以看作是一个双组分负载分担系统。在双组件负荷分担系统中,当一个组件失效后,幸存的组件必须承担额外的负荷。因此,它容易在比原始模型预期更早的时间失败。有时候,一个组件的故障可能会向存活组件释放额外的资源,从而延迟故障的发生。在现有的大多数文献中,都假设在开始时这两个组成部分的寿命分布是独立分布的,在这种情况下,这可能不是很合理。在本文中,我们引入了一个新的二元负荷分担模型,其中两个组件的寿命分布在开始时的独立性假设被放宽。在目前的模型中,它们可能是相互依赖的。此外,两个组件同时失效的概率为正。如果两个组件没有同时发生故障,则假设幸存组件的生命周期根据篡改故障率假设而变化。所提出的二元分布具有奇异分量。给出了未知参数的似然推断。仿真结果和数据集分析表明了该模型的有效性。
{"title":"A bivariate load-sharing model.","authors":"Debasis Kundu","doi":"10.1080/02664763.2024.2428267","DOIUrl":"10.1080/02664763.2024.2428267","url":null,"abstract":"<p><p>The motivation of this work came from a data set obtained from an experiment performed on diabetic patients, with diabetic retinopathy disorder. The aim of this experiment is to test whether there is any significant difference between two different treatments which are being used for this disease. The two eyes can be considered as a two-component load-sharing system. In a two-component load-sharing system after the failure of one component, the surviving component has to shoulder extra load. Hence, it is prone to failure at an earlier time than what is expected under the original model. It may also happen sometimes that the failure of one component may release extra resources to the survivor, thus delaying the failure. In most of the existing literature, it has been assumed that at the beginning the lifetime distributions of the two components are independently distributed, which may not be very reasonable in this case. In this paper, we have introduced a new bivariate load-sharing model where the independence assumptions of the lifetime distributions of the two components at the beginning have been relaxed. In this present model, they may be dependent. Further, there is a positive probability that the two components may fail simultaneously. If the two components do not fail simultaneously, it is assumed that the lifetime of the surviving component changes based on the tampered failure rate assumption. The proposed bivariate distribution has a singular component. The likelihood inference of the unknown parameters has been provided. Simulation results and the analysis of the data set have been presented to show the effectiveness of the proposed model.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 7","pages":"1446-1469"},"PeriodicalIF":1.1,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12123949/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144199239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tests of covariate effects under finite Gaussian mixture regression models. 有限高斯混合回归模型下协变量效应的检验。
IF 1.2 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-11-27 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2024.2433567
Chong Gan, Jiahua Chen, Zeny Feng

Mixture of regression model is widely used to cluster subjects from a suspected heterogeneous population due to differential relationships between response and covariates over unobserved subpopulations. In such applications, statistical evidence pertaining to the significance of a hypothesis is important yet missing to substantiate the findings. In this case, one may wish to test hypotheses regarding the effect of a covariate such as its overall significance. If confirmed, a further test of whether its effects are different in different subpopulations might be performed. This paper is motivated by the analysis of Chiroptera dataset, in which, we are interested in knowing how forearm length development of bat species is influenced by precipitation within their habitats and living regions using finite Gaussian mixture regression (GMR) model. Since precipitation may have different effects on the evolutionary development of the forearm across the underlying subpopulations among bat species worldwide, we propose several testing procedures for hypotheses regarding the effect of precipitation on forearm length under finite GMR models. In addition to the real analysis of Chiroptera data, through simulation studies, we examine the performances of these testing procedures on their type I error rate, power, and consequently, the accuracy of clustering analysis.

由于未观察到的亚群体的反应和协变量之间的差异关系,混合回归模型被广泛用于从疑似异质群体中聚类受试者。在这种应用中,与假设的重要性有关的统计证据很重要,但缺少证实发现的证据。在这种情况下,人们可能希望检验关于协变量影响的假设,例如它的总体显著性。如果得到证实,可能会进行进一步的测试,以确定其对不同亚群的影响是否不同。基于对翼目目数据的分析,利用有限高斯混合回归(GMR)模型,研究了生境和生活区域降水对蝙蝠前臂长度发育的影响。由于降水可能对世界各地蝙蝠物种中潜在亚群的前臂进化发育有不同的影响,我们提出了几种测试程序,以验证有限GMR模型下降水对前臂长度影响的假设。除了实际分析翼目数据外,通过模拟研究,我们检验了这些测试程序在其I型错误率,功率以及聚类分析准确性方面的性能。
{"title":"Tests of covariate effects under finite Gaussian mixture regression models.","authors":"Chong Gan, Jiahua Chen, Zeny Feng","doi":"10.1080/02664763.2024.2433567","DOIUrl":"10.1080/02664763.2024.2433567","url":null,"abstract":"<p><p>Mixture of regression model is widely used to cluster subjects from a suspected heterogeneous population due to differential relationships between response and covariates over unobserved subpopulations. In such applications, statistical evidence pertaining to the significance of a hypothesis is important yet missing to substantiate the findings. In this case, one may wish to test hypotheses regarding the effect of a covariate such as its overall significance. If confirmed, a further test of whether its effects are different in different subpopulations might be performed. This paper is motivated by the analysis of Chiroptera dataset, in which, we are interested in knowing how forearm length development of bat species is influenced by precipitation within their habitats and living regions using finite Gaussian mixture regression (GMR) model. Since precipitation may have different effects on the evolutionary development of the forearm across the underlying subpopulations among bat species worldwide, we propose several testing procedures for hypotheses regarding the effect of precipitation on forearm length under finite GMR models. In addition to the real analysis of Chiroptera data, through simulation studies, we examine the performances of these testing procedures on their type I error rate, power, and consequently, the accuracy of clustering analysis.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 8","pages":"1571-1593"},"PeriodicalIF":1.2,"publicationDate":"2024-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12147513/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144266342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mixture mean residual life model for competing risks data with mismeasured covariates. 带有错测协变量的竞争风险数据的混合平均剩余寿命模型。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-11-22 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2024.2426015
Chyong-Mei Chen, Chih-Ching Lin, Chih-Cheng Wu, Jia-Ren Tsai

This paper proposes a mixture regression model for competing risks data, where the logistic regression model is specified for the marginal probabilities of the failure types and the mean residual lifetime (MRL) model is assumed for the failure time given the failure of interest. The estimating equations (EEs) are derived to infer the logistic regression and MRL model separately. We further consider the situation where the covariates are subject to measurement error. The presence of measurement error imposes extra challenges for the analysis of complex time-to-event data. By using the above EEs as the correction-amenable original estimating functions, we propose a corrected score estimation, which does not require specifying the distributions for unobserved error-prone covariates. The proposed estimators are shown to be consistent and asymptotically normally distributed. The performance of the method is investigated by intensive simulation studies and two real examples are presented to illustrate the proposed methods.

本文提出了一个竞争风险数据的混合回归模型,其中逻辑回归模型用于失效类型的边际概率,平均剩余寿命(MRL)模型用于失效时间。分别推导了逻辑回归和MRL模型的估计方程。我们进一步考虑协变量受测量误差影响的情况。测量误差的存在给复杂的时间到事件数据的分析带来了额外的挑战。通过使用上述EEs作为可校正的原始估计函数,我们提出了一种校正分数估计,它不需要指定未观察到的易出错协变量的分布。所提出的估计量是一致且渐近正态分布的。通过深入的仿真研究研究了该方法的性能,并给出了两个实例来说明所提出的方法。
{"title":"Mixture mean residual life model for competing risks data with mismeasured covariates.","authors":"Chyong-Mei Chen, Chih-Ching Lin, Chih-Cheng Wu, Jia-Ren Tsai","doi":"10.1080/02664763.2024.2426015","DOIUrl":"10.1080/02664763.2024.2426015","url":null,"abstract":"<p><p>This paper proposes a mixture regression model for competing risks data, where the logistic regression model is specified for the marginal probabilities of the failure types and the mean residual lifetime (MRL) model is assumed for the failure time given the failure of interest. The estimating equations (EEs) are derived to infer the logistic regression and MRL model separately. We further consider the situation where the covariates are subject to measurement error. The presence of measurement error imposes extra challenges for the analysis of complex time-to-event data. By using the above EEs as the correction-amenable original estimating functions, we propose a corrected score estimation, which does not require specifying the distributions for unobserved error-prone covariates. The proposed estimators are shown to be consistent and asymptotically normally distributed. The performance of the method is investigated by intensive simulation studies and two real examples are presented to illustrate the proposed methods.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 7","pages":"1361-1380"},"PeriodicalIF":1.1,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12117869/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144180957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Change-point detection of the Kumaraswamy skew-t distribution based on modified information criterion. 基于改进信息准则的库马拉斯瓦米偏t分布变点检测。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-11-22 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2024.2431743
Jun Wang, Wei Ning

In this paper, we study the change-point problem of the Kumaraswamy skew-t distribution. An approach based on the modified information criterion is proposed to detect the changes of the parameters of this distribution. Simulations have been conducted to investigate the performance of the proposed method. The proposed method is applied to real data to illustrate the detection procedure.

本文研究了Kumaraswamy偏t分布的变点问题。提出了一种基于改进信息准则的方法来检测该分布参数的变化。通过仿真研究了该方法的性能。通过对实际数据的分析,说明了该方法的检测过程。
{"title":"Change-point detection of the Kumaraswamy skew-t distribution based on modified information criterion.","authors":"Jun Wang, Wei Ning","doi":"10.1080/02664763.2024.2431743","DOIUrl":"10.1080/02664763.2024.2431743","url":null,"abstract":"<p><p>In this paper, we study the change-point problem of the Kumaraswamy skew-t distribution. An approach based on the modified information criterion is proposed to detect the changes of the parameters of this distribution. Simulations have been conducted to investigate the performance of the proposed method. The proposed method is applied to real data to illustrate the detection procedure.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 8","pages":"1561-1570"},"PeriodicalIF":1.1,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12147483/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144266338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semiparametric regression analysis of panel binary data with a dependent failure time. 具有相关失效时间的面板二值数据的半参数回归分析。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-11-19 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2024.2428266
Lei Ge, Yang Li, Jianguo Sun

In health and clinical research, panel binary data from recurrent events arise when subjects are surveyed to report occurrence statuses of recurrent events over fixed observation windows. In practice, such data can be cut short by a dependent failure event such as death. For the analysis of panel binary data, tools from generalized linear models overlook the recurrence nature of panel binary data, and other relevant literature does not accommodate the failure time. Motivated by the hospitalization data surveyed from the Health and Retirement Study, we propose a semiparametric joint-modeling-based procedure for analyzing panel binary data with a dependent failure time. For model fitting, we develop a computationally efficient EM algorithm and show the resulting estimates are consistent and asymptotically normal. Theoretical results are provided to enable valid inferences. Simulation studies have confirmed the performance of the proposed method in practical settings. The method is applied to assess important risk factors associated with incidences of hospitalization among the working elderly.

在健康和临床研究中,当调查对象在固定的观察窗口内报告复发事件的发生状态时,来自复发事件的面板二元数据就产生了。在实践中,这样的数据可能会被一个相关的失败事件(如死亡)打断。对于面板二值数据的分析,来自广义线性模型的工具忽略了面板二值数据的递归性,其他相关文献也没有考虑失效时间。基于健康与退休研究的住院数据,我们提出了一种基于半参数联合建模的方法来分析具有依赖失效时间的面板二进制数据。对于模型拟合,我们开发了一种计算效率高的EM算法,并证明了结果估计是一致的和渐近正态的。理论结果提供了有效的推论。仿真研究证实了该方法在实际环境中的性能。该方法用于评估与在职老年人住院发生率相关的重要危险因素。
{"title":"Semiparametric regression analysis of panel binary data with a dependent failure time.","authors":"Lei Ge, Yang Li, Jianguo Sun","doi":"10.1080/02664763.2024.2428266","DOIUrl":"10.1080/02664763.2024.2428266","url":null,"abstract":"<p><p>In health and clinical research, panel binary data from recurrent events arise when subjects are surveyed to report occurrence statuses of recurrent events over fixed observation windows. In practice, such data can be cut short by a dependent failure event such as death. For the analysis of panel binary data, tools from generalized linear models overlook the recurrence nature of panel binary data, and other relevant literature does not accommodate the failure time. Motivated by the hospitalization data surveyed from the Health and Retirement Study, we propose a semiparametric joint-modeling-based procedure for analyzing panel binary data with a dependent failure time. For model fitting, we develop a computationally efficient EM algorithm and show the resulting estimates are consistent and asymptotically normal. Theoretical results are provided to enable valid inferences. Simulation studies have confirmed the performance of the proposed method in practical settings. The method is applied to assess important risk factors associated with incidences of hospitalization among the working elderly.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 7","pages":"1423-1445"},"PeriodicalIF":1.1,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12117875/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144179896","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A bootstrap procedure to estimate the causal effect of a public policy, considering overlap and imperfect compliance. 一种评估公共政策因果效应的自举程序,考虑到重叠和不完全遵从。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-11-17 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2024.2428994
Stefano Cabras

This paper introduces a nonparametric bootstrap method for estimating the causal effects of public policy under the circumstances of imperfect compliance and overlap. It focuses on business investment subsidies in Sardinia by comparing firms eligible for the 1999 subsidies to those not, amid issues of imperfect compliance and overlapping programs. Bootstrap confidence intervals (CI) are proposed for the average effect of treatment on the sub-population of compliers. The obtained CIs are consistent across nominal levels and robust against data nonnormality; they show coverages of credible intervals close to nominal, suggesting effectiveness for assessing causal effects. Compared to other methods, the results of the new combination of a specific estimator for incompliance and the bootstrap align with those of more modern approaches such as Bayesian Additive Regression Trees and Causal forest.

本文介绍了在不完全服从和重叠情况下估计公共政策因果效应的非参数自举方法。该报告主要关注撒丁岛的商业投资补贴,通过比较1999年有资格获得补贴的公司和没有资格获得补贴的公司,解决了合规不完善和项目重叠的问题。对于治疗对编译者亚群的平均影响,提出了自举置信区间(CI)。所获得的ci在名义水平上是一致的,并且对数据异常具有鲁棒性;它们显示了接近名义可信区间的覆盖率,表明了评估因果效应的有效性。与其他方法相比,将特定估计器与bootstrap相结合的结果与更现代的方法(如贝叶斯加性回归树和因果森林)的结果一致。
{"title":"A bootstrap procedure to estimate the causal effect of a public policy, considering overlap and imperfect compliance.","authors":"Stefano Cabras","doi":"10.1080/02664763.2024.2428994","DOIUrl":"10.1080/02664763.2024.2428994","url":null,"abstract":"<p><p>This paper introduces a nonparametric bootstrap method for estimating the causal effects of public policy under the circumstances of imperfect compliance and overlap. It focuses on business investment subsidies in Sardinia by comparing firms eligible for the 1999 subsidies to those not, amid issues of imperfect compliance and overlapping programs. Bootstrap confidence intervals (CI) are proposed for the average effect of treatment on the sub-population of compliers. The obtained CIs are consistent across nominal levels and robust against data nonnormality; they show coverages of credible intervals close to nominal, suggesting effectiveness for assessing causal effects. Compared to other methods, the results of the new combination of a specific estimator for incompliance and the bootstrap align with those of more modern approaches such as Bayesian Additive Regression Trees and Causal forest.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 7","pages":"1470-1484"},"PeriodicalIF":1.1,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12117864/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144179895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A defective cure rate quantile regression model for male breast cancer data. 男性乳腺癌数据的缺陷治愈率分位数回归模型。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-11-14 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2024.2428272
Agatha Rodrigues, Patrick Borges, Bruno Santos

In this article, we particularly address the problem of assessing the impact of different prognostic factors, such as clinical stage and age, on the specific survival times of men with breast cancer when cure is a possibility. To this end, we developed a quantile regression model for survival data in the presence of long-term survivors based on the generalized Gompertz distribution in a defective version, which is conveniently reparametrized in terms of the q-th quantile and then linked to covariates via a logarithm link function. This proposal allows us to obtain how each variable affects the survival times in different quantiles. In addition, we are able to study the effects of covariates on the cure rate as well. We consider Markov Chain Monte Carlo methods to develop a Bayesian analysis in the proposed model and we evaluate its performance through Monte Carlo simulation studies. Finally, we illustrate the application of our model in a data set about male breast cancer from Brazil analyzed for the very first time.

在这篇文章中,我们特别讨论了评估不同预后因素(如临床分期和年龄)对男性乳腺癌患者在有可能治愈时的具体生存时间的影响。为此,我们基于有缺陷版本的广义Gompertz分布,为存在长期幸存者的生存数据开发了一个分位数回归模型,该模型可以方便地根据第q个分位数重新参数化,然后通过对数链接函数链接到协变量。这个建议使我们能够获得每个变量如何影响不同分位数的生存时间。此外,我们还可以研究协变量对治愈率的影响。我们考虑马尔可夫链蒙特卡罗方法在所提出的模型中进行贝叶斯分析,并通过蒙特卡罗仿真研究来评估其性能。最后,我们说明了我们的模型在巴西首次分析的男性乳腺癌数据集中的应用。
{"title":"A defective cure rate quantile regression model for male breast cancer data.","authors":"Agatha Rodrigues, Patrick Borges, Bruno Santos","doi":"10.1080/02664763.2024.2428272","DOIUrl":"10.1080/02664763.2024.2428272","url":null,"abstract":"<p><p>In this article, we particularly address the problem of assessing the impact of different prognostic factors, such as clinical stage and age, on the specific survival times of men with breast cancer when cure is a possibility. To this end, we developed a quantile regression model for survival data in the presence of long-term survivors based on the generalized Gompertz distribution in a defective version, which is conveniently reparametrized in terms of the <i>q</i>-th quantile and then linked to covariates via a logarithm link function. This proposal allows us to obtain how each variable affects the survival times in different quantiles. In addition, we are able to study the effects of covariates on the cure rate as well. We consider Markov Chain Monte Carlo methods to develop a Bayesian analysis in the proposed model and we evaluate its performance through Monte Carlo simulation studies. Finally, we illustrate the application of our model in a data set about male breast cancer from Brazil analyzed for the very first time.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 8","pages":"1485-1512"},"PeriodicalIF":1.1,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12147491/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144266336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient non-parametric estimation of variable productivity Hawkes processes. 可变生产率Hawkes过程的有效非参数估计。
IF 1.2 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-11-12 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2024.2426019
Sophie Phillips, Frederic Schoenberg

Several approaches to estimating the productivity function for a Hawkes point process with variable productivity are discussed, improved upon, and compared in terms of their root-mean-squared error and computational efficiency for various data sizes, and for binned as well as unbinned data. We find that for unbinned data, a regularized version of the analytic maximum likelihood estimator proposed by Schoenberg is the most accurate but is computationally burdensome. The unregularized version of the estimator is faster to compute but has lower accuracy, though both estimators outperform empirical or binned least squares estimators in terms of root-mean-squared error, especially when the mean productivity is 0.2 or greater. For binned data, binned least squares estimates are highly efficient both in terms of computation time and root-mean-squared error. An extension to estimating transmission time density is discussed, and an application to estimating the productivity of Covid-19 in the United States as a function of time from January 2020 to July 2022 is provided.

本文讨论了几种估算具有可变生产率的Hawkes点过程的生产率函数的方法,对其进行了改进,并根据其根均方误差和计算效率对各种数据大小进行了比较,并对分类和非分类数据进行了比较。我们发现,对于非装箱数据,Schoenberg提出的正则化版本的解析极大似然估计是最准确的,但计算量很大。非正则版本的估计器计算速度更快,但精度较低,尽管两种估计器在均方根误差方面都优于经验估计器或分箱最小二乘估计器,特别是当平均生产率为0.2或更高时。对于分类数据,分类最小二乘估计在计算时间和均方根误差方面都是非常有效的。讨论了估计传播时间密度的扩展,并提供了一个应用程序,用于估计2020年1月至2022年7月期间美国Covid-19的生产力作为时间函数。
{"title":"Efficient non-parametric estimation of variable productivity Hawkes processes.","authors":"Sophie Phillips, Frederic Schoenberg","doi":"10.1080/02664763.2024.2426019","DOIUrl":"10.1080/02664763.2024.2426019","url":null,"abstract":"<p><p>Several approaches to estimating the productivity function for a Hawkes point process with variable productivity are discussed, improved upon, and compared in terms of their root-mean-squared error and computational efficiency for various data sizes, and for binned as well as unbinned data. We find that for unbinned data, a regularized version of the analytic maximum likelihood estimator proposed by Schoenberg is the most accurate but is computationally burdensome. The unregularized version of the estimator is faster to compute but has lower accuracy, though both estimators outperform empirical or binned least squares estimators in terms of root-mean-squared error, especially when the mean productivity is 0.2 or greater. For binned data, binned least squares estimates are highly efficient both in terms of computation time and root-mean-squared error. An extension to estimating transmission time density is discussed, and an application to estimating the productivity of Covid-19 in the United States as a function of time from January 2020 to July 2022 is provided.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 7","pages":"1405-1422"},"PeriodicalIF":1.2,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12117874/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144181444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecasting economic growth by combining local linear and standard approaches. 结合本地线性和标准方法预测经济增长。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-11-08 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2024.2424920
Marlon Fritz, Sarah Forstinger, Yuanhua Feng, Thomas Gries

Today, developing economies are of major importance for global macroeconomic development. However, the empirical analysis and especially the forecasting of macroeconomic time series remain difficult due to a lack of sufficient data, data frequency, high volatility, and non-linear developments. These difficulties require more sophisticated approaches to obtain reliable forecasts. Therefore, we propose an improved forecasting method especially for growth data based on a data-driven local linear trend estimation with an extended iterative plug-in algorithm for determining the bandwidth endogenously. This approach allows a smooth trend estimation that takes care of temporary changes in trend processes. Further, the naïve random walk model is extended for forecasting by including a local linear, time-varying drift. We apply this method to GDP development for six developing and two advanced economies and compare different forecast combinations. The combinations that include the local linear approach and the random walk with a local linear trend improve forecasting accuracy and reduce variance.

今天,发展中经济体对全球宏观经济发展具有重要意义。然而,由于缺乏足够的数据、数据频率高、波动性大和非线性发展,宏观经济时间序列的实证分析特别是预测仍然很困难。这些困难需要更复杂的方法来获得可靠的预测。因此,我们提出了一种改进的预测方法,特别是基于数据驱动的局部线性趋势估计和扩展迭代插件算法来确定带宽的内源性。这种方法允许平滑的趋势估计,可以处理趋势过程中的临时变化。此外,naïve随机游走模型通过包含局部线性时变漂移来扩展预测。我们将这种方法应用于六个发展中经济体和两个发达经济体的GDP发展,并比较了不同的预测组合。将局部线性方法与具有局部线性趋势的随机漫步相结合,提高了预测精度,减小了方差。
{"title":"Forecasting economic growth by combining local linear and standard approaches.","authors":"Marlon Fritz, Sarah Forstinger, Yuanhua Feng, Thomas Gries","doi":"10.1080/02664763.2024.2424920","DOIUrl":"10.1080/02664763.2024.2424920","url":null,"abstract":"<p><p>Today, developing economies are of major importance for global macroeconomic development. However, the empirical analysis and especially the forecasting of macroeconomic time series remain difficult due to a lack of sufficient data, data frequency, high volatility, and non-linear developments. These difficulties require more sophisticated approaches to obtain reliable forecasts. Therefore, we propose an improved forecasting method especially for growth data based on a data-driven local linear trend estimation with an extended iterative plug-in algorithm for determining the bandwidth endogenously. This approach allows a smooth trend estimation that takes care of temporary changes in trend processes. Further, the naïve random walk model is extended for forecasting by including a local linear, time-varying drift. We apply this method to GDP development for six developing and two advanced economies and compare different forecast combinations. The combinations that include the local linear approach and the random walk with a local linear trend improve forecasting accuracy and reduce variance.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 7","pages":"1342-1360"},"PeriodicalIF":1.1,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12117868/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144182001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Applied Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1