Lifetime Data Analysis最新文献_第6页

Unifying mortality forecasting model: an investigation of the COM–Poisson distribution in the GAS model for improved projections 统一死亡率预测模型：为改进预测而对 GAS 模型中 COM-Poisson 分布的研究

IF 1.3 3区数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Lifetime Data Analysis

Pub Date : 2024-09-13 DOI: 10.1007/s10985-024-09634-x

Suryo Adi Rakhmawan, Tahir Mahmood, Nasir Abbas, Muhammad Riaz

Forecasting mortality rates is crucial for evaluating life insurance company solvency, especially amid disruptions caused by phenomena like COVID-19. The Lee–Carter model is commonly employed in mortality modelling; however, extensions that can encompass count data with diverse distributions, such as the Generalized Autoregressive Score (GAS) model utilizing the COM–Poisson distribution, exhibit potential for enhancing time-to-event forecasting accuracy. Using mortality data from 29 countries, this research evaluates various distributions and determines that the COM–Poisson model surpasses the Poisson, binomial, and negative binomial distributions in forecasting mortality rates. The one-step forecasting capability of the GAS model offers distinct advantages, while the COM–Poisson distribution demonstrates enhanced flexibility and versatility by accommodating various distributions, including Poisson and negative binomial. Ultimately, the study determines that the COM–Poisson GAS model is an effective instrument for examining time series data on mortality rates, particularly when facing time-varying parameters and non-conventional data distributions.

预测死亡率对于评估人寿保险公司的偿付能力至关重要，尤其是在 COVID-19 等现象造成混乱的情况下。死亡率建模通常采用 Lee-Carter 模型；然而，能够包含具有不同分布的计数数据的扩展模型，如利用 COM-Poisson 分布的广义自回归分数 (GAS) 模型，在提高时间到事件预测准确性方面展现出了潜力。这项研究利用 29 个国家的死亡率数据，对各种分布进行了评估，结果表明 COM-Poisson 模型在预测死亡率方面优于泊松分布、二项分布和负二项分布。GAS 模型的一步预测能力具有明显的优势，而 COM-Poisson 分布则通过容纳包括泊松和负二项在内的各种分布，显示出更大的灵活性和多功能性。研究最终确定，COM-泊松 GAS 模型是研究死亡率时间序列数据的有效工具，尤其是在面对时变参数和非常规数据分布时。

{"title":"Unifying mortality forecasting model: an investigation of the COM–Poisson distribution in the GAS model for improved projections","authors":"Suryo Adi Rakhmawan, Tahir Mahmood, Nasir Abbas, Muhammad Riaz","doi":"10.1007/s10985-024-09634-x","DOIUrl":"https://doi.org/10.1007/s10985-024-09634-x","url":null,"abstract":"Forecasting mortality rates is crucial for evaluating life insurance company solvency, especially amid disruptions caused by phenomena like COVID-19. The Lee–Carter model is commonly employed in mortality modelling; however, extensions that can encompass count data with diverse distributions, such as the Generalized Autoregressive Score (GAS) model utilizing the COM–Poisson distribution, exhibit potential for enhancing time-to-event forecasting accuracy. Using mortality data from 29 countries, this research evaluates various distributions and determines that the COM–Poisson model surpasses the Poisson, binomial, and negative binomial distributions in forecasting mortality rates. The one-step forecasting capability of the GAS model offers distinct advantages, while the COM–Poisson distribution demonstrates enhanced flexibility and versatility by accommodating various distributions, including Poisson and negative binomial. Ultimately, the study determines that the COM–Poisson GAS model is an effective instrument for examining time series data on mortality rates, particularly when facing time-varying parameters and non-conventional data distributions.","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":"60 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142220139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Special issue dedicated to Mitchell H. Gail, M.D. Ph.D. 米切尔-盖尔医学博士特刊

IF 1.2 3区数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Lifetime Data Analysis

Pub Date : 2024-07-01 Epub Date: 2024-06-24 DOI: 10.1007/s10985-024-09631-0

Mei-Ling Ting Lee

引用次数: 0

A constrained maximum likelihood approach to developing well-calibrated models for predicting binary outcomes. 开发校准良好的二元结果预测模型的受限最大似然法。

IF 1.2 3区数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Lifetime Data Analysis

Pub Date : 2024-07-01 Epub Date: 2024-05-08 DOI: 10.1007/s10985-024-09628-9

Yaqi Cao, Weidong Ma, Ge Zhao, Anne Marie McCarthy, Jinbo Chen

The added value of candidate predictors for risk modeling is routinely evaluated by comparing the performance of models with or without including candidate predictors. Such comparison is most meaningful when the estimated risk by the two models are both unbiased in the target population. Very often data for candidate predictors are sourced from nonrepresentative convenience samples. Updating the base model using the study data without acknowledging the discrepancy between the underlying distribution of the study data and that in the target population can lead to biased risk estimates and therefore an unfair evaluation of candidate predictors. To address this issue assuming access to a well-calibrated base model, we propose a semiparametric method for model fitting that enforces good calibration. The central idea is to calibrate the fitted model against the base model by enforcing suitable constraints in maximizing the likelihood function. This approach enables unbiased assessment of model improvement offered by candidate predictors without requiring a representative sample from the target population, thus overcoming a significant practical challenge. We study theoretical properties for model parameter estimates, and demonstrate improvement in model calibration via extensive simulation studies. Finally, we apply the proposed method to data extracted from Penn Medicine Biobank to inform the added value of breast density for breast cancer risk assessment in the Caucasian woman population.

候选预测因子对风险建模的附加值通常是通过比较包含或不包含候选预测因子的模型的性能来评估的。当两个模型在目标人群中估计的风险都无偏时，这种比较才最有意义。候选预测因子的数据往往来自非代表性的便利样本。使用研究数据更新基础模型时，如果不承认研究数据的基本分布与目标人群的分布之间存在差异，就会导致风险估计值存在偏差，从而对候选预测因子进行不公平的评估。为了解决这个问题，我们提出了一种半参数方法，在获得校准良好的基础模型的前提下进行模型拟合。其核心思想是通过在最大化似然函数时强制执行适当的约束条件，根据基础模型校准拟合模型。这种方法无需目标人群的代表性样本，就能对候选预测因子对模型的改进进行无偏评估，从而克服了一个重大的实际挑战。我们研究了模型参数估计的理论属性，并通过大量模拟研究证明了模型校准的改进。最后，我们将所提出的方法应用于从宾夕法尼亚医学生物库中提取的数据，以告知乳腺密度对白种女性乳腺癌风险评估的附加价值。

{"title":"A constrained maximum likelihood approach to developing well-calibrated models for predicting binary outcomes.","authors":"Yaqi Cao, Weidong Ma, Ge Zhao, Anne Marie McCarthy, Jinbo Chen","doi":"10.1007/s10985-024-09628-9","DOIUrl":"10.1007/s10985-024-09628-9","url":null,"abstract":"The added value of candidate predictors for risk modeling is routinely evaluated by comparing the performance of models with or without including candidate predictors. Such comparison is most meaningful when the estimated risk by the two models are both unbiased in the target population. Very often data for candidate predictors are sourced from nonrepresentative convenience samples. Updating the base model using the study data without acknowledging the discrepancy between the underlying distribution of the study data and that in the target population can lead to biased risk estimates and therefore an unfair evaluation of candidate predictors. To address this issue assuming access to a well-calibrated base model, we propose a semiparametric method for model fitting that enforces good calibration. The central idea is to calibrate the fitted model against the base model by enforcing suitable constraints in maximizing the likelihood function. This approach enables unbiased assessment of model improvement offered by candidate predictors without requiring a representative sample from the target population, thus overcoming a significant practical challenge. We study theoretical properties for model parameter estimates, and demonstrate improvement in model calibration via extensive simulation studies. Finally, we apply the proposed method to data extracted from Penn Medicine Biobank to inform the added value of breast density for breast cancer risk assessment in the Caucasian woman population.","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":" ","pages":"624-648"},"PeriodicalIF":1.2,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11634939/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140877759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Competing risks and multivariate outcomes in epidemiological and clinical trial research. 流行病学和临床试验研究中的竞争风险和多变量结果。

IF 1.2 3区数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Lifetime Data Analysis

Pub Date : 2024-07-01 Epub Date: 2024-05-06 DOI: 10.1007/s10985-024-09629-8

R L Prentice

Data analysis methods for the study of treatments or exposures in relation to a clinical outcome in the presence of competing risks have a long history, often with inference targets that are hypothetical, thereby requiring strong assumptions for identifiability with available data. Here data analysis methods are considered that are based on single and higher dimensional marginal hazard rates, quantities that are identifiable under standard independent censoring assumptions. These lead naturally to joint survival function estimators for outcomes of interest, including competing risk outcomes, and provide the basis for addressing a variety of data analysis questions. These methods will be illustrated using simulations and Women's Health Initiative cohort and clinical trial data sets, and additional research needs will be described.

在存在竞争风险的情况下，研究与临床结果相关的治疗或暴露的数据分析方法由来已久，其推断目标往往是假设的，因此需要对可用数据的可识别性做出强有力的假设。这里考虑的数据分析方法基于单维和高维边际危险率，这些量在标准独立删减假设下是可识别的。这些方法可以自然地得出相关结果（包括竞争风险结果）的联合生存函数估计值，并为解决各种数据分析问题提供基础。我们将利用模拟和妇女健康倡议队列及临床试验数据集来说明这些方法，并介绍其他研究需求。

引用次数: 0

A Bayesian quantile joint modeling of multivariate longitudinal and time-to-event data. 多变量纵向和时间到事件数据的贝叶斯量化联合建模。

IF 1.2 3区数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Lifetime Data Analysis

Pub Date : 2024-07-01 Epub Date: 2024-03-01 DOI: 10.1007/s10985-024-09622-1

Damitri Kundu, Shekhar Krishnan, Manash Pratim Gogoi, Kiranmoy Das

Linear mixed models are traditionally used for jointly modeling (multivariate) longitudinal outcomes and event-time(s). However, when the outcomes are non-Gaussian a quantile regression model is more appropriate. In addition, in the presence of some time-varying covariates, it might be of interest to see how the effects of different covariates vary from one quantile level (of outcomes) to the other, and consequently how the event-time changes across different quantiles. For such analyses linear quantile mixed models can be used, and an efficient computational algorithm can be developed. We analyze a dataset from the Acute Lymphocytic Leukemia (ALL) maintenance study conducted by Tata Medical Center, Kolkata. In this study, the patients suffering from ALL were treated with two standard drugs (6MP and MTx) for the first two years, and three biomarkers (e.g. lymphocyte count, neutrophil count and platelet count) were longitudinally measured. After treatment the patients were followed nearly for the next three years, and the relapse-time (if any) for each patient was recorded. For this dataset we develop a Bayesian quantile joint model for the three longitudinal biomarkers and time-to-relapse. We consider an Asymmetric Laplace Distribution (ALD) for each outcome, and exploit the mixture representation of the ALD for developing a Gibbs sampler algorithm to estimate the regression coefficients. Our proposed model allows different quantile levels for different biomarkers, but still simultaneously estimates the regression coefficients corresponding to a particular quantile combination. We infer that a higher lymphocyte count accelerates the chance of a relapse while a higher neutrophil count and a higher platelet count (jointly) reduce it. Also, we infer that across (almost) all quantiles 6MP reduces the lymphocyte count, while MTx increases the neutrophil count. Simulation studies are performed to assess the effectiveness of the proposed approach.

线性混合模型传统上用于（多变量）纵向结果和事件时间的联合建模。然而，当结果是非高斯性的时候，采用量子回归模型更为合适。此外，在存在某些时变协变量的情况下，研究不同协变量对不同量级（结果）的影响有何不同，进而研究事件时间在不同量级之间有何变化，可能会引起人们的兴趣。对于此类分析，可以使用线性量级混合模型，并开发出一种高效的计算算法。我们分析的数据集来自加尔各答塔塔医疗中心开展的急性淋巴细胞白血病（ALL）维持研究。在这项研究中，急性淋巴细胞白血病患者在头两年接受了两种标准药物（6MP 和 MTx）的治疗，并对三种生物标志物（如淋巴细胞计数、中性粒细胞计数和血小板计数）进行了纵向测量。治疗结束后，对患者进行为期三年的跟踪随访，并记录每位患者的复发时间（如有）。针对这一数据集，我们为三种纵向生物标记物和复发时间建立了贝叶斯量化联合模型。我们考虑了每个结果的非对称拉普拉斯分布（ALD），并利用 ALD 的混合表示法开发了一种吉布斯采样器算法来估计回归系数。我们提出的模型允许不同的生物标记物具有不同的量化水平，但仍能同时估算出特定量化组合对应的回归系数。我们推断，淋巴细胞计数越高，复发几率越大，而中性粒细胞计数和血小板计数越高（共同），复发几率越小。此外，我们还推断，在（几乎）所有量子组合中，6MP 可降低淋巴细胞计数，而 MTx 可增加中性粒细胞计数。我们进行了模拟研究，以评估所提出方法的有效性。

{"title":"A Bayesian quantile joint modeling of multivariate longitudinal and time-to-event data.","authors":"Damitri Kundu, Shekhar Krishnan, Manash Pratim Gogoi, Kiranmoy Das","doi":"10.1007/s10985-024-09622-1","DOIUrl":"10.1007/s10985-024-09622-1","url":null,"abstract":"Linear mixed models are traditionally used for jointly modeling (multivariate) longitudinal outcomes and event-time(s). However, when the outcomes are non-Gaussian a quantile regression model is more appropriate. In addition, in the presence of some time-varying covariates, it might be of interest to see how the effects of different covariates vary from one quantile level (of outcomes) to the other, and consequently how the event-time changes across different quantiles. For such analyses linear quantile mixed models can be used, and an efficient computational algorithm can be developed. We analyze a dataset from the Acute Lymphocytic Leukemia (ALL) maintenance study conducted by Tata Medical Center, Kolkata. In this study, the patients suffering from ALL were treated with two standard drugs (6MP and MTx) for the first two years, and three biomarkers (e.g. lymphocyte count, neutrophil count and platelet count) were longitudinally measured. After treatment the patients were followed nearly for the next three years, and the relapse-time (if any) for each patient was recorded. For this dataset we develop a Bayesian quantile joint model for the three longitudinal biomarkers and time-to-relapse. We consider an Asymmetric Laplace Distribution (ALD) for each outcome, and exploit the mixture representation of the ALD for developing a Gibbs sampler algorithm to estimate the regression coefficients. Our proposed model allows different quantile levels for different biomarkers, but still simultaneously estimates the regression coefficients corresponding to a particular quantile combination. We infer that a higher lymphocyte count accelerates the chance of a relapse while a higher neutrophil count and a higher platelet count (jointly) reduce it. Also, we infer that across (almost) all quantiles 6MP reduces the lymphocyte count, while MTx increases the neutrophil count. Simulation studies are performed to assess the effectiveness of the proposed approach.","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":" ","pages":"680-699"},"PeriodicalIF":1.2,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139998108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

On the role of Volterra integral equations in self-consistent, product-limit, inverse probability of censoring weighted, and redistribution-to-the-right estimators for the survival function. 论 Volterra 积分方程在生存函数的自洽、乘积限制、反删减概率加权和向右再分布估计器中的作用。

IF 1.2 3区数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Lifetime Data Analysis

Pub Date : 2024-07-01 Epub Date: 2024-03-21 DOI: 10.1007/s10985-024-09623-0

Robert L Strawderman, Benjamin R Baer

This paper reconsiders several results of historical and current importance to nonparametric estimation of the survival distribution for failure in the presence of right-censored observation times, demonstrating in particular how Volterra integral equations help inter-connect the resulting estimators. The paper begins by considering Efron's self-consistency equation, introduced in a seminal 1967 Berkeley symposium paper. Novel insights provided in the current work include the observations that (i) the self-consistency equation leads directly to an anticipating Volterra integral equation whose solution is given by a product-limit estimator for the censoring survival function; (ii) a definition used in this argument immediately establishes the familiar product-limit estimator for the failure survival function; (iii) the usual Volterra integral equation for the product-limit estimator of the failure survival function leads to an immediate and simple proof that it can be represented as an inverse probability of censoring weighted estimator; (iv) a simple identity characterizes the relationship between natural inverse probability of censoring weighted estimators for the survival and distribution functions of failure; (v) the resulting inverse probability of censoring weighted estimators, attributed to a highly influential 1992 paper of Robins and Rotnitzky, were implicitly introduced in Efron's 1967 paper in its development of the redistribution-to-the-right algorithm. All results developed herein allow for ties between failure and/or censored observations.

本文重新考虑了在存在右删失观察时间的情况下，对失败的生存分布进行非参数估计的几个具有历史和现实意义的结果，特别展示了 Volterra 积分方程是如何帮助将所得到的估计值相互连接起来的。论文首先考虑了埃夫隆的自洽方程，该方程是在 1967 年伯克利研讨会的一篇开创性论文中提出的。本研究提供的新见解包括：(i) 自洽方程直接导致一个预期伏特拉积分方程，该方程的解由删减生存函数的乘积极限估计器给出；(ii) 本论证中使用的定义立即建立了我们熟悉的失败生存函数的乘积极限估计器；(iii) 失败生存函数的乘积极限估计器的通常伏特拉积分方程立即导致一个简单的证明，即它可以表示为删减加权估计器的反概率；(iv) 一个简单的特征描述了失败生存函数和分布函数的自然删减加权反概率估计器之间的关系；(v) 罗宾斯和罗特尼茨基在 1992 年发表的一篇极具影响力的论文中提出了删减加权反概率估计器，而艾夫隆在 1967 年发表的论文中在发展向右再分配算法时隐含地引入了这一估计器。本文提出的所有结果都考虑了失败和/或删减观测值之间的联系。

{"title":"On the role of Volterra integral equations in self-consistent, product-limit, inverse probability of censoring weighted, and redistribution-to-the-right estimators for the survival function.","authors":"Robert L Strawderman, Benjamin R Baer","doi":"10.1007/s10985-024-09623-0","DOIUrl":"10.1007/s10985-024-09623-0","url":null,"abstract":"This paper reconsiders several results of historical and current importance to nonparametric estimation of the survival distribution for failure in the presence of right-censored observation times, demonstrating in particular how Volterra integral equations help inter-connect the resulting estimators. The paper begins by considering Efron's self-consistency equation, introduced in a seminal 1967 Berkeley symposium paper. Novel insights provided in the current work include the observations that (i) the self-consistency equation leads directly to an anticipating Volterra integral equation whose solution is given by a product-limit estimator for the censoring survival function; (ii) a definition used in this argument immediately establishes the familiar product-limit estimator for the failure survival function; (iii) the usual Volterra integral equation for the product-limit estimator of the failure survival function leads to an immediate and simple proof that it can be represented as an inverse probability of censoring weighted estimator; (iv) a simple identity characterizes the relationship between natural inverse probability of censoring weighted estimators for the survival and distribution functions of failure; (v) the resulting inverse probability of censoring weighted estimators, attributed to a highly influential 1992 paper of Robins and Rotnitzky, were implicitly introduced in Efron's 1967 paper in its development of the redistribution-to-the-right algorithm. All results developed herein allow for ties between failure and/or censored observations.","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":" ","pages":"649-666"},"PeriodicalIF":1.2,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140186140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Risk projection for time-to-event outcome from population-based case-control studies leveraging summary statistics from the target population. 利用目标人群的汇总统计数据，对基于人群的病例对照研究的时间到事件结果进行风险预测。

IF 1.2 3区数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Lifetime Data Analysis

Pub Date : 2024-07-01 Epub Date: 2024-05-28 DOI: 10.1007/s10985-024-09626-x

Jiayin Zheng, Li Hsu

Risk stratification based on prediction models has become increasingly important in preventing and managing chronic diseases. However, due to cost- and time-limitations, not every population can have resources for collecting enough detailed individual-level information on a large number of people to develop risk prediction models. A more practical approach is to use prediction models developed from existing studies and calibrate them with relevant summary-level information of the target population. Many existing studies were conducted under the population-based case-control design. Gail et al. (J Natl Cancer Inst 81:1879-1886, 1989) proposed to combine the odds ratio estimates obtained from case-control data and the disease incidence rates from the target population to obtain the baseline hazard function, and thereby the pure risk for developing diseases. However, the approach requires the risk factor distribution of cases from the case-control studies be same as the target population, which, if violated, may yield biased risk estimation. In this article, we propose two novel weighted estimating equation approaches to calibrate the baseline risk by leveraging the summary information of (some) risk factors in addition to disease-free probabilities from the targeted population. We establish the consistency and asymptotic normality of the proposed estimators. Extensive simulation studies and an application to colorectal cancer studies demonstrate the proposed estimators perform well for bias reduction in finite samples.

在预防和管理慢性疾病方面，基于预测模型的风险分层变得越来越重要。然而，由于成本和时间的限制，并非每个人群都有资源收集足够详细的大量个体信息来开发风险预测模型。更实用的方法是利用现有研究开发的预测模型，并用目标人群的相关汇总信息对其进行校准。现有的许多研究都是在基于人群的病例对照设计下进行的。Gail 等人（J Natl Cancer Inst 81:1879-1886，1989 年）建议把从病例对照数据中得到的几率估计值与目标人群的疾病发病率结合起来，以得到基线危险函数，从而得到纯粹的患病风险。然而，该方法要求病例对照研究中病例的危险因素分布与目标人群相同，如果违反了这一要求，可能会导致风险估计出现偏差。在本文中，我们提出了两种新的加权估计方程方法，除了利用目标人群的无病概率外，还利用（部分）风险因素的汇总信息来校准基线风险。我们确定了所提估计方程的一致性和渐近正态性。广泛的模拟研究和对结直肠癌研究的应用表明，所提出的估计器在有限样本中减少偏差方面表现良好。

{"title":"Risk projection for time-to-event outcome from population-based case-control studies leveraging summary statistics from the target population.","authors":"Jiayin Zheng, Li Hsu","doi":"10.1007/s10985-024-09626-x","DOIUrl":"10.1007/s10985-024-09626-x","url":null,"abstract":"Risk stratification based on prediction models has become increasingly important in preventing and managing chronic diseases. However, due to cost- and time-limitations, not every population can have resources for collecting enough detailed individual-level information on a large number of people to develop risk prediction models. A more practical approach is to use prediction models developed from existing studies and calibrate them with relevant summary-level information of the target population. Many existing studies were conducted under the population-based case-control design. Gail et al. (J Natl Cancer Inst 81:1879-1886, 1989) proposed to combine the odds ratio estimates obtained from case-control data and the disease incidence rates from the target population to obtain the baseline hazard function, and thereby the pure risk for developing diseases. However, the approach requires the risk factor distribution of cases from the case-control studies be same as the target population, which, if violated, may yield biased risk estimation. In this article, we propose two novel weighted estimating equation approaches to calibrate the baseline risk by leveraging the summary information of (some) risk factors in addition to disease-free probabilities from the targeted population. We establish the consistency and asymptotic normality of the proposed estimators. Extensive simulation studies and an application to colorectal cancer studies demonstrate the proposed estimators perform well for bias reduction in finite samples.","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":" ","pages":"549-571"},"PeriodicalIF":1.2,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11283322/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141158740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Measurement error models with zero inflation and multiple sources of zeros, with applications to hard zeros. 零膨胀和多源零的测量误差模型，以及对硬零的应用。

IF 1.2 3区数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Lifetime Data Analysis

Pub Date : 2024-07-01 Epub Date: 2024-05-28 DOI: 10.1007/s10985-024-09627-w

Anindya Bhadra, Rubin Wei, Ruth Keogh, Victor Kipnis, Douglas Midthune, Dennis W Buckman, Ya Su, Ananya Roy Chowdhury, Raymond J Carroll

We consider measurement error models for two variables observed repeatedly and subject to measurement error. One variable is continuous, while the other variable is a mixture of continuous and zero measurements. This second variable has two sources of zeros. The first source is episodic zeros, wherein some of the measurements for an individual may be zero and others positive. The second source is hard zeros, i.e., some individuals will always report zero. An example is the consumption of alcohol from alcoholic beverages: some individuals consume alcoholic beverages episodically, while others never consume alcoholic beverages. However, with a small number of repeat measurements from individuals, it is not possible to determine those who are episodic zeros and those who are hard zeros. We develop a new measurement error model for this problem, and use Bayesian methods to fit it. Simulations and data analyses are used to illustrate our methods. Extensions to parametric models and survival analysis are discussed briefly.

我们考虑的是重复观测并存在测量误差的两个变量的测量误差模型。其中一个变量是连续的，而另一个变量是连续测量和零测量的混合。第二个变量有两个零点来源。第一个来源是偶发零，即一个人的某些测量值可能为零，而另一些测量值可能为正。第二个来源是硬性零，即有些人总是报告零。例如，酒精饮料的酒精消耗量：有些人偶尔饮用酒精饮料，而有些人则从不饮用酒精饮料。然而，由于重复测量的个体数量较少，因此无法确定哪些是偶发性零，哪些是硬性零。我们针对这一问题建立了一个新的测量误差模型，并使用贝叶斯方法对其进行拟合。模拟和数据分析用于说明我们的方法。我们还简要讨论了对参数模型和生存分析的扩展。

{"title":"Measurement error models with zero inflation and multiple sources of zeros, with applications to hard zeros.","authors":"Anindya Bhadra, Rubin Wei, Ruth Keogh, Victor Kipnis, Douglas Midthune, Dennis W Buckman, Ya Su, Ananya Roy Chowdhury, Raymond J Carroll","doi":"10.1007/s10985-024-09627-w","DOIUrl":"10.1007/s10985-024-09627-w","url":null,"abstract":"We consider measurement error models for two variables observed repeatedly and subject to measurement error. One variable is continuous, while the other variable is a mixture of continuous and zero measurements. This second variable has two sources of zeros. The first source is episodic zeros, wherein some of the measurements for an individual may be zero and others positive. The second source is hard zeros, i.e., some individuals will always report zero. An example is the consumption of alcohol from alcoholic beverages: some individuals consume alcoholic beverages episodically, while others never consume alcoholic beverages. However, with a small number of repeat measurements from individuals, it is not possible to determine those who are episodic zeros and those who are hard zeros. We develop a new measurement error model for this problem, and use Bayesian methods to fit it. Simulations and data analyses are used to illustrate our methods. Extensions to parametric models and survival analysis are discussed briefly.","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":" ","pages":"600-623"},"PeriodicalIF":1.2,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141162786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Regression analysis of doubly censored failure time data with ancillary information 带辅助信息的双删失故障时间数据回归分析

IF 1.3 3区数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Lifetime Data Analysis

Pub Date : 2024-04-20 DOI: 10.1007/s10985-024-09625-y

Mingyue Du, Xiyuan Gao, Ling Chen

Doubly censored failure time data occur in many areas and for the situation, the failure time of interest usually represents the elapsed time between two related events such as an infection and the resulting disease onset. Although many methods have been proposed for regression analysis of such data, most of them are conditional on the occurrence time of the initial event and ignore the relationship between the two events or the ancillary information contained in the initial event. Corresponding to this, a new sieve maximum likelihood approach is proposed that makes use of the ancillary information, and in the method, the logistic model and Cox proportional hazards model are employed to model the initial event and the failure time of interest, respectively. A simulation study is conducted and suggests that the proposed method works well in practice and is more efficient than the existing methods as expected. The approach is applied to an AIDS study that motivated this investigation.

双删失故障时间数据在许多领域都会出现，在这种情况下，所关注的故障时间通常代表两个相关事件（如感染和由此导致的疾病发作）之间的经过时间。虽然已经提出了许多对此类数据进行回归分析的方法，但大多数方法都以初始事件的发生时间为条件，忽略了两个事件之间的关系或初始事件所包含的辅助信息。与此相对应，提出了一种利用辅助信息的新筛最大似然法，在该方法中，采用 logistic 模型和 Cox 比例危险模型分别对初始事件和相关故障时间进行建模。我们进行了模拟研究，结果表明所提出的方法在实践中效果良好，而且比现有方法更有效。该方法被应用于一项艾滋病研究，这也是本次调查的动机所在。

引用次数: 0

Partial-linear single-index transformation models with censored data 有删减数据的部分线性单指数变换模型

IF 1.3 3区数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Lifetime Data Analysis

Pub Date : 2024-04-16 DOI: 10.1007/s10985-024-09624-z

Myeonggyun Lee, Andrea B. Troxel, Mengling Liu

In studies with time-to-event outcomes, multiple, inter-correlated, and time-varying covariates are commonly observed. It is of great interest to model their joint effects by allowing a flexible functional form and to delineate their relative contributions to survival risk. A class of semiparametric transformation (ST) models offers flexible specifications of the intensity function and can be a general framework to accommodate nonlinear covariate effects. In this paper, we propose a partial-linear single-index (PLSI) transformation model that reduces the dimensionality of multiple covariates into a single index and provides interpretable estimates of the covariate effects. We develop an iterative algorithm using the regression spline technique to model the nonparametric single-index function for possibly nonlinear joint effects, followed by nonparametric maximum likelihood estimation. We also propose a nonparametric testing procedure to formally examine the linearity of covariate effects. We conduct Monte Carlo simulation studies to compare the PLSI transformation model with the standard ST model and apply it to NYU Langone Health de-identified electronic health record data on COVID-19 hospitalized patients’ mortality and a Veteran’s Administration lung cancer trial.

在时间到事件结果的研究中，通常会观察到多个相互关联且随时间变化的协变量。通过灵活的函数形式对它们的联合效应进行建模，并确定它们对生存风险的相对贡献是非常有意义的。半参数变换（ST）模型提供了灵活的强度函数规格，可以作为一个通用框架来适应非线性协变量效应。在本文中，我们提出了一种部分线性单指数（PLSI）转换模型，该模型可将多个协变量的维度降低为单个指数，并提供可解释的协变量效应估计值。我们利用回归样条技术开发了一种迭代算法，为可能的非线性联合效应建立非参数单指数函数模型，然后进行非参数最大似然估计。我们还提出了一种非参数检验程序，用于正式检验协变量效应的线性度。我们进行了蒙特卡罗模拟研究，将 PLSI 转换模型与标准 ST 模型进行比较，并将其应用于纽约大学朗贡卫生院关于 COVID-19 住院患者死亡率的去标识化电子健康记录数据和退伍军人管理局肺癌试验。

{"title":"Partial-linear single-index transformation models with censored data","authors":"Myeonggyun Lee, Andrea B. Troxel, Mengling Liu","doi":"10.1007/s10985-024-09624-z","DOIUrl":"https://doi.org/10.1007/s10985-024-09624-z","url":null,"abstract":"In studies with time-to-event outcomes, multiple, inter-correlated, and time-varying covariates are commonly observed. It is of great interest to model their joint effects by allowing a flexible functional form and to delineate their relative contributions to survival risk. A class of semiparametric transformation (ST) models offers flexible specifications of the intensity function and can be a general framework to accommodate nonlinear covariate effects. In this paper, we propose a partial-linear single-index (PLSI) transformation model that reduces the dimensionality of multiple covariates into a single index and provides interpretable estimates of the covariate effects. We develop an iterative algorithm using the regression spline technique to model the nonparametric single-index function for possibly nonlinear joint effects, followed by nonparametric maximum likelihood estimation. We also propose a nonparametric testing procedure to formally examine the linearity of covariate effects. We conduct Monte Carlo simulation studies to compare the PLSI transformation model with the standard ST model and apply it to NYU Langone Health de-identified electronic health record data on COVID-19 hospitalized patients’ mortality and a Veteran’s Administration lung cancer trial.","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":"19 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140574258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0