首页 > 最新文献

Lifetime Data Analysis最新文献

英文 中文
Median regression models for clustered, interval-censored survival data - An application to prostate surgery study. 聚类、间隔剔除生存数据的中位数回归模型——在前列腺手术研究中的应用。
IF 1.3 3区 数学 Q2 Mathematics Pub Date : 2022-10-01 Epub Date: 2022-08-07 DOI: 10.1007/s10985-022-09570-8
Debajyoti Sinha, Piyali Basak, Stuart R Lipsitz

Genitourinary surgeons and oncologists are particularly interested in whether a robotic surgery improves times to Prostate Specific Antigen (PSA) recurrence compared to a non-robotic surgery for removing the cancerous prostate. Time to PSA recurrence is an example of a survival time that is typically interval-censored between two consecutive clinical inspections with opposite test results. In addition, success of medical devices and technologies often depends on factors such as experience and skill level of the medical service providers, thus leading to clustering of these survival times. For analyzing the effects of surgery types and other covariates on median of clustered interval-censored time to post-surgery PSA recurrence, we present three competing novel models and associated frequentist and Bayesian analyses. The first model is based on a transform-both-sides of survival time with Gaussian random effects to account for the within-cluster association. Our second model assumes an approximate marginal Laplace distribution for the transformed log-survival times with a Gaussian copula to accommodate clustering. Our third model is a special case of the second model with Laplace distribution for the marginal log-survival times and Gaussian copula for the within-cluster association. Simulation studies establish the second model to be highly robust against extreme observations while estimating median regression coefficients. We provide a comprehensive comparison among these three competing models based on the model properties and the computational ease of their Frequentist and Bayesian analysis. We also illustrate the practical implementations and uses of these methods via analysis of a simulated clustered interval-censored data-set similar in design to a post-surgery PSA recurrence study.

泌尿生殖外科医生和肿瘤学家特别感兴趣的是,与非机器人手术相比,机器人手术是否能改善前列腺特异性抗原(PSA)的复发率。PSA复发的时间是生存时间的一个例子,通常在两次连续的临床检查结果相反的情况下进行间隔审查。此外,医疗设备和技术的成功往往取决于医疗服务提供者的经验和技能水平等因素,从而导致这些生存时间的聚类。为了分析手术类型和其他协变量对聚类间隔截除时间中位数对术后PSA复发的影响,我们提出了三个相互竞争的新模型以及相关的频率分析和贝叶斯分析。第一个模型是基于高斯随机效应的生存时间的转换,以解释簇内关联。我们的第二个模型假设了一个近似的边际拉普拉斯分布,用于转换后的对数生存时间,并使用高斯copula来适应聚类。我们的第三个模型是第二个模型的特殊情况,其边际对数生存时间为拉普拉斯分布,聚类内关联为高斯联结。模拟研究建立了第二个模型,在估计中位数回归系数时对极端观测具有高度鲁棒性。我们根据模型的性质以及它们的频率分析和贝叶斯分析的计算便利性,对这三种相互竞争的模型进行了全面的比较。我们还通过分析一个类似于手术后PSA复发研究的模拟聚类间隔剔除数据集来说明这些方法的实际实现和使用。
{"title":"Median regression models for clustered, interval-censored survival data - An application to prostate surgery study.","authors":"Debajyoti Sinha,&nbsp;Piyali Basak,&nbsp;Stuart R Lipsitz","doi":"10.1007/s10985-022-09570-8","DOIUrl":"https://doi.org/10.1007/s10985-022-09570-8","url":null,"abstract":"<p><p>Genitourinary surgeons and oncologists are particularly interested in whether a robotic surgery improves times to Prostate Specific Antigen (PSA) recurrence compared to a non-robotic surgery for removing the cancerous prostate. Time to PSA recurrence is an example of a survival time that is typically interval-censored between two consecutive clinical inspections with opposite test results. In addition, success of medical devices and technologies often depends on factors such as experience and skill level of the medical service providers, thus leading to clustering of these survival times. For analyzing the effects of surgery types and other covariates on median of clustered interval-censored time to post-surgery PSA recurrence, we present three competing novel models and associated frequentist and Bayesian analyses. The first model is based on a transform-both-sides of survival time with Gaussian random effects to account for the within-cluster association. Our second model assumes an approximate marginal Laplace distribution for the transformed log-survival times with a Gaussian copula to accommodate clustering. Our third model is a special case of the second model with Laplace distribution for the marginal log-survival times and Gaussian copula for the within-cluster association. Simulation studies establish the second model to be highly robust against extreme observations while estimating median regression coefficients. We provide a comprehensive comparison among these three competing models based on the model properties and the computational ease of their Frequentist and Bayesian analysis. We also illustrate the practical implementations and uses of these methods via analysis of a simulated clustered interval-censored data-set similar in design to a post-surgery PSA recurrence study.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40605557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bias correction via outcome reassignment for cross-sectional data with binary disease outcome. 通过结果重新分配对具有二元疾病结果的横断面数据进行偏倚校正。
IF 1.3 3区 数学 Q2 Mathematics Pub Date : 2022-10-01 Epub Date: 2022-06-24 DOI: 10.1007/s10985-022-09559-3
Mei-Cheng Wang, Yuxin Zhu

Cross-sectionally sampled data with binary disease outcome are commonly analyzed in observational studies to identify the relationship between covariates and disease outcome. A cross-sectional population is defined as a population of living individuals at the sampling or observational time. It is generally understood that binary disease outcome from cross-sectional data contains less information than longitudinally collected time-to-event data, but there is insufficient understanding as to whether bias can possibly exist in cross-sectional data and how the bias is related to the population risk of interest. Wang and Yang (2021) presented the complexity and bias in cross-sectional data with binary disease outcome with detailed analytical explorations into the data structure. As the distribution of the cross-sectional binary outcome is quite different from the population risk distribution, bias can arise when using cross-sectional data analysis to draw inference for population risk. In this paper we argue that the commonly adopted age-specific risk probability is biased for the estimation of population risk and propose an outcome reassignment approach which reassigns a portion of the observed binary outcome, 0 or 1, to the other disease category. A sign test and a semiparametric pseudo-likelihood method are developed for analyzing cross-sectional data using the OR approach. Simulations and an analysis based on Alzheimer's Disease data are presented to illustrate the proposed methods.

在观察性研究中,通常对具有二元疾病结局的横断面抽样数据进行分析,以确定协变量与疾病结局之间的关系。横断面人口被定义为在抽样或观察时间活的个体的人口。人们普遍认为,来自横断面数据的二元疾病结局所包含的信息少于纵向收集的事件时间数据,但对于横断面数据中是否可能存在偏倚以及偏倚如何与感兴趣的人群风险相关,人们的理解不足。Wang和Yang(2021)通过对数据结构的详细分析探索,提出了具有二元疾病结局的横断面数据的复杂性和偏倚性。由于横断面二值结果的分布与总体风险分布有很大的不同,在使用横断面数据分析进行总体风险推断时可能会产生偏差。在本文中,我们认为通常采用的年龄特异性风险概率对于估计人群风险是有偏差的,并提出了一种结果重新分配方法,该方法将观察到的二进制结果的一部分(0或1)重新分配给其他疾病类别。提出了一种符号检验和半参数伪似然方法,用于使用OR方法分析截面数据。基于阿尔茨海默病数据的仿真和分析说明了所提出的方法。
{"title":"Bias correction via outcome reassignment for cross-sectional data with binary disease outcome.","authors":"Mei-Cheng Wang,&nbsp;Yuxin Zhu","doi":"10.1007/s10985-022-09559-3","DOIUrl":"https://doi.org/10.1007/s10985-022-09559-3","url":null,"abstract":"<p><p>Cross-sectionally sampled data with binary disease outcome are commonly analyzed in observational studies to identify the relationship between covariates and disease outcome. A cross-sectional population is defined as a population of living individuals at the sampling or observational time. It is generally understood that binary disease outcome from cross-sectional data contains less information than longitudinally collected time-to-event data, but there is insufficient understanding as to whether bias can possibly exist in cross-sectional data and how the bias is related to the population risk of interest. Wang and Yang (2021) presented the complexity and bias in cross-sectional data with binary disease outcome with detailed analytical explorations into the data structure. As the distribution of the cross-sectional binary outcome is quite different from the population risk distribution, bias can arise when using cross-sectional data analysis to draw inference for population risk. In this paper we argue that the commonly adopted age-specific risk probability is biased for the estimation of population risk and propose an outcome reassignment approach which reassigns a portion of the observed binary outcome, 0 or 1, to the other disease category. A sign test and a semiparametric pseudo-likelihood method are developed for analyzing cross-sectional data using the OR approach. Simulations and an analysis based on Alzheimer's Disease data are presented to illustrate the proposed methods.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40395608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Marker-dependent observation and carry-forward of internal covariates in Cox regression. Cox回归中标记依赖观察及内协变量的结转。
IF 1.3 3区 数学 Q2 Mathematics Pub Date : 2022-10-01 Epub Date: 2022-06-20 DOI: 10.1007/s10985-022-09561-9
Richard J Cook, Jerald F Lawless, Bingfeng Xie

Studies of chronic disease often involve modeling the relationship between marker processes and disease onset or progression. The Cox regression model is perhaps the most common and convenient approach to analysis in this setting. In most cohort studies, however, biospecimens and biomarker values are only measured intermittently (e.g. at clinic visits) so Cox models often treat biomarker values as fixed at their most recently observed values, until they are updated at the next visit. We consider the implications of this convention on the limiting values of regression coefficient estimators when the marker values themselves impact the intensity for clinic visits. A joint multistate model is described for the marker-failure-visit process which can be fitted to mitigate this bias and an expectation-maximization algorithm is developed. An application to data from a registry of patients with psoriatic arthritis is given for illustration.

慢性疾病的研究通常涉及对标志物过程与疾病发生或进展之间的关系进行建模。在这种情况下,Cox回归模型可能是最常见和最方便的分析方法。然而,在大多数队列研究中,生物标本和生物标志物值只是间歇性地测量的(例如,在诊所就诊时),因此Cox模型通常将生物标志物值视为最近观察到的固定值,直到下次就诊时更新。当标记值本身影响门诊就诊的强度时,我们考虑这种惯例对回归系数估计器的极限值的含义。描述了标记-故障-访问过程的联合多状态模型,该模型可以拟合以减轻这种偏差,并开发了期望最大化算法。应用数据从注册的银屑病关节炎患者给出了说明。
{"title":"Marker-dependent observation and carry-forward of internal covariates in Cox regression.","authors":"Richard J Cook,&nbsp;Jerald F Lawless,&nbsp;Bingfeng Xie","doi":"10.1007/s10985-022-09561-9","DOIUrl":"https://doi.org/10.1007/s10985-022-09561-9","url":null,"abstract":"<p><p>Studies of chronic disease often involve modeling the relationship between marker processes and disease onset or progression. The Cox regression model is perhaps the most common and convenient approach to analysis in this setting. In most cohort studies, however, biospecimens and biomarker values are only measured intermittently (e.g. at clinic visits) so Cox models often treat biomarker values as fixed at their most recently observed values, until they are updated at the next visit. We consider the implications of this convention on the limiting values of regression coefficient estimators when the marker values themselves impact the intensity for clinic visits. A joint multistate model is described for the marker-failure-visit process which can be fitted to mitigate this bias and an expectation-maximization algorithm is developed. An application to data from a registry of patients with psoriatic arthritis is given for illustration.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40104617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Screening for chronic diseases: optimizing lead time through balancing prescribed frequency and individual adherence. 慢性疾病筛查:通过平衡处方频率和个人依从性来优化前置时间。
IF 1.3 3区 数学 Q2 Mathematics Pub Date : 2022-10-01 Epub Date: 2022-06-24 DOI: 10.1007/s10985-022-09563-7
John D Rice, Brent A Johnson, Robert L Strawderman

Screening for chronic diseases, such as cancer, is an important public health priority, but traditionally only the frequency or rate of screening has received attention. In this work, we study the importance of adhering to recommended screening policies and develop new methodology to better optimize screening policies when adherence is imperfect. We consider a progressive disease model with four states (healthy, undetectable preclinical, detectable preclinical, clinical), and overlay this with a stochastic screening-behavior model using the theory of renewal processes that allows us to capture imperfect adherence to screening programs in a transparent way. We show that decreased adherence leads to reduced efficacy of screening programs, quantified here using elements of the lead time distribution (i.e., the time between screening diagnosis and when diagnosis would have occurred clinically in the absence of screening). Under the assumption of an inverse relationship between prescribed screening frequency and individual adherence, we show that the optimal screening frequency generally decreases with increasing levels of non-adherence. We apply this model to an example in breast cancer screening, demonstrating how accounting for imperfect adherence affects the recommended screening frequency.

对癌症等慢性病进行筛查是一项重要的公共卫生优先事项,但传统上只关注筛查的频率或比率。在这项工作中,我们研究了坚持推荐筛查政策的重要性,并开发了新的方法,以便在依从性不完善时更好地优化筛查政策。我们考虑了一个具有四种状态(健康、不可检测的临床前、可检测的临床前、临床)的进行性疾病模型,并使用更新过程理论将其与随机筛查行为模型叠加,该模型允许我们以透明的方式捕捉对筛查程序的不完美依从性。我们表明,依从性的降低导致筛查项目的有效性降低,这里使用前置时间分布的要素进行量化(即,筛查诊断之间的时间和在没有筛查的情况下临床诊断的时间)。在假定规定筛查频率与个体依从性呈反比关系的前提下,我们发现最佳筛查频率通常随着不依从性水平的增加而降低。我们将该模型应用于乳腺癌筛查的一个例子,展示了不完美的依从性如何影响推荐的筛查频率。
{"title":"Screening for chronic diseases: optimizing lead time through balancing prescribed frequency and individual adherence.","authors":"John D Rice,&nbsp;Brent A Johnson,&nbsp;Robert L Strawderman","doi":"10.1007/s10985-022-09563-7","DOIUrl":"https://doi.org/10.1007/s10985-022-09563-7","url":null,"abstract":"<p><p>Screening for chronic diseases, such as cancer, is an important public health priority, but traditionally only the frequency or rate of screening has received attention. In this work, we study the importance of adhering to recommended screening policies and develop new methodology to better optimize screening policies when adherence is imperfect. We consider a progressive disease model with four states (healthy, undetectable preclinical, detectable preclinical, clinical), and overlay this with a stochastic screening-behavior model using the theory of renewal processes that allows us to capture imperfect adherence to screening programs in a transparent way. We show that decreased adherence leads to reduced efficacy of screening programs, quantified here using elements of the lead time distribution (i.e., the time between screening diagnosis and when diagnosis would have occurred clinically in the absence of screening). Under the assumption of an inverse relationship between prescribed screening frequency and individual adherence, we show that the optimal screening frequency generally decreases with increasing levels of non-adherence. We apply this model to an example in breast cancer screening, demonstrating how accounting for imperfect adherence affects the recommended screening frequency.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40334301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing dynamic covariate effects with survival data. 用生存数据评估动态协变量效应。
IF 1.3 3区 数学 Q2 Mathematics Pub Date : 2022-10-01 DOI: 10.1007/s10985-022-09571-7
Ying Cui, Limin Peng

Dynamic (or varying) covariate effects often manifest meaningful physiological mechanisms underlying chronic diseases. However, a static view of covariate effects is typically adopted by standard approaches to evaluating disease prognostic factors, which can result in depreciation of some important disease markers. To address this issue, in this work, we take the perspective of globally concerned quantile regression, and propose a flexible testing framework suited to assess either constant or dynamic covariate effects. We study the powerful Kolmogorov-Smirnov (K-S) and Cramér-Von Mises (C-V) type test statistics and develop a simple resampling procedure to tackle their complicated limit distributions. We provide rigorous theoretical results, including the limit null distributions and consistency under a general class of alternative hypotheses of the proposed tests, as well as the justifications for the presented resampling procedure. Extensive simulation studies and a real data example demonstrate the utility of the new testing procedures and their advantages over existing approaches in assessing dynamic covariate effects.

动态(或变化的)协变量效应往往表现出慢性疾病潜在的有意义的生理机制。然而,评估疾病预后因素的标准方法通常采用协变量效应的静态观点,这可能导致一些重要疾病标志物的贬值。为了解决这个问题,在这项工作中,我们采取了全球关注的分位数回归的角度,并提出了一个灵活的测试框架,适合于评估恒定或动态协变量效应。我们研究了强大的Kolmogorov-Smirnov (K-S)和cram - von Mises (C-V)型检验统计量,并开发了一个简单的重采样程序来处理它们复杂的极限分布。我们提供了严格的理论结果,包括极限零分布和在所提出的检验的一般类别的可选假设下的一致性,以及所提出的重采样程序的理由。广泛的模拟研究和一个真实的数据例子证明了新的测试程序的效用,以及它们在评估动态协变量效应方面比现有方法的优势。
{"title":"Assessing dynamic covariate effects with survival data.","authors":"Ying Cui,&nbsp;Limin Peng","doi":"10.1007/s10985-022-09571-7","DOIUrl":"https://doi.org/10.1007/s10985-022-09571-7","url":null,"abstract":"<p><p>Dynamic (or varying) covariate effects often manifest meaningful physiological mechanisms underlying chronic diseases. However, a static view of covariate effects is typically adopted by standard approaches to evaluating disease prognostic factors, which can result in depreciation of some important disease markers. To address this issue, in this work, we take the perspective of globally concerned quantile regression, and propose a flexible testing framework suited to assess either constant or dynamic covariate effects. We study the powerful Kolmogorov-Smirnov (K-S) and Cramér-Von Mises (C-V) type test statistics and develop a simple resampling procedure to tackle their complicated limit distributions. We provide rigorous theoretical results, including the limit null distributions and consistency under a general class of alternative hypotheses of the proposed tests, as well as the justifications for the presented resampling procedure. Extensive simulation studies and a real data example demonstrate the utility of the new testing procedures and their advantages over existing approaches in assessing dynamic covariate effects.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9901566/pdf/nihms-1867801.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10655787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Accounting for delayed entry into observational studies and clinical trials: length-biased sampling and restricted mean survival time. 考虑延迟进入观察性研究和临床试验:长度偏倚抽样和限制平均生存时间。
IF 1.3 3区 数学 Q2 Mathematics Pub Date : 2022-10-01 Epub Date: 2022-07-01 DOI: 10.1007/s10985-022-09562-8
Mei-Ling Ting Lee, John Lawrence, Yiming Chen, G A Whitmore

Individuals in many observational studies and clinical trials for chronic diseases are enrolled well after onset or diagnosis of their disease. Times to events of interest after enrollment are therefore residual or left-truncated event times. Individuals entering the studies have disease that has advanced to varying extents. Moreover, enrollment usually entails probability sampling of the study population. Finally, event times over a short to moderate time horizon are often of interest in these investigations, rather than more speculative and remote happenings that lie beyond the study period. This research report looks at the issue of delayed entry into these kinds of studies and trials. Time to event for an individual is modelled as a first hitting time of an event threshold by a latent disease process, which is taken to be a Wiener process. It is emphasized that recruitment into these studies often involves length-biased sampling. The requisite mathematics for this kind of sampling and delayed entry are presented, including explicit formulas needed for estimation and inference. Restricted mean survival time (RMST) is taken as the clinically relevant outcome measure. Exact parametric formulas for this measure are derived and presented. The results are extended to settings that involve study covariates using threshold regression methods. Methods adapted for clinical trials are presented. An extensive case illustration for a clinical trial setting is then presented to demonstrate the methods, the interpretation of results, and the harvesting of useful insights. The closing discussion covers a number of important issues and concepts.

在许多慢性疾病的观察性研究和临床试验中,个体在发病或诊断后很长时间才入组。因此,注册后感兴趣事件的时间是残差或左截短的事件时间。参与研究的个体都有不同程度的疾病进展。此外,入组通常需要对研究人群进行概率抽样。最后,在这些调查中,短期到中等时间范围内的事件时间通常是感兴趣的,而不是超出研究期的更具推测性和遥远的事件。这份研究报告着眼于延迟进入这类研究和试验的问题。个体的事件发生时间通过潜伏性疾病过程建模为事件阈值的首次到达时间,该过程被认为是维纳过程。需要强调的是,这些研究的招募通常涉及长度偏差抽样。给出了这种采样和延迟输入的必要数学,包括估计和推理所需的显式公式。限制平均生存时间(RMST)作为临床相关的结局指标。推导并给出了该测量的精确参数公式。使用阈值回归方法将结果扩展到涉及研究协变量的设置。提出了适用于临床试验的方法。一个广泛的案例说明,临床试验设置,然后提出演示的方法,结果的解释,并收获有用的见解。闭幕讨论涵盖了一些重要的问题和概念。
{"title":"Accounting for delayed entry into observational studies and clinical trials: length-biased sampling and restricted mean survival time.","authors":"Mei-Ling Ting Lee,&nbsp;John Lawrence,&nbsp;Yiming Chen,&nbsp;G A Whitmore","doi":"10.1007/s10985-022-09562-8","DOIUrl":"https://doi.org/10.1007/s10985-022-09562-8","url":null,"abstract":"<p><p>Individuals in many observational studies and clinical trials for chronic diseases are enrolled well after onset or diagnosis of their disease. Times to events of interest after enrollment are therefore residual or left-truncated event times. Individuals entering the studies have disease that has advanced to varying extents. Moreover, enrollment usually entails probability sampling of the study population. Finally, event times over a short to moderate time horizon are often of interest in these investigations, rather than more speculative and remote happenings that lie beyond the study period. This research report looks at the issue of delayed entry into these kinds of studies and trials. Time to event for an individual is modelled as a first hitting time of an event threshold by a latent disease process, which is taken to be a Wiener process. It is emphasized that recruitment into these studies often involves length-biased sampling. The requisite mathematics for this kind of sampling and delayed entry are presented, including explicit formulas needed for estimation and inference. Restricted mean survival time (RMST) is taken as the clinically relevant outcome measure. Exact parametric formulas for this measure are derived and presented. The results are extended to settings that involve study covariates using threshold regression methods. Methods adapted for clinical trials are presented. An extensive case illustration for a clinical trial setting is then presented to demonstrate the methods, the interpretation of results, and the harvesting of useful insights. The closing discussion covers a number of important issues and concepts.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40464059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Mixture survival trees for cancer risk classification. 用于癌症风险分类的混合生存树。
IF 1.3 3区 数学 Q2 Mathematics Pub Date : 2022-07-01 DOI: 10.1007/s10985-022-09552-w
Beilin Jia, Donglin Zeng, Jason J Z Liao, Guanghan F Liu, Xianming Tan, Guoqing Diao, Joseph G Ibrahim

In oncology studies, it is important to understand and characterize disease heterogeneity among patients so that patients can be classified into different risk groups and one can identify high-risk patients at the right time. This information can then be used to identify a more homogeneous patient population for developing precision medicine. In this paper, we propose a mixture survival tree approach for direct risk classification. We assume that the patients can be classified into a pre-specified number of risk groups, where each group has distinct survival profile. Our proposed tree-based methods are devised to estimate latent group membership using an EM algorithm. The observed data log-likelihood function is used as the splitting criterion in recursive partitioning. The finite sample performance is evaluated by extensive simulation studies and the proposed method is illustrated by a case study in breast cancer.

在肿瘤学研究中,了解和表征患者之间的疾病异质性是非常重要的,这样可以将患者划分为不同的风险组,并在适当的时候识别出高危患者。然后,这些信息可以用于确定更均匀的患者群体,以开发精准医疗。本文提出了一种用于直接风险分类的混合生存树方法。我们假设患者可以被分为预先指定的风险组,其中每组有不同的生存概况。我们提出的基于树的方法是设计来估计潜在的群体成员使用EM算法。将观测数据的对数似然函数作为递归划分的分割准则。有限样本的性能通过广泛的模拟研究进行了评估,并提出了一种方法,说明了一个案例研究在乳腺癌。
{"title":"Mixture survival trees for cancer risk classification.","authors":"Beilin Jia,&nbsp;Donglin Zeng,&nbsp;Jason J Z Liao,&nbsp;Guanghan F Liu,&nbsp;Xianming Tan,&nbsp;Guoqing Diao,&nbsp;Joseph G Ibrahim","doi":"10.1007/s10985-022-09552-w","DOIUrl":"https://doi.org/10.1007/s10985-022-09552-w","url":null,"abstract":"<p><p>In oncology studies, it is important to understand and characterize disease heterogeneity among patients so that patients can be classified into different risk groups and one can identify high-risk patients at the right time. This information can then be used to identify a more homogeneous patient population for developing precision medicine. In this paper, we propose a mixture survival tree approach for direct risk classification. We assume that the patients can be classified into a pre-specified number of risk groups, where each group has distinct survival profile. Our proposed tree-based methods are devised to estimate latent group membership using an EM algorithm. The observed data log-likelihood function is used as the splitting criterion in recursive partitioning. The finite sample performance is evaluated by extensive simulation studies and the proposed method is illustrated by a case study in breast cancer.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10402927/pdf/nihms-1918361.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10207564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Model selection among Dimension-Reduced generalized Cox models. 降维广义Cox模型的模型选择。
IF 1.3 3区 数学 Q2 Mathematics Pub Date : 2022-07-01 Epub Date: 2022-06-28 DOI: 10.1007/s10985-022-09565-5
Ming-Yueh Huang, Kwun Chuen Gary Chan

Conventional semiparametric hazards regression models rely on the specification of particular model formulations, such as proportional-hazards feature and single-index structures. Instead of checking these modeling assumptions one-by-one, we proposed a class of dimension-reduced generalized Cox models, and then a consistent model selection procedure among this class to select covariates with proportional-hazards feature and a proper model formulation for non-proportional-hazards covariates. In this class, the non-proportional-hazards covariates are treated in a nonparametric manner, and a partial sufficient dimension reduction is introduced to reduce the curse of dimensionality. A semiparametric efficient estimation is proposed to estimate these models. Based on the proposed estimation, we further constructed a cross-validation type criterion to consistently select the correct model among this class. Most importantly, this class of hazards regression models contains the fully nonparametric hazards regression model as the most saturated submodel, and hence no further model diagnosis is required. Overall speaking, this model selection approach is more effective than performing a sequence of conventional model checking. The proposed method is illustrated by simulation studies and a data example.

传统的半参数风险回归模型依赖于特定模型公式的规范,如比例风险特征和单指数结构。本文提出了一类降维广义Cox模型,以选择具有比例风险特征的协变量,并对非比例风险的协变量给出了合适的模型公式,而不是逐个检验这些建模假设。在本课程中,非比例风险协变量以非参数的方式处理,并引入了部分充分降维来减少维数的诅咒。提出了一种半参数有效估计方法来估计这些模型。基于所提出的估计,我们进一步构建了一个交叉验证类型准则,以便在这类模型中一致地选择正确的模型。最重要的是,这类风险回归模型包含了完全非参数风险回归模型作为最饱和的子模型,因此不需要进一步的模型诊断。总的来说,这种模型选择方法比执行一系列常规模型检查更有效。通过仿真研究和数据算例说明了该方法的有效性。
{"title":"Model selection among Dimension-Reduced generalized Cox models.","authors":"Ming-Yueh Huang,&nbsp;Kwun Chuen Gary Chan","doi":"10.1007/s10985-022-09565-5","DOIUrl":"https://doi.org/10.1007/s10985-022-09565-5","url":null,"abstract":"<p><p>Conventional semiparametric hazards regression models rely on the specification of particular model formulations, such as proportional-hazards feature and single-index structures. Instead of checking these modeling assumptions one-by-one, we proposed a class of dimension-reduced generalized Cox models, and then a consistent model selection procedure among this class to select covariates with proportional-hazards feature and a proper model formulation for non-proportional-hazards covariates. In this class, the non-proportional-hazards covariates are treated in a nonparametric manner, and a partial sufficient dimension reduction is introduced to reduce the curse of dimensionality. A semiparametric efficient estimation is proposed to estimate these models. Based on the proposed estimation, we further constructed a cross-validation type criterion to consistently select the correct model among this class. Most importantly, this class of hazards regression models contains the fully nonparametric hazards regression model as the most saturated submodel, and hence no further model diagnosis is required. Overall speaking, this model selection approach is more effective than performing a sequence of conventional model checking. The proposed method is illustrated by simulation studies and a data example.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40406545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Privacy-preserving estimation of an optimal individualized treatment rule: a case study in maximizing time to severe depression-related outcomes. 最佳个体化治疗规则的隐私保护估计:最大限度延长严重抑郁症相关结果的案例研究。
IF 1.2 3区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-07-01 Epub Date: 2022-05-02 DOI: 10.1007/s10985-022-09554-8
Erica E M Moodie, Janie Coulombe, Coraline Danieli, Christel Renoux, Susan M Shortreed

Estimating individualized treatment rules-particularly in the context of right-censored outcomes-is challenging because the treatment effect heterogeneity of interest is often small, thus difficult to detect. While this motivates the use of very large datasets such as those from multiple health systems or centres, data privacy may be of concern with participating data centres reluctant to share individual-level data. In this case study on the treatment of depression, we demonstrate an application of distributed regression for privacy protection used in combination with dynamic weighted survival modelling (DWSurv) to estimate an optimal individualized treatment rule whilst obscuring individual-level data. In simulations, we demonstrate the flexibility of this approach to address local treatment practices that may affect confounding, and show that DWSurv retains its double robustness even when performed through a (weighted) distributed regression approach. The work is motivated by, and illustrated with, an analysis of treatment for unipolar depression using the United Kingdom's Clinical Practice Research Datalink.

评估个体化治疗规则,特别是在权利审查结果的情况下,是一项挑战,因为感兴趣的治疗效果异质性通常很小,因此很难检测。虽然这促使人们使用非常大的数据集,例如来自多个卫生系统或中心的数据,但参与的数据中心可能会担心数据隐私,因为它们不愿意共享个人层面的数据。在这项关于抑郁症治疗的案例研究中,我们展示了分布式回归在隐私保护方面的应用,该应用与动态加权生存模型(DWSurv)相结合,以估计最佳的个性化治疗规则,同时掩盖个体水平的数据。在模拟中,我们证明了这种方法的灵活性,以解决可能影响混杂的局部治疗实践,并表明DWSurv即使通过(加权)分布式回归方法进行,也保持其双重稳健性。这项工作的动机是,使用英国临床实践研究数据链接对单极性抑郁症的治疗进行分析,并加以说明。
{"title":"Privacy-preserving estimation of an optimal individualized treatment rule: a case study in maximizing time to severe depression-related outcomes.","authors":"Erica E M Moodie, Janie Coulombe, Coraline Danieli, Christel Renoux, Susan M Shortreed","doi":"10.1007/s10985-022-09554-8","DOIUrl":"10.1007/s10985-022-09554-8","url":null,"abstract":"<p><p>Estimating individualized treatment rules-particularly in the context of right-censored outcomes-is challenging because the treatment effect heterogeneity of interest is often small, thus difficult to detect. While this motivates the use of very large datasets such as those from multiple health systems or centres, data privacy may be of concern with participating data centres reluctant to share individual-level data. In this case study on the treatment of depression, we demonstrate an application of distributed regression for privacy protection used in combination with dynamic weighted survival modelling (DWSurv) to estimate an optimal individualized treatment rule whilst obscuring individual-level data. In simulations, we demonstrate the flexibility of this approach to address local treatment practices that may affect confounding, and show that DWSurv retains its double robustness even when performed through a (weighted) distributed regression approach. The work is motivated by, and illustrated with, an analysis of treatment for unipolar depression using the United Kingdom's Clinical Practice Research Datalink.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.2,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10805063/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41217922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semi-supervised approach to event time annotation using longitudinal electronic health records. 使用纵向电子健康记录的事件时间注释的半监督方法。
IF 1.3 3区 数学 Q2 Mathematics Pub Date : 2022-07-01 DOI: 10.1007/s10985-022-09557-5
Liang Liang, Jue Hou, Hajime Uno, Kelly Cho, Yanyuan Ma, Tianxi Cai

Large clinical datasets derived from insurance claims and electronic health record (EHR) systems are valuable sources for precision medicine research. These datasets can be used to develop models for personalized prediction of risk or treatment response. Efficiently deriving prediction models using real world data, however, faces practical and methodological challenges. Precise information on important clinical outcomes such as time to cancer progression are not readily available in these databases. The true clinical event times typically cannot be approximated well based on simple extracts of billing or procedure codes. Whereas, annotating event times manually is time and resource prohibitive. In this paper, we propose a two-step semi-supervised multi-modal automated time annotation (MATA) method leveraging multi-dimensional longitudinal EHR encounter records. In step I, we employ a functional principal component analysis approach to estimate the underlying intensity functions based on observed point processes from the unlabeled patients. In step II, we fit a penalized proportional odds model to the event time outcomes with features derived in step I in the labeled data where the non-parametric baseline function is approximated using B-splines. Under regularity conditions, the resulting estimator of the feature effect vector is shown as root-n consistent. We demonstrate the superiority of our approach relative to existing approaches through simulations and a real data example on annotating lung cancer recurrence in an EHR cohort of lung cancer patients from Veteran Health Administration.

来自保险索赔和电子健康记录(EHR)系统的大型临床数据集是精准医学研究的宝贵资源。这些数据集可用于开发个性化预测风险或治疗反应的模型。然而,利用真实世界的数据有效地推导预测模型面临着实践和方法上的挑战。关于重要临床结果的精确信息,如癌症进展的时间,在这些数据库中并不容易获得。真实的临床事件时间通常不能根据简单的账单或程序代码的摘录很好地近似。然而,手动标注事件时间既费时又浪费资源。在本文中,我们提出了一种利用多维纵向电子病历记录的两步半监督多模态自动时间注释(MATA)方法。在第一步中,我们采用功能主成分分析方法来估计基于未标记患者的观察点过程的潜在强度函数。在步骤II中,我们将一个惩罚比例赔率模型拟合到事件时间结果中,该模型使用步骤I在标记数据中导出的特征,其中使用b样条近似非参数基线函数。在正则性条件下,得到的特征效应向量估计量为根n一致。我们通过模拟和退伍军人健康管理局肺癌患者EHR队列中肺癌复发注释的真实数据示例,证明了我们的方法相对于现有方法的优越性。
{"title":"Semi-supervised approach to event time annotation using longitudinal electronic health records.","authors":"Liang Liang,&nbsp;Jue Hou,&nbsp;Hajime Uno,&nbsp;Kelly Cho,&nbsp;Yanyuan Ma,&nbsp;Tianxi Cai","doi":"10.1007/s10985-022-09557-5","DOIUrl":"https://doi.org/10.1007/s10985-022-09557-5","url":null,"abstract":"<p><p>Large clinical datasets derived from insurance claims and electronic health record (EHR) systems are valuable sources for precision medicine research. These datasets can be used to develop models for personalized prediction of risk or treatment response. Efficiently deriving prediction models using real world data, however, faces practical and methodological challenges. Precise information on important clinical outcomes such as time to cancer progression are not readily available in these databases. The true clinical event times typically cannot be approximated well based on simple extracts of billing or procedure codes. Whereas, annotating event times manually is time and resource prohibitive. In this paper, we propose a two-step semi-supervised multi-modal automated time annotation (MATA) method leveraging multi-dimensional longitudinal EHR encounter records. In step I, we employ a functional principal component analysis approach to estimate the underlying intensity functions based on observed point processes from the unlabeled patients. In step II, we fit a penalized proportional odds model to the event time outcomes with features derived in step I in the labeled data where the non-parametric baseline function is approximated using B-splines. Under regularity conditions, the resulting estimator of the feature effect vector is shown as root-n consistent. We demonstrate the superiority of our approach relative to existing approaches through simulations and a real data example on annotating lung cancer recurrence in an EHR cohort of lung cancer patients from Veteran Health Administration.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":null,"pages":null},"PeriodicalIF":1.3,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10044535/pdf/nihms-1879201.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9194040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
Lifetime Data Analysis
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1