首页 > 最新文献

Statistics in Medicine最新文献

英文 中文
A Precision Mixture Risk Model to Identify Adverse Drug Events in Subpopulations Using a Case-Crossover Design. 利用病例交叉设计识别亚人群药物不良事件的精确混合风险模型
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-11-30 Epub Date: 2024-09-19 DOI: 10.1002/sim.10216
Yi Shi, Michael T Eadon, Yao Chen, Anna Sun, Yuedi Yang, Chienwei Chiang, Macarius Donneyong, Jing Su, Pengyue Zhang

Despite the success of pharmacovigilance studies in detecting signals of adverse drug events (ADEs) from real-world data, the risks of ADEs in subpopulations warrant increased scrutiny to prevent them in vulnerable individuals. Recently, the case-crossover design has been implemented to leverage large-scale administrative claims data for ADE detection, while controlling both observed confounding effects and short-term fixed unobserved confounding effects. Additionally, as the case-crossover design only includes cases, subpopulations can be conveniently derived. In this manuscript, we propose a precision mixture risk model (PMRM) to identify ADE signals from subpopulations under the case-crossover design. The proposed model is able to identify signals from all ADE-subpopulation-drug combinations, while controlling for false discovery rate (FDR) and confounding effects. We applied the PMRM to an administrative claims data. We identified ADE signals in subpopulations defined by demographic variables, comorbidities, and detailed diagnosis codes. Interestingly, certain drugs were associated with a higher risk of ADE only in subpopulations, while these drugs had a neutral association with ADE in the general population. Additionally, the PMRM could control FDR at a desired level and had a higher probability to detect true ADE signals than the widely used McNemar's test. In conclusion, the PMRM is able to identify subpopulation-specific ADE signals from a tremendous number of ADE-subpopulation-drug combinations, while controlling for both FDR and confounding effects.

尽管药物警戒研究成功地从真实世界的数据中发现了药物不良事件(ADEs)的信号,但是亚人群中的 ADEs 风险仍值得加强关注,以防止易受影响的个体发生 ADEs。最近,人们开始采用病例交叉设计来利用大规模行政索赔数据检测 ADE,同时控制观察到的混杂效应和短期固定的未观察到的混杂效应。此外,由于病例交叉设计只包括病例,因此可以方便地得出亚人群。在本手稿中,我们提出了一种精确混合风险模型(PMRM),用于在病例交叉设计下从亚人群中识别 ADE 信号。所提出的模型能够识别来自所有 ADE-亚人群-药物组合的信号,同时控制虚假发现率 (FDR) 和混杂效应。我们将 PMRM 应用于行政索赔数据。我们发现了由人口统计学变量、合并症和详细诊断代码定义的亚人群中的 ADE 信号。有趣的是,某些药物仅在亚人群中与较高的 ADE 风险相关,而在一般人群中,这些药物与 ADE 的关系则是中性的。此外,PMRM 可以将 FDR 控制在理想水平,与广泛使用的 McNemar 检验相比,它检测到真实 ADE 信号的概率更高。总之,PMRM 能够从大量的 ADE-亚人群-药物组合中识别出亚人群特异性 ADE 信号,同时控制 FDR 和混杂效应。
{"title":"A Precision Mixture Risk Model to Identify Adverse Drug Events in Subpopulations Using a Case-Crossover Design.","authors":"Yi Shi, Michael T Eadon, Yao Chen, Anna Sun, Yuedi Yang, Chienwei Chiang, Macarius Donneyong, Jing Su, Pengyue Zhang","doi":"10.1002/sim.10216","DOIUrl":"10.1002/sim.10216","url":null,"abstract":"<p><p>Despite the success of pharmacovigilance studies in detecting signals of adverse drug events (ADEs) from real-world data, the risks of ADEs in subpopulations warrant increased scrutiny to prevent them in vulnerable individuals. Recently, the case-crossover design has been implemented to leverage large-scale administrative claims data for ADE detection, while controlling both observed confounding effects and short-term fixed unobserved confounding effects. Additionally, as the case-crossover design only includes cases, subpopulations can be conveniently derived. In this manuscript, we propose a precision mixture risk model (PMRM) to identify ADE signals from subpopulations under the case-crossover design. The proposed model is able to identify signals from all ADE-subpopulation-drug combinations, while controlling for false discovery rate (FDR) and confounding effects. We applied the PMRM to an administrative claims data. We identified ADE signals in subpopulations defined by demographic variables, comorbidities, and detailed diagnosis codes. Interestingly, certain drugs were associated with a higher risk of ADE only in subpopulations, while these drugs had a neutral association with ADE in the general population. Additionally, the PMRM could control FDR at a desired level and had a higher probability to detect true ADE signals than the widely used McNemar's test. In conclusion, the PMRM is able to identify subpopulation-specific ADE signals from a tremendous number of ADE-subpopulation-drug combinations, while controlling for both FDR and confounding effects.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":" ","pages":"5088-5099"},"PeriodicalIF":1.8,"publicationDate":"2024-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142295957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Latent Archetypes of the Spatial Patterns of Cancer. 癌症空间模式的潜在原型。
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-11-30 Epub Date: 2024-10-03 DOI: 10.1002/sim.10232
Thaís Pacheco Menezes, Marcos Oliveira Prates, Renato Assunção, Mônica Silva Monteiro De Castro

The cancer atlas edited by several countries is the main resource for the analysis of the geographic variation of cancer risk. Correlating the observed spatial patterns with known or hypothesized risk factors is time-consuming work for epidemiologists who need to deal with each cancer separately, breaking down the patterns according to sex and race. The recent literature has proposed to study more than one cancer simultaneously looking for common spatial risk factors. However, this previous work has two constraints: they consider only a very small (2-4) number of cancers previously known to share risk factors. In this article, we propose an exploratory method to search for latent spatial risk factors of a large number of supposedly unrelated cancers. The method is based on the singular value decomposition and nonnegative matrix factorization, it is computationally efficient, scaling easily with the number of regions and cancers. We carried out a simulation study to evaluate the method's performance and apply it to cancer atlas from the USA, England, France, Australia, Spain, and Brazil. We conclude that with very few latent maps, which can represent a reduction of up to 90% of atlas maps, most of the spatial variability is conserved. By concentrating on the epidemiological analysis of these few latent maps a substantial amount of work is saved and, at the same time, high-level explanations affecting many cancers simultaneously can be reached.

一些国家编辑的癌症地图集是分析癌症风险地域差异的主要资源。将观察到的空间模式与已知或假设的风险因素联系起来,对流行病学家来说是一项耗时的工作,他们需要根据性别和种族分别处理每种癌症的模式。最近有文献建议同时研究一种以上的癌症,寻找共同的空间风险因素。然而,以往的工作有两个限制因素:他们只考虑了极少数(2-4 种)之前已知具有共同风险因素的癌症。在本文中,我们提出了一种探索性方法,用于搜索大量本不相关的癌症的潜在空间风险因素。该方法基于奇异值分解和非负矩阵因式分解,计算效率高,很容易随着区域和癌症数量的增加而扩展。我们进行了一项模拟研究来评估该方法的性能,并将其应用于美国、英国、法国、澳大利亚、西班牙和巴西的癌症图谱。我们得出的结论是,只需极少量的潜在地图(可减少地图集地图的 90%),大部分空间变异性就能得到保留。通过集中精力对这些少量的潜在地图进行流行病学分析,可以节省大量的工作,同时还可以同时对许多癌症进行高层次的解释。
{"title":"Latent Archetypes of the Spatial Patterns of Cancer.","authors":"Thaís Pacheco Menezes, Marcos Oliveira Prates, Renato Assunção, Mônica Silva Monteiro De Castro","doi":"10.1002/sim.10232","DOIUrl":"10.1002/sim.10232","url":null,"abstract":"<p><p>The cancer atlas edited by several countries is the main resource for the analysis of the geographic variation of cancer risk. Correlating the observed spatial patterns with known or hypothesized risk factors is time-consuming work for epidemiologists who need to deal with each cancer separately, breaking down the patterns according to sex and race. The recent literature has proposed to study more than one cancer simultaneously looking for common spatial risk factors. However, this previous work has two constraints: they consider only a very small (2-4) number of cancers previously known to share risk factors. In this article, we propose an exploratory method to search for latent spatial risk factors of a large number of supposedly unrelated cancers. The method is based on the singular value decomposition and nonnegative matrix factorization, it is computationally efficient, scaling easily with the number of regions and cancers. We carried out a simulation study to evaluate the method's performance and apply it to cancer atlas from the USA, England, France, Australia, Spain, and Brazil. We conclude that with very few latent maps, which can represent a reduction of up to 90% of atlas maps, most of the spatial variability is conserved. By concentrating on the epidemiological analysis of these few latent maps a substantial amount of work is saved and, at the same time, high-level explanations affecting many cancers simultaneously can be reached.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":" ","pages":"5115-5137"},"PeriodicalIF":1.8,"publicationDate":"2024-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11583956/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142372925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Causal Inference Over a Subpopulation: The Effect of Malaria Vaccine in Women During Pregnancy. 子群体的因果推论:疟疾疫苗对孕期妇女的影响》。
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-11-30 Epub Date: 2024-10-07 DOI: 10.1002/sim.10228
Zonghui Hu, Dean Follmann

Preventing malaria during pregnancy is of critical importance, yet there are no approved malaria vaccines for pregnant women due to lack of efficacy results within this population. Conducting a randomized trial in pregnant women throughout the entire duration of pregnancy is impractical. Instead, a randomized trial was conducted among women of childbearing potential (WOCBP), and some participants became pregnant during the 2-year study. We explore a statistical method for estimating vaccine effect within the target subpopulation-women who can naturally become pregnant, namely, women who can become pregnant under a placebo condition-within the causal inference framework. Two vaccine effect estimators are employed to effectively utilize baseline characteristics and account for the fact that certain baseline characteristics were only available from pregnant participants. The first estimator considers all participants but can only utilize baseline variables collected from the entire participant pool. In contrast, the second estimator, which includes only pregnant participants, utilizes all available baseline information. Both estimators are evaluated numerically through simulation studies and applied to the WOCBP trial to assess vaccine effect against pregnancy malaria.

在怀孕期间预防疟疾至关重要,但目前还没有针对孕妇的疟疾疫苗获得批准,因为在这一人群中缺乏疗效结果。在整个孕期对孕妇进行随机试验是不切实际的。因此,我们在有生育能力的妇女(WOCBP)中开展了一项随机试验,一些参与者在为期 2 年的研究期间怀孕。我们在因果推论框架内探索了一种统计方法,用于估计目标亚群(即可以自然怀孕的女性,即在安慰剂条件下可以怀孕的女性)中的疫苗效应。为了有效利用基线特征,并考虑到只有怀孕参与者才能提供某些基线特征这一事实,我们采用了两种疫苗效应估计方法。第一种估计方法考虑了所有参与者,但只能利用从整个参与者库中收集的基线变量。与此相反,第二个估计器只包括怀孕的参与者,利用了所有可用的基线信息。我们通过模拟研究对这两种估计器进行了数值评估,并将其应用于世界妊娠期疟疾疫苗试验,以评估疫苗对妊娠期疟疾的效果。
{"title":"Causal Inference Over a Subpopulation: The Effect of Malaria Vaccine in Women During Pregnancy.","authors":"Zonghui Hu, Dean Follmann","doi":"10.1002/sim.10228","DOIUrl":"10.1002/sim.10228","url":null,"abstract":"<p><p>Preventing malaria during pregnancy is of critical importance, yet there are no approved malaria vaccines for pregnant women due to lack of efficacy results within this population. Conducting a randomized trial in pregnant women throughout the entire duration of pregnancy is impractical. Instead, a randomized trial was conducted among women of childbearing potential (WOCBP), and some participants became pregnant during the 2-year study. We explore a statistical method for estimating vaccine effect within the target subpopulation-women who can naturally become pregnant, namely, women who can become pregnant under a placebo condition-within the causal inference framework. Two vaccine effect estimators are employed to effectively utilize baseline characteristics and account for the fact that certain baseline characteristics were only available from pregnant participants. The first estimator considers all participants but can only utilize baseline variables collected from the entire participant pool. In contrast, the second estimator, which includes only pregnant participants, utilizes all available baseline information. Both estimators are evaluated numerically through simulation studies and applied to the WOCBP trial to assess vaccine effect against pregnancy malaria.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":" ","pages":"5193-5202"},"PeriodicalIF":1.8,"publicationDate":"2024-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11583954/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142393471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pairwise Accelerated Failure Time Regression Models for Infectious Disease Transmission in Close-Contact Groups With External Sources of Infection. 有外部传染源的密切接触群体中传染病传播的成对加速失败时间回归模型。
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-11-30 Epub Date: 2024-10-03 DOI: 10.1002/sim.10226
Yushuf Sharker, Zaynab Diallo, Wasiur R KhudaBukhsh, Eben Kenah

Many important questions in infectious disease epidemiology involve associations between covariates (e.g., age or vaccination status) and infectiousness or susceptibility. Because disease transmission produces dependent outcomes, these questions are difficult or impossible to address using standard regression models from biostatistics. Pairwise survival analysis handles dependent outcomes by calculating likelihoods in terms of contact interval distributions in ordered pairs of individuals. The contact interval in the ordered pair i j $$ ij $$ is the time from the onset of infectiousness in i $$ i $$ to infectious contact from i $$ i $$ to j $$ j $$ , where an infectious contact is sufficient to infect j $$ j $$ if they are susceptible. Here, we introduce a pairwise accelerated failure time regression model for infectious disease transmission that allows the rate parameter of the contact interval distribution to depend on individual-level infectiousness covariates for i $$ i $$ , individual-level susceptibility covariates for j $$ j $$ , and pair-level covariates (e.g., type of relationship). This model can simultaneously handle internal infections (caused by transmission between individuals under observation) and external infections (caused by environmental or community sources of infection). We show that this model produces consistent and asymptotically normal parameter estimates. In a simulation study, we evaluate bias and confidence interval coverage probabilities, explore the role of epidemiologic study design, and investigate the effects of model misspecification. We use this regression model to analyze household data from Los Angeles County during the 2009 influenza A (H1N1) pandemic, where we find that the ability to account for external sources of infection increases the statistical power to estimate the effect of antiviral prophylaxis.

传染病流行病学中的许多重要问题都涉及协变量(如年龄或疫苗接种状况)与传染性或易感性之间的关联。由于疾病传播会产生依赖性结果,这些问题很难或不可能用生物统计学的标准回归模型来解决。配对生存分析通过计算有序配对个体接触间隔分布的可能性来处理依赖性结果。有序配对 i j $$ ij $$ 中的接触间隔是指从 i $$ i $$ 开始感染到 i $$ i $$ 与 j $$ j $$ 发生感染性接触的时间,其中,如果 j $$ j $$ 是易感人群,则感染性接触足以感染他们。在此,我们引入了一个传染病传播的成对加速失败时间回归模型,该模型允许接触间隔分布的速率参数取决于 i $$ i $$ 的个体层面感染性协变量、j $$ j $$ 的个体层面易感性协变量和成对层面协变量(如关系类型)。该模型可同时处理内部感染(由观察对象之间的传播引起)和外部感染(由环境或社区感染源引起)。我们的研究表明,该模型可得出一致且渐近正态的参数估计值。在模拟研究中,我们对偏差和置信区间覆盖概率进行了评估,探讨了流行病学研究设计的作用,并研究了模型不规范的影响。我们使用该回归模型分析了 2009 年甲型 H1N1 流感大流行期间洛杉矶县的家庭数据,发现考虑外部感染源的能力提高了估计抗病毒预防效果的统计能力。
{"title":"Pairwise Accelerated Failure Time Regression Models for Infectious Disease Transmission in Close-Contact Groups With External Sources of Infection.","authors":"Yushuf Sharker, Zaynab Diallo, Wasiur R KhudaBukhsh, Eben Kenah","doi":"10.1002/sim.10226","DOIUrl":"10.1002/sim.10226","url":null,"abstract":"<p><p>Many important questions in infectious disease epidemiology involve associations between covariates (e.g., age or vaccination status) and infectiousness or susceptibility. Because disease transmission produces dependent outcomes, these questions are difficult or impossible to address using standard regression models from biostatistics. Pairwise survival analysis handles dependent outcomes by calculating likelihoods in terms of contact interval distributions in ordered pairs of individuals. The contact interval in the ordered pair <math> <semantics><mrow><mi>i</mi> <mi>j</mi></mrow> <annotation>$$ ij $$</annotation></semantics> </math> is the time from the onset of infectiousness in <math> <semantics><mrow><mi>i</mi></mrow> <annotation>$$ i $$</annotation></semantics> </math> to infectious contact from <math> <semantics><mrow><mi>i</mi></mrow> <annotation>$$ i $$</annotation></semantics> </math> to <math> <semantics><mrow><mi>j</mi></mrow> <annotation>$$ j $$</annotation></semantics> </math> , where an infectious contact is sufficient to infect <math> <semantics><mrow><mi>j</mi></mrow> <annotation>$$ j $$</annotation></semantics> </math> if they are susceptible. Here, we introduce a pairwise accelerated failure time regression model for infectious disease transmission that allows the rate parameter of the contact interval distribution to depend on individual-level infectiousness covariates for <math> <semantics><mrow><mi>i</mi></mrow> <annotation>$$ i $$</annotation></semantics> </math> , individual-level susceptibility covariates for <math> <semantics><mrow><mi>j</mi></mrow> <annotation>$$ j $$</annotation></semantics> </math> , and pair-level covariates (e.g., type of relationship). This model can simultaneously handle internal infections (caused by transmission between individuals under observation) and external infections (caused by environmental or community sources of infection). We show that this model produces consistent and asymptotically normal parameter estimates. In a simulation study, we evaluate bias and confidence interval coverage probabilities, explore the role of epidemiologic study design, and investigate the effects of model misspecification. We use this regression model to analyze household data from Los Angeles County during the 2009 influenza A (H1N1) pandemic, where we find that the ability to account for external sources of infection increases the statistical power to estimate the effect of antiviral prophylaxis.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":" ","pages":"5138-5154"},"PeriodicalIF":1.8,"publicationDate":"2024-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11583957/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142372926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Trivariate Joint Modeling for Family Data with Longitudinal Counts, Recurrent Events and a Terminal Event with Application to Lynch Syndrome. 具有纵向计数、复发事件和终末事件的家庭数据的三变量联合建模,并应用于林奇综合征。
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-11-20 Epub Date: 2024-09-15 DOI: 10.1002/sim.10210
Jingwei Lu, Grace Y Yi, Denis Rustand, Patrick Parfrey, Laurent Briollais, Yun-Hee Choi

Trivariate joint modeling for longitudinal count data, recurrent events, and a terminal event for family data has increased interest in medical studies. For example, families with Lynch syndrome (LS) are at high risk of developing colorectal cancer (CRC), where the number of polyps and the frequency of colonoscopy screening visits are highly associated with the risk of CRC among individuals and families. To assess how screening visits influence polyp detection, which in turn influences time to CRC, we propose a clustered trivariate joint model. The proposed model facilitates longitudinal count data that are zero-inflated and over-dispersed and invokes individual-specific and family-specific random effects to account for dependence among individuals and families. We formulate our proposed model as a latent Gaussian model to use the Bayesian estimation approach with the integrated nested Laplace approximation algorithm and evaluate its performance using simulation studies. Our trivariate joint model is applied to a series of 18 families from Newfoundland, with the occurrence of CRC taken as the terminal event, the colonoscopy screening visits as recurrent events, and the number of polyps detected at each visit as zero-inflated count data with overdispersion. We showed that our trivariate model fits better than alternative bivariate models and that the cluster effects should not be ignored when analyzing family data. Finally, the proposed model enables us to quantify heterogeneity across families and individuals in polyp detection and CRC risk, thus helping to identify individuals and families who would benefit from more intensive screening visits.

针对纵向计数数据、复发性事件和家庭数据的终末事件的三变量联合建模在医学研究中越来越受到关注。例如,林奇综合征(Lynch Syndrome,LS)患者家族罹患结直肠癌(CRC)的风险很高,其中息肉数量和结肠镜筛查次数与个人和家族罹患 CRC 的风险高度相关。为了评估筛查次数如何影响息肉检测,进而影响患上 CRC 的时间,我们提出了一个聚类三变量联合模型。该模型适用于零膨胀和过度分散的纵向计数数据,并引用个体特异性和家庭特异性随机效应来解释个体和家庭之间的依赖关系。我们将提议的模型表述为潜在高斯模型,使用贝叶斯估计方法和集成嵌套拉普拉斯近似算法,并通过模拟研究评估其性能。我们的三变量联合模型应用于纽芬兰省的 18 个家庭,将 CRC 的发生作为终结事件,结肠镜筛查就诊作为重复事件,每次就诊检测到的息肉数量作为具有过度分散性的零膨胀计数数据。我们的研究表明,我们的三变量模型比其他的二变量模型拟合得更好,而且在分析家族数据时不应忽略群集效应。最后,所提出的模型使我们能够量化不同家庭和个人在息肉检测和 CRC 风险方面的异质性,从而帮助确定哪些个人和家庭可以从更密集的筛查访问中获益。
{"title":"Trivariate Joint Modeling for Family Data with Longitudinal Counts, Recurrent Events and a Terminal Event with Application to Lynch Syndrome.","authors":"Jingwei Lu, Grace Y Yi, Denis Rustand, Patrick Parfrey, Laurent Briollais, Yun-Hee Choi","doi":"10.1002/sim.10210","DOIUrl":"10.1002/sim.10210","url":null,"abstract":"<p><p>Trivariate joint modeling for longitudinal count data, recurrent events, and a terminal event for family data has increased interest in medical studies. For example, families with Lynch syndrome (LS) are at high risk of developing colorectal cancer (CRC), where the number of polyps and the frequency of colonoscopy screening visits are highly associated with the risk of CRC among individuals and families. To assess how screening visits influence polyp detection, which in turn influences time to CRC, we propose a clustered trivariate joint model. The proposed model facilitates longitudinal count data that are zero-inflated and over-dispersed and invokes individual-specific and family-specific random effects to account for dependence among individuals and families. We formulate our proposed model as a latent Gaussian model to use the Bayesian estimation approach with the integrated nested Laplace approximation algorithm and evaluate its performance using simulation studies. Our trivariate joint model is applied to a series of 18 families from Newfoundland, with the occurrence of CRC taken as the terminal event, the colonoscopy screening visits as recurrent events, and the number of polyps detected at each visit as zero-inflated count data with overdispersion. We showed that our trivariate model fits better than alternative bivariate models and that the cluster effects should not be ignored when analyzing family data. Finally, the proposed model enables us to quantify heterogeneity across families and individuals in polyp detection and CRC risk, thus helping to identify individuals and families who would benefit from more intensive screening visits.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":" ","pages":"5000-5022"},"PeriodicalIF":1.8,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142295958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Functional Principal Component Analysis as an Alternative to Mixed-Effect Models for Describing Sparse Repeated Measures in Presence of Missing Data. 功能主成分分析作为混合效应模型的替代方法,用于描述缺失数据情况下的稀疏重复测量。
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-11-20 Epub Date: 2024-09-09 DOI: 10.1002/sim.10214
Corentin Ségalas, Catherine Helmer, Robin Genuer, Cécile Proust-Lima

Analyzing longitudinal data in health studies is challenging due to sparse and error-prone measurements, strong within-individual correlation, missing data and various trajectory shapes. While mixed-effect models (MM) effectively address these challenges, they remain parametric models and may incur computational costs. In contrast, functional principal component analysis (FPCA) is a non-parametric approach developed for regular and dense functional data that flexibly describes temporal trajectories at a potentially lower computational cost. This article presents an empirical simulation study evaluating the behavior of FPCA with sparse and error-prone repeated measures and its robustness under different missing data schemes in comparison with MM. The results show that FPCA is well-suited in the presence of missing at random data caused by dropout, except in scenarios involving most frequent and systematic dropout. Like MM, FPCA fails under missing not at random mechanism. The FPCA was applied to describe the trajectories of four cognitive functions before clinical dementia and contrast them with those of matched controls in a case-control study nested in a population-based aging cohort. The average cognitive declines of future dementia cases showed a sudden divergence from those of their matched controls with a sharp acceleration 5 to 2.5 years prior to diagnosis.

由于测量数据稀疏且容易出错、个体内部相关性强、数据缺失以及轨迹形状各异,分析健康研究中的纵向数据具有挑战性。虽然混合效应模型(MM)能有效解决这些难题,但它们仍然是参数模型,可能会产生计算成本。相比之下,函数主成分分析(FPCA)是一种针对规则和密集函数数据开发的非参数方法,能以较低的计算成本灵活描述时间轨迹。本文介绍了一项实证模拟研究,评估了 FPCA 在稀疏且易出错的重复测量中的表现,以及它与 MM 相比在不同缺失数据方案下的鲁棒性。研究结果表明,FPCA 非常适合因遗漏而导致的随机数据缺失,但涉及最频繁和系统性遗漏的情况除外。与 MM 一样,FPCA 在非随机缺失机制下也会失效。在一项嵌套于人口老龄化队列的病例对照研究中,应用 FPCA 描述了临床痴呆前四种认知功能的变化轨迹,并与匹配对照组的认知功能变化轨迹进行了对比。未来痴呆症病例的平均认知功能衰退与匹配对照组的平均认知功能衰退出现了突然的背离,在确诊前 5 到 2.5 年出现了急剧的加速。
{"title":"Functional Principal Component Analysis as an Alternative to Mixed-Effect Models for Describing Sparse Repeated Measures in Presence of Missing Data.","authors":"Corentin Ségalas, Catherine Helmer, Robin Genuer, Cécile Proust-Lima","doi":"10.1002/sim.10214","DOIUrl":"10.1002/sim.10214","url":null,"abstract":"<p><p>Analyzing longitudinal data in health studies is challenging due to sparse and error-prone measurements, strong within-individual correlation, missing data and various trajectory shapes. While mixed-effect models (MM) effectively address these challenges, they remain parametric models and may incur computational costs. In contrast, functional principal component analysis (FPCA) is a non-parametric approach developed for regular and dense functional data that flexibly describes temporal trajectories at a potentially lower computational cost. This article presents an empirical simulation study evaluating the behavior of FPCA with sparse and error-prone repeated measures and its robustness under different missing data schemes in comparison with MM. The results show that FPCA is well-suited in the presence of missing at random data caused by dropout, except in scenarios involving most frequent and systematic dropout. Like MM, FPCA fails under missing not at random mechanism. The FPCA was applied to describe the trajectories of four cognitive functions before clinical dementia and contrast them with those of matched controls in a case-control study nested in a population-based aging cohort. The average cognitive declines of future dementia cases showed a sudden divergence from those of their matched controls with a sharp acceleration 5 to 2.5 years prior to diagnosis.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":" ","pages":"4899-4912"},"PeriodicalIF":1.8,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142154969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Selection of number of clusters and warping penalty in clustering functional electrocardiogram. 功能性心电图聚类中的聚类数选择和扭曲惩罚
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-11-20 Epub Date: 2024-09-09 DOI: 10.1002/sim.10192
Wei Yang, Harold I Feldman, Wensheng Guo

Clustering functional data aims to identify unique functional patterns in the entire domain, but this can be challenging due to phase variability that distorts the observed patterns. Curve registration can be used to remove this variability, but determining the appropriate level of warping flexibility can be complicated. Curve registration also requires a target to which a functional object is aligned, typically the cross-sectional mean of functional objects within the same cluster. However, this mean is unknown prior to clustering. Furthermore, there is a trade-off between flexible warping and the number of resulting clusters. Removing more phase variability through curve registration can lead to fewer remaining variations in the functional data, resulting in a smaller number of clusters. Thus, the optimal number of clusters and warping flexibility cannot be uniquely identified. We propose to use external information to solve the identification issue. We define a cross validated Kullback-Leibler information criterion to select the number of clusters and the warping penalty. The criterion is derived from the predictive classification likelihood considering the joint distribution of both the functional data and external variable and penalizes the uncertainty in the cluster membership. We evaluate our method through simulation and apply it to electrocardiographic data collected in the Chronic Renal Insufficiency Cohort study. We identify two distinct clusters of electrocardiogram (ECG) profiles, with the second cluster exhibiting ST segment depression, an indication of cardiac ischemia, compared to the normal ECG profiles in the first cluster.

对功能数据进行聚类的目的是识别整个领域中的独特功能模式,但由于相位变异会扭曲观察到的模式,这可能具有挑战性。曲线配准可用于消除这种可变性,但确定适当程度的翘曲灵活性可能比较复杂。曲线配准还需要一个与功能对象对齐的目标,通常是同一群组中功能对象的横截面平均值。然而,在聚类之前,这个平均值是未知的。此外,在灵活翘曲和由此产生的聚类数量之间需要权衡。通过曲线配准去除更多的相位变异会导致功能数据中剩余的变异减少,从而导致聚类数量减少。因此,聚类的最佳数量和翘曲的灵活性无法唯一确定。我们建议使用外部信息来解决识别问题。我们定义了一个经过交叉验证的库尔贝克-莱伯勒信息准则来选择聚类数量和翘曲惩罚。该准则源于预测分类可能性,考虑了功能数据和外部变量的联合分布,并对群组成员的不确定性进行惩罚。我们通过模拟评估了我们的方法,并将其应用于慢性肾功能不全队列研究中收集的心电图数据。我们确定了两个不同的心电图(ECG)集群,与第一个集群中正常的心电图相比,第二个集群表现出 ST 段压低,这是心脏缺血的迹象。
{"title":"Selection of number of clusters and warping penalty in clustering functional electrocardiogram.","authors":"Wei Yang, Harold I Feldman, Wensheng Guo","doi":"10.1002/sim.10192","DOIUrl":"10.1002/sim.10192","url":null,"abstract":"<p><p>Clustering functional data aims to identify unique functional patterns in the entire domain, but this can be challenging due to phase variability that distorts the observed patterns. Curve registration can be used to remove this variability, but determining the appropriate level of warping flexibility can be complicated. Curve registration also requires a target to which a functional object is aligned, typically the cross-sectional mean of functional objects within the same cluster. However, this mean is unknown prior to clustering. Furthermore, there is a trade-off between flexible warping and the number of resulting clusters. Removing more phase variability through curve registration can lead to fewer remaining variations in the functional data, resulting in a smaller number of clusters. Thus, the optimal number of clusters and warping flexibility cannot be uniquely identified. We propose to use external information to solve the identification issue. We define a cross validated Kullback-Leibler information criterion to select the number of clusters and the warping penalty. The criterion is derived from the predictive classification likelihood considering the joint distribution of both the functional data and external variable and penalizes the uncertainty in the cluster membership. We evaluate our method through simulation and apply it to electrocardiographic data collected in the Chronic Renal Insufficiency Cohort study. We identify two distinct clusters of electrocardiogram (ECG) profiles, with the second cluster exhibiting ST segment depression, an indication of cardiac ischemia, compared to the normal ECG profiles in the first cluster.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":" ","pages":"4913-4927"},"PeriodicalIF":1.8,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11499710/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142154970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating analytic models for individually randomized group treatment trials with complex clustering in nested and crossed designs. 评估嵌套和交叉设计中具有复杂聚类的单独随机分组治疗试验的分析模型。
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-11-10 Epub Date: 2024-09-03 DOI: 10.1002/sim.10206
Jonathan C Moyer, Fan Li, Andrea J Cook, Patrick J Heagerty, Sherri L Pals, Elizabeth L Turner, Rui Wang, Yunji Zhou, Qilu Yu, Xueqi Wang, David M Murray

Many individually randomized group treatment (IRGT) trials randomly assign individuals to study arms but deliver treatments via shared agents, such as therapists, surgeons, or trainers. Post-randomization interactions induce correlations in outcome measures between participants sharing the same agent. Agents can be nested in or crossed with trial arm, and participants may interact with a single agent or with multiple agents. These complications have led to ambiguity in choice of models but there have been no systematic efforts to identify appropriate analytic models for these study designs. To address this gap, we undertook a simulation study to examine the performance of candidate analytic models in the presence of complex clustering arising from multiple membership, single membership, and single agent settings, in both nested and crossed designs and for a continuous outcome. With nested designs, substantial type I error rate inflation was observed when analytic models did not account for multiple membership and when analytic model weights characterizing the association with multiple agents did not match the data generating mechanism. Conversely, analytic models for crossed designs generally maintained nominal type I error rates unless there was notable imbalance in the number of participants that interact with each agent.

许多个体随机分组治疗(IRGT)试验将个体随机分配到研究臂,但通过共享代理(如治疗师、外科医生或培训师)提供治疗。随机化后的交互作用会诱发共享相同代理的参与者之间的结果测量相关性。代理人可以嵌套在试验臂中,也可以与试验臂交叉,参与者可以与单个代理人或多个代理人互动。这些复杂因素导致了模型选择的模糊性,但目前还没有系统性的工作来为这些研究设计确定合适的分析模型。为了填补这一空白,我们开展了一项模拟研究,以考察候选分析模型在嵌套设计和交叉设计中,在连续结果下,在多成员、单成员和单代理等复杂聚类情况下的表现。在嵌套设计中,当分析模型没有考虑多重成员时,以及当分析模型权重表征与多个代理的关联与数据生成机制不匹配时,观察到 I 类错误率大幅上升。相反,交叉设计的分析模型通常保持名义 I 型误差率,除非与每个代理互动的参与者人数明显失衡。
{"title":"Evaluating analytic models for individually randomized group treatment trials with complex clustering in nested and crossed designs.","authors":"Jonathan C Moyer, Fan Li, Andrea J Cook, Patrick J Heagerty, Sherri L Pals, Elizabeth L Turner, Rui Wang, Yunji Zhou, Qilu Yu, Xueqi Wang, David M Murray","doi":"10.1002/sim.10206","DOIUrl":"10.1002/sim.10206","url":null,"abstract":"<p><p>Many individually randomized group treatment (IRGT) trials randomly assign individuals to study arms but deliver treatments via shared agents, such as therapists, surgeons, or trainers. Post-randomization interactions induce correlations in outcome measures between participants sharing the same agent. Agents can be nested in or crossed with trial arm, and participants may interact with a single agent or with multiple agents. These complications have led to ambiguity in choice of models but there have been no systematic efforts to identify appropriate analytic models for these study designs. To address this gap, we undertook a simulation study to examine the performance of candidate analytic models in the presence of complex clustering arising from multiple membership, single membership, and single agent settings, in both nested and crossed designs and for a continuous outcome. With nested designs, substantial type I error rate inflation was observed when analytic models did not account for multiple membership and when analytic model weights characterizing the association with multiple agents did not match the data generating mechanism. Conversely, analytic models for crossed designs generally maintained nominal type I error rates unless there was notable imbalance in the number of participants that interact with each agent.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":" ","pages":"4796-4818"},"PeriodicalIF":1.8,"publicationDate":"2024-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142120561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-Dimensional Overdispersed Generalized Factor Model With Application to Single-Cell Sequencing Data Analysis. 应用于单细胞测序数据分析的高维过度分散广义因子模型
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-11-10 Epub Date: 2024-09-05 DOI: 10.1002/sim.10213
Jinyu Nie, Zhilong Qin, Wei Liu

The current high-dimensional linear factor models fail to account for the different types of variables, while high-dimensional nonlinear factor models often overlook the overdispersion present in mixed-type data. However, overdispersion is prevalent in practical applications, particularly in fields like biomedical and genomics studies. To address this practical demand, we propose an overdispersed generalized factor model (OverGFM) for performing high-dimensional nonlinear factor analysis on overdispersed mixed-type data. Our approach incorporates an additional error term to capture the overdispersion that cannot be accounted for by factors alone. However, this introduces significant computational challenges due to the involvement of two high-dimensional latent random matrices in the nonlinear model. To overcome these challenges, we propose a novel variational EM algorithm that integrates Laplace and Taylor approximations. This algorithm provides iterative explicit solutions for the complex variational parameters and is proven to possess excellent convergence properties. We also develop a criterion based on the singular value ratio to determine the optimal number of factors. Numerical results demonstrate the effectiveness of this criterion. Through comprehensive simulation studies, we show that OverGFM outperforms state-of-the-art methods in terms of estimation accuracy and computational efficiency. Furthermore, we demonstrate the practical merit of our method through its application to two datasets from genomics. To facilitate its usage, we have integrated the implementation of OverGFM into the R package GFM.

目前的高维线性因子模型无法解释不同类型的变量,而高维非线性因子模型往往忽略了混合型数据中存在的过度分散性。然而,在实际应用中,尤其是在生物医学和基因组学研究等领域,超分散现象十分普遍。针对这一实际需求,我们提出了一种超分散广义因子模型(OverGFM),用于对超分散混合型数据进行高维非线性因子分析。我们的方法包含一个额外的误差项,以捕捉仅靠因子无法解释的超分散性。然而,由于非线性模型中涉及两个高维潜在随机矩阵,这给计算带来了巨大挑战。为了克服这些挑战,我们提出了一种整合拉普拉斯和泰勒近似的新型变分电磁算法。该算法为复杂的变分参数提供了迭代显式解,并被证明具有出色的收敛特性。我们还开发了一种基于奇异值比率的标准,以确定最佳因子数。数值结果证明了这一标准的有效性。通过全面的模拟研究,我们表明 OverGFM 在估计精度和计算效率方面都优于最先进的方法。此外,我们还通过将该方法应用于两个基因组学数据集,证明了它的实用性。为了方便使用,我们将 OverGFM 的实现集成到了 R 软件包 GFM 中。
{"title":"High-Dimensional Overdispersed Generalized Factor Model With Application to Single-Cell Sequencing Data Analysis.","authors":"Jinyu Nie, Zhilong Qin, Wei Liu","doi":"10.1002/sim.10213","DOIUrl":"10.1002/sim.10213","url":null,"abstract":"<p><p>The current high-dimensional linear factor models fail to account for the different types of variables, while high-dimensional nonlinear factor models often overlook the overdispersion present in mixed-type data. However, overdispersion is prevalent in practical applications, particularly in fields like biomedical and genomics studies. To address this practical demand, we propose an overdispersed generalized factor model (OverGFM) for performing high-dimensional nonlinear factor analysis on overdispersed mixed-type data. Our approach incorporates an additional error term to capture the overdispersion that cannot be accounted for by factors alone. However, this introduces significant computational challenges due to the involvement of two high-dimensional latent random matrices in the nonlinear model. To overcome these challenges, we propose a novel variational EM algorithm that integrates Laplace and Taylor approximations. This algorithm provides iterative explicit solutions for the complex variational parameters and is proven to possess excellent convergence properties. We also develop a criterion based on the singular value ratio to determine the optimal number of factors. Numerical results demonstrate the effectiveness of this criterion. Through comprehensive simulation studies, we show that OverGFM outperforms state-of-the-art methods in terms of estimation accuracy and computational efficiency. Furthermore, we demonstrate the practical merit of our method through its application to two datasets from genomics. To facilitate its usage, we have integrated the implementation of OverGFM into the R package GFM.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":" ","pages":"4836-4849"},"PeriodicalIF":1.8,"publicationDate":"2024-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142141100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multilevel Longitudinal Functional Principal Component Model. 多层次纵向功能主成分模型。
IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-11-10 Epub Date: 2024-09-03 DOI: 10.1002/sim.10207
Wenyi Lin, Jingjing Zou, Chongzhi Di, Cheryl L Rock, Loki Natarajan

Sensor devices, such as accelerometers, are widely used for measuring physical activity (PA). These devices provide outputs at fine granularity (e.g., 10-100 Hz or minute-level), which while providing rich data on activity patterns, also pose computational challenges with multilevel densely sampled data, resulting in PA records that are measured continuously across multiple days and visits. On the other hand, a scalar health outcome (e.g., BMI) is usually observed only at the individual or visit level. This leads to a discrepancy in numbers of nested levels between the predictors (PA) and outcomes, raising analytic challenges. To address this issue, we proposed a multilevel longitudinal functional principal component analysis (mLFPCA) model to directly model multilevel functional PA inputs in a longitudinal study, and then implemented a longitudinal functional principal component regression (FPCR) to explore the association between PA and obesity-related health outcomes. Additionally, we conducted a comprehensive simulation study to examine the impact of imbalanced multilevel data on both mLFPCA and FPCR performance and offer guidelines for selecting optimal methods.

加速度计等传感设备被广泛用于测量体力活动(PA)。这些设备提供细粒度(如 10-100 Hz 或分钟级)的输出,在提供丰富的活动模式数据的同时,也给多层次密集采样数据的计算带来了挑战,导致 PA 记录在多天和多次访问中被连续测量。另一方面,标量健康结果(如体重指数)通常只能在个人或访问水平上观测到。这就导致了预测因子(PA)和结果之间嵌套层级数量的差异,给分析带来了挑战。为解决这一问题,我们提出了多层次纵向功能主成分分析(mLFPCA)模型,以直接模拟纵向研究中的多层次功能性 PA 输入,然后实施纵向功能主成分回归(FPCR)来探讨 PA 与肥胖相关健康结果之间的关联。此外,我们还进行了一项综合模拟研究,以检验不平衡多层次数据对 mLFPCA 和 FPCR 性能的影响,并为选择最佳方法提供指导。
{"title":"Multilevel Longitudinal Functional Principal Component Model.","authors":"Wenyi Lin, Jingjing Zou, Chongzhi Di, Cheryl L Rock, Loki Natarajan","doi":"10.1002/sim.10207","DOIUrl":"10.1002/sim.10207","url":null,"abstract":"<p><p>Sensor devices, such as accelerometers, are widely used for measuring physical activity (PA). These devices provide outputs at fine granularity (e.g., 10-100 Hz or minute-level), which while providing rich data on activity patterns, also pose computational challenges with multilevel densely sampled data, resulting in PA records that are measured continuously across multiple days and visits. On the other hand, a scalar health outcome (e.g., BMI) is usually observed only at the individual or visit level. This leads to a discrepancy in numbers of nested levels between the predictors (PA) and outcomes, raising analytic challenges. To address this issue, we proposed a multilevel longitudinal functional principal component analysis (mLFPCA) model to directly model multilevel functional PA inputs in a longitudinal study, and then implemented a longitudinal functional principal component regression (FPCR) to explore the association between PA and obesity-related health outcomes. Additionally, we conducted a comprehensive simulation study to examine the impact of imbalanced multilevel data on both mLFPCA and FPCR performance and offer guidelines for selecting optimal methods.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":" ","pages":"4781-4795"},"PeriodicalIF":1.8,"publicationDate":"2024-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142126751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Statistics in Medicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1