首页 > 最新文献

Journal of the Royal Statistical Society Series C-Applied Statistics最新文献

英文 中文
Bayesian modelling strategies for borrowing of information in randomised basket trials 随机篮子试验中信息借鉴的贝叶斯建模策略。
IF 1.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-10-28 DOI: 10.1111/rssc.12602
Luke O. Ouma, Michael J. Grayling, James M. S. Wason, Haiyan Zheng

Basket trials are an innovative precision medicine clinical trial design evaluating a single targeted therapy across multiple diseases that share a common characteristic. To date, most basket trials have been conducted in early-phase oncology settings, for which several Bayesian methods permitting information sharing across subtrials have been proposed. With the increasing interest of implementing randomised basket trials, information borrowing could be exploited in two ways; considering the commensurability of either the treatment effects or the outcomes specific to each of the treatment groups between the subtrials. In this article, we extend a previous analysis model based on distributional discrepancy for borrowing over the subtrial treatment effects (‘treatment effect borrowing’, TEB) to borrowing over the subtrial groupwise responses (‘treatment response borrowing’, TRB). Simulation results demonstrate that both modelling strategies provide substantial gains over an approach with no borrowing. TRB outperforms TEB especially when subtrial sample sizes are small on all operational characteristics, while the latter has considerable gains in performance over TRB when subtrial sample sizes are large, or the treatment effects and groupwise mean responses are noticeably heterogeneous across subtrials. Further, we notice that TRB, and TEB can potentially lead to different conclusions in the analysis of real data.

篮子试验是一种创新的精准医学临床试验设计,用于评估具有共同特征的多种疾病的单一靶向治疗。到目前为止,大多数篮子试验都是在早期肿瘤学环境中进行的,已经提出了几种允许在子树之间共享信息的贝叶斯方法。随着人们对实施随机篮子试验越来越感兴趣,信息借用可以通过两种方式加以利用;考虑治疗效果或每个治疗组特有的结果在子树之间的可公度。在这篇文章中,我们将先前基于亚治疗效应的借款分布差异的分析模型(“治疗效应借款”,TEB)扩展到亚治疗组反应的借款(“治疗反应借款”,TRB)。仿真结果表明,与不借款的方法相比,这两种建模策略都提供了实质性的收益。TRB的性能优于TEB,尤其是当减法样本量在所有操作特性上都很小时,而当减法样本大时,或者处理效果和分组平均响应在减法之间明显不同时,后者在性能上比TRB有相当大的提高。此外,我们注意到TRB和TEB在实际数据分析中可能会导致不同的结论。
{"title":"Bayesian modelling strategies for borrowing of information in randomised basket trials","authors":"Luke O. Ouma,&nbsp;Michael J. Grayling,&nbsp;James M. S. Wason,&nbsp;Haiyan Zheng","doi":"10.1111/rssc.12602","DOIUrl":"10.1111/rssc.12602","url":null,"abstract":"<p>Basket trials are an innovative precision medicine clinical trial design evaluating a single targeted therapy across multiple diseases that share a common characteristic. To date, most basket trials have been conducted in early-phase oncology settings, for which several Bayesian methods permitting information sharing across subtrials have been proposed. With the increasing interest of implementing randomised basket trials, information borrowing could be exploited in two ways; considering the commensurability of either the treatment effects or the outcomes specific to each of the treatment groups between the subtrials. In this article, we extend a previous analysis model based on distributional discrepancy for borrowing over the subtrial treatment effects (‘treatment effect borrowing’, TEB) to borrowing over the subtrial groupwise responses (‘treatment response borrowing’, TRB). Simulation results demonstrate that both modelling strategies provide substantial gains over an approach with no borrowing. TRB outperforms TEB especially when subtrial sample sizes are small on all operational characteristics, while the latter has considerable gains in performance over TRB when subtrial sample sizes are large, or the treatment effects and groupwise mean responses are noticeably heterogeneous across subtrials. Further, we notice that TRB, and TEB can potentially lead to different conclusions in the analysis of real data.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"71 5","pages":"2014-2037"},"PeriodicalIF":1.6,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9827857/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9147756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Combining cytotoxic agents with continuous dose levels in seamless phase I-II clinical trials 在无缝I-II期临床试验中将细胞毒性药物与连续剂量水平相结合。
IF 1.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-10-26 DOI: 10.1111/rssc.12598
José L. Jiménez, Mourad Tighiouart

Phase I-II cancer clinical trial designs are intended to accelerate drug development. In cases where efficacy cannot be ascertained in a short period of time, it is common to divide the study in two stages: (i) a first stage in which dose is escalated based only on toxicity data and we look for the maximum tolerated dose (MTD) set and (ii) a second stage in which we search for the most efficacious dose within the MTD set. Current available approaches in the area of continuous dose levels involve fixing the MTD after stage I and discarding all collected stage I efficacy data. However, this methodology is clearly inefficient when there is a unique patient population present across stages. In this article, we propose a two-stage design for the combination of two cytotoxic agents assuming a single patient population across the entire study. In stage I, conditional escalation with overdose control is used to allocate successive cohorts of patients. In stage II, we employ an adaptive randomisation approach to allocate patients to drug combinations along the estimated MTD curve, which is constantly updated. The proposed methodology is assessed with extensive simulations in the context of a real case study.

癌症I-II期临床试验设计旨在加速药物开发。在短期内无法确定疗效的情况下,通常将研究分为两个阶段:i)第一阶段,仅根据毒性数据增加剂量,我们寻找最大耐受剂量(MTD)集;ii)第二阶段,我们在MTD集中寻找最有效的剂量。在连续剂量水平领域,目前可用的方法包括在第一阶段后固定MTD,并丢弃所有收集的第一阶段疗效数据。然而,当跨阶段存在独特的患者群体时,这种方法显然效率低下。在这篇文章中,我们提出了两种细胞毒性药物组合的两阶段设计,假设整个研究中只有一个患者群体。在第一阶段,使用过量控制条件升级(EWOC)来分配连续的患者队列。在第二阶段,我们采用自适应随机化方法,沿着不断更新的估计MTD曲线将患者分配到药物组合中。在实际案例研究的背景下,通过广泛的模拟对所提出的方法进行了评估。
{"title":"Combining cytotoxic agents with continuous dose levels in seamless phase I-II clinical trials","authors":"José L. Jiménez,&nbsp;Mourad Tighiouart","doi":"10.1111/rssc.12598","DOIUrl":"10.1111/rssc.12598","url":null,"abstract":"<p>Phase I-II cancer clinical trial designs are intended to accelerate drug development. In cases where efficacy cannot be ascertained in a short period of time, it is common to divide the study in two stages: (i) a first stage in which dose is escalated based only on toxicity data and we look for the maximum tolerated dose (MTD) set and (ii) a second stage in which we search for the most efficacious dose within the MTD set. Current available approaches in the area of continuous dose levels involve fixing the MTD after stage I and discarding all collected stage I efficacy data. However, this methodology is clearly inefficient when there is a unique patient population present across stages. In this article, we propose a two-stage design for the combination of two cytotoxic agents assuming a single patient population across the entire study. In stage I, conditional escalation with overdose control is used to allocate successive cohorts of patients. In stage II, we employ an adaptive randomisation approach to allocate patients to drug combinations along the estimated MTD curve, which is constantly updated. The proposed methodology is assessed with extensive simulations in the context of a real case study.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"71 5","pages":"1996-2013"},"PeriodicalIF":1.6,"publicationDate":"2022-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9276318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Bayesian multi-level mixed-effects model for influenza dynamics 流感动力学的贝叶斯多级混合效应模型
IF 1.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-10-24 DOI: 10.1111/rssc.12603
Hanwen Huang

Influenza A viruses (IAV) are the only influenza viruses known to cause flu pandemics. Understanding the evolution of different sub-types of IAV on their natural hosts is important for preventing and controlling the virus. We propose a mechanism-based Bayesian multi-level mixed-effects model for characterising influenza viral dynamics, described by a set of ordinary differential equations (ODE). Both strain-specific and subject-specific random effects are included for the ODE parameters. Our models can characterise the common features in the population while taking into account the variations among individuals. The random effects selection is conducted at strain level through re-parameterising the covariance parameters of the corresponding random effect distribution. Our method does not need to solve ODE directly. We demonstrate that the posterior computation can proceed via a simple and efficient Markov chain Monte Carlo algorithm. The methods are illustrated using simulated data and a real data from a study relating virus load estimates from influenza infections in ducks.

甲型流感病毒(IAV)是已知唯一引起流感大流行的流感病毒。了解IAV不同亚型在其自然宿主上的进化对预防和控制病毒具有重要意义。我们提出了一个基于机制的贝叶斯多级混合效应模型来描述流感病毒动力学,该模型由一组常微分方程(ODE)描述。ODE参数包括特定于菌株和特定于主体的随机效应。我们的模型可以在考虑个体差异的同时,描绘出总体的共同特征。通过对随机效应分布的协方差参数重新参数化,在应变水平上进行随机效应选择。我们的方法不需要直接求解ODE。我们证明了后验计算可以通过一个简单有效的马尔可夫链蒙特卡罗算法进行。这些方法使用模拟数据和来自一项有关鸭子流感感染病毒载量估计的研究的真实数据来说明。
{"title":"Bayesian multi-level mixed-effects model for influenza dynamics","authors":"Hanwen Huang","doi":"10.1111/rssc.12603","DOIUrl":"10.1111/rssc.12603","url":null,"abstract":"<p>Influenza A viruses (IAV) are the only influenza viruses known to cause flu pandemics. Understanding the evolution of different sub-types of IAV on their natural hosts is important for preventing and controlling the virus. We propose a mechanism-based Bayesian multi-level mixed-effects model for characterising influenza viral dynamics, described by a set of ordinary differential equations (ODE). Both strain-specific and subject-specific random effects are included for the ODE parameters. Our models can characterise the common features in the population while taking into account the variations among individuals. The random effects selection is conducted at strain level through re-parameterising the covariance parameters of the corresponding random effect distribution. Our method does not need to solve ODE directly. We demonstrate that the posterior computation can proceed via a simple and efficient Markov chain Monte Carlo algorithm. The methods are illustrated using simulated data and a real data from a study relating virus load estimates from influenza infections in ducks.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"71 5","pages":"1978-1995"},"PeriodicalIF":1.6,"publicationDate":"2022-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76935397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Derivation of maternal dietary patterns accounting for regional heterogeneity 解释区域异质性的母体饮食模式的推导
IF 1.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-10-18 DOI: 10.1111/rssc.12604
Briana J. K. Stephenson, Amy H. Herring, Andrew F. Olshan

Latent class models are often used to characterise dietary patterns. Yet, when subtle variations exist across different sub-populations, overall population patterns can be masked and affect statistical inference on health outcomes. We address this concern with a flexible supervised clustering approach, introduced as Supervised Robust Profile Clustering, that identifies outcome-dependent population-based patterns, while partitioning out subpopulation pattern differences. Using dietary data from the 1997–2011 National Birth Defects Prevention Study, we determine how maternal dietary profiles associate with orofacial clefts among offspring. Results indicate mothers who consume a higher proportion of fruits and vegetables compared to land meats lower the proportion of progeny with orofacial cleft defect.

潜在类别模型通常用于描述饮食模式。然而,当不同亚群之间存在细微差异时,总体人口模式可能被掩盖,并影响对健康结果的统计推断。我们通过一种灵活的监督聚类方法来解决这个问题,该方法被称为监督鲁棒概要聚类,它可以识别结果依赖的基于种群的模式,同时划分出亚种群模式差异。利用1997-2011年国家出生缺陷预防研究的饮食数据,我们确定了母亲的饮食特征与后代的口面部裂之间的关系。结果表明,与陆地肉类相比,食用水果和蔬菜比例较高的母亲,其后代患口腔面部缺陷的比例较低。
{"title":"Derivation of maternal dietary patterns accounting for regional heterogeneity","authors":"Briana J. K. Stephenson,&nbsp;Amy H. Herring,&nbsp;Andrew F. Olshan","doi":"10.1111/rssc.12604","DOIUrl":"10.1111/rssc.12604","url":null,"abstract":"<p>Latent class models are often used to characterise dietary patterns. Yet, when subtle variations exist across different sub-populations, overall population patterns can be masked and affect statistical inference on health outcomes. We address this concern with a flexible supervised clustering approach, introduced as Supervised Robust Profile Clustering, that identifies outcome-dependent population-based patterns, while partitioning out subpopulation pattern differences. Using dietary data from the 1997–2011 National Birth Defects Prevention Study, we determine how maternal dietary profiles associate with orofacial clefts among offspring. Results indicate mothers who consume a higher proportion of fruits and vegetables compared to land meats lower the proportion of progeny with orofacial cleft defect.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"71 5","pages":"1957-1977"},"PeriodicalIF":1.6,"publicationDate":"2022-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76670681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Non-parametric calibration of multiple related radiocarbon determinations and their calendar age summarisation 多个相关放射性碳测定的非参数校准及其日历年龄汇总
IF 1.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-10-17 DOI: 10.1111/rssc.12599
Timothy J. Heaton
<p>Due to fluctuations in past radiocarbon (<math> <semantics> <mrow> <msup> <mrow></mrow> <mrow> <mn>14</mn> </mrow> </msup> </mrow> <annotation>$$ {}^{14} $$</annotation> </semantics></math>C) levels, calibration is required to convert <math> <semantics> <mrow> <msup> <mrow></mrow> <mrow> <mn>14</mn> </mrow> </msup> </mrow> <annotation>$$ {}^{14} $$</annotation> </semantics></math>C determinations <math> <semantics> <mrow> <msub> <mrow> <mi>X</mi> </mrow> <mrow> <mi>i</mi> </mrow> </msub> </mrow> <annotation>$$ {X}_i $$</annotation> </semantics></math> into calendar ages <math> <semantics> <mrow> <msub> <mrow> <mi>θ</mi> </mrow> <mrow> <mi>i</mi> </mrow> </msub> </mrow> <annotation>$$ {theta}_i $$</annotation> </semantics></math>. In many studies, we wish to calibrate a set of related samples taken from the same site or context, which have calendar ages drawn from the same shared, but unknown, density <math> <semantics> <mrow> <mi>f</mi> <mo>(</mo> <mi>θ</mi> <mo>)</mo> </mrow> <annotation>$$ fleft(theta right) $$</annotation> </semantics></math>. Calibration of <math> <semantics> <mrow> <msub> <mrow> <mi>X</mi> </mrow> <mrow> <mn>1</mn> </mrow> </msub> <mo>,</mo> <mi>…</mi> <mo>,</mo> <msub> <mrow> <mi>X</mi> </mrow> <mrow> <mi>n</mi> </mrow> </msub> </mrow> <annotation>$$ {X}_1,dots, {X}_n
由于过去放射性碳(14 $$ {}^{14} $$ C)水平的波动,需要校准转换14 $$ {}^{14} $$ C测定X i$$ {X}_i $$变成历法年龄θ I $$ {theta}_i $$。在许多研究中,我们希望校准来自同一地点或环境的一组相关样本,这些样本的日历年龄来自相同的共享但未知的密度f (θ) $$ fleft(theta right) $$。校准x1,…,X n $$ {X}_1,dots, {X}_n $$可以通过纳入样本相关的知识而得到显著改善。此外,对潜在的共享f (θ) $$ fleft(theta right) $$的概要估计可以提供关于人口规模/活动随时间变化的有价值的信息。目前的大多数方法都需要f (θ) $$ fleft(theta right) $$的参数说明,这通常是不合适的。我们使用Dirichlet过程混合模型开发了严格的非参数贝叶斯方法,并使用切片采样来解决14 $$ {}^{14} $$ C校准内的多模态典型问题。我们的方法同时校准了14个$$ {}^{14} $$ C测定集,并为未来样本的潜在日历年龄提供了预测性估计。在一项模拟研究中,我们表明,与单独校准每个14 $$ {}^{14} $$ C测定相比,使用我们的方法联合校准相关样品时,日历年龄估计的改善。我们还通过三个现实案例研究说明了预测性日历年龄估计的使用,以深入了解随时间变化的活动水平。
{"title":"Non-parametric calibration of multiple related radiocarbon determinations and their calendar age summarisation","authors":"Timothy J. Heaton","doi":"10.1111/rssc.12599","DOIUrl":"10.1111/rssc.12599","url":null,"abstract":"&lt;p&gt;Due to fluctuations in past radiocarbon (&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;msup&gt;\u0000 &lt;mrow&gt;&lt;/mrow&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mn&gt;14&lt;/mn&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/msup&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ {}^{14} $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt;C) levels, calibration is required to convert &lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;msup&gt;\u0000 &lt;mrow&gt;&lt;/mrow&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mn&gt;14&lt;/mn&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/msup&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ {}^{14} $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt;C determinations &lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;msub&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;X&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;i&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/msub&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ {X}_i $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt; into calendar ages &lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;msub&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;θ&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;i&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/msub&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ {theta}_i $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt;. In many studies, we wish to calibrate a set of related samples taken from the same site or context, which have calendar ages drawn from the same shared, but unknown, density &lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;f&lt;/mi&gt;\u0000 &lt;mo&gt;(&lt;/mo&gt;\u0000 &lt;mi&gt;θ&lt;/mi&gt;\u0000 &lt;mo&gt;)&lt;/mo&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ fleft(theta right) $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt;. Calibration of &lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;msub&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;X&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mn&gt;1&lt;/mn&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/msub&gt;\u0000 &lt;mo&gt;,&lt;/mo&gt;\u0000 &lt;mi&gt;…&lt;/mi&gt;\u0000 &lt;mo&gt;,&lt;/mo&gt;\u0000 &lt;msub&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;X&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;n&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/msub&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ {X}_1,dots, {X}_n","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"71 5","pages":"1918-1956"},"PeriodicalIF":1.6,"publicationDate":"2022-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://rss.onlinelibrary.wiley.com/doi/epdf/10.1111/rssc.12599","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80137362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Optimal approximate choice designs for a two-step coffee choice, taste and choice again experiment 最优近似选择设计为两步咖啡选择,口味和选择再次实验
IF 1.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-10-03 DOI: 10.1111/rssc.12601
Nedka Dechkova Nikiforova, Rossella Berni, Jesús Fernando López-Fidalgo

This work deals with consumers' preferences about coffee. Firstly, a choice experiment is performed on a sample of potential consumers. Following this, a sensory test involving the tasting of two varieties of coffee is carried out with the respondents, after which the same choice experiment is supplied to them again. An innovative approach for building heterogeneous choice designs is specifically developed for the case-study, based on approximate design theory and compound design criterion. Panel Mixed Logit models are used, thereby allowing for the inclusion of correlation among consumers' responses; choice-sets are supplied to a proportion of respondents according to optimal weights. The estimation results of the Panel Mixed Logit model are satisfactory, confirming the validity of the proposed approach.

这项工作涉及消费者对咖啡的偏好。首先,对潜在消费者样本进行选择实验。在此之后,对受访者进行了一项感官测试,包括品尝两种咖啡,之后再次向他们提供相同的选择实验。基于近似设计理论和复合设计准则,为案例研究开发了一种构建异质选择设计的创新方法。使用面板混合Logit模型,从而允许包含消费者的反应之间的相关性;选择集根据最优权重提供给一定比例的受访者。面板混合Logit模型的估计结果令人满意,验证了所提方法的有效性。
{"title":"Optimal approximate choice designs for a two-step coffee choice, taste and choice again experiment","authors":"Nedka Dechkova Nikiforova,&nbsp;Rossella Berni,&nbsp;Jesús Fernando López-Fidalgo","doi":"10.1111/rssc.12601","DOIUrl":"10.1111/rssc.12601","url":null,"abstract":"<p>This work deals with consumers' preferences about coffee. Firstly, a choice experiment is performed on a sample of potential consumers. Following this, a sensory test involving the tasting of two varieties of coffee is carried out with the respondents, after which the same choice experiment is supplied to them again. An innovative approach for building heterogeneous choice designs is specifically developed for the case-study, based on approximate design theory and compound design criterion. Panel Mixed Logit models are used, thereby allowing for the inclusion of correlation among consumers' responses; choice-sets are supplied to a proportion of respondents according to optimal weights. The estimation results of the Panel Mixed Logit model are satisfactory, confirming the validity of the proposed approach.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"71 5","pages":"1895-1917"},"PeriodicalIF":1.6,"publicationDate":"2022-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://rss.onlinelibrary.wiley.com/doi/epdf/10.1111/rssc.12601","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76942285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Flexible domain prediction using mixed effects random forests 使用混合效应随机森林的灵活域预测
IF 1.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-10-02 DOI: 10.1111/rssc.12600
Patrick Krennmair, Timo Schmid

This paper promotes the use of random forests as versatile tools for estimating spatially disaggregated indicators in the presence of small area-specific sample sizes. Small area estimators are predominantly conceptualised within the regression-setting and rely on linear mixed models to account for the hierarchical structure of the survey data. In contrast, machine learning methods offer non-linear and non-parametric alternatives, combining excellent predictive performance and a reduced risk of model-misspecification. Mixed effects random forests combine advantages of regression forests with the ability to model hierarchical dependencies. This paper provides a coherent framework based on mixed effects random forests for estimating small area averages and proposes a non-parametric bootstrap estimator for assessing the uncertainty of the estimates. We illustrate advantages of our proposed methodology using Mexican income-data from the state Nuevo León. Finally, the methodology is evaluated in model-based and design-based simulations comparing the proposed methodology to traditional regression-based approaches for estimating small area averages.

本文提倡使用随机森林作为在存在小区域特定样本量的情况下估计空间分类指标的通用工具。小面积估计值主要在回归设置中概念化,并依赖线性混合模型来解释调查数据的层次结构。相比之下,机器学习方法提供了非线性和非参数替代方案,结合了出色的预测性能和降低模型错误规范的风险。混合效应随机森林结合了回归森林的优点和对分层依赖关系建模的能力。本文提出了一种基于混合效应随机森林的小面积平均估计框架,并提出了一种用于估计不确定性的非参数自举估计方法。我们使用来自Nuevo州León的墨西哥收入数据来说明我们提出的方法的优点。最后,在基于模型和基于设计的模拟中对该方法进行了评估,并将该方法与传统的基于回归的小面积平均值估算方法进行了比较。
{"title":"Flexible domain prediction using mixed effects random forests","authors":"Patrick Krennmair,&nbsp;Timo Schmid","doi":"10.1111/rssc.12600","DOIUrl":"10.1111/rssc.12600","url":null,"abstract":"<p>This paper promotes the use of random forests as versatile tools for estimating spatially disaggregated indicators in the presence of small area-specific sample sizes. Small area estimators are predominantly conceptualised within the regression-setting and rely on linear mixed models to account for the hierarchical structure of the survey data. In contrast, machine learning methods offer non-linear and non-parametric alternatives, combining excellent predictive performance and a reduced risk of model-misspecification. Mixed effects random forests combine advantages of regression forests with the ability to model hierarchical dependencies. This paper provides a coherent framework based on mixed effects random forests for estimating small area averages and proposes a non-parametric bootstrap estimator for assessing the uncertainty of the estimates. We illustrate advantages of our proposed methodology using Mexican income-data from the state Nuevo León. Finally, the methodology is evaluated in model-based and design-based simulations comparing the proposed methodology to traditional regression-based approaches for estimating small area averages.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"71 5","pages":"1865-1894"},"PeriodicalIF":1.6,"publicationDate":"2022-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://rss.onlinelibrary.wiley.com/doi/epdf/10.1111/rssc.12600","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117390797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A Bayesian model for estimating Sustainable Development Goal indicator 4.1.2: School completion rates 估算可持续发展目标指标4.1.2:学校完成率的贝叶斯模型
IF 1.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-09-25 DOI: 10.1111/rssc.12595
Ameer Dharamshi, Bilal Barakat, Leontine Alkema, Manos Antoninis

Estimating school completion is crucial for monitoring Sustainable Development Goal (SDG) 4 on education. The recently introduced SDG indicator 4.1.2, defined as the percentage of children aged 3–5 years above the expected completion age of a given level of education that have completed the respective level, differs from enrolment indicators in that it relies primarily on household surveys. This introduces a number of challenges including gaps between survey waves, conflicting estimates, age misreporting and delayed completion. We introduce the Adjusted Bayesian Completion Rates (ABCR) model to address these challenges and produce the first complete and consistent time series for SDG indicator 4.1.2, by school level and sex, for 164 countries. Validation exercises indicate that the model appears well-calibrated and offers a meaningful improvement over simpler approaches in predictive performance. The ABCR model is now used by the United Nations to monitor completion rates for all countries with available survey data.

估计学校完成情况对于监测关于教育的可持续发展目标4至关重要。最近引入的可持续发展目标指标4.1.2定义为超过预期完成某一特定教育水平的3-5岁儿童完成相应教育水平的百分比,它与入学率指标不同,因为它主要依赖于住户调查。这带来了许多挑战,包括调查浪潮之间的差距、相互矛盾的估计、年龄错误报告和延迟完成。我们引入了调整贝叶斯完成率(ABCR)模型来应对这些挑战,并为164个国家的可持续发展目标指标4.1.2制作了第一个完整和一致的时间序列,按学校水平和性别分列。验证练习表明,该模型似乎经过了很好的校准,并在预测性能方面提供了比更简单的方法有意义的改进。联合国现在使用ABCR模式来监测所有拥有调查数据的国家的完成率。
{"title":"A Bayesian model for estimating Sustainable Development Goal indicator 4.1.2: School completion rates","authors":"Ameer Dharamshi,&nbsp;Bilal Barakat,&nbsp;Leontine Alkema,&nbsp;Manos Antoninis","doi":"10.1111/rssc.12595","DOIUrl":"10.1111/rssc.12595","url":null,"abstract":"<p>Estimating school completion is crucial for monitoring Sustainable Development Goal (SDG) 4 on education. The recently introduced SDG indicator 4.1.2, defined as the percentage of children aged 3–5 years above the expected completion age of a given level of education that have completed the respective level, differs from enrolment indicators in that it relies primarily on household surveys. This introduces a number of challenges including gaps between survey waves, conflicting estimates, age misreporting and delayed completion. We introduce the Adjusted Bayesian Completion Rates (ABCR) model to address these challenges and produce the first complete and consistent time series for SDG indicator 4.1.2, by school level and sex, for 164 countries. Validation exercises indicate that the model appears well-calibrated and offers a meaningful improvement over simpler approaches in predictive performance. The ABCR model is now used by the United Nations to monitor completion rates for all countries with available survey data.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"71 5","pages":"1822-1864"},"PeriodicalIF":1.6,"publicationDate":"2022-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://rss.onlinelibrary.wiley.com/doi/epdf/10.1111/rssc.12595","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72417219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Efficient estimation of the marginal mean of recurrent events 重复事件的边际均值的有效估计
IF 1.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-09-21 DOI: 10.1111/rssc.12586
Giuliana Cortese, Thomas H. Scheike

Recurrent events are often encountered in clinical and epidemiological studies where a terminal event is also observed. With recurrent events data it is of great interest to estimate the marginal mean of the cumulative number of recurrent events experienced prior to the terminal event. The standard nonparametric estimator was suggested in Cook and Lawless and further developed in Ghosh and Lin. We here investigate the efficiency of this estimator that, surprisingly, has not been studied before. We rewrite the standard estimator as an inverse probability of censoring weighted estimator. From this representation we derive an efficient augmented estimator using efficient estimation theory for right-censored data. We show that the standard estimator is efficient in settings with no heterogeneity. In other settings with different sources of heterogeneity, we show theoretically and by simulations that the efficiency can be greatly improved when an efficient augmented estimator based on dynamic predictions is employed, at no extra cost to robustness. The estimators are applied and compared to study the mean number of catheter-related bloodstream infections in heterogeneous patients with chronic intestinal failure who can possibly die, and the efficiency gain is highlighted in the resulting point-wise confidence intervals.

在临床和流行病学研究中经常遇到复发事件,在这些研究中也观察到终末事件。有了反复事件的数据,估计在结束事件之前经历的反复事件累积次数的边际平均值是非常有趣的。标准非参数估计量由Cook和Lawless提出,并由Ghosh和Lin进一步发展。我们在这里研究这个估计器的效率,令人惊讶的是,以前没有研究过。我们将标准估计量改写为一个逆概率的滤波加权估计量。在此基础上,利用有效估计理论导出了右截尾数据的有效增广估计量。我们证明了标准估计器在没有异质性的情况下是有效的。在具有不同异质性来源的其他设置中,我们从理论上和模拟中表明,当采用基于动态预测的有效增强估计器时,效率可以大大提高,而不会对鲁棒性造成额外损失。我们应用并比较了这些估计值来研究可能死亡的异质性慢性肠衰竭患者中导管相关血流感染的平均数量,并在所得的逐点置信区间中强调了效率的提高。
{"title":"Efficient estimation of the marginal mean of recurrent events","authors":"Giuliana Cortese,&nbsp;Thomas H. Scheike","doi":"10.1111/rssc.12586","DOIUrl":"10.1111/rssc.12586","url":null,"abstract":"<p>Recurrent events are often encountered in clinical and epidemiological studies where a terminal event is also observed. With recurrent events data it is of great interest to estimate the marginal mean of the cumulative number of recurrent events experienced prior to the terminal event. The standard nonparametric estimator was suggested in Cook and Lawless and further developed in Ghosh and Lin. We here investigate the efficiency of this estimator that, surprisingly, has not been studied before. We rewrite the standard estimator as an inverse probability of censoring weighted estimator. From this representation we derive an efficient augmented estimator using efficient estimation theory for right-censored data. We show that the standard estimator is efficient in settings with no heterogeneity. In other settings with different sources of heterogeneity, we show theoretically and by simulations that the efficiency can be greatly improved when an efficient augmented estimator based on dynamic predictions is employed, at no extra cost to robustness. The estimators are applied and compared to study the mean number of catheter-related bloodstream infections in heterogeneous patients with chronic intestinal failure who can possibly die, and the efficiency gain is highlighted in the resulting point-wise confidence intervals.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"71 5","pages":"1787-1821"},"PeriodicalIF":1.6,"publicationDate":"2022-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://rss.onlinelibrary.wiley.com/doi/epdf/10.1111/rssc.12586","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79223958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Contour models for physical boundaries enclosing star-shaped and approximately star-shaped polygons 星形多边形和近似星形多边形的物理边界的轮廓模型
IF 1.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-09-19 DOI: 10.1111/rssc.12592
Hannah M. Director, Adrian E. Raftery

Boundaries on spatial fields divide regions with particular features from surrounding background areas. Methods to identify boundary lines from interpolated spatial fields are well established. Less attention has been paid to how to model sequences of connected spatial points. Such models are needed for physical boundaries. For example, in the Arctic ocean, large contiguous areas are covered by sea ice, or frozen ocean water. We define the ice edge contour as the ordered sequences of spatial points that connect to form a line around set(s) of contiguous grid boxes with sea ice present. Polar scientists need to describe how this contiguous area behaves in present and historical data and under future climate change scenarios. We introduce the Gaussian Star-shaped Contour Model (GSCM) for modelling boundaries represented as connected sequences of spatial points such as the sea ice edge. GSCMs generate sequences of spatial points via generating sets of distances in various directions from a fixed starting point. The GSCM can be applied to contours that enclose regions that are star-shaped polygons or approximately star-shaped polygons. Metrics are introduced to assess the extent to which a polygon deviates from star-shapedness. Simulation studies illustrate the performance of the GSCM in different situations.

空间场的边界将具有特定特征的区域与周围的背景区域分开。从插值空间场中识别边界线的方法已经建立。如何对空间点的连通序列进行建模一直受到较少的关注。物理边界需要这样的模型。例如,在北冰洋,大片连续的区域被海冰或冰冻的海水覆盖。我们将冰边缘轮廓定义为空间点的有序序列,这些点围绕存在海冰的一组连续网格框连接形成一条线。极地科学家需要描述这片连续区域在当前和历史数据以及未来气候变化情景下的表现。我们引入了高斯星形轮廓模型(GSCM),用于将边界表示为空间点(如海冰边缘)的连接序列。GSCMs从固定的起始点出发,通过不同方向的距离生成集生成空间点序列。GSCM可以应用于包围星形多边形或近似星形多边形区域的轮廓。引入度量来评估多边形偏离星形的程度。仿真研究表明了GSCM在不同情况下的性能。
{"title":"Contour models for physical boundaries enclosing star-shaped and approximately star-shaped polygons","authors":"Hannah M. Director,&nbsp;Adrian E. Raftery","doi":"10.1111/rssc.12592","DOIUrl":"10.1111/rssc.12592","url":null,"abstract":"<p>Boundaries on spatial fields divide regions with particular features from surrounding background areas. Methods to identify boundary lines from interpolated spatial fields are well established. Less attention has been paid to how to model sequences of connected spatial points. Such models are needed for physical boundaries. For example, in the Arctic ocean, large contiguous areas are covered by sea ice, or frozen ocean water. We define the ice edge contour as the ordered sequences of spatial points that connect to form a line around set(s) of contiguous grid boxes with sea ice present. Polar scientists need to describe how this contiguous area behaves in present and historical data and under future climate change scenarios. We introduce the Gaussian Star-shaped Contour Model (GSCM) for modelling boundaries represented as connected sequences of spatial points such as the sea ice edge. GSCMs generate sequences of spatial points via generating sets of distances in various directions from a fixed starting point. The GSCM can be applied to contours that enclose regions that are star-shaped polygons or approximately star-shaped polygons. Metrics are introduced to assess the extent to which a polygon deviates from star-shapedness. Simulation studies illustrate the performance of the GSCM in different situations.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"71 5","pages":"1688-1720"},"PeriodicalIF":1.6,"publicationDate":"2022-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89579451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the Royal Statistical Society Series C-Applied Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1