A mixed model formulation for designing cluster randomized trials with binary outcomes

T. Braun
{"title":"A mixed model formulation for designing cluster randomized trials with binary outcomes","authors":"T. Braun","doi":"10.1191/1471082X03st054oa","DOIUrl":null,"url":null,"abstract":"Cluster randomized trials (CRTs) are unlike traditional individually randomized trials because observations within the same cluster are positively correlated and the sample size (number of clusters) is relatively small. Although formulae for sample size and power estimates of CRT designs do exist, these formulae rely upon first-order asymptotic approximations for the distribution of the average intervention effect and are inaccurate for CRTs that have a small number of clusters. These formulae also assume that the intracluster correlation (ICC) is the same for each cluster in the CRT. However, for CRTs in which the clusters are classrooms or medical practices, the degree of ICC is often a factor of how many students are in each classroom or how many patients are in each practice. Specifically, smaller clusters are expected to have larger ICC than larger clusters. A weighted sum of the cluster means, D, is the statistic often used to estimate the average intervention effect in a CRT. Therefore, we propose that a saddlepoint approximation is a natural choice to approximate the distributions of the cluster means more precisely than a standard large-sample approximation. We parameterize the ICC for each cluster as a random effect with a predefined prior distribution that is dependent upon the size of each cluster. After integrating over the range of the random effect, we use Monte Carlo methods to generate sample cluster means, which are in turn used to approximate the distribution of D with saddlepoint methods. Through numerical examples and an actual application, we show that our method has accuracy that is equal to or better than that of existing methods. Futhermore, our method accommodates CRTs in which the correlation within cluster is expected to diminish with the cluster size.","PeriodicalId":354759,"journal":{"name":"Statistical Modeling","volume":"70 5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistical Modeling","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1191/1471082X03st054oa","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Cluster randomized trials (CRTs) are unlike traditional individually randomized trials because observations within the same cluster are positively correlated and the sample size (number of clusters) is relatively small. Although formulae for sample size and power estimates of CRT designs do exist, these formulae rely upon first-order asymptotic approximations for the distribution of the average intervention effect and are inaccurate for CRTs that have a small number of clusters. These formulae also assume that the intracluster correlation (ICC) is the same for each cluster in the CRT. However, for CRTs in which the clusters are classrooms or medical practices, the degree of ICC is often a factor of how many students are in each classroom or how many patients are in each practice. Specifically, smaller clusters are expected to have larger ICC than larger clusters. A weighted sum of the cluster means, D, is the statistic often used to estimate the average intervention effect in a CRT. Therefore, we propose that a saddlepoint approximation is a natural choice to approximate the distributions of the cluster means more precisely than a standard large-sample approximation. We parameterize the ICC for each cluster as a random effect with a predefined prior distribution that is dependent upon the size of each cluster. After integrating over the range of the random effect, we use Monte Carlo methods to generate sample cluster means, which are in turn used to approximate the distribution of D with saddlepoint methods. Through numerical examples and an actual application, we show that our method has accuracy that is equal to or better than that of existing methods. Futhermore, our method accommodates CRTs in which the correlation within cluster is expected to diminish with the cluster size.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
设计具有二元结果的聚类随机试验的混合模型
聚类随机试验(crt)不同于传统的单独随机试验,因为同一聚类内的观察结果呈正相关,而且样本量(聚类数量)相对较小。虽然确实存在CRT设计的样本量和功率估计公式,但这些公式依赖于平均干预效果分布的一阶渐近近似,对于具有少量簇的CRT是不准确的。这些公式还假设簇内相关性(ICC)对CRT中的每个簇都是相同的。然而,对于集群是教室或医疗实践的crt, ICC的程度通常是每个教室有多少学生或每个实践有多少病人的一个因素。具体来说,较小的集群预计比较大的集群具有更大的ICC。聚类均值的加权和D是通常用于估计CRT平均干预效果的统计量。因此,我们提出鞍点近似是比标准大样本近似更精确地近似聚类均值分布的自然选择。我们将每个集群的ICC参数化为具有预定义先验分布的随机效应,该分布取决于每个集群的大小。在随机效应的范围内积分后,我们使用蒙特卡罗方法生成样本聚类均值,然后使用鞍点方法近似D的分布。通过数值算例和实际应用表明,该方法的精度等于或优于现有方法。此外,我们的方法适用于簇内相关性随着簇大小而减小的crt。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Use of auxiliary data in semi-parametric spatial regression with nonignorable missing responses Bayesian modeling for genetic association in case-control studies: accounting for unknown population substructure GLMM approach to study the spatial and temporal evolution of spikes in the small intestine Comparing nonparametric surfaces Analyzing the emergence times of permanent teeth: an example of modeling the covariance matrix with interval-censored data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1