Three algorithms and SAS macros for estimating power and sample size for logistic models with one or more independent variables of interest in the presence of covariates.

Q2 Decision Sciences Source Code for Biology and Medicine Pub Date : 2014-11-15 eCollection Date: 2014-01-01 DOI:10.1186/1751-0473-9-24

David Keith Williams, Zoran Bursac

{"title":"Three algorithms and SAS macros for estimating power and sample size for logistic models with one or more independent variables of interest in the presence of covariates.","authors":"David Keith Williams, Zoran Bursac","doi":"10.1186/1751-0473-9-24","DOIUrl":null,"url":null,"abstract":"Background: Commonly when designing studies, researchers propose to measure several independent variables in a regression model, a subset of which are identified as the main variables of interest while the rest are retained in a model as covariates or confounders. Power for linear regression in this setting can be calculated using SAS PROC POWER. There exists a void in estimating power for the logistic regression models in the same setting.Methods: Currently, an approach that calculates power for only one variable of interest in the presence of other covariates for logistic regression is in common use and works well for this special case. In this paper we propose three related algorithms along with corresponding SAS macros that extend power estimation for one or more primary variables of interest in the presence of some confounders.Results: The three proposed empirical algorithms employ likelihood ratio test to provide a user with either a power estimate for a given sample size, a quick sample size estimate for a given power, and an approximate power curve for a range of sample sizes. A user can specify odds ratios for a combination of binary, uniform and standard normal independent variables of interest, and or remaining covariates/confounders in the model, along with a correlation between variables.Conclusions: These user friendly algorithms and macro tools are a promising solution that can fill the void for estimation of power for logistic regression when multiple independent variables are of interest, in the presence of additional covariates in the model.","PeriodicalId":35052,"journal":{"name":"Source Code for Biology and Medicine","volume":"9 ","pages":"24"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1751-0473-9-24","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Source Code for Biology and Medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/1751-0473-9-24","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2014/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"Decision Sciences","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Commonly when designing studies, researchers propose to measure several independent variables in a regression model, a subset of which are identified as the main variables of interest while the rest are retained in a model as covariates or confounders. Power for linear regression in this setting can be calculated using SAS PROC POWER. There exists a void in estimating power for the logistic regression models in the same setting.

Methods: Currently, an approach that calculates power for only one variable of interest in the presence of other covariates for logistic regression is in common use and works well for this special case. In this paper we propose three related algorithms along with corresponding SAS macros that extend power estimation for one or more primary variables of interest in the presence of some confounders.

Results: The three proposed empirical algorithms employ likelihood ratio test to provide a user with either a power estimate for a given sample size, a quick sample size estimate for a given power, and an approximate power curve for a range of sample sizes. A user can specify odds ratios for a combination of binary, uniform and standard normal independent variables of interest, and or remaining covariates/confounders in the model, along with a correlation between variables.

Conclusions: These user friendly algorithms and macro tools are a promising solution that can fill the void for estimation of power for logistic regression when multiple independent variables are of interest, in the presence of additional covariates in the model.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

三种算法和SAS宏用于估计在协变量存在下具有一个或多个感兴趣的自变量的逻辑模型的功率和样本大小。

背景:通常在设计研究时，研究人员建议在回归模型中测量几个独立变量，其中一个子集被确定为感兴趣的主要变量，而其余的则作为协变量或混杂因素保留在模型中。在这种情况下，线性回归的功率可以使用SAS PROC Power计算。在相同设置下，逻辑回归模型的估计能力存在空白。方法:目前，在逻辑回归中存在其他协变量时，仅计算一个感兴趣变量的功率的方法是常用的，并且适用于这种特殊情况。在本文中，我们提出了三种相关的算法以及相应的SAS宏，这些宏在存在一些混杂因素的情况下扩展了对一个或多个感兴趣的主要变量的功率估计。结果:提出的三种经验算法采用似然比检验为用户提供给定样本量的功率估计，给定功率的快速样本量估计以及样本量范围内的近似功率曲线。用户可以为感兴趣的二元、统一和标准正态自变量的组合指定比值比，以及模型中剩余的协变量/混杂因素，以及变量之间的相关性。结论:这些用户友好的算法和宏观工具是一个很有前途的解决方案，可以填补在模型中存在额外协变量的情况下，当多个自变量感兴趣时，逻辑回归功率估计的空白。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Source Code for Biology and Medicine Decision Sciences-Information Systems and Management

自引率

0.00%

发文量

期刊介绍： Source Code for Biology and Medicine is a peer-reviewed open access, online journal that publishes articles on source code employed over a wide range of applications in biology and medicine. The journal"s aim is to publish source code for distribution and use in the public domain in order to advance biological and medical research. Through this dissemination, it may be possible to shorten the time required for solving certain computational problems for which there is limited source code availability or resources.

期刊最新文献

2DKD: a toolkit for content-based local image search. Computing and graphing probability values of pearson distributions: a SAS/IML macro. iPBAvizu: a PyMOL plugin for an efficient 3D protein structure superimposition approach Social support for collaboration and group awareness in life science research teams. MZPAQ: a FASTQ data compression tool.