Source Code for Biology and Medicine | |
Three algorithms and SAS macros for estimating power and sample size for logistic models with one or more independent variables of interest in the presence of covariates | |
Zoran Bursac1  David Keith Williams2  | |
[1] Division of Biostatistics and Center for Population Studies, Department of Preventive Medicine, College of Medicine, University of Tennessee Health Science Center, Memphis, TN, USA;Department of Biostatistics, University of Arkansas for Medical Sciences, Little Rock, AR, USA | |
关键词: Sample size; Power; Logistic regression; | |
Others : 1139283 DOI : 10.1186/1751-0473-9-24 |
|
received in 2014-06-19, accepted in 2014-10-27, 发布年份 2014 | |
【 摘 要 】
Background
Commonly when designing studies, researchers propose to measure several independent variables in a regression model, a subset of which are identified as the main variables of interest while the rest are retained in a model as covariates or confounders. Power for linear regression in this setting can be calculated using SAS PROC POWER. There exists a void in estimating power for the logistic regression models in the same setting.
Methods
Currently, an approach that calculates power for only one variable of interest in the presence of other covariates for logistic regression is in common use and works well for this special case. In this paper we propose three related algorithms along with corresponding SAS macros that extend power estimation for one or more primary variables of interest in the presence of some confounders.
Results
The three proposed empirical algorithms employ likelihood ratio test to provide a user with either a power estimate for a given sample size, a quick sample size estimate for a given power, and an approximate power curve for a range of sample sizes. A user can specify odds ratios for a combination of binary, uniform and standard normal independent variables of interest, and or remaining covariates/confounders in the model, along with a correlation between variables.
Conclusions
These user friendly algorithms and macro tools are a promising solution that can fill the void for estimation of power for logistic regression when multiple independent variables are of interest, in the presence of additional covariates in the model.
【 授权许可】
2014 Williams and Bursac; licensee BioMed Central Ltd.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
20150321091733760.pdf | 895KB | download | |
Figure 1. | 56KB | Image | download |
【 图 表 】
Figure 1.
【 参考文献 】
- [1]SAS Institute Inc: SAS/STAT User’s Guide, Version 9.3. Cary, NC: SAS Institute Inc; 2013.
- [2]Maxwell SE: Sample size and multiple regression analysis. Psychol Methods 2000, 5:434-458.
- [3]Hsieh FY, Block DA, Larsen MD: A simple method of sample size calculation for linear and logistic regression. Stat Med 1998, 17:1623-1634.
- [4]Hintze J: PASS 2012. Kaysville, UT: NCSS, LLC; 2012. www.ncss.com webcite
- [5]Castelloe JM: Sample Size Computations and Power Analysis with the SAS System. In Proceedings of the Twenty-Fifth Annual SAS Users Group International Conference, Paper 265–25. Cary, NC: SAS Institute Inc; 2000.
- [6]Friendly M: Visualizing Categorical Data. Cary, NC: SAS Institute; 2000. http://www.math.yorku.ca/SCS/vcd/powerlog.html webcite
- [7]Demidenko E: Sample size determination for logistic regression revisited. Stat Med 2007, 26:3385-3397. http://www.dartmouth.edu/~eugened/power-samplesize.php webcite
- [8]Shieh G: On power and sample size calculations for likelihood ratio test in generalized linear models. Biometrics 2000, 56(4):1192-1196.