期刊论文详细信息
BMC Medical Research Methodology
The need to balance merits and limitations from different disciplines when considering the stepped wedge cluster randomized trial design
Hendrik Koffijberg2  Johannes B. Reitsma4  Johannes J. M. van Delden3  Karel G. M. Moons4  Rieke van der Graaf3  Ingeborg van der Tweel1  Esther de Hoop1 
[1] Department of Biostatistics and Research Support, University Medical Center Utrecht, Julius Center for Health Sciences and Primary Care, Utrecht 3508, GA, The Netherlands;Department of Health Technology Assessment, University Medical Center Utrecht, Julius Center for Health Sciences and Primary Care, Utrecht 3508, GA, The Netherlands;Department of Medical Humanities, University Medical Center Utrecht, Julius Center for Health Sciences and Primary Care, Utrecht 3508, GA, The Netherlands;Department of Epidemiology, University Medical Center Utrecht, Julius Center for Health Sciences and Primary Care, Utrecht 3508, GA, The Netherlands
关键词: Biostatistics;    Research ethics;    Health economics;    Cluster randomized trial;    Stepped wedge design;    Epidemiologic research design;   
Others  :  1230332
DOI  :  10.1186/s12874-015-0090-2
 received in 2015-05-04, accepted in 2015-10-19,  发布年份 2015
【 摘 要 】

Background

Various papers have addressed pros and cons of the stepped wedge cluster randomized trial design (SWD). However, some issues have not or only limitedly been addressed. Our aim was to provide a comprehensive overview of all merits and limitations of the SWD to assist researchers, reviewers and medical ethics committees when deciding on the appropriateness of the SWD for a particular study.

Methods

We performed an initial search to identify articles with a methodological focus on the SWD, and categorized and discussed all reported advantages and disadvantages of the SWD. Additional aspects were identified during multidisciplinary meetings in which ethicists, biostatisticians, clinical epidemiologists and health economists participated. All aspects of the SWD were compared to the parallel group cluster randomized design. We categorized the merits and limitations of the SWD to distinct phases in the design and conduct of such studies, highlighting that their impact may vary depending on the context of the study or that benefits may be offset by drawbacks across study phases. Furthermore, a real-life illustration is provided.

Results

New aspects are identified within all disciplines. Examples of newly identified aspects of an SWD are: the possibility to measure a treatment effect in each cluster to examine the (in)consistency in effects across clusters, the detrimental effect of lower than expected inclusion rates, deviation from the ordinary informed consent process and the question whether studies using the SWD are likely to have sufficient social value. Discussions are provided on e.g. clinical equipoise, social value, health economical decision making, number of study arms, and interim analyses.

Conclusions

Deciding on the use of the SWD involves aspects and considerations from different disciplines not all of which have been discussed before. Pros and cons of this design should be balanced in comparison to other feasible design options as to choose the optimal design for a particular intervention study.

【 授权许可】

   
2015 de Hoop et al.

附件列表
Files Size Format View
Fig. 1. 11KB Image download
Fig. 1. 11KB Image download
【 图 表 】

Fig. 1.

Fig. 1.

【 参考文献 】
  • [1]Donner A, Klar N. Design and analysis of cluster randomization trials in health research. 1st ed. Arnold, London; 2000.
  • [2]Rietbergen C, Moerbeek M. The design of cluster randomized crossover trials. J Educ Behav Stat. 2011; 36:472-90.
  • [3]Baer DM, Wolf MM, Risley TR. Some current dimensions of applied behavior analysis. J Appl Behav Anal. 1968; 1:91-7.
  • [4]Hawkins NG, Sanson-Fisher RW, Shakeshaft A, D’Este C, Green LW. The multiple baseline design for evaluating population-based research. Am J Prev Med. 2007; 33:162-8.
  • [5]Rhoda DA, Murray DM, Andridge RR, Pennell ML, Hade EM. Studies with staggered starts: multiple baseline designs and group-randomized trials. Am J Public Health. 2011; 101:2164-9.
  • [6]Brown CA, Lilford RJ. The stepped wedge trial design: a systematic review. BMC Med Res Methodol. 2006; 6:54. BioMed Central Full Text
  • [7]Sanson-Fisher RW, D’Este CA, Carey ML, Noble N, Paul CL. Evaluation of systems-oriented public health interventions: alternative research designs. Annu Rev Public Health. 2014; 35:9-27.
  • [8]The Gambia hepatitis intervention study. Cancer Res. 1987; 47:5782-7.
  • [9]Hussey MA, Hughes JP. Design and analysis of stepped wedge cluster randomized trials. Contemp Clin Trials. 2007; 28:182-91.
  • [10]Ukoumunne O, Gulliford M, Chinn S, Sterne J, Burney P. Methods for evaluating area-wide and organisation-based interventions in health and health care: a systematic review. Health Technol Assess. 1999; 3:98.
  • [11]Brown C, Hofer T, Johal A, Thomson R, Nicholl J, Franklin BD, et al. An epistemology of patient safety research: a framework for study design and interpretation. Part 2. Study design. Qual Saf Health Care. 2008;17:163–9.
  • [12]Mdege ND, Man MS, Brown CA T n, Torgerson DJ. Systematic review of stepped wedge cluster randomized trials shows that design is particularly used to evaluate interventions during routine implementation. J Clin Epidemiol. 2011; 64:936-48.
  • [13]Zhan Z, van den Heuvel ER, Doornbos PM, Burger H, Verberne CJ, Wiggers T, et al. Strengths and weaknesses of a stepped wedge cluster randomized design: its application in a colorectal cancer follow-up study. J Clin Epidemiol. 2014;67:454–61.
  • [14]Mdege ND, Man MS, Taylor nee Brown CA, Torgerson DJ. There are some circumstances where the stepped-wedge cluster randomized trial is preferable to the alternative: no randomized trial at all. Response to the commentary by Kotz and colleagues. J Clin Epidemiol. 2012; 65:1253-4.
  • [15]Craig P, Dieppe P, Macintyre S, Michie S, Nazareth I, Petticrew M. Developing and evaluating complex interventions: the new Medical Research Council guidance. Br Med J. 2008; 337:a1655.
  • [16]Keriel-Gascou M, Buchet-Poyau K, Rabilloud M, Duclos A, Colin C. A stepped wedge cluster randomized trial is preferable for assessing complex health interventions. J Clin Epidemiol. 2014; 67:831-3.
  • [17]Mdege ND, Kanaan M. Response to Keriel-Gascou et al. Addressing assumptions on the stepped wedge randomized trial design. J Clin Epidemiol. 2014; 67:833-4.
  • [18]Kotz D, Spigt M, Arts IC, Crutzen R, Viechtbauer W. Use of the stepped wedge design cannot be recommended: a critical appraisal and comparison with the classic cluster randomized controlled trial design. J Clin Epidemiol. 2012; 65:1249-52.
  • [19]Hemming K, Girling A, Martin J, Bond SJ. Stepped wedge cluster randomized trials are efficient and provide a method of evaluation without which some interventions would not be evaluated. J Clin Epidemiol. 2013; 66:1058-9.
  • [20]Kotz D, Spigt M, Arts IC, Crutzen R, Viechtbauer W. The stepped wedge design does not inherently have more power than a cluster randomized controlled trial. J Clin Epidemiol. 2013; 66:1059-60.
  • [21]Viechtbauer W, Kotz D, Spigt M, Arts IC, Crutzen R. Response to Keriel-Gascou et al.: Higher efficiency and other alleged advantages are not inherent to the stepped wedge design. J Clin Epidemiol. 2014; 67:834-6.
  • [22]Kotz D, Spigt M, Arts IC, Crutzen R, Viechtbauer W. Researchers should convince policy makers to perform a classic cluster randomized controlled trial instead of a stepped wedge design when an intervention is rolled out. J Clin Epidemiol. 2012; 65:1255-6.
  • [23]Hutson AD, Reid ME. The utility of partial cross-over designs in early phase randomized prevention trials. Control Clin Trials. 2004; 25:493-501.
  • [24]Freedman B. Equipoise and the ethics of clinical research. N Engl J Med. 1987; 317:141-5.
  • [25]Macklin R, Shepherd L. Informed consent and standard of care: what must be disclosed. Am J Bioeth. 2013; 13:9-13.
  • [26]Emanuel EJ, Wendler D, Grady C. What makes clinical research ethical? JAMA. 2000; 283:2701-11.
  • [27]Drummond MF, Sculpher MJ, Torrance GW, O’Brien BJ, Stoddart GL. Methods for the economic evaluation of health care programmes. Oxford University Press, Oxford; 2005.
  • [28]Claxton K, Palmer S, Longworth L, Bojke L, Griffin S, McKenna C, et al. Informing a decision framework for when NICE should recommend the use of health technologies only in the context of an appropriately designed programme of evidence development. Health Technol Assess. 2012;16:1–323.
  • [29]Eckermann S, Willan AR. Expected value of information and decision making in HTA. Health Econ. 2007; 16:195-209.
  • [30]Claxton KP, Sculpher MJ. Using value of information analysis to prioritise health research: some lessons from recent UK experience. Pharmacoeconomics. 2006; 24:1055-68.
  • [31]Claxton K, Griffin S, Koffijberg H, McKenna C. Expected health benefits of additional evidence: principles, methods and applications. Center for Health Economics, University of York, York; 2013.
  • [32]de Smet AM, Kluytmans JA, Cooper BS, Mascini EM, Benus RF, van der Werf TS et al.. Decontamination of the digestive tract and oropharynx in ICU patients. N Engl J Med. 2009; 360:20-31.
  • [33]Huis A, Schoonhoven L, Grol R, Borm G, Adang E, Hulscher M, et al. Helping hands: a cluster randomised trial to evaluate the effectiveness of two different strategies for promoting hand hygiene in hospital nurses. Implement Sci. 2011;6:101.
  • [34]de Hoop E, Woertman W, Teerenstra S. The stepped wedge cluster randomized trial always requires fewer clusters but not always fewer measurements, that is, participants than a parallel cluster randomized trial in a cross-sectional design. J Clin Epidemiol. 2013; 66:1428.
  • [35]Woertman W, de Hoop E, Moerbeek M, Zuidema SU, Gerritsen DL, Teerenstra S. Stepped wedge designs could reduce the required sample size in cluster randomized trials. J Clin Epidemiol. 2013; 66:752-8.
  • [36]Hemming K, Girling A. The efficiency of stepped wedge vs. cluster randomized trials: Stepped wedge studies do not always require a smaller sample size. J Clin Epidemiol. 2013; 66:1427-8.
  • [37]Hemming K, Haines TP, Chilton PJ, Girling AJ, Lilford RJ. The stepped wedge cluster randomised trial: rationale, design, analysis, and reporting. Br Med J. 2015; 350:h391.
  • [38]Weijer C, Grimshaw JM, Eccles MP, McRae AD, White A, Brehaut JC, Taljaard M. The Ottawa statement on the ethical design and conduct of cluster randomized trials. PLoS Med. 2012; 9:e1001346.
  • [39]Guyatt GH, Oxman AD, Kunz R, Woodcock J, Brozek J, Helfand M et al.. GRADE guidelines: 7. Rating the quality of evidence-inconsistency. J Clin Epidemiol. 2011; 64:1294-302.
  • [40]Zou GY, Donner A, Klar N. Group sequential methods for cluster randomization trials with binary outcomes. Clin Trials. 2005; 2:479-87.
  • [41]Hayes RJ, Moulton LH. Cluster randomised trials. CRC Press, Boca Raton; 2009.
  • [42]Poldervaart JM, Reitsma JB, Koffijberg H, Backus BE, Six AJ, Doevendans PA, et al. The impact of the HEART risk score in the early assessment of patients with acute chest pain: design of a stepped wedge, cluster randomised trial. BMC Cardiovasc Disord. 2013;13:77.
  • [43]Six AJ, Cullen L, Backus BE, Greenslade J, Parsonage W, Aldous S, et al. The HEART score for the assessment of patients with chest pain in the emergency department: a multinational validation study. Crit Pathw Cardiol. 2013;12:121–6.
  • [44]Backus BE, Six AJ, Kelder JC, Mast TP, van den Akker F, Mast EG, et al. Chest pain in the emergency room: a multicenter validation of the HEART Score. Crit Pathw Cardiol. 2010;9:164–9.
  • [45]de Araujo GP, Ferreira J, Aguiar C, Seabra-Gomes R. TIMI, PURSUIT, and GRACE risk scores: sustained prognostic value and interaction with revascularization in NSTE-ACS. Eur Heart J. 2005; 26:865-72.
  • [46]Antman EM, Cohen M, Bernink PJ, McCabe CH, Horacek T, Papuchis G, et al. The TIMI risk score for unstable angina/non-ST elevation MI: a method for prognostication and therapeutic decision making. JAMA. 2000;284:835–42.
  • [47]Fox KA, Dabbous OH, Goldberg RJ, Pieper KS, Eagle KA, Van de WF et al.. Prediction of risk of death and myocardial infarction in the six months after presentation with acute coronary syndrome: prospective multinational observational study (GRACE). Br Med J. 2006; 333:1091.
  • [48]Hemming K, Lilford R, Girling AJ. Stepped-wedge cluster randomised controlled trials: a generic framework including parallel and multiple-level designs. Stat Med. 2015; 34:181-96.
  • [49]Hemming K, Girling A. A menu-driven facility for power and detectable-difference calculations in stepped-wedge cluster-randomized trials. Stata J. 2014; 14:363-80.
  • [50]Duncan GJ, Kalton G. Issues of design and analysis of surveys across time. Int Stat Rev. 1987; 55:97-117.
  • [51]Feldman HA, Mckinlay SM. Cohort versus cross-sectional design in large field trials - precision, sample-size, and a unifying model. Stat Med. 1994; 13:61-78.
  • [52]Frison L, Pocock SJ. Repeated measures in clinical-trials - analysis using mean summary statistics and its implications for design. Stat Med. 1992; 11:1685-704.
  • [53]van der Tweel I, van der Graaf R. Issues in the use of stepped wedge cluster and alternative designs in the case of pandemics. Am J Bioeth. 2013; 13:23-4.
  • [54]Handley MA, Schillinger D, Shiboski S. Quasi-experimental designs in practice-based research settings: design and implementation considerations. J Am Board Fam Med. 2011; 24:589-96.
  • [55]Fatemi Y, Jacobson RM. The stepped wedge cluster randomized trial and its potential for child health services research: a narrative review. Acad Pediatr. 2015; 15:128-33.
  • [56]Pearson D, Torgerson D, McDougall C, Bowles R. Parable of two agencies, one of which randomizes. An Am Acad Polit Soc Sci. 2010; 628:11-29.
  • [57]van der Graaf R, Koffijberg H, Grobbee DE, de Hoop E, Moons KGM, van Thiel GJMW, et al. Rethinking the ethics of cluster randomized trials: a refinement of the Ottawa statement. J Clin Epidemiol. 2015;68:1108–14.
  文献评价指标  
  下载次数:21次 浏览次数:32次