Making trials matter: pragmatic and explanatory trials and the problem of applicability
Trials volume 10, Article number: 37 (2009)
Randomised controlled trials are the best research design for decisions about the effect of different interventions but randomisation does not, of itself, promote the applicability of a trial's results to situations other than the precise one in which the trial was done. While methodologists and trialists have rightly paid great attention to internal validity, much less has been given to applicability.
This narrative review is aimed at those planning to conduct trials, and those aiming to use the information in them. It is intended to help the former group make their trials more widely useful and to help the latter group make more informed decisions about the wider use of existing trials. We review the differences between the design of most randomised trials (which have an explanatory attitude) and the design of trials more able to inform decision making (which have a pragmatic attitude) and discuss approaches used to assert applicability of trial results.
If we want evidence from trials to be used in clinical practice and policy, trialists should make every effort to make their trial widely applicable, which means that more trials should be pragmatic in attitude.
The statistical experiment, or, as we know it in medicine, the randomised controlled trial (RCT) is among the more beautiful intersections between man and mathematics. RCTs minimise the risk of bias (threats to internal validity), particularly selection bias [1, 2] and are thus the best research design for decisions about the effect of different interventions, be they treatments, therapies or delivery methods and policies. But, as Cochrane noted, there is a snag: randomisation does not, of itself, promote external validity; that is, the applicability of a trial's results to situations other than the precise one in which the trial was done . It is thus possible for a trial to be free of bias but be of no relevance beyond the immediate setting, patients, and practitioners among whom it was conducted. This question of applicability is central to those who have to choose between therapies for groups of patients (policymakers), for their own patients (clinicians) or for themselves (patients and families). How likely is it, these decision makers may ask, that this treatment (apparently successful in this trial or review) will achieve important benefits in my context, administered to me by my clinicians, by me to my patients, or by clinicians to patients in my organisation? In other words, 'Are these published findings applicable to my decision?'.
This narrative review is aimed at those planning to conduct trials, and those aiming to use the information in them. It is intended to help the former group make their trials more widely useful and to help the latter group make more informed decisions about the wider use of existing trials. We review the differences between the design of most RCTs and the design of trials more able to inform decision making, discuss some approaches used to assert applicability and end by proposing:
That applicability should be explicitly considered by trialists as they plan their trial, and by decision makers when gathering evidence to guide decisions.
That trialists explicitly design their trials to produce results that are more widely applicable than is the case at present.
That trialists can and should report their trials in ways that make it easier for others to make judgements about their applicability.
That decision makers seek trials with a pragmatic attitude to inform choices that directly affect clinical care, health services delivery and health policies.
That researchers conduct, and research funders fund, empirical research to understand the major determinants of applicability.
Explanatory and pragmatic attitudes to trials
Two French statisticians, Schwartz and Lellouch, were acutely aware over 40 years ago of the limited applicability of many trial results beyond the artificial, 'laboratory' environment of the trial . They proposed a distinction between trials aimed at confirming a physiological hypothesis, precisely specified as a causal relationship between administration of an intervention and some physiological outcome (which they called 'explanatory') and the entirely different group of trials aimed at informing a clinical, health service or policy decision, where this decision involves the choice between two or more interventions (called 'pragmatic').
While explanatory trials have an important role in providing knowledge concerning the effects of precisely defined interventions applied to select groups under optimal conditions, healthcare interventions are seldom given under such circumstances [5, 6]. Moreover, inadequate consideration of applicability is the most frequent criticism by clinicians of randomised trials, systematic reviews and guidelines [7, 8]. For example, a clinician considering a treatment for secondary prevention of stroke might read results from the Heart Outcomes Prevention Evaluation (HOPE) trial and wonder what to make of the long list of exclusion criteria, the exclusion of nearly 1 in 10 of the remaining patients because of non-adherence, side-effects or withdrawal of consent in the pretrial run-in phase, and the use of placebo as comparator rather than aspirin . Calls for more trials with wide applicability have come both from those interested in improved treatment for clinical problems [10–12] and those interested in health policy [13, 14].
Schwartz and Lellouch characterised pragmatism as an attitude to trial design rather than a characteristic of the trial itself. Although some authors appear to suggest that a trial is either explanatory or pragmatic , there is a continuum rather than a dichotomy between explanatory and pragmatic trials with the pragmatic attitude explicitly favouring design choices that maximise applicability of the trial's results to usual care settings. As Schwartz and Lellouch wrote:
' [m]ost trials done hitherto have adopted the explanatory approach without question; the pragmatic approach would often have been more justifiable'.
As summarised in [16–18], we are aware of only a single study that has attempted to identify pragmatic trials (identified using: MeSH term 'clinical trial', keyword 'pragmatic' and authors' judgement that identified studies described clinical trials with a pragmatic attitude) and it found just 95 published between 1976 and 2002 . Since PubMed identifies over 168,000 RCTs for that period, trials with a pragmatic attitude are clearly the exception even if we make allowances for Vallvé et al.'s rather narrow search . This is at least in part due to the requirements of regulatory agencies, especially the US Food and Drug Administration (FDA) [16–18]. Although the FDA offers little guidance on the design of trials, what guidance there is argues against trials with a pragmatic attitude: ' [T]here are numerous ways of conducting a study that can obscure differences between treatments, such as poor diagnostic criteria, poor methods of measurement, poor compliance, medication errors, or poor training of observers. As a general statement, carelessness of all kinds will tend to obscure differences between treatments. Where the objective of a study is to show a difference, investigators have powerful stimuli toward assuring study excellence' . The FDA equates explanatory design choices with study excellence, thereby favouring trials that lack the attributes needed to support decisions about the applicability of a treatment or therapy to usual practice [16–18]. Conversely, the clinical, policy and funding decision makers who are expected to use these trials for real world funding and clinical decision making are not convinced of their relevance and applicability to their patients or settings .
How might a trial with a pragmatic attitude be more helpful to policymakers, clinicians and patients than an explanatory trial? Below we recount two trials demonstrating some of the problems created when trials are not widely applicable.
Consider the Vioxx Gastrointestinal Outcomes Research (VIGOR) trial, which assessed whether rofecoxib (Vioxx) was associated with a lower incidence of upper gastrointestinal events than the non-selective non-steroidal anti-inflammatory drug (NSAID) naproxen among patients with rheumatoid arthritis . The patients with rheumatoid arthritis included in this trial were highly selected; in particular, those with recent cardiovascular events and those taking aspirin were excluded. Patients were followed up for an average of 8 months. Despite VIGOR showing an increased risk of cardiovascular events in patients taking rofecoxib, this was attributed to the protective effect of naproxen. It was not until a later trial of rofecoxib with longer follow-up, the Adenomatous Polyp Prevention on Vioxx (APPROVe) trial, modified its protocol to include patients at higher baseline risk for a cardiovascular event to be enrolled that the increased cardiovascular risk became undeniable and rofecoxib was withdrawn from the market . Had VIGOR taken a more pragmatic approach to participant selection and follow-up, it is likely that the balance of benefit and harm for refecoxib would have been evaluated differently, and far fewer people would have been exposed to these risks.
The National Institute of Neurological Disorders and Stroke (NINDS) trial found a benefit for thrombolytic therapy when used with acute ischemic stroke patients who have had symptoms for less than 3 h . However, in clinical practice, a minority of patients present within 3 h. Moreover, the recruitment protocol of the NINDS trial required 50% of participants to have presented within 1.5 h, a group that is almost non-existent in practice . These design features practically guaranteed that the result of this trial would have poor applicability to the patients more typically seen in clinical practice, necessitating trials with wider inclusion criteria such as International Stroke Trial 3 (IST-3) .
The problem of applicability is enlarged when we consider guidelines, where many trials contribute to each recommendation. Travers et al. looked at the extent to which community-based asthma and chronic obstructive pulmonary disease (COPD) patients, respectively, would be eligible for the 17 major trials cited in the Global Initiative for Asthma (GINA) guidelines  or the 18 major trials cited in the Global Initiative for Chronic Obstructive Lung Disease (GOLD) guidelines . Of the 749 individuals responding to their survey, a median 4% with current asthma (range 0% to 36%) and a median 6% (range 0% to 43%) with current asthma on treatment met the eligibility criteria for the GINA trials. For the GOLD guidelines, a median 5% (range 0% to 20%) with COPD and a median 5% (range 0% to 9%) of those with COPD receiving treatment met inclusion criteria. Such restrictive entry criteria make it very difficult for clinicians to use such guidelines.
Using narrow inclusion criteria for a trial may be appropriate if: (a) there is evidence to support a strong relationship between the selection criteria and treatment response, (b) the criteria used for selection are reasonably common among the typical patient population, both for reasons of trial feasibility and for reasons of population impact, and (c) a typical clinician caring for patients with this condition could easily use these criteria to select patients. In the absence of selection criteria with these traits, the best estimate of treatment effect under real world conditions, is, for any individual patient, the average treatment effect of an intervention on an unselected group of patients with that condition, rather than the treatment effect found in a small and narrow subgroup, defined by multiple exclusion criteria. There are three reasons for this counterintuitive conclusion. Firstly, any single patient to whom we wish to apply the results of a trial is far more likely to be found within the ranks of unselected patients included in a pragmatic trial, than in the highly selected patients of an explanatory trial. Secondly, even though we have substantial epidemiological knowledge about the prognostic factors for disease incidence and outcomes in a population, we have far less knowledge of the clinical and biological characteristics of patients, which determine their treatment response. Thirdly, even in those instances where we know a prognostic factor which influences treatment response, few such factors are overwhelmingly powerful, and sufficiently common to be relevant on a large scale. For those few that are, it is rare to find any which can be implemented in a programmatic fashion once the intervention is proven for the group with that factor.
Our point is not that there should be no explanatory trials. One can argue that most first trials of a healthcare intervention with an obvious and well understood mechanism of action should be small, rapidly conducted pilot trials towards the explanatory end of the explanatory-pragmatic continuum . If this trial rules out a benefit in a select group of patients, treated under ideal conditions, who are thought, based on mechanistic reasoning, to be most likely to benefit then there is no need for more trials. But if the intervention does show a benefit it is still unclear whether it works in the real world, which is why a trial at the pragmatic end of the explanatory-pragmatic continuum is then needed. This trial should involve participants (both clinicians and patients) who are like those for whom the intervention is relevant in the real, messy world of clinical practice . This pragmatic trial should use the current accepted treatment as the comparator, require no more financial or staff resources than are currently available in the type of practice or clinic expected to deliver the new intervention, and the trial should measure an outcome that is of immediate importance to both patients and clinicians .
If an intervention is not well understood, or if the intervention has been used in another indication, and the mechanism by which it will provide benefit on a new indication is not clear, then a more pragmatic trial is the place to start. This can be followed, if subgroup analysis reveals startlingly different results in some participant groups, by a trial in which all participants lie within the group who obtained unique benefit.
There is broad agreement on the type of design decisions that make a trial explanatory or pragmatic in attitude [4, 11, 14, 15, 28–32], and Table 1 shows some key differences. Trialists who describe their trials as pragmatic have made design decisions that they believe will make it more likely their trial will achieve its purpose of informing real world decision about which among the alternative treatments to choose [33–37].
But how does a trialist with this goal in mind know that his or her trial does indeed have the right design for its purpose? There are at least two tools available to help trialists and others judge where on the explanatory-pragmatic continuum a trial is best placed though they have somewhat different aims. The first, developed by Gartlehner et al.  characterised trials as efficacy (explanatory) or effectiveness (pragmatic) trials, and was designed to classify trials for systematic review and to help clinicians judge the applicability of trial results. The tool has seven criteria considered relevant to judgements as to where a trial is placed on the efficacy-effectiveness continuum. These include considerations of the trial setting, its inclusion criteria, the choice of health outcome and the length of follow-up. The authors asked the directors of 12 evidence-based practice centres (centres that conduct systematic reviews) in the USA and Canada to nominate 6 trials each: 4 effectiveness studies and 2 efficacy trials. Then, 2 independent raters applied the tool's 7 criteria to the 24 trials that met the study's eligibility criteria. A score of 6 criteria met gave the best balance between sensitivity and specificity for identifying effectiveness trials; at this threshold the tool identified 13 of 18 trials judged to be effectiveness trials by the 12 directors. Used in this dichotomous fashion, however, the tool does tend to reinforce the misconception that a trial is either explanatory or pragmatic, rather than acknowledging that there is a continuum. It has the added problem that one criterion is whether or not the setting of the trial is in primary care, implying that a trial conducted in, say, a referral hospital cannot be oriented towards asking questions of real world effectiveness, even though many patients are treated in such settings.
A more recent tool, the pragmatic-explanatory continuum indicator summary (PRECIS) , (figure 1) is intended to be used by trialists designing a trial to assess the degree to which their design decisions align with the trial's stated purpose. This tool has 10 dimensions based on trial design decisions (for example, participant and practitioner expertise, flexibility with which the intervention can be delivered and choice of comparator), and presents these on a graphical, 10-spoked 'wheel'. A highly pragmatic trial is out at the rim, while explanatory trials are nearer the hub. Table 1 compares the highly pragmatic Directly Observed Treatment (DOT) trial  with the highly explanatory North American Symptomatic Carotid Endarterectomy Trial (NASCET) . The advantage of this graph is that it quickly highlights inconsistencies in how the 10 dimensions will be managed in a trial. For example, if the DOT trial had intensely monitored compliance and intervened when it faltered, a single glance at the wheel would have immediately identified this inconsistency with the trial's otherwise pragmatic attitude. This allows trialists to make adjustments, if possible and appropriate, to the design to obtain greater consistency with their trial's purpose.
While these tools do allow a trialist to assess the likely impact of his or her design decisions on the trial's ability to achieve its purpose, they do not address an important observation made by Karanicolas et al. that how pragmatic a trial is depends on perspective and context . There is disagreement as to which of perspective and context is the more important [40–43], although both are clearly relevant to someone trying to interpret a trial result. However, while perspective is a feature of the individual reading the trial report and hard for trialists to predict, context (the distinctive features of a trial's setting, participants, clinicians and other staff) is a feature of the trial itself and should be within the capabilities of trialists to describe. We would argue, therefore, that trialists should not worry about trying to guess the various perspectives of those making decisions but should instead do all they can to describe the context of their trial.
Two examples will help to illustrate this point. In a Dutch pragmatic trial comparing web-based self-help for problem drinking with a six-page, web-based psychoeducational brochure on alcohol , one of the inclusion criteria for the trial was that participants must have internet access. While this might not be considered a restrictive criterion in the Netherlands, where internet penetration in 2007 was 88% , this would not be the case in, say, Poland, where internet penetration is just under 30% . From the perspective of a clinician or policymaker in Poland, one could imagine that a trial conducted in the Dutch context is more explanatory, given the limited penetration of the internet in Poland. Trialists based in the Netherlands cannot be expected to know the perspectives of decision makers in Poland but they can be expected to recognise that an internet penetration of around 88% is part of the context of their trial and report it, which was not performed in this example . Another example is a pragmatic trial performed in Quebec, Canada, which compared a pharmacist-managed anticoagulation service with usual care delivered by general practitioners . Although these trialists found that care was similar for both groups, they also found that pharmacist-managed care was more expensive. However, as the authors report, this is context-specific because the comparator care provided by physicians was performed through telephone consultation, for which physicians receive no monetary compensation in Quebec. Without this contextual information some readers may conclude that the intervention is not applicable to their contexts; with it they may see an opportunity for improving the delivery of care.
Decisions about applicability depend on readers being able to assess the feasibility of the intervention in their own context . However, understanding what comprises the intervention (and often the comparator) is not always a simple matter of reading the trial report . Detailed reporting of the content of interventions, especially complex, non-pharmacological ones, is often poor [49–51]. For example, 41 of 80 published descriptions of studies selected for abstraction by the journal Evidence-Based Medicine from October 2005 to October 2006 failed to adequately describe all elements of the intervention . A study of 47 trials involving nurses found that information about the nurses delivering interventions (for example, qualifications, experience, training) was often lacking . This is important contextual information without which it is extremely difficult for readers to make informed judgements about applicability; indeed, it may be impossible.
The recent Consolidated Standards of Reporting Trials (CONSORT) Statement extension for the reporting of pragmatic trials should go some way to improving the reporting of contextual information , especially the recommendation for reporting information about the participants and on applicability (or generalisabilty) of the trial findings. Initiatives such as the Workgroup for Intervention Development and Evaluation Research (WIDER) , the CONSORT extension for non-pharmacological treatments  and the Standards for QUality Improvement Reporting Excellence (SQUIRE) Statement  are likely to help others to both judge the applicability of an intervention to their own setting and implement it should they choose to. A trial report with a poor description of the intervention is effectively rendered useless because implementing it elsewhere becomes a matter of guesswork. Readers need to know 'who, what, when and where' .
For pragmatic trials, where the intention is to interfere as little as possible with the usual process of care, understanding context is essential. But how are its effects measured? Despite its importance, there is little work exploring how context might influence the results of a trial, or the feasibility of widespread implementation.
The Normalisation Process Model [55–58] may be able to help. The model was developed to guide the design of evaluations of the implementation of complex interventions but applies equally to the study of simple interventions that have complex requirements of the healthcare system needed to deliver them. It may also be adapted to guide the investigation of the feasibility of interventions in advance of their implementation, as it assists in the systematic and comprehensive mapping of the human, organisational and resource changes that an intervention will require. Some interventions can only be implemented with major structural or organisational changes to healthcare delivery; trials evaluating these interventions might be called 'aspirational'. The Normalisation Process Model could help to identify such interventions and allow trialists and others to better judge whether the required changes are feasible on a wide scale and whether the likely benefit of the intervention justifies making them. Documents linked to trial reports could provide empirical data, both quantitative and qualitative, on the features of health care providers, patients or working practices which influenced the observed results, putting judgements about the feasibility of interventions (and hence applicability of the trial's results) on a firmer basis.
The applicability of trial results can also be estimated through statistical modelling. Here the influence of one or more features of a trial, such as the selection of participants, is investigated using statistical techniques to see how sensitive the trial result is to the feature or features being varied. For example, Greenhouse et al. have developed techniques for making what they call generalisabilty judgements, which are based on comparisons between RCT participants and individuals included in large surveys, databases and epidemiologic studies that are known to be representative of the population of interest . These authors were interested in a question familiar to users of trials with a pragmatic attitude: how similar are the trial's participants to those of the target population in general? This is clearly relevant to applicability. Greenhouse et al. compared the demographic profiles of youths included in trials of antidepressants with the profiles of depressed adolescents contained in a national database, the Youth Risk Behaviour Survey. Although both the trial and survey populations were found to be similar for most demographic characteristics, the rate of suicidal ideation and suicidal behaviours (the trial's primary outcomes) in the trial participants was found to be about half the adjusted rate among depressed adolescents in the USA as estimated from the national database (3.6% vs 7.1%) . This difference appeared to be due to trials excluding adolescents considered to be at high risk of suicide. Although one might reach the same general conclusion of limited applicability after using, say, the PRECIS tool , the technique used by Greenhouse et al. provides a quantitative estimate of applicability, at least with regard to participant selection. Other aspects of applicability have also been considered using quantitative methods. Yamaguchi and Ohashi used a proportional hazards model to investigate the influence of treatment-by-centre and baseline risk on the trial result in a multicentre superficial bladder cancer trial . Yamaguchi and Ohashi found that although there was some variation between centres, especially in the baseline risk, this made little difference to the estimate of treatment effect. While we shouldn't overstate the predictive power of modelling on the benefits of an intervention applied outside the trial's original context, it does have a role to play.
An internally valid trial that has poor applicability, or is reported in such a way that it is difficult or impossible for others to make judgements about its applicability, is a lost opportunity to influence clinical practice and healthcare delivery. It is worth repeating a line from Rothwell's 2005 Lancet paper: 'Lack of consideration of external validity is the most frequent criticism by clinicians of RCTs, systematic reviews, and guidelines' . An increase in the number of well-designed trials with a pragmatic attitude is surely needed. Perhaps the FDA and other regulatory authorities might also consider revisiting the gap in their regulations on the design of trials whose goal is to support decision making. The FDA's dismissal of much of the reality of real-life clinical practice as carelessness to be avoided in a trial does not help a trialist who wants to design a trial that can be used by policymakers and clinicians to decide which of several competing treatments they should be using in the unkempt world of usual practice.
Some trials aim to provide data on whether an intervention can be effective under optimal conditions; these trials have an explanatory attitude. Others aim to show that an intervention is effective in real and far from ideal conditions; these trials have a pragmatic attitude. Both attitudes have their place. However, we believe that:
More trials should have a pragmatic attitude.
Trialists should give as much care and attention to issues of applicability as they already do to issues of internal validity.
So, what should trialists do to improve the applicability of their trials? Trialists should routinely ask themselves at the design stage of their trial 'Who are the people I expect to use the results of my trial and what can I do to make sure that these people will not be forced to dismiss my trial as irrelevant to them, their patients, or their healthcare systems?' Rothwell gives a good list of issues that affect applicability , and Table 1's 'Pragmatic attitude' column gives pointers to design issue that can increase applicability, as does the CONSORT extension for pragmatic trials . The PRECIS tool  and that by Gertlehner et al.  can help trialists to match design to purpose. While there is some evidence suggesting factors that have influenced applicability, there is not enough empirical study of this question, and we are in need of a body of work similar to that performed over the past decades on internal validity. We would suggest that attention is given to the following:
Summarising existing evidence on the relevance of trials to decision making within a trial's own context and, if available, on relevance to other contexts.
Developing a methodology for identifying contextual factors of importance and estimating their influence on applicability.
Developing standards for describing and reporting contextual information.
Wells provides a list of research recommendations linked to trials of complex interventions, which is also relevant .
Users of trial reports need to make judgements about the applicability of the results to their own context, a task to which those designing the trial often give insufficient thought. If we want evidence from trials to be used in clinical practice (and we do), trialists should make every effort to make their trial widely applicable, which means that more trials should be pragmatic in attitude [16–18]. Trialists should not give policymakers, clinicians and patients reason to ignore research evidence.
Altman DG, Bland JM: Statistics notes. Treatment allocation in controlled trials: why randomise?. BMJ. 1999, 318: 1209-
Guyatt GH, Haynes RB, Jaeschke RZ, Cook DJ, Green L, Naylor CD, Wilson MC, Richardson WS: Users' Guides to the Medical Literature: XXV. Evidence-based medicine: principles for applying the Users' Guides to patient care. Evidence-Based Medicine Working Group. JAMA. 2000, 284: 1290-1296. 10.1001/jama.284.10.1290.
Cochrane AL: Effectiveness and efficiency. Random reflections on health services. 1972, London, UK: Nuffield Provincial Hospitals Trust
Schwartz D, Lellouch J: Explanatory and pragmatic attitudes in therapeutical trials. J Chronic Dis. 1967, 20: 637-648. 10.1016/0021-9681(67)90041-0.
Coca SG, Krumholz HM, Garg AX, Parikh CR: Underrepresentation of renal disease in randomized controlled trials of cardiovascular disease. JAMA. 2006, 296: 1377-1384. 10.1001/jama.296.11.1377.
Lee PY, Alexander KP, Hammill BG, Pasquali SK, Peterson ED: Representation of elderly persons and women in published randomized trials of acute coronary syndromes. JAMA. 2001, 286: 708-713. 10.1001/jama.286.6.708.
Glasgow R, Davidson K, Dobkin P, Ockene J, Spring B: Practical behavioural trials to advance evidence-based behavioural medicine. Ann Behav Med. 2006, 31: 5-13. 10.1207/s15324796abm3101_3.
Rothwell PM: External validity of randomised controlled trials: "To whom do the results of this trial apply?". Lancet. 2005, 365: 82-93. 10.1016/S0140-6736(04)17670-8.
Bosch J, Yusuf S, Pogue J, Sleight P, Lonn E, Rangoonwala B, Davies R, Ostergren J, Probstfield J, HOPE Investigators: Heart outcomes prevention evaluation. Use of ramipril in preventing stroke: double blind randomised trial. BMJ. 2002, 324: 699-10.1136/bmj.324.7339.699.
Rothwell PM: Factors that can affect the external validity of randomised controlled trials. PLoS Clinical Trials. 2006, 1: e9-10.1371/journal.pctr.0010009.
Hotopf M, Churchill R, Glyn L: Pragmatic randomised controlled trials in psychiatry. Br J Psychiatry. 1999, 175: 217-223. 10.1192/bjp.175.3.217.
Marson A, Kadir Z, Chadwick D: Large pragmatic randomised studies of new antiepileptic drugs are needed. BMJ. 1997, 314: 1764-
Lavis JN, Posada FB, Haines A, Osei E: Use of research to inform public policymaking. Lancet. 2004, 364: 1615-1621. 10.1016/S0140-6736(04)17317-0.
Tunis SR, Stryer DB, Clancy CM: Practical clinical trials: increasing the value of clinical research for decision making in clinical and health policy. JAMA. 2003, 290: 1624-1632. 10.1001/jama.290.12.1624.
Godwin M, Ruhland L, Casson I, MacDonald S, Delva D, Birtwhistle R, Lam M, Seguin R: Pragmatic controlled clinical trials in primary care: the struggle between external and internal validity. BMC Med Res Methodol. 2003, 3: 28-10.1186/1471-2288-3-28.
Zwarenstein M, Treweek S: What kind of randomized trials do we need?. J Clin Epi. 2009, 62 (2): 461-463. 10.1016/j.jclinepi.2009.01.011.
Zwarenstein M, Treweek S: What kind of randomized trials do we need?. CMAJ. 2009, 180: 998-1000.
Zwarenstein M, Treweek S: What kind of randomized trials do patients and clinicians need?. Ann Intern Med. 2009, 150: JC5-2, JC5-3.
Vallvé C: A critical review of the pragmatic clinical trial [in Spanish]. Med Clin (Barc). 2003, 27: 384-388. 10.1157/13052554.
US Food and Drug Administration, Guidance for Institutional Review Boards and Clinical Investigators 1998 Update. http://www.fda.gov/ScienceResearch/SpecialTopics/RunningClinicalTrials/GuidancesInformationSheetsandNotices/ucm117847.htm
Bombardier C, Laine L, Reicin A, Shapiro D, Burgos-Vargas R, Davis B, Day R, Ferraz MB, Hawkey CJ, Hochberg MC, Kvien TK, Schnitzer TJ, VIGOR Study Group: Comparison of upper gastrointestinal toxicity of rofecoxib and naproxen in patients with rheumatoid arthritis. N Engl J Med. 2000, 343: 1520-1528. 10.1056/NEJM200011233432103.
Bresalier RS, Sandler RS, Quan H, Bolognese JA, Oxenius B, Horgan K, Lines C, Riddell R, Morton D, Lanas A, Konstam MA, Baron JA, Adenomatous Polyp Prevention on Vioxx (APPROVe) Trial Investigators: Cardiovascular events associated with rofecoxib in a colorectal adenoma chemoprevention trial. N Engl J Med. 2005, 352: 1092-1102. 10.1056/NEJMoa050493.
The National Institute of Neurological Disorders and Stroke rt-PA Stroke Study Group: Tissue plasminogen activator for acute ischemic stroke. New Engl J Med. 1995, 333: 1581-1587. 10.1056/NEJM199512143332401.
Hoffman JR: Tissue plasminogen activator for acute ischemic stroke: is the CAEP Position Statement too negative?. CJEM. 2001, 3 (3): 183-185.
Sandercock P, Lindley R, Wardlaw J, Dennis M, Lewis S, Venables G, Kobayashi A, Czlonkowska A, Berge E, Slot KB, Murray V, Peeters A, Hankey G, Matz K, Brainin M, Ricci S, Celani MG, Righetti E, Cantisani T, Gubitz G, Phillips S, Arauz A, Prasad K, Correia M, Lyrer P, the IST-3 collaborative group: The third international stroke trial (IST-3) of thrombolysis for acute ischaemic stroke. Trials. 2008, 9: 37-
Travers J, Marsh S, Williams M, Weatherall M, Caldwell B, Shirtcliffe P, Aldington S, Beasley R: External validity of randomised controlled trials in asthma: to whom do the results of the trials apply?. Thorax. 2007, 62: 219-223. 10.1136/thx.2006.066837.
Travers J, Marsh S, Caldwell B, Williams M, Aldington S, Weatherall M, Shirtcliffe P, Beasley R: External validity of randomized controlled trials in COPD. Respir Med. 2007, 101: 1313-1320. 10.1016/j.rmed.2006.10.011.
Sackett DL: Explanatory vs. management trials. Clinical Epidemiology: How to Do Clinical Practice Research. Edited by: Haynes RB, Sackett DL, Guyatt GH, Tugwell P. 2005, Philadelphia, PA: Lippincott, Williams and Wilkins
Thorpe KE, Zwarenstein M, Oxman AD, Treweek S, Furberg CD, Altman DG, Tunis S, Bergel E, Harvey I, Magid DJ, Chalkidou K: A pragmatic-explanatory continuum indicator summary (PRECIS): a tool to help trial designers. CMAJ. 2009, 180: E47-57.
Gartlehner G, Hansen RA, Nissman D, Lohr KN, Carey TS: A simple and valid tool distinguished efficacy from effectiveness studies. J Clin Epidemiol. 2006, 59: 1040-1048. 10.1016/j.jclinepi.2006.01.011.
Alford L: On differences between explanatory and pragmatic clinical trials. NZ J Physiother. 2007, 35: 12-16.
Hotopf M: The pragmatic randomised controlled trial. Advan Psychiatr Treat. 2002, 8 (5): 326-333. 10.1192/apt.8.5.326.
Cuthbertson BH, Rattray J, Johnston M, Wildsmith A, Wilson E, Hernendez R, Ramsey C, Hull AM, Norrie J, Campbell M: A pragmatic randomised, controlled trial of intensive care follow up programmes in improving longer-term outcomes from critical illness. The PRACTICAL study (study protocol). BMC Health Serv Res. 2007, 7: 116-10.1186/1472-6963-7-116.
Hutchings J, Bywater T, Daley D, Gardner F, Whitaker C, Jones K, Eames C, Edwards RT: Parenting intervention in Sure Start services for children at risk of developing conduct disorder: pragmatic randomised controlled trial. BMJ. 2007, 334: 678-10.1136/bmj.39126.620799.55.
Mutrie N, Campbell AM, Whyte F, McConnachie A, Emslie C, Lee L, Kearney N, Walker A, Ritchie D: Benefits of supervised group exercise programme for women being treated for early stage breast cancer: pragmatic randomised controlled trial. BMJ. 2007, 334: 517-10.1136/bmj.39094.648553.AE.
Hay EM, Paterson SM, Lewis M, Hosie G, Croft P: Pragmatic randomised controlled trial of local corticosteroid injection and naproxen for treatment of lateral epicondylitis of elbow in primary care. BMJ. 1999, 319: 964-968.
Steinsbekk A, Fonnebo V, Lewith G, Bentzen N: Homeopathic care for the prevention of upper respiratory tract infections in children: a pragmatic, randomised, controlled trial comparing individualised homeopathic care and waiting-list controls. Complement Ther Med. 2005, 13: 231-238. 10.1016/j.ctim.2005.06.007.
Zwarenstein M, Schoeman JH, Vundule C, Lombard CJ, Tatley M: Randomised controlled trial of self-supervised and directly observed treatment of tuberculosis. Lancet. 1998, 352: 1340-1343. 10.1016/S0140-6736(98)04022-7.
North American Symptomatic Carotid Endarterectomy Trial Collaborators: Beneficial effect of carotid endarterectomy in symptomatic patients with high-grade carotid stenosis. N Engl J Med. 1991, 325: 445-453.
Karanicolas PJ, Montori VM, Devereaux PJ, Schünermann H, Guyatt GH: A new "mechanistic-practical" framework for designing and interpreting randomized trials. J Clin Epidemiol. 2009, 62: 479-484. 10.1016/j.jclinepi.2008.02.009.
Oxman AD, Lombard C, Treweek S, Gagnier JJ, Maclure M, Zwarenstein M: Why we will remain pragmatists: four problems with the impractical mechanistic framework and a better solution. J Clin Epidemiol. 2009, 62: 485-488. 10.1016/j.jclinepi.2008.08.015.
Karanicolas PJ, Montori VM, Devereaux PJ, Schünermann H, Guyatt GH: The practicalists' response. J Clin Epidemiol. 2009, 62: 489-494. 10.1016/j.jclinepi.2008.08.013.
Oxman AD, Lombard C, Treweek S, Gagnier JJ, Maclure M, Zwarenstein M: A pragmatic resolution. J Clin Epidemiol. 2009, 62: 495-498. 10.1016/j.jclinepi.2008.08.014.
Riper H, Kramer J, Smit F, Conijn B, Schippers G, Cuijpers P: Web-based self-help for problem drinkers: a pragmatic randomized trial. Addiction. 2007, 103 (2): 218-227. 10.1111/j.1360-0443.2007.02063.x.
Internet World Stats Netherlands Internet Usage Stats and Telecom Reports. http://www.internetworldstats.com/eu/nl.htm
Internet World Stats Poland Internet Usage Stats and Telecom Reports. http://www.internetworldstats.com/eu/pl.htm
Lalonde L, Martineau J, Blais N, Montigny M, Ginsberg J, Fournier M, Berbiche D, Vanier MC, Blais L, Perreault S, Rodrigues I: Is long-term pharmacist-managed anticoagulation service efficient? A pragmatic randomized controlled trial. Am Heart J. 2008, 156: 148-154. 10.1016/j.ahj.2008.02.009.
Zwarenstein M, Treweek S, Gagnier J, Altman DG, Maclure M, Tunis S, Haynes B, Oxman AD, Moher D: Improving the reporting of pragmatic trials: an extension of the CONSORT Statement. BMJ. 2008, 337: a2390-10.1136/bmj.a2390.
Glasziou P, Meats E, Heneghan C, Shepperd S: What is missing from descriptions of treatment in trials and reviews?. BMJ. 2008, 336: 1472-1474. 10.1136/bmj.39590.732037.47.
Groves T: How to do it. BMJ. 2007, 335:
Boutron I, Moher D, Altman DG, Schulz KF, Ravaud P: Extending the CONSORT statement to randomized trials of nonpharmacologic treatment: explanation and elaboration. Ann Intern Med. 2008, 148: W60-66.
Lindsay B: Randomized controlled trials of socially complex nursing interventions: creating bias and unreliability?. J Adv Nurs. 2004, 45: 84-94. 10.1046/j.1365-2648.2003.02864.x.
Ogrinc G, Mooney SE, Estrada C, Foster T, Goldmann D, Hall LW, Huizinga MM, Liu SK, Mills P, Neily J, Nelson W, Pronovost PJ, Provost L, Rubenstein LV, Speroff T, Splaine T, Thomson R, Tomolo AM, Watts B: The SQUIRE (Standards for QUality Improvement Reporting Excellence) guidelines for quality improvement reporting: explanation and elaboration. Qual Saf Health Care. 2008, 17: i13-32. 10.1136/qshc.2008.029058.
May C: A rational model for assessing and evaluating complex interventions in health care. BMC Health Serv Res. 2006, 6: 1-11. 10.1186/1472-6963-6-86.
May C, Finch T, Mair F, Ballini L, Dowrick C, Eccles M, Gask L, MacFarlane A, Murray E, Rapley T, Rogers A, Treweek S, Wallace P, Anderson G, Burns J, Heaven B: Understanding the implementation of complex interventions in health care: the normalisation process model. BMC Health Serv Res. 2007, 7: 148-10.1186/1472-6963-7-148.
May C, Mair FS, Dowrick CF, Finch TL: Process evaluation of complex interventions in primary care: understanding trials using the normalization process model. BMC Fam Pract. 2007, 8: 42-10.1186/1471-2296-8-42.
May C, Finch T: Implementation, embedding, and integration: an outline of normalization process theory. Sociology. 2009,
Greenhouse JB, Kaizar EE, Kelleher K, Seltman H, Gardner W: Generalizing from clinical trial data: a case study. The risk of suicidality among pediatric antidepressant users. Stat Med. 2008, 27: 1801-1813. 10.1002/sim.3218.
Yamaguchi T, Ohashi Y: Investigating centre effects in a multi-centre clinical trial of superficial bladder cancer. Stat Med. 1999, 18: 1961-1971. 10.1002/(SICI)1097-0258(19990815)18:15<1961::AID-SIM170>3.0.CO;2-3.
Wells EM: Behind the scenes of randomised trials of complex interventions – insiders reveal the importance of context. PhD Thesis. 2007, University of Dundee; School of Nursing and Midwifery
The authors declare that they have no competing interests.
Both authors contributed to the design of the paper. ST wrote the first draft. Both authors critically reviewed and edited drafts and read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Treweek, S., Zwarenstein, M. Making trials matter: pragmatic and explanatory trials and the problem of applicability. Trials 10, 37 (2009). https://doi.org/10.1186/1745-6215-10-37
- Chronic Obstructive Pulmonary Disease
- Pragmatic Trial
- Explanatory Trial
- Directly Observe Treatment