Skip to main content

Clinical trial results in context: comparison of baseline characteristics and outcomes of 38,510 RECOVERY trial participants versus a reference population of 346,271 people hospitalised with COVID-19 in England

Abstract

Background

Randomised trials are essential to reliably assess medical interventions. Nevertheless, interpretation of such studies, particularly when considering absolute effects, is enhanced by understanding how the trial population may differ from the populations it aims to represent.

Methods

We compared baseline characteristics and mortality of RECOVERY participants recruited in England (n = 38,510) with a reference population hospitalised with COVID-19 in England (n = 346,271) from March 2020 to November 2021. We used linked hospitalisation and mortality data for both cohorts to extract demographics, comorbidity/frailty scores, and crude and age- and sex-adjusted 28-day all-cause mortality.

Results

Demographics of RECOVERY participants were broadly similar to the reference population, but RECOVERY participants were younger (mean age [standard deviation]: RECOVERY 62.6 [15.3] vs reference 65.7 [18.5] years) and less frequently female (37% vs 45%). Comorbidity and frailty scores were lower in RECOVERY, but differences were attenuated after age stratification. Age- and sex-adjusted 28-day mortality declined over time but was similar between cohorts across the study period (RECOVERY 23.7% [95% confidence interval: 23.3–24.1%]; vs reference 24.8% [24.6–25.0%]), except during the first pandemic wave in the UK (March–May 2020) when adjusted mortality was lower in RECOVERY.

Conclusions

Adjusted 28-day mortality in RECOVERY was similar to a nationwide reference population of patients admitted with COVID-19 in England during the same period but varied substantially over time in both cohorts. Therefore, the absolute effect estimates from RECOVERY were broadly applicable to the target population at the time but should be interpreted in the light of current mortality estimates.

Trial registration

ISRCTN50189673- Feb. 04, 2020, NCT04381936- May 11, 2020.

Peer Review reports

Background

Randomised controlled trials (RCTs) are essential to reliably evaluate safety and efficacy of health interventions [1, 2]. The use of randomisation (with allocation concealment) minimises the risk of bias, but, inevitability, due to eligibility criteria, trial participants are rarely representative of the populations whose treatment they aim to inform. Nonetheless, the proportional estimates of treatment effects from the trial are usually generalisable to the broader population, unless there are good grounds for believing there may be systematic differences in the effectiveness of the intervention or in the biology of the target disease outside of the trial setting (e.g. the advent of a new variant that renders a pathogen resistant to the particular drug that was studied) [3]. However, the estimates of absolute harm and benefit generated by such trials may not be directly generalisable, and assessment of the absolute rates of the relevant outcomes in the target population is useful to understand the likely absolute effects of the intervention in clinical practice [4, 5].

The Randomised Evaluation of COVID-19 Therapy (RECOVERY) trial is a randomised, controlled, open-label, pragmatic, platform trial of potential therapies for patients hospitalised with COVID-19 [6]. Eligibility criteria were broad and simple (i.e. hospitalisation for suspected or confirmed COVID-19), and trial procedures streamlined to be feasible in local practice. Data collection by trial staff, using dedicated case report forms (CRF), focused on the minimum information needed and was complemented with extensive linkage to several healthcare systems data sources in the UK. The trial took place in all acute UK National Health Service (NHS) hospitals, and in several other countries globally.

Here, we aimed to compare the baseline characteristics (demographics and comorbidities) and all-cause 28-day mortality (the trial primary outcome) for RECOVERY participants with a reference population hospitalised with COVID-19, within England.

Methods

RECOVERY cohort

The RECOVERY trial design has been described previously [6]. Briefly, RECOVERY recruited patients admitted to hospital with confirmed or suspected COVID-19 who were considered suitable for inclusion by their attending clinical team. Recruitment was not targeted to any particular subgroups or aimed at achieving a representative sample of the target population; the aim was to recruit a large number of participants rapidly. Randomisation was performed via a short online CRF in which essential baseline data are collected. Follow-up data were collected using a simple CRF upon death, hospital discharge, or at 28 days from randomisation (whichever occurs sooner). In the UK, these data were complemented with linkage to national healthcare systems data sources. The protocol, data analysis plan, baseline characteristics and outcome derivation documentation, and published results are openly available at www.recoverytrial.net, and the trial is registered with ISRCTN (50189673) and ClinicalTrials.gov (NCT04381936). Written informed consent was obtained from all the patients or from a legal representative if they were unable to provide consent. The RECOVERY trial has been approved by the UK Medicines and Healthcare products Regulatory Agency and the Cambridge East Research Ethics Committee (reference 20/EE/0101).

For this analysis, we included all RECOVERY participants recruited in England who had not withdrawn consent and had available healthcare systems data on hospital admissions (Hospital Episode Statistics [HES]) [7], with or without mortality data from official death records (Civil Registrations) [8]. We excluded children aged < 16 years due to difficulties in accessing linked healthcare systems data in this group in RECOVERY. HES data contained information on admissions to all NHS hospitals in England (using standardised coding practices since the 1990s), namely admission and discharge dates and relevant diagnostic and procedure codes. Diagnostic codes are recorded using the International Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) clinical terminology and can be assigned a position from 1 to 20; codes in position 1 usually indicate the primary cause of admission (or main cause of extension of hospital stay) [9]. Civil Registrations included information on date of death and underlying and contributing causes of death (also coded using ICD-10). HES and Civil Registrations were linked and supplied by NHS England [10].

Reference population

To derive a reference population of people hospitalised with COVID-19 in England (thus potentially eligible for RECOVERY), we used an anonymised database covering the entirety of England which includes linked HES and Civil Registrations data continuously collected since 1999. These data were linked and supplied by NHS England, and are analysed at the University of Oxford [11]. More information can be found in the NHS England Data Uses Register at http://digital.nhs.uk/services/data-access-request-service-dars/data-uses-register (reference: DARS-NIC-315419-F3W7K). Approval for the use of the datasets was provided by the Central and South Bristol Research Ethics Committee (ref 04/Q2006/176).

The reference population was ascertained based on the presence of a COVID-19 ICD-10 code (U071—‘COVID-19, Virus identified’, or U072—‘COVID-19, Virus not identified’) [12]. This approach was informed by preliminary cross-validation work (Additional file 1: Annex III) using linked HES and SARS-CoV-2 testing data for RECOVERY participants, which showed 92% of RECOVERY participants recruited in England with a positive SARS-CoV-2 test (as captured in NHS England’s COVID-19 Second Generation Surveillance System—SGSS dataset) [13] had an admission in the HES data which included one of these codes in the primary diagnostic position. We therefore restricted our reference population to individuals with relevant ICD-10 codes in the primary position to avoid inclusion of people in whom COVID-19 was not the main reason for care. The RECOVERY cohort is largely contained within the reference population, but given the anonymised nature of the national datasets it was not possible to identify them.

Analysis period

For each individual in RECOVERY and the reference population, we assigned an index date as the start of the earliest HES episode with U071/U072 in the first diagnostic position. For RECOVERY participants with index dates before 1 March 2020 (indicating long episodes before inclusion in the study; n = 22) or no COVID-19 codes in their HES records (n = 1465), we used randomisation date as the index date. We then restricted our analysis period to index dates between 1 March 2020 and 30 November 2021 inclusive. These analyses were not extended beyond this time-point as the launch of the high-dose dexamethasone comparison in the UK (only suitable to patients with oxygen or ventilation requirements) resulted in more selected patient populations being included in the trial [14].

Baseline characteristic and outcomes

We used HES data in both cohorts to extract baseline clinical characteristics and demographics including age, sex, ethnicity, deprivation (quintile of Index of Multiple Deprivation 2019) [15], geographical location, Charlson Comorbidity Score [16, 17] and its components, and Hospital Frailty Risk Score [18]. Comorbidities were defined as the presence of a relevant ICD-10 code in any diagnostic position recorded within 5 years before the index date (i.e. excluding the index episode). Further methodological details, including the ICD-10 codes used, are provided in Additional file 1: Annex I. Geographical location data (including for deprivation assessments) were extracted from HES records and ascertained from full postcode in the RECOVERY HES data and lower-super output area of the postcode in the national HES data.

For outcomes, we calculated all-cause mortality within 28 days using linked HES and Civil Registrations data. Ascertainment of fact and date of death was based on these linked data sources (derivation methodology described elsewhere) [19]. We considered death records occurring in either healthcare systems data source. We ignored reports of deaths of RECOVERY participants recorded only on the CRF data as there were no CRF data for the reference population.

Statistical analyses

This analysis is limited to RECOVERY participants in England with available HES data. To assess how this selection may have affected the cohort characteristics, we first compared the characteristics of those recruited in England with those recruited in other UK nations (using CRF data for all characteristics except ethnicity, and healthcare systems data in each nation for ethnicity). We then compared the characteristics of RECOVERY participants recruited in England who had available HES data with those who did not (using CRF data for all characteristics except ethnicity, for which we used healthcare systems data from primary care).

We compared baseline characteristics and 28-day mortality of the RECOVERY cohort with those of the reference population, in each case restricted to England only. Age was stratified into 4 groups: < 60, 60–69, 70–79, and ≥ 80. We presented continuous parameters as mean with standard deviation (SD) or median with interquartile range (IQR) as appropriate (with visual assessment of frequency distribution for normality) and frequency counts and percentage distribution for categorical parameters. We compared age, sex, and region of residence by calculating a representativeness ratio—defined as the proportion of people within RECOVERY in each category divided by the proportion of people within the reference population in the same category—and presented these along with 95% confidence intervals [95% CI] [20]. We also calculated a recruitment ratio defined as number of individuals included in RECOVERY divided by the number of individuals in the reference population. We then aggregated individuals in each cohort into three-month periods and conducted the same calculations as above for each time period separately.

The primary RECOVERY trial outcome of 28-day all-cause mortality was calculated starting from the index date in both cohorts, overall and over time (by three-month periods). We presented crude and age- and sex-adjusted mortality rates with 95% CI, with adjustment performed using direct standardisation methods [20] (i.e. applying RECOVERY mortality rates to the reference population age and sex composition using the age groups mentioned above). Further methodological details are provided in Additional file 1: Annex I.

We used Stata v17/MP to derive baseline characteristics and outcomes in HES and Civil Registrations data in both cohorts and R v4.2.1 for all subsequent data management, statistical analysis, and plotting (further details are provided in Additional file 1: Annex I).

Results

Baseline characteristics

Up until 1st September 2022, RECOVERY recruited 46,010 participants, of which 44,766 in the UK and 39,952 in England. Of these, 39,304 (98.4%) had available HES data, and 38,780 were recruited within the analysis period (1 March 2020–30 November 2021). After excluding participants aged below 16 at the index date, a total of 38,510 participants were finally included in our analysis (Fig. 1). RECOVERY participants recruited in other UK nations had generally similar characteristics to those recruited in England (Additional file 1: Supplementary Table S1). People with no HES data available were younger, less frequently of white ethnicity, and had generally lower comorbidity burden and need for respiratory support at randomisation (Additional file 1: Supplementary Table S2). The reference population included 346,271 individuals (Fig. 1); for every 100 people admitted with COVID-19 in England, 11 participants were recruited to RECOVERY. When considering geographical region, the proportion of relevant patients recruited to RECOVERY in London, West Midlands, and Yorkshire and The Humber was lower than in the other England regions (Fig. 2).

Fig. 1
figure 1

CONSORT diagram depicting the cohort derivation process. ‘a’ symbol indicates the following: randomised up until 1 September 2022. ‘b’ symbol indicates the following: index date is the episode start date for the earliest episode with a COVID-19 ICD-10 code in the primary diagnostic position. ‘c’ symbol indicates the following: up to June 2022 (latest data included in the raw extract)

Fig. 2
figure 2

Geographical representativeness of the RECOVERY trial cohort in comparison with the national reference population. Number of RECOVERY participants plotted at the location of the recruiting NHS Trust hospital site. Recruitment ratios were calculated by dividing the number of RECOVERY participants recruited in each region by the number of individuals in the reference population in the same region and are presented by region. The average recruitment ratio across all English regions was 11.1%. There were 1097 and 2409 individuals with missing residential area in HES data in the RECOVERY and the reference population cohort, respectively

Table 1 shows the baseline characteristics of both cohorts. RECOVERY participants were less frequently female (RECOVERY 37% vs reference population 45%) and were on average slightly younger than the reference population (mean age [SD]: 62.6 [15.3] vs 65.7 [18.5] years), with people aged 80 + and women underrepresented in RECOVERY throughout the analysis period (Additional file 1: Supplementary Figure S1 and Supplementary Table S3). RECOVERY participants were more frequently of White background (83% vs 79%) (Table 1 and Additional file 1: Supplementary Figure S2) but had similar deprivation status overall and throughout the study period (Additional file 1: Supplementary Figure S3).

Table 1 Baseline cohort characteristics

With respect to clinical conditions, RECOVERY participants had a lower prevalence of comorbidity (median Charlson Comorbidity Score [IQR]: RECOVERY 3.0 [1.0–5.0] vs reference population 4.0 [1.0–6.0]) and were less frail (median Hospital Frailty Risk Score [IQR]: 5.1 [1.8–11.4] vs 6.3 [1.8–16.3]) These differences were largely explained by the age structure of the two cohorts, with small differences remaining in the prevalence of some comorbidities, including cardiovascular disease, congestive heart failure, and dementia, after accounting for age (Additional file 1: Supplementary Figures S4-S6).

Outcomes

Overall, the crude all-cause 28-day mortality in RECOVERY was 20.6% (95% CI: 20.2–21.0%) and 24.8% (95% CI: 24.6–25.0%) in the reference population, with mortality decreasing substantially in both cohorts from March 2021 onwards. After standardising the RECOVERY cohort to the age-sex composition of the national reference population, 28-day mortality in RECOVERY was similar to the reference population (23.7%, 95% CI: 23.3–24.1%; Fig. 3). Age-stratified mortality rates were similar between the two cohorts, with the exception of March–May 2020 where mortality was lower in RECOVERY (Additional file 1: Supplementary Figures S7-S8 and Supplementary Table S4). When mortality was assessed separately by comorbidity level and age, the difference in 28-day mortality between the two cohorts in March–May 2020 appeared to be mostly driven by older and more comorbid patients (Additional file 1: Supplementary Figure S9).

Fig. 3
figure 3

All-cause 28-day mortality over time in RECOVERY and the reference population. Twenty-eight-day mortality is the proportion of people with death recorded within 28 days of their index date (with 95% confidence intervals included). Adjustment performed by applying RECOVERY 28-day mortality to an age- (5-year bands) and sex-standardised population using the reference population, in a rolling basis within each time period (for 28-day mortality and age and sex breakdown)

Discussion

This study compared the characteristics of RECOVERY trial participants with people admitted to hospital due to COVID-19 in England. Our main findings were that RECOVERY participants were generally similar, but slightly younger, less frequently female, and had an overall lower comorbidity and frailty burden, much of which attributable to age differences. After adjustment for age and sex, 28-day mortality in the RECOVERY cohort was similar to that in the wider population of patients admitted to hospital with COVID-19 in England. This pattern was observed throughout the period studied, with the exception of March–May 2020 (corresponding to the first COVID-19 wave in the UK) when, even after adjusting for age and sex, 28-day mortality in RECOVERY was slightly lower than the reference population. The reasons for this are not fully explained by differences in measured frailty or comorbidity as assessed in our analyses and may be attributable to factors not captured in the datasets available in this study.

Older adults are frequently underrepresented in trials [21] and have been excluded from over half of COVID-19 clinical trials and all major vaccine trials [22]. Although RECOVERY does not have an upper age limit (and some participants were aged over 100 years old), in our study, RECOVERY participants were on average 3 years younger, with underrepresentation of people aged ≥ 80. RECOVERY participants were also less frequently female (37% vs 45%), but it is not possible to identify the underlying reasons for this in the available data. However, this is similar to results found in other trials and may be due to under-recruitment of older patients (who are more frequently women) [23, 24]. Of note, we found important differences in recruitment across different geographical regions, with the recruitment ratio (the number of individuals included in RECOVERY divided by the number of individuals in the reference population) ranging from 7.4 to 16.4%. The reasons for this are likely to be complex, including issues related to local research infrastructure and funding, competing studies, demand on local clinical services and clinician and patient willingness to engage with research. Data on these parameters are not available for this study, but these differences merit further investigation. We also found that comorbidity and frailty scores were lower in the RECOVERY cohort compared with the reference population. Most of these differences were attributable to age composition, but within older age groups, comorbidities and the overall frailty risk scores remained slightly higher in the reference population. Clinical decision making about eligibility for randomised trials will inevitably result in differences between the trial cohort and the target population; however, the proportional estimates of treatment effect from trials are usually generalisable, unless there are substantial differences in the biology of the target disease or the effectiveness of the intervention in the non-trial context [4, 5].

While crude 28-day all-cause mortality was lower in RECOVERY, age- and sex-adjusted mortality were generally similar, with similar trends in both cohorts over time. The reduction seen from March 2021 onwards, consistent with previous reports [25], may represent the effect of SARS-CoV-2 vaccination uptake, which greatly reduced the likelihood not only of hospital admission but also of death following hospitalisation [26, 27]. Overall, the absolute effect estimates generated by RECOVERY were generalisable to the national population during the period studied. However, secular trends in mortality rates should be considered and the best estimate of the likely absolute effect size in current clinical practice requires application of the proportional treatment effect from the RECOVERY trial to current absolute event rates among patients hospitalised with COVID-19 [4, 5].

Our study has a number of limitations. We were not able to determine baseline respiratory status (which has been shown to be an important determinant of the proportional and absolute benefits of corticosteroid treatment) [28] in our reference cohort, since there was low agreement between respiratory support status extracted from HES alone and that collected in the trial (based on a larger number of linked data sources) and used in published analyses (Additional file 1: Annex IV). We also cannot be certain whether our reference population had clinically significant COVID-19, although we have mitigated this by including only people with a relevant ICD-10 code in the primary diagnostic position. Our analysis was restricted to people admitted in England. Baseline characteristics were similar when comparing RECOVERY participants recruited across all UK nations, but may differ from non-UK countries. Finally, our analysis was restricted to the period from March 2020 to November 2021, due to changes to trial eligibility which could not be replicated in the reference population with the available data. However, recruitment to RECOVERY declined significantly from December 2021 onwards (along with national COVID-19 admissions), so that extending the analysis period to the time of writing would add only a small number of additional deaths (~ 4%), which were unlikely to meaningfully influence interpretation of our results.

Conclusion

The RECOVERY trial recruited a broad patient population that was generally representative of people admitted to hospital due to COVID-19 in England during the same period, with respect to both baseline characteristics and subsequent mortality. Twenty-eight-day mortality declined substantially in both the RECOVERY and reference populations throughout the period studied. Estimates of current mortality rates from healthcare systems data combined with the proportional treatment effects from trials are needed to estimate the likely absolute effects of the treatments tested within current practice.

Availability of data and materials

The RECOVERY trial protocol, consent form, statistical analysis plan, definition and derivation of clinical characteristics and outcomes, training materials, regulatory documents, and other relevant study materials are available online at www.recoverytrial.net. Data will be made available in line with the Nuffield Department of Population Health policy and procedures. Those wishing to request access should complete the form available at http://www.ndph.ox.ac.uk/data-access and email it to data.access@ndph.ox.ac.uk. Nationwide anonymised English mortality (Civil Registrations) and hospitalisations (HES) data used to derive the reference population can be obtained upon application to NHS England at www.digital.nhs.uk. The statistical programming code used in this work is available for inspection and reuse at https://github.com/gpessoaamorim/recovery-generalizability-representativeness. 

Abbreviations

95% CI:

95% Confidence intervals

CRF:

Case report form

HES:

Hospital Episode Statistics

ICD-10:

International Classification of Diseases and Related Health Problems, Tenth Revision

IQR:

Interquartile range

NHS:

National Health Service

RCTs:

Randomised controlled trials

RECOVERY:

Randomised Evaluation of COVID-19 Therapy

SD:

Standard deviation

References

  1. Concato J, Shah N, Horwitz RI. Randomized, controlled trials, observational studies, and the hierarchy of research designs. N Engl J Med. 2000;342(25):1887–92.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  2. Good Clinical Trials Collaborative. Good Clinical Trials Collaborative. Guidelines for good randomized clinical trials. 2023. Available from: www.goodtrials.org. Cited 2023 Apr 25.

  3. Collins R, Reith C, Emberson J, Armitage J, Baigent C, Blackwell L, et al. Interpretation of the evidence for the efficacy and safety of statin therapy. Lancet. 2016;388(10059):2532–61.

    Article  CAS  PubMed  Google Scholar 

  4. MacMahon S, Collins R. Reliable assessment of the effects of treatment on mortality and major morbidity, II: observational studies. Lancet. 2001;357(9254):455–62.

    Article  CAS  PubMed  Google Scholar 

  5. Collins R, MacMahon S. Reliable assessment of the effects of treatment on mortality and major morbidity, I: clinical trials. Lancet. 2001;357(9253):373–80.

    Article  CAS  PubMed  Google Scholar 

  6. Pessoa-Amorim G, Campbell M, Fletcher L, Horby P, Landray M, Mafham M, et al. Making trials part of good clinical care: lessons from the RECOVERY trial. Future Healthc J. 2021Jul;8(2):e243–50.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Herbert A, Wijlaars L, Zylbersztejn A, Cromwell D, Hardelid P. Data resource profile: hospital episode statistics admitted patient care (HES APC). Int J Epidemiol. 2017Aug 1;46(4):1093–1093i.

    Article  PubMed  PubMed Central  Google Scholar 

  8. NHS Digital. Civil Registration – Deaths. 2022. Available from: https://digital.nhs.uk/services/data-access-request-service-dars/dars-products-and-services/data-set-catalogue/civil-registration-deaths. Cited 2022 Oct 11.

  9. NHS Digital. National Clinical Coding Standards ICD-10. 5th ed. 2021. Available from: https://classbrowser.nhs.uk/ref_books/ICD-10_2021_5th_Ed_NCCS.pdf.

  10. NHS Digital. NHS Digital. Available from: https://digital.nhs.uk/. Cited 2022 Oct 11.

  11. Big Data Institute, University of Oxford. Unit of Health Care Epidemiology. 2022. Available from: https://www.bdi.ox.ac.uk/research/unit-of-health-care-epidemiology. Cited 2022 Oct 11.

  12. World Health Organization. Emergency use ICD codes for COVID-19 disease outbreak (who.int). 2021. Available from: https://www.who.int/standards/classifications/classification-of-diseases/emergency-use-icd-codes-for-covid-19-disease-outbreak. Cited 2022 Oct 11.

  13. NHS England. COVID-19 Second Generation Surveillance System (SGSS). 2023. Available from: https://digital.nhs.uk/services/data-access-request-service-dars/dars-products-and-services/data-set-catalogue/covid-19-second-generation-surveillance-system-sgss. Cited 2023 Apr 25.

  14. RECOVERY Collaborative Group. Higher dose corticosteroids in patients admitted to hospital with COVID-19 who are hypoxic but not requiring ventilatory support (RECOVERY): a randomised, controlled, open-label, platform trial. Lancet. 2023;401(10387):1499–507.

    Article  Google Scholar 

  15. Ministry of Housing, Communities & Local Government. English indices of deprivation 2019. 2019. Available from: https://www.gov.uk/government/statistics/english-indices-of-deprivation-2019. Cited 2022 Oct 11.

  16. Charlson ME, Pompei P, Ales KL, MacKenzie CR. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis. 1987;40(5):373–83.

    Article  CAS  PubMed  Google Scholar 

  17. Quan H, Sundararajan V, Halfon P, Fong A, Burnand B, Luthi JC, et al. Coding algorithms for defining comorbidities in ICD-9-CM and ICD-10 administrative data. Med Care. 2005;43(11):1130–9.

    Article  PubMed  Google Scholar 

  18. Gilbert T, Neuburger J, Kraindler J, Keeble E, Smith P, Ariti C, et al. Development and validation of a Hospital Frailty Risk Score focusing on older people in acute care settings using electronic hospital records: an observational study. Lancet Lond Engl. 2018May 5;391(10132):1775–82.

    Article  Google Scholar 

  19. NHS Digital. A guide to linked mortality data from Hospital Episode Statistics and the Office for National Statistics. 2015. Available from: https://nhs-prod.global.ssl.fastly.net/binaries/content/assets/website-assets/data-and-information/data-tools-and-services/data-services/hospital-episode-statistics/hes-ons_linked_mortality_data_guide.pdf. Cited 2022 Dec 6.

  20. Kirkwood BR, Sterne JAC. Essential medical statistics. 2nd ed. Malden, Massachusetts, USA: Blackwell Science; 2003. p. 156–264.

    Google Scholar 

  21. Marum RJ. Underrepresentation of the elderly in clinical trials, time for action. Br J Clin Pharmacol. 2020;86(10):2014–6.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Helfand BKI, Webb M, Gartaganis SL, Fuller L, Kwon CS, Inouye SK. The exclusion of older persons from vaccine and treatment trials for coronavirus disease 2019—missing the target. JAMA Intern Med. 2020;180(11):1546.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Daitch V, Turjeman A, Poran I, Tau N, Ayalon-Dangur I, Nashashibi J, et al. Underrepresentation of women in randomized controlled trials: a systematic review and meta-analysis. Trials. 2022;23(1):1038.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Holtzman JN, Kaur G, Power JE, Barkhordarian M, Mares A, Goyal A, et al. Underrepresentation of women in late-breaking cardiovascular clinical trials. J Womens Health 2002. 2023;32(6):635–40.

    Article  Google Scholar 

  25. Gray WK, Navaratnam AV, Day J, Wendon J, Briggs TWR. COVID-19 hospital activity and in-hospital mortality during the first and second waves of the pandemic in England: an observational study. Thorax. 2021. https://doi.org/10.1136/thoraxjnl-2021–218025.

  26. Au WY, Cheung PPH. Effectiveness of heterologous and homologous COVID-19 vaccine regimens: living systematic review with network meta-analysis. BMJ. 2022May;31(377):e069989.

    Article  Google Scholar 

  27. Lopez Bernal J, Andrews N, Gower C, Robertson C, Stowe J, Tessier E, et al. Effectiveness of the Pfizer-BioNTech and Oxford-AstraZeneca vaccines on COVID-19 related symptoms, hospital admissions, and mortality in older adults in England: test negative case-control study. BMJ. 2021May;13:n1088.

    Article  Google Scholar 

  28. RECOVERY Collaborative Group, Horby P, Lim WS, Emberson JR, Mafham M, Bell JL, et al. Dexamethasone in hospitalized patients with COVID-19. N Engl J Med. 2021;384(8):693–704.

    Article  Google Scholar 

Download references

Acknowledgements

The authors wish to thank the many thousands of RECOVERY participants, as well as the doctors, nurses, pharmacists, other allied health professionals, and research administrators at National Health Service hospital organisations across the whole of the UK, supported by staff at the National Institute for Health and Care Research (NIHR) Clinical Research Network, NHS DigiTrials, Public Health England, Department of Health & Social Care, the Intensive Care National Audit & Research Centre, Public Health Scotland, National Records Service of Scotland, the Secure Anonymised Information Linkage at the University of Swansea, and the NHS in England, Scotland, Wales, and Northern Ireland. This work uses data provided by patients and collected by the NHS as part of their care and support.

Funding

This work was supported by grants to the University of Oxford from UK Research and Innovation and NIHR (MC_PC_19056), the Wellcome Trust (grant reference 222406/Z/20/Z) through the COVID-19 Therapeutics Accelerator, the Oxford BHF Centre of Research Excellence (grant reference RE/18/3/34214), and Health Data Research UK and by core funding provided by the NIHR Oxford Biomedical Research Centre, the Wellcome Trust, the Bill and Melinda Gates Foundation, the Foreign, Commonwealth and Development Office, the Medical Research Council Population Health Research Unit, the NIHR Health Protection Unit in Emerging and Zoonotic Infections, and NIHR Clinical Trials Unit Support Funding; and by PhD funding from the Medical Research Council Population Health Research Unit at the University of Oxford (to GPA and MC) and by a PhD studentship (grant reference MR/L004933/2) funded by the MRC Network of Hubs for Trials Methodology Research (HTMR) (to CH). For the purpose of open access, the authors have applied a Creative Commons Attribution (CC BY) licence to any Author Accepted Manuscript version arising.

Author information

Authors and Affiliations

Authors

Contributions

This manuscript was initially drafted by GPA, RG, MC, CH, and MM and further developed and approved by all authors. GPA, RG, MC, CH, and MM conceptualised and designed this analysis. GPA and RG performed the data derivation and analysis. GPA performed the data visualisation. GPA, RG, CC, WS, AK, DM, RW, MN, KW, and MC contributed to the data acquisition, management, and processing, together with the broader RECOVERY Collaborative Group. All authors contributed to data interpretation and critical review and revision of the manuscript.

Corresponding author

Correspondence to Guilherme Pessoa-Amorim.

Ethics declarations

Ethics approval and consent to participate

Written informed consent was obtained from all RECOVERY participants or from a legal representative if they were unable to provide consent. The RECOVERY trial has been approved by the UK Medicines and Healthcare products Regulatory Agency and the Cambridge East Research Ethics Committee (reference 20/EE/0101). The dataset used to build the reference population by the University of Oxford has been approved by the Central and South Bristol Research Ethics Committee (reference 04/Q2006/176).

Consent for publication

Not applicable.

Competing interests

Roche, AbbVie, Regeneron, and GSK have provided study drugs for evaluation in the RECOVERY trial. MM is an applicant on research grants from Novartis and Novo Nordisk (unrelated to this work). MJL is in receipt of grants to University of Oxford from Novartis and Boehringer Ingelheim and grants to Protas from Regeneron, Sanofi, Moderna, FluLab, Google Ventures, and Schmidt Futures (all unrelated to this work).

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

13063_2024_8273_MOESM1_ESM.docx

Additional file 1: Annex I. Supplementary methods. Annex II. Supplementary tables and figures. Supplementary Figure S1. Age and sex representativeness of RECOVERY in comparison with the reference population. Supplementary Figure S2. Ethnicity over time in the RECOVERY cohort in comparison with the reference population, by age. Supplementary Figure S3. Deprivation over time in the RECOVERY cohort in comparison with the reference population, by age. Supplementary Figure S4. Charlson Comorbidity Score (excluding age) and Hospital Frailty Risk Score in the RECOVERY cohort and the reference population, by age groups. Supplementary Figure S5. Hospital Frailty Risk Score over time in the RECOVERY cohort and the reference population, by age groups. Supplementary Figure S6. Prevalence of select comorbidities in the RECOVERY cohort and reference population, by age groups. Supplementary Figure S7. 28-day mortality in the RECOVERY HES and All-England HES populations (stratified by age and sex). Supplementary Figure S8. 28-day mortality over time in RECOVERY and the reference population (by age groups). Supplementary Figure S9. Mortality rates over time in RECOVERY and the reference population, by age, comorbidity, and frailty. Supplementary Table S1. Baseline characteristics of the RECOVERY population recruited in the UK, grouped by nation (using data from the case report form only, except where stated). Supplementary Table S2. Baseline characteristics of the RECOVERY population recruited in England, grouped by HES linkage status (using data from the case report form only, except where stated). Supplementary Table S3. Number of individuals included in RECOVERY versus reference population over time, by age groups. Supplementary Table S4. 28-day mortality along time in the RECOVERY HES and All-England HES cohorts (split by age groups). Supplementary Table S5. Cross-coding of COVID-19 in HES and SGSS data within the RECOVERY population. Supplementary Table S6. Cross-coding of COVID-19 in HES (disaggregated across different ICD-10 codes) and SGSS data within the RECOVERY population. Supplementary Table S7. IMV coding cross tabulation (HES vs CRF)—15 days before randomisation. Supplementary Table S8. IMV coding cross tabulation (HES vs CRF)—30 days before randomisation. Supplementary Table S9. NIV coding cross tabulation (HES vs CRF)—15 days before randomisation. Supplementary Table S10. NIV coding cross tabulation (HES vs CRF)—30 days before randomisation. Annex III. Cross-coding of COVID-19 coding in Hospital Episode Statistics versus testing data in SGSS within the RECOVERY population. Annex IV. Cross-coding of invasive and non-invasive mechanical ventilation in the RECOVERY case report form versus Hospital Episode Statistics data

Additional file 2.

Additional file 3.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pessoa-Amorim, G., Goldacre, R., Crichton, C. et al. Clinical trial results in context: comparison of baseline characteristics and outcomes of 38,510 RECOVERY trial participants versus a reference population of 346,271 people hospitalised with COVID-19 in England. Trials 25, 429 (2024). https://doi.org/10.1186/s13063-024-08273-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13063-024-08273-9

Keywords