Revisiting the multi-armed bandit model for the optimal design of clinical trials: benefits and drawbacks

Villar, Sofia S; Bowden, Jack; Wason, James

doi:10.1186/1745-6215-14-S1-P36

Volume 14 Supplement 1

2nd Clinical Trials Methodology Conference: Methodology Matters

Poster presentation
Open access
Published: 29 November 2013

Revisiting the multi-armed bandit model for the optimal design of clinical trials: benefits and drawbacks

Sofia S Villar^1,2,
Jack Bowden¹ &
James Wason¹

Trials volume 14, Article number: P36 (2013) Cite this article

1194 Accesses
Metrics details

In a traditional randomised clinical trial, patients are allocated to an experimental treatment or the standard therapy arm - with equal probability - for its entire duration. However, using data from past patients to change allocation probabilities for future patients can increase the average number of patients who receive the best treatment. A drawback of doing this is that the power of the trial can be considerably reduced. The Bernoulli Multiarmed bandit problem (MABP) is an idealised model that illustrates such a conflict. For such model, Gittins & Jones (1974) provided a rule that maximises the expected patient benefit in a large trial.

Bandit models for clinical trials have been extensively studied in theory, yet the resulting schemes have been rarely used in practice. There are many reasons for this, however the power limitations to detect a significant treatment effect is a major drawback. In this presentation, we discuss allocating patients between several experimental treatments and a shared control group in a multi-arm trial, using the Gittins index. We modify the MABP design so that the shared control group is allocated separately to protect the power of the trial. The design provides considerable gains not only in efficiency over separate randomised trials but also in the average proportion of patients allocated to the best experimental treatment. This design can be advantageously extended for trials in rare diseases, where it becomes particularly relevant to optimize the choice of treatment both for patients in the trial and for those to treat after it.

Author information

Authors and Affiliations

MRC Biostatistics Unit, Cambridge, UK
Sofia S Villar, Jack Bowden & James Wason
Mathematics and Statistics Department, Lancaster University, Lancaster, UK
Sofia S Villar

Authors

Sofia S Villar
View author publications
You can also search for this author in PubMed Google Scholar
Jack Bowden
View author publications
You can also search for this author in PubMed Google Scholar
James Wason
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Villar, S.S., Bowden, J. & Wason, J. Revisiting the multi-armed bandit model for the optimal design of clinical trials: benefits and drawbacks. Trials 14 (Suppl 1), P36 (2013). https://doi.org/10.1186/1745-6215-14-S1-P36

Download citation

Published: 29 November 2013
DOI: https://doi.org/10.1186/1745-6215-14-S1-P36

2nd Clinical Trials Methodology Conference: Methodology Matters

Revisiting the multi-armed bandit model for the optimal design of clinical trials: benefits and drawbacks

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Trials

Contact us

2nd Clinical Trials Methodology Conference: Methodology Matters

Revisiting the multi-armed bandit model for the optimal design of clinical trials: benefits and drawbacks

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Trials

Contact us