Randomized controlled trials: who fails run-in?

Background Early identification of participants at risk of run-in failure (RIF) may present opportunities to improve trial efficiency and generalizability. Methods We conducted a partial factorial-design, randomized, controlled trial of calcium and vitamin D to prevent colorectal adenoma recurrence at 11 centers in the United States. At baseline, participants completed two self-administered questionnaires (SAQs) and a questionnaire administered by staff. Participants in the full factorial randomization (calcium, vitamin D, both, or neither) received a placebo during a 3-month single-blinded run-in; women electing to take calcium enrolled in a two-group randomization (calcium with vitamin D, or calcium alone) and received calcium during the run-in. Using logistic regression models, we examined baseline factors associated with RIF in three subgroups: men (N = 1606) and women (N = 301) in the full factorial randomization and women in the two-group randomization (N = 666). Results Overall, 314/2573 (12 %) participants failed run-in; 211 (67 %) took fewer than 80 % of their tablets (poor adherence), and 103 (33 %) withdrew or were uncooperative. In multivariable models, 8- to 13-fold variation was seen by study center in odds of RIF risk in the two largest groups. In men, RIF decreased with age (adjusted odds ratio [OR] per 5 years 0.85 [95 % confidence interval, CI; 0.76–0.96]) and was associated with being single (OR 1.65 [95 % CI; 1.10–2.47]), not graduating from high school (OR 2.77 [95 % CI; 1.58–4.85]), and missing SAQ data (OR 1.97 [1.40–2.76]). Among women, RIF was associated primarily with health-related factors; RIF risk was lower with higher physical health score (OR 0.73 [95 % CI; 0.62–0.86]) and baseline multivitamin use (OR 0.44 [95 % CI; 0.26–0.75]). Women in the 5-year colonoscopy surveillance interval were at greater risk of RIF than those with 3-year follow-up (OR 1.91 [95 % CI; 1.08–3.37]), and the number of prescription medicines taken was also positively correlated with RIF (p = 0.03). Perceived toxicities during run-in were associated with 12- to 29-fold significantly increased odds of RIF. Conclusions There were few common baseline predictors of run-in failure in the three randomization groups. However, heterogeneity in run-in failure associated with study center, and missing SAQ data reflect potential opportunities for intervention to improve trial efficiency and retention. Trial registration ClinicalTrials.gov: NCT00153816. Registered September 2005. Electronic supplementary material The online version of this article (doi:10.1186/s13063-016-1451-9) contains supplementary material, which is available to authorized users.


Background
The run-in period of a trial is a participatory phase between enrollment and randomization to determine participants' eligibility to continue in the trial.At the end of run-in, participants are randomized only if they meet prespecified criteria.A common use of the run-in period is to identify and exclude individuals who are likely to adhere poorly to the trial protocol.When used appropriately, this helps to minimize dropouts after randomization [1] and maximize trial efficiency and statistical power in the estimation of efficacy, although it may impair external validity if run-in failures (RIFs) are systematically different than those retained in the trial.Other uses of a run-in period are to identify and remove "placebo responders," to identify the best effective or tolerated dose for each participant, to select participants with a good clinical response to (or tolerance of) the active treatment [2], or to establish baseline measurements for comparison after the intervention has been applied [3].Such uses of a run-in period may introduce various forms of bias [4].The run-in also offers time before randomization for participants to change their minds about taking part and for investigators to verify eligibility, for example, through review of enrollment blood test results and medical records.
In a trial whose placebo run-in is specifically designed to assess adherence and select adherent participants, participants may be largely responsible for their ineligibility, e.g., due to poor pill-taking or changing their mind about participation; in other cases they may have been good candidates to take part if factors beyond their control had not intervened, e.g., abnormal blood test results.Simplistically, we can think of these groups as voluntary and involuntary RIFs, respectively.An understanding of the characteristics associated with voluntary RIF may increase efficiency in trial planning, help determine enrollment targets for subgroups at risk of failing run-in, or be used to identify participants who might be retained in the trial using motivational strategies.
During a multicenter, randomized, placebo-controlled trial of calcium and vitamin D in the chemoprevention of colorectal adenoma recurrence [5], we examined the characteristics of participants who became "voluntary run-in failures" after an approximately 3-month singleblinded placebo run-in period.

Methods
We conducted a randomized, double-blinded, placebocontrolled multicenter trial of daily supplementation with 1000 IU vitamin D 3 and/or 1200 mg elemental calcium as calcium carbonate, for the prevention of large bowel adenomas [5].Participants were recruited between 2004 and 2008 at 11 clinical centers in the continental United States and Puerto Rico following a complete colonoscopy, during which at least one colorectal adenoma was removed and none remained after the procedure.Eligible participants aged 45-75 years were in good general health and had no contraindications to calcium or vitamin D, no familial colorectal cancer syndromes, and no history of serious gastrointestinal disease.All participants provided written informed consent; the research was approved by the Committee for the Protection of Human Subjects at Dartmouth College and by Institutional Review Boards (IRBs) at each clinical center (see Additional file 1 for the list).The trial is reported according to the Consolidated Standards of Reporting Trials CONSORT (see Additional file 2 for the CONSORT checklist).

Enrollment and run-in
During the informed consent process at enrollment, participants received an explanation about the study procedures, including randomized allocation to the study agents, and were provided a copy of the signed and dated informed consent form.Participants were mailed the SF-36 short form health survey and food frequency questionnaires to complete at home and bring to the enrollment interview.During a 2-to 3-hour in-person interview, they completed a detailed intake questionnaire including questions on their beliefs about the study interventions and the health effects of vitamin D: preference: "If you could choose, which kind of pill would you like to receive during the study?";efficacy beliefs: "How likely do you think it is that vitamin D supplements [are helpful in preventing colon polyps]/[improve general health]/[improve pain in bones and joints]/[improve mood or depression]/[cause constipation]?"; and allocation belief: "If you were to place a bet, which pill would you bet you'll be given during the trial?".The latter question was poorly received by many participants and was removed from the survey after several months of use.
Participants were given a 7-day pill dispenser and their first bottle of study tablets containing placebo tablets, or calcium tablets for those in the two-group randomization, dispensed in a single-blinded manner.Information about the contents of the medicine bottle was shown on the label as: "This bottle contains one of the following: calcium, vitamin D, vitamin D + calcium, or placebo" or, for the two-group randomization, "This bottle contains one of the following: calcium, or vitamin D + calcium", with instructions to take one tablet twice a day with food.The granulation and coating of the placebo tablets were composed of inactive compounds such as lactose (no more than 600 mg), cellulose, polymers, and minerals.All study tablets were manufactured for the study and looked similar.Participants were asked to take the first tablet during the enrollment interview; in some cases the first tablet was taken at home, and the participant was asked to confirm this by postcard or telephone.Participants were also given a study brochure, including general instructions on taking the tablets and specific instructions on staggered administration if they were taking medicines that might interact with calcium.Participants were asked to discontinue any personal calcium-or vitamin D-containing supplements and, for the duration of the trial, were offered a supply of multivitamins that excluded those ingredients.The label on the multivitamin bottle identified these as "Multivitamin supplement" and included a complete list of ingredients.Participants were counseled on the major dietary sources of calcium and vitamin D and were asked to avoid regularly consuming large amounts of such foods.
Blood was drawn at enrollment and tested for calcium, 25-hydroxyvitamin D (25-OH D), and creatinine levels.Where possible, an appointment was scheduled for a telephone interview in 2 to 3 months' time.Participants received $100 at completion of the enrollment interview.Study coordinators and investigators were aware of the placebo (or calcium, in the two-group randomization) run-in, but participants were not informed about the run-in period.During run-in, medical records were reviewed and blood test results became available; individuals found to have disqualifying medical conditions or abnormal blood results (including serum 25-OH D levels <12 ng/ml) were ineligible for randomization.Throughout the trial, perceived toxicity reports were completed when participants reported symptoms that they attributed to the study tablets.

Randomization
After a single-blinded run-in period of approximately 3 months (56-84 days), coordinators confirmed that the eligibility criteria necessary for randomization had been met, including a one-hour telephone interview to determine patient-reported eligibility criteria.They obtained the self-reported number of tablets left in the participant's bottle, calculated the percentage of tablets taken, and disqualified from randomization anyone with selfreported adherence below 80 %.RIFs were generally discontinued from the study before completion of the full questionnaire.In rare cases, coordinators used their discretion, e.g., to retain a non-adherent participant who had misunderstood the dose during run-in, or to exclude a participant with unusual circumstances unlikely to adversely affect participation.
Eligible participants who confirmed their ongoing commitment to the study were block-randomized to treatment in a double-blinded manner, using a webbased random number generator, stratified by study center, sex, and colonoscopy interval (3 or 5 years).Those in the full factorial randomization were randomly assigned to one of four treatment groups (calcium, vitamin D, both, or placebo).Those in the two-group randomization (i.e., women who were not willing to forgo calcium supplementation) were randomized to vitamin D or placebo and were all provided with calcium.Study treatment was scheduled to continue until the surveillance colonoscopy either 3 or 5 years after the qualifying examination, according to recommendations by each participant's gastroenterologist.

Definitions: voluntary and involuntary run-in failures
RIFs were defined as individuals who enrolled and consented to participate in the trial but were not randomized after the run-in period.We defined "voluntary" RIFs as participants with some degree of control over the factors that prevented their randomization, i.e., those who declined to continue; those who took <80 % of their study tablets; and those who could not be reached for the telephone interview to be randomized."Involuntary" RIFs were defined as participants whose removal from the trial was beyond their control, e.g., those whose safety blood tests were abnormal; those confirmed after enrollment as having a disqualifying medical condition; and others who changed residence after enrollment.Had the disqualifying medical information been available earlier, these participants would have been ineligible for enrollment.We focus on voluntary RIFs throughout this paper.

Statistical methods
We reasoned that the determinants of voluntary RIF are likely to vary in different trials, depending on the type of intervention offered and the characteristics of participants who enroll in a particular trial.For this reason, a priori we chose to analyze three groups of participants separately to illustrate what might happen in parallel "trials" with minor differences in participant characteristics: (M4) men in the full factorial randomization; (F4) women in the full factorial randomization; and (F2) women in the two-group randomization, i.e., women who chose to take calcium.We reasoned that the predictors of RIF in a single, pooled analysis would lack external validity, whereas common factors identified independently in the three subgroups might generate more plausible hypotheses that could be tested in future, similar trials.
For each group of participants, we looked for univariate associations between baseline factors and RIF using chi-square, t test, and analysis of variance.All variables that had a p value <0.1 in univariate analyses were added to a "full" logistic regression model.The final model was obtained by removing all variables in the full model with p > 0.05 in order to give a minimum set of variables that together contribute significantly to the model.If significant univariately, the reported perceived toxicity (the only post-enrollment factor examined) was added to the final model to allow us to assess how perceived toxicity influences the effect of the final model factors.For women in the two-group randomization, multivitamin use, calcium supplementation, and vitamin D supplementation were collinear, and we chose to include only multivitamin use in the full model.

Post hoc analyses
In general, the analysis plan and potential predictors of RIF were selected a priori; however, during the analysis we discovered substantial variability in RIF according to which study center the participant had been recruited.This was unexpected and led us to undertake several post hoc analyses.(1) First, we recognized other variables that deserved investigation and tested those in the models.These were: imperfect completion of the interviewer-administered enrollment questionnaire or the self-administered SF-36 and food frequency questionnaires; the participant's willingness to set a date at enrollment for the next interview; the study experience of the enrolling staff member in prior polyp prevention studies (dichotomous) and the total number of participants they enrolled during the entire study.For logistical reasons, we were only able to measure this by summing enrollments retrospectively, understanding that total accrual did not reflect staff experience up to the point at which each individual was enrolled.The variables used in these post hoc analyses are identified as such in tables.(2) We reasoned that the observed effect of the study center may reflect a range of factors from regional differences among participants (e.g., education, race) to differences in methods used by study staff; we examined this possibility by building multivariable models without the center variable.(3) Finally, we investigated the importance of RIF rates at each center for long-term trial efficiency by assessing the proportion failing run-in at each center in relation to three measures of postrandomization adherence, using scatterplots and Kendall's tau-b.
Odds ratios are presented with 95 % confidence intervals.A single p value for each categorical variable was estimated using a likelihood ratio test.Analyses were done using SAS (version 9) and STATA (version 14).
Treating the three randomization groups as though they represent three separate study populations, the univariate analyses identified several variables that were associated with a higher proportion of RIF in each of the three groups: non-white race (overall proportions in RIF and     1).In all three groups, participants at the enrollment interview who confirmed a date and time for the next appointment were less likely to fail run-in (overall 11 % versus 17 %), but this was only statistically significant in the largest group, M4 (p < 0.001).Five additional factors were significantly associated with RIF in univariate analyses in two of the three groups: Hispanic ethnicity, study center, a lower SF-36 mental health score, non-use of multivitamins before the study, and answering "Don't know" or refusing to answer one or more questions during the in-person enrollment interview.Generally, participants' preferences and beliefs about the properties of the study tablets did not substantially influence their probability of being randomized (Table 2).At study entry, 63 % of participants believed that calcium and vitamin D were very or somewhat likely to prevent colorectal polyps, 34 % did not know, and 3 % thought this unlikely.Overall, a majority of participants believed that calcium and vitamin D were likely to improve general health (86 %), improve pain in bones and joints (63 %), or improve mood (32 %); 20 % believed the study agents were likely to cause constipation.In univariate analyses, beliefs about the effectiveness of calcium and vitamin D and the baseline guess about allocation were not significantly associated with RIF.Although RIF risk tended to be lower in participants who would prefer to receive both calcium and vitamin D (overall 11 % versus 14 %), this was not statistically significant.
The multivariable models developed for the three groups had few similarities (Tables 3, 4, 5).Study center was significantly associated with RIF in M4 and F2, with >8-fold variation in odds of RIF among the 11 centers in M4, and >13-fold in F2.In M4, men who missed or refused any question in the self-administered SF-36 or food frequency questionnaires had more than twice the odds of failing runin (adjusted odds ratio [OR] 1.97; 95 % confidence interval [CI] 1.40-2.76).Younger men were more likely to fail runin (adjusted OR per 5 years of age 0.85; 95 % CI 0.76-0.96),as well as single or divorced men (adjusted OR 1.65; 95 % CI 1.10-2.47)or men who had not graduated high school (OR 2.77; 95 % CI 1.58-4.85).Among women in the full factorial randomization (F4), RIF was more likely in those reporting use of no prescription medicines or three or more (p = 0.03).Among women in the two-group randomization (F2), in addition to study center, RIF was inversely associated with regular use of multivitamins at baseline (adjusted OR 0.44; 95 % CI 0.26-0.75)and SF-36 Physical Component Summary (PCS) measure (adjusted OR 0.73; 95 % CI 0.62-0.86).Women in the 5-year colonoscopy cycle had almost twice the odds of run-in failure as those with 3-year recommended follow-up (adjusted OR 1.91; 95 % CI 1.08-3.37).
Perceived toxicities (PTs) were reported during run-in by 34 (1.8 %) participants in the full factorial randomization and by 12 (1.8 %) women in the two-group randomization.In the two larger groups, study center was associated with an 8-to 13-fold variation in odds of RIF, even after adjustment for other factors.This substantial effect may reflect differences in the methods used by study staff at each center, or it may be due to residual confounding by medical, educational, cultural, or other characteristics of the participants.With the available data, we could not identify differences in methods between centers.However, in post hoc analyses designed to explore the substantial heterogeneity in RIF by center, we found that building the model without center led, among men, to stronger associations between run-in failure and coordinator experience, timely scheduling of the next interview, and participant education.Center may represent a mix of factors including participant and coordinator characteristics, participants' uncertainty about their commitment, and competing constraints on their time (e.g., by employment).Center has been associated with RIF risk in other settings [6,7], and this association may be worth exploring in future trials.Will a center with more RIFs subsequently have better adherence and endpoint ascertainment because participants were more  Refused any questions during enrollment intake questionnaire d 0.01 differences among the participants who enrolled in each region.Among men, those who were younger, single, and had not graduated high school had the greatest risk of failing run-in.One possible explanation is that those who were working had less time to commit to the study than retired participants, but we did not collect employment data.In addition, men had twice the risk of RIF if they overlooked or refused to answer one or more questions in two self-administered questionnaires (SAQs).This finding might be explained by an association between perfect completion of the SAQ and an individual's motivations underlying trial participation.However, an alternative explanation is differences in quality checking by study staff; those who more effectively identified missing SAQ questions and had participants correct them may have also been better at motivating enrollees during enrollment and run-in.With our data, we could not distinguish these two possibilities.The first suggests that a SAQ could be developed to help identify individuals at risk of run-in failure and target them for intervention.The second could be addressed via improvements in staff training and motivational protocols.
In women, health-related factors tended to be associated with RIF.In the full factorial randomization, RIF was most frequent among women taking no prescription medications at baseline.Among women in the twogroup randomization, RIF was less common among women who took vitamin supplements at baseline, had better SF-36 physical health scores, and a 3-year (rather than a 5-year) trial participation, which represents not only a shorter commitment to trial participation, but also higher risk adenomas that require more intensive follow-up.We found no convincing evidence that the successful negotiation of run-in was associated with participants' expectations of health benefits from the agents, which tablets they would prefer if given the choice, or which agents they guessed they would be given for the study.Other studies have found that trial participation is associated with altruistic factors [8][9][10][11], so it is possible that individuals with stronger feelings about the study agents were less likely to enroll in the first place.Predictors of RIF have varied in previous trials.RIF was associated with lower Karnofsky performance score and lower education level in a head and neck cancer chemoprevention trial [12], and with younger age, not working, and smoking, in a trial of pregnant women to prevent adverse neonatal outcomes [7].Interestingly, in the latter study, the proportions of RIFs ranged from 20-40 % in five clinics but was lowest in one clinic where participants were told they would receive sugar pills during run-in.Another trial of motivational interventions to reduce blood pressure found no significant characteristics of RIFs, but the sample was small [1].
In our trial and others, the purpose of the adherence run-in is to help researchers identify and randomize the participants most likely to follow trial protocol.Its potential advantages include the retention of good long-term adherers [13], improved efficiency [14], improved internal validity in the estimation of efficacy, and greater statistical precision [15].There is certainly evidence that removing poor adherers can change a study's effect estimates.For example, in a trial of lovastatin to reduce cholesterol levels, Davis et al. found that participants who would have failed run-in had an adherence criterion been applied, experienced 17.8 % smaller cholesterol reductions than more adherent participants [6].By retaining these individuals in the trial, the proportion of less educated participants was increased (improving generalizability), but the overall measure of lovastatin's effectiveness was 5.2 % lower than it would otherwise have been.Pablos-Mendez discussed two comparable trials of aspirin to prevent myocardial infarction [2].The first trial, in American physicians, included a run-in and reported 90 % adherence over 5 years and a significant risk reduction of 44 % [16].The other trial, in British physicians, did not include a run-in, and reported adherence of 70 % over 6 years and a nonsignificant risk reduction of 3 % [17].These findings support the use of run-in to improve adherence, provided that the goal is to estimate efficacy rather than effectiveness.
One concern about the adherence run-in is that individuals who adhere poorly to treatment are very different than those who adhere well, with respect to health and other characteristics; poor adherers have worse health outcomes during clinical trials independent of assignment to the active or placebo group [18][19][20][21][22][23], and their exclusion may therefore reduce generalizability.Further, improvements in efficiency offered by the runin may be attenuated if the exclusion criteria misclassify individuals who would have gone on to complete the trial successfully [14].Although early trial adherence and longer term adherence tend to be highly correlated [7], one study showed that this was true for less educated participants but not for more educated ones [6].The implication is that exclusion of better educated participants who adhere poorly during run-in may be relatively inefficient because their longer term adherence is less accurately predicted.However, although the factors associated with adherence in different settings have been studied extensively [24,25], RIF is not simply an adherence issue; it also gives participants an opportunity to reconsider their enrollment ("buyer's remorse").In one study, when patients were interviewed within a month of enrollment in a variety of clinical trials, 16 (12 %) had already considered dropping out, and 4 of those continued because of a sense of obligation, despite a preference to withdraw [26].In our study, 21 % of RIFs (4 % of enrollees medically eligible for randomization) clearly stated that they did not want to continue participating, and a further 12 % were uncooperative or could not be contacted.But among the two-thirds of RIFs we attributed to poor adherence, ambivalence towards trial participation may have been the underlying cause of that poor adherence in some cases.It is also possible that the $100 incentive payment was the primary motivation for enrollment; this may account for some early dropouts.
Future studies of run-in might collect more granular data to distinguish poor adherence from disinclination to continue in the trial.Potential strategies to address both problems may include increased communication with participants during run-in to elicit questions and concerns, motivate participants in their pill-taking routine, and develop a rapport that encourages individuals to stay in the trial.One option might be a two-stage run-in.For example, in our trial, participants could have been telephoned one week after enrollment to assess early adherence, identify cases of "buyer's remorse," and provide explanations and motivational counseling where appropriate.After 3 months, persistent non-adherers could be excluded from randomization as usual, and randomization could be stratified according to whether motivational counseling was initiated, to assess the impact of the strategy by subgroup.This approach could be extended to study the ideal frequency of participant contact or counseling needed to retain promising participants but not reluctant enrollees with poor long-term prospects of adherence.
Forty-six participants (2 % of enrollees and 15 % of RIFs) contacted their coordinator to report perceived toxicities (PTs) during run-in, and almost two-thirds of those became RIFs.In the full factorial randomization, participants received placebo during run-in and PTs were associated with a 12-fold increased odds of RIF.In the two-group randomization, women receiving calcium and reporting a PT experienced a 29-fold increase in odds of RIF.However, the risk of PT was similar in those given placebo or calcium during run-in.When a participant develops any new symptom by chance during run-in, they may attribute it to the study intervention; alternatively, the PT may be a nocebo phenomenon arising from expectations of an adverse effect upon starting a new treatment.Up to a quarter of patients in the placebo group in previous trials have reported symptoms after randomization, with a wide range of frequency across trials [27][28][29].Some trials deliberately exclude participants who show a placebo response (or a coincident improvement in health) during run-in, but they may be criticized for poorer generalizability [4].What we observed was an increased tendency for participants with a nocebo response (or coincidental development of symptoms) to fail run-in.Both types of exclusion involve a change in health while taking placebo: the first, determined by the investigator, excludes those with beneficial changes in health while taking placebo, while the second type, determined by the participant, excludes those with adverse changes in health.Both are likely to affect generalizability.Although PTs were very strongly associated with RIF, it is unclear how to address this, because if the blind must be maintained, participants cannot be told that they were taking placebo during run-in.
This study was limited by differences in sample size in the three subgroups, which will have affected the power to consistently detect factors with common specific effect sizes and meant that our multivariable modeling did not directly compare RIF rates in men and women.In addition, we chose to only consider main effects in our analyses and did not model any interactions between potential predictor variables.A further limitation was our use of self-reported adherence via tablet counts, which, although superior to some methods, is still subject to misclassification [30].Our study was a relatively long-term (3-to 5-year) trial using agents (calcium and vitamin D) that are generally thought to have few, if any, adverse side effects.As a chemoprevention trial, it recruited individuals with a specific interest in preventive health strategies, and our study was also limited to adults aged 45-75 in good general health.The study may therefore have limited generalizability to other types of trials, although this would be more of a concern if we had found a consistent set of RIF predictors in the three subgroups and were proposing to extend our findings to diverse trial settings.

Conclusions
The lack of a clear, single set of predictors of run-in failures among our three subgroups suggests that the search for general predictors of run-in failure in trials will not be straightforward.However, substantially different risks associated with study center and missing self-administered questionnaire data reflect opportunities for further study with the goal of identifying interventions that might improve trial efficiency and retention.Perceived toxicities during a placebo run-in represent the strongest risk factor for run-in failure, but perhaps are the most difficult to address in a blinded study.The loss of 12 % of participants due to voluntary factors represents considerable expense as well as loss of generalizability, and the search for a balance between optimal efficiency and generalizability via the adherence run-in deserves further attention.

Table 1
Participant characteristics at enrollment for randomized participants and voluntary run-in failures

Table 1
Participant characteristics at enrollment for randomized participants and voluntary run-in failures (Continued)

Table 1
Participant characteristics at enrollment for randomized participants and voluntary run-in failures (Continued)

Table 1
Participant characteristics at enrollment for randomized participants and voluntary run-in failures (Continued) Missing data are shown where values are not zero.pvalues are based on non-missing data, except for race a Fisher exact test b p value from Kruskal-Wallis test because SF-36 scores are skewed c Not a priori potential predictors dExcludes questions asked as part of a skip pattern (N = 43 questions for men and 47 questions for women) and questions about beliefs in which "Don't know" was an allowable response

Table 3
Logistic regression models of voluntary run-in failure: full factorial study males

Table 3
Logistic regression models of voluntary run-in failure: full factorial study males (Continued) a Included were all variables that had p < 0.1 from Table1and baseline factors from Table2b Final model plus perceived toxicity, the post-enrollment factor from Table2c Not a priori potential predictors stringently selected, or does a high RIF rate indicate a local problem that will persist throughout the study?Our exploratory post hoc data hinted that centers with more RIFs had lower rates of post-randomization adherence and trial completion; this might suggest local differences by center, e.g., in study methods, or in intrinsic

Table 4
Logistic regression models of run-in failure: full factorial study females a Included were all variables that had p < 0.1 from Table1and baseline factors from Table2b Final model plus perceived toxicity, the post-enrollment factor from Table2c Not a priori potential predictors

Table 5
Logistic regression models of run-in failure: two-group randomization females

Table 5
Logistic regression models of run-in failure: two-group randomization females (Continued)Included were all variables that had p < 0.1 from Table1and baseline factors from a

Table 2 b
Final model plus perceived toxicity, the post-enrollment factor from

Table 2 c
Baseline multivitamin, calcium, and vitamin D supplement use were collinear variables; the first was included in the model d Not a priori potential predictors