 Methodology
 Open access
 Published:
A novel method for interpreting survival analysis data: description and test on three major clinical trials on cardiovascular prevention
Trials volume 21, Article number: 578 (2020)
Abstract
Background
Major results of randomized clinical trials on cardiovascular prevention are currently provided in terms of relative or absolute risk reductions, including also the number needed to treat (NNT), incorrectly implying that a treatment might prevent the occurrence of the outcome/s under investigation. Provided that these results are based on survival analysis, the primary measure of which is timetothe outcome and not the outcome itself, we sought an alternative method to describe, analyse and interpret clinical trial results consistent with this assumption, so as to better define qualitative and quantitative heterogeneity of various therapeutic strategies in terms of their effects and costs.
Methods
The original KaplanMeier graphs of three major positive cardiovascular prevention trials (PROVEIT, LIFE and HOPE) were captured from the PDF images of the article and then digitalized. We calculated the difference between the placebo and active treatment curves and plotted it as a function of time to describe the eventfree time gain (TimeGain) produced by the active treatment. By calculating the exposure to the active treatment in terms of months (MoT) as a function of time and dividing it for the corresponding timedependent number of eventfree years gained (i.e. months/12), we described the kinetics of the pharmacoeconomic index MoT/y^{+}. The same procedure was repeated replacing MoT with the actual number of patients being treated at each time point as a function of time to obtain the NNT to gain 1 eventfree year (NNT/y^{+}) curve.
Results
The TimeGain curves depict the kinetics of the treatmentrelated effect over time and possess the peculiar feature of being smooth and accurately fitted by secondorder polynomial functions (a*time^{2} + b*time); similarly, also the MoT/y^{+} and NNT/y^{+} curves can be accurately fitted by power functions (a*time^{b}).
These curves and indices allow to fully appreciate the quantitative and qualitative heterogeneity, both in terms of effects and costs, of the different therapeutic strategies adopted in the three trials.
Conclusions
With our novel method, by exploiting original KaplanMeier curves from three major clinical trials on cardiovascular prevention, we generate new information on the actual consequences of choosing a therapeutic strategy vs another, thus ultimately providing the clinical gain in terms of timedependent functions. Accurately assessing clinically and economic meaningful results from any intervention trial reporting positive results through this approach, facilitates objective comparisons and increases reliability in predicting survival among the various therapeutic options provided.
Trial registration
PROVEIT (Pravastatin or Atorvastatin Evaluation and Infection Therapy (TIMI22), Clinical trial registration number: NCT00382460, date of registration: September 29, 2006, study start date: November 2000).
LIFE (Losartan Intervention For Endpoint Reduction in Hypertension (LIFE) Study, Clinical trial registration number: NCT00338260, date of registration: June 20, 2006, study start date: June 1995).
HOPE (Heart Outcomes Prevention Evaluation; we could not find Clinical trial registration number and date of registration).
Introduction
The standard presentation of randomized clinical trials (RCT) comparing treatment strategies is based on hazard ratios (HR) and eventually on the number needed to treat (NNT). When a strategy is shown to be superior, the message conveyed is that it reduces the risk to develop (i.e. it prevents) the outcome under consideration. Assuming that in most studies the primary outcome variable is the timetotheevent and not the event itself, the message may be misleading. In addition, the true benefit of any intervention, when effective, consists of the longer length of eventfree time experienced by the cohort when compared to that on the alternative treatment. Extending eventfree life is not equivalent to prevent (avoid) the occurrence of the event. This is particularly evident when the outcome is unavoidable like death, but it also holds true for any other event that is likely to occur in the specific population, namely like cardiovascular (CV) events in highrisk groups.
The need to move beyond the HR, as highlighted in recent oncology literature [1], is not only based on the elusive clinical implications of the HR itself, but also on the formal requirement that the ratio of the two hazard functions is constant over time, which is seldom verified and not always selfevident. Median survival [2], accelerated failuretime model [3, 4] and quantile regression [5] have been proposed as alternatives to overcome these limitations [1, 2, 6]; however, they require that at least 50% of the sample population experienced the outcome of interest, and in case of CV prevention trials, this is seldom the case. Projecting halflives beyond the duration of the trial is an alternative; however, it relies on the assumption that the slope of the survival curve over time follows a predictable function, which has been demonstrated to be incorrect by MeierKriesche et al [7]. These authors suggested that the difference in the area under the KaplanMeier (KM) curves of the two treatment groups can provide a better and more reliable estimate for the treatment effect, especially for shortterms studies. The need for alternatives to the HR based on the analysis of time, particularly for noninferiority trials, has been elegantly highlighted in recent paper by Uno et al. [8] who purposefully stated that “... the patients’ exposure times are more clinically important than the observed number of event” and it is underscored that unlike proportional hazards, the analysis of time has also the advantage of not requiring specific assumptions. Lytsy et al. [9] proposed to adopt a nonparametric description of the time course of the delay of events. This parameter has the merit of focusing on the timetotheevent rather than on the event itself, but has the flaw of not considering the exposures’ times; therefore, it remains essentially descriptive.
Pharmacoeconomy is a second perspective to be taken into account. Focusing on timetotheevent is instrumental for accurate costbenefit analysis allowing estimates over time and, eventually, their prediction beyond the duration of the study, which becomes especially relevant when comparing novel and more expensive treatments to standard ones.
Finally, a third important issue is the comparison among studies. To rely only on crude numbers such as HR and NNT, especially for studies with different lengths, might not only be misleading but, most importantly, does not convey all the information generated by all data collected throughout the study.
Our objective was to extract all the information contained in a clinical trial’s KM curves. To this aim, we approached the trial as a biological experiment where an intervention is applied over time (exposure) in order to produce a response (eventfree time gain), which might follow a specific kinetics. Subsequently, from these doseresponse curves, we derived indices able to effectively describe the trial’s results in pharmacoeconomic terms, allowing comparisons among different interventions.
To do so, we here present a novel method that by exploiting KM curves enables a clinically meaningful representation of the major results generated by any positive clinical trial. The method, named PISA (Pragmatic Interpretation of Survival Analysis), is described in detail and tested on PROVEIT [10], LIFE [11] and HOPE [12], three major, heterogeneous and positive CV prevention clinical trials.
Methods
Trial selection
The criteria were as follows: (a) active treatment vs standard or headtohead comparison, (b) superiority clearly demonstrated, (c) highquality KM images, (d) CV outcomes, (e) studies lasting more than 2 years, (f) continuous treatment throughout the study, (g) population size greater than 4000.
Data extraction
As illustrated in Fig. 1, original inverse KM graphs (i.e. indicating cumulative incidence instead of eventfree survival) were first captured from the PDF of the article as highdefinition images (.png) and then converted into data using the UNSCANIT Graph Digitizer software (Silk Scientific, Inc. Orem, Utah USA). After several attempts, we selected the following software parameters for line following being the more efficient: (1) “point assignment”: midline of upper and lower surface of the curve; (2) “line follow algorithm”: sloped; (3) sampling: 1 scanner unit (which ranged from 0.027 to 0.081 months among the three studies). Digitalized data (408–1080 couples of time and % values) were visualized and misplaced points at visual inspection (< 0.1%) were manually shifted or erased and replaced. In order to provide homogeneous data sets, to be transferred in an ad hoc built spreadsheet for subsequent calculations, time (x axis) spacing was then forced to 0.25 months yielding sets of 120–266 couples and the few (< 1%) missing incidence (y axis) data were automatically interpolated with the linear method using the closest upper and lower points. To provide an internal validation, we tested the method through two different operators in four different days: the % of total variance attributable to the intra and interobserver variability was < 0.1%. We could not perform any external validation (i.e. accuracy of extracted data with respect to the original ones) since the original data were not available; however its accuracy can be visually appreciated from Supplementary Figure 1.
Data calculation of the PISA approach
A full glossary with definitions and mathematical formulae is provided in Supplementary Figure 2.
 a)
TimeGain curves
To assess the benefit (eventfree time gain) of the treatments, we drew the TimeGain curves. First, we calculated the integrals of each inverse KM applying piecewise integration using the trapezoid rule with equal x segments (0.25 months) and then we plotted it as a function of time. This function (TimeLost) represents the time course of the units of time (months) spent after the specific event has occurred that progressively accumulate during the followup in a group of 100 individuals at baseline. The difference of the placebo and active treatment TimeLost curves (i.e. the difference between the two KM areas under the curve), also plotted as a function of time, describes the eventfree time (TimeGain) that is produced by the active treatment. Positive values indicate a protective effect of the active treatment vs control, whereas negative values represent a harmful effect (i.e. a loss of eventfree time).
The TimeGain curve, when the active treatment is effective (i.e. the two original inverse KM curves separate with time), has a peculiar characteristic: it follows a kinetics that can be accurately described by a secondorder polynomial function forced to pass through the origin (a*time^{2} + b*time). This was verified also in other CV outcome clinical trials of heterogeneous durations in whom different types of treatments were effective like UKPDS34, STENO2, CIBISII, EMPAREG OUTCOME, CANVAS, LEADER and SUSTAIN6 (Supplementary Figure 3).
In order to facilitate comparisons, to avoid loss of timedependent information and to attenuate the uncertainties introduced by the progressive reduction of the number of the subjects in followup, we exploited the peculiar kinetics of the curves to perform curve fitting on data. The interval on which the fit was performed was from time 0 to the time when 50% of the total population was still being followed or, when this was not possible (as in the HOPE trial, where data on the percentage of population in followup was not available) at 50% of the length of followup. Data fit, performed using standard regression procedures, was then accepted only for regression coefficient values > 0.95. The resulting equations were used to generate TimeGain f50% curves and the math coefficients describing all studies; for the PROVEIT (the shorter of the tree), the fit was used to extrapolate data beyond the actual duration of the study.
 b)
Months of treatment per eventfree years (MoT/y^{+}).
The cost effectiveness of any treatment results from the ratio between drug exposure and clinical benefit and is a function of time. Accordingly, we generated the pharmacoeconomic index MoT/y^{+} (Fig. 3), which represent the number of months of treatment of the entire cohort that is necessary to gain 1 year of eventfree life at any given time during the trial. This was obtained as the exposure to the active treatment, expressed as total months of treatment and calculated as the integral (area under the curve) of the percent of subjects without the event (i.e. KM survival curves), divided for the corresponding timedependent number of years (i.e. months/12) gained as a consequence of the treatment. To avoid loss of information, as described in section “TimeGain curves”, we also calculated the MoT/y^{+}f50%, where the values of years gained were assessed through the TimeGain f50% function instead of the actual TimeGain curve.
The MoT/y^{+}f50% curve is initially very noisy since the denominator is extremely small; however, when the value of 6 months of TimeGain is achieved, it follows very closely a power kinetics (a*time^{b}). To maximize the precision of the fit, the interval on which it was performed was therefore from when the value of 6 months gained was reached to the end of the followup. In order to allow homogeneous comparisons among the trials, the equation obtained through this curve fitting (eMoT/y^{+}) was used to calculate the data at fixed time points (eMoT/y^{+}@2 years and eMoT/y^{+}@6 years).
 c)
Number needed to treat per eventfree year (NNT/y^{+}).
Since the NNT is an index extensively used in the medical literature, and also easily understood by the medical community, we generated another pharmacoeconomic index, the NNT/y^{+}, that represents the number of subjects who need to be treated to gain 1 year of eventfree time at any given time during the trial.
It was calculated with the same procedure adopted for the MoT/y^{+}, replacing the MoT with the actual number of patients being treated at each time point (obtained by dividing the MoT for the time of the study) to obtain the NNT to gain one eventfree year (NNT/y^{+}f50%) and the corresponding eNNT/y^{+}@2 years and eNNT/y^{+}@6 years indices.
Results
The TimeGain curves
The length of followup, the absolute rate and the kinetics of the event accrual as well as the effect of the intervention were rather different among the three clinical trials used to test the PISA method (Fig. 2). The event accumulation was not linear over time and five to threefold greater in the PROVEIT study with respect to LIFE and HOPE trials. The time necessary for the effect of the treatment to emerge (i.e. before the KM curves start to diverge) was longer for HOPE, but only in this study the separation of the curves showed a stable trend to increase over time. Despite this heterogeneity, the calculated TimeGain curves were able to describe the kinetics of the treatmentrelated benefit over time and showed constant characteristics. They were all smooth and could be accurately described—through nonlinear regression—by a secondorder polynomial function. The accuracy of this fit is demonstrated by the high regression coefficient values calculated within the interval used for the fit (R^{2}value 0t_{50%}) (Table 1). The ability of the fit in predicting the actual TimeGain well beyond the time window used for its calculation is particularly evident in HOPE and PROVEIT, in which the t_{50%} fall at 50 and 70%, respectively, of the whole study duration (comparison of continuous and dotted lines in Fig. 2 and comparisons of TimeGain R^{2} values 0t_{50%} vs 0T in Table 1). The heterogeneity of true clinical benefit produced by the three interventions clearly emerges from the comparisons of TimeGain@6 years, being in HOPE double and in PROVEIT fourfold than what is observed in LIFE (Table 1), a difference, which cannot be entirely appreciated by comparing usual indices of intervention efficacy (relative risk and absolute risk reduction).
The MoT/y^{+} curves
The MoT/y^{+} curves (Fig. 3) show that the number of months of treatment necessary to gain 1 year of eventfree life decline rapidly during the first 24 months of the trial and slowly thereafter. The kinetics of this index in all the three studies is well represented by a negative power function whose adequacy is supported by the high regression coefficient values that were calculated on the whole available study data set (Table 1). The extreme difference in costbenefit of the three different interventions applied to the three different populations clearly emerges: the ranking being 1:2:4 in PROVEIT, HOPE and LIFE.
The NNT/y^{+} curves
The NNT/y^{+} shows the number of subjects who need to be treated to gain 1 year of eventfree time at any given time during the trial. This index rapidly decreases over the first 24 months; after which, its slope progressively approaches zero. The differences among the studies lie in the readiness of emergence of the effect (time at 200) and in the initial slope affecting the time lag necessary for the index to reach the same values. For example, in PROVEIT, HOPE and LIFE trial values of 200 and 50 are reached at 6 and 13.3, 16 and 29.3 and 13 and 32.5 months, respectively. All the curves are well fitted by negative power functions with a high accuracy as shown by the regression coefficient values (Table 1). Again the projections at 6 years show large differences among the studies.
Discussion
Our main finding is that the PISA method, by reconstructing from KM curves, the timedependent relationship between real exposure and real benefit, allows a true, accurate and meaningful representation of the results of any RCT. Indeed, three apparently similar RCT emerge as very heterogeneous when dissected in the clinical (TimeGain curves) and pharmacoeconomic (MoT/y^{+} and NNT/y^{+}) domain. We also unrevealed that the most relevant clinical and economic indices of any positive clinical trial follow a predictable kinetics, a property that facilitates data calculation and allows data modelling.
The need to improve the information generated by RCT, already presented more than 60 years ago [13] and recently prompted by authors particularly active in oncology [1, 6, 14,15,16], has emerged also for the limited reliability of risk ratio, and also NNT, in adequately representing the clinical significance of the eventually successful intervention. This integrated index is based on the assumption, seldom verified, of constancy throughout the study, does not take into consideration the kinetics (i.e. timedependency) of the effect and implies the possibility to prevent an outcome, which for populations bearing a high risk for that event, is misleading. What is generated by a successful intervention is a gain in eventfree time that is enjoyed by the whole cohort as a unit according to a specific kinetics. Any intervention, in fact, will have a different interaction with time depending on the mechanism/s it interferes with to slow down the progression of the disease. Unfortunately, CV prevention trials that are designed to accumulate a given number of events—in order for the risk ratio to achieve statistical significance—tend to shorten the study duration by increasing the sample size making it difficult to adequately take into account the timedependent effects and eventually appreciate the differences among interventions.
The time gained by the cohort receiving the superior treatment is indeed a function of time, which can be precisely calculated as the difference between the areas of the KM incidence curves. This peculiar curve was already described as difference between restricted mean survival time (RMST) curves by Zhao et al. [17], but the peculiarity of its kinetics and its modelling has never been evaluated. These curves have the rather unexpected property of being smooth and to follow a consistent mathematical pattern, which can be accurately described by a secondorder polynomial function. Clearly, the upraise (slope) of the time gain is not expected to continue indefinitely but only until approximately 50% of the subjects have developed the outcome, after which the curve will tend to plateau (as the number of subjects who can benefit from the treatment is progressively reduced). This pattern is shared by all CV prevention positive studies regardless of the type of intervention as shown in this analysis and also verified in other studies using antithrombotic or antidiabetic drugs. The firstorder coefficient represents the linear and constant gain over time (as would be generated two KM parallel curves) and the secondorder coefficient represent the timedependent progression, i.e. the angle between the two KM curves. The different kinetics, particularly the onset, depends to some extent on the absolute level of risk of the population, but also on whether, and to what extent, the intervention also interferes with the clinical emergence (firstorder coefficient) of the background disease, or mainly with its progression (secondorder coefficient). In our analysis, the different kinetics of cholesterollowering (PROVEIT) and blood pressurelowering drugs (LIFE and HOPE) is evident both in qualitative and in quantitative terms. Interestingly, although the three interventions evaluated yielded comparable RR reductions (and comparable time gains at the end of the study), the time gain of the PROVEIT at 2 years was 2.4fold higher with respect to the other two studies (Table 1).
By using the PISA method, not only the effect of an intervention can be accurately described in the time domain, but also the exposure can be precisely measured throughout the study. From the incidence curves, we can calculate the complementary survival curves, which represent the true exposure of each cohort to the treatment. By doing this, we obtain two timedependent indices (MoT/y^{+} and NNT/y^{+}), which allow to appreciate how the cost effectiveness of the intervention changes with time. These curves also have the property of being smooth and to follow a kinetics that is very well described by a mathematical function. The major characteristics of these curves that reflect the cost normalized per unit of gain are that they display a rapid decline in the first 12–24 months and then slowly tend to approach a plateau. By describing objectively how the cost efficacy improves over time, these curves allow to compare the different interventions. The achievement of costefficacy level of 1000 MoT/y^{+} is reached after 6.3, 39.5 and > 72 months in PROVEIT, HOPE and LIFE trials respectively. Whereas the broader primary composite outcome of the PROVEIT in part justifies its quicker and greater cost effectiveness, the difference between HOPE and LIFE for the identical composite outcome is a fact to be taken into account in costefficacy analysis of the two different therapeutic strategies. Another relevant aspect that emerges from the inspection of the curves is that the cost efficacy of the HOPE tend to increase progressively (lower MoT/y^{+} values) with time. Therefore, the heterogeneity of the trials can be well appreciated both in absolute terms and also in the kinetics. Thanks to the curve fitting, it is possible to compare NNT at the same time (Table 1) and also to estimate how long would it take for an intervention to reach an NNT which is considered cost effective. For example, before the value of NNT/y^{+} goes below 50, it is necessary to wait 13.3, 29.3 and 32.5 months for PROVEIT, HOPE and LIFE respectively. Due to the different kinetics of the curves, whereas at 2 years the NNT/y^{+} is similar for HOPE and LIFE the projections at 6 years indicate a two/fourfold difference among the studies. Clearly, this does not translate into a superiority of an intervention since the study design, outcomes and population were different, but would be a relevant information in case of studies sharing similar patient’s characteristics, similar design and identical outcomes. In addition, since the index is normalized for 1 year of eventfree life, this allows an easy quantification of the real benefit; in economic terms, it is in fact possible to calculate the value of this unit, while this is more difficult when, as it is commonly done, the unit of gain is expressed as 1 event less, which again implies the concept that this event is really avoided, while it is more likely to be postponed. The novel index we here propose MoT/y^{+} in our opinion is particularly meaningful. It is based on a more accurate representation of the exposure (MoT), which taking simultaneously into consideration both the number of patients being treated and the duration of the treatment, allows a fair comparison among the studies also when having different durations. As evident from Table 1, while at study end the NNT/y^{+} of the three trials was similar, the corresponding values of MoT/y^{+} indicate a clear difference among the three interventions. In addition, due to its slower kinetics MoT/y^{+} allows a better discrimination in the late, and also economically more relevant, part of the study.
Another major asset of the PISA method is that it allows to estimate the results of any positive trial beyond its actual duration. Although we fully recognize the limitations inherent to the extrapolation of the data, we base our confidence on the reliability of the results on the following arguments. First, the time gain functions of cardiovascular prevention trials with durations up to 10 years (UKPDS34, STENO2) all share the same secondorder polynomial pattern and within the same RCT this pattern is present both on compound and individual outcomes. Second, although the data fit is based only on a portion of the whole duration of the study (t_{0}–t_{50%}), the extrapolation beyond t_{50%} tend to closely represent the real data and this in the HOPE study holds true over a period of 27 months. Third, the extremely large sample size, characteristic of most CV prevention studies, increases the accuracy with which the effect is measured particularly in the early part of the study, on which the data fit is based.
The major limitation of the PISA method is that it does not allow to calculate the error of the estimates when performed on secondary data. Indeed, data at the patient level would be necessary. This would allow calculation of the 95% confidence intervals of the KaplanMeier curves and the parameters estimates using the 2 extreme pairs (97.5% active vs 2.5% control and 2.5% active vs 97.5% control) to produce the 95% confidence intervals. Of note, if the original KaplanMeier curves are presented with their confidence boundaries, it would be possible to calculate the error without accessing to patientlevel data. Alternatively, provided that methods have been recently developed to calculate reliable confidence intervals from summary data [18], this limitation might be overcome.
In conclusion, the PISA method dissects the whole information enclosed in KM curves. Being simple and easily reproducible, it might improve survival data analysis by providing clear definition of trial results, allowing fair comparisons between similar RCT and performing accurate predictions beyond the trial duration.
Abbreviations
 RCT:

Randomized clinical trials
 HR:

Hazard ratios
 NNT:

Number needed to treat
 CV:

Cardiovascular
 KM:

KaplanMeier
 PISA:

Pragmatic Interpretation of Survival Analysis
 MoT:

Months of treatment
References
Uno H, Claggett B, Tian L, et al. Moving beyond the hazard ratio in quantifying the betweengroup difference in survival analysis. J Clin Oncol. 2014;32:2380–5.
Mathew A, Pandey M, Murthy NS. Survival analysis: caveats and pitfalls. Eur J Surg Oncol. 1999;25:321–9.
George B, Seals S, Aban I. Survival analysis and regression models. J Nucl Cardiol. 2014;21:686–94.
Wei LJ. The accelerated failure time model: a useful alternative to the Cox regression model in survival analysis. Stat Med. 1992;11:1871–9.
Koenker RHK. Quantile regression. J Econ Perspect. 2001;15:143–56.
Royston P, Parmar MK. Restricted mean survival time: an alternative to the hazard ratio for the design and analysis of randomized trials with a timetoevent outcome. BMC Med Res Methodol. 2013;13:152.
MeierKriesche HU, Schold JD, Kaplan B. Longterm renal allograft survival: have we made significant progress or is it time to rethink our analytic and therapeutic strategies? Am J Transplant. 2004;4:1289–95.
Uno H, Wittes J, Fu H, et al. Alternatives to hazard ratios for comparing the efficacy or safety of therapies in noninferiority studies. Ann Intern Med. 2015;163:127–34.
Lytsy P, Berglund L, Sundstrom J. A proposal for an additional clinical trial outcome measure assessing preventive effect as delay of events. Eur J Epidemiol. 2012;27:903–9.
Cannon CP, Braunwald E, McCabe CH, et al. Intensive versus moderate lipid lowering with statins after acute coronary syndromes. N Engl J Med. 2004;350:1495–504.
Dahlöf B, Devereux RB, Kjeldsen SE, et al. Cardiovascular morbidity and mortality in the losartan intervention for endpoint reduction in hypertension study (LIFE): a randomised trial against atenolol. Lancet. 2002;359:995–1003.
Yusuf S, Sleight P, Pogue J, Bosch J, Davies R, Dagenais G. Effects of an angiotensinconvertingenzyme inhibitor, ramipril, on cardiovascular events in highrisk patients. N Engl J Med. 2000;342:145–53.
Irwin JO. The standard error of an estimate of expectation of life, with special reference to expectation of tumourless life in experiments with mice. J Hyg (Lond). 1949;47:188.
A'Hern RP. Restricted mean survival time: an obligatory end point for timetoevent analysis in cancer trials? J Clin Oncol. 2016;34:3474–6.
Trinquart L, Jacot J, Conner SC, Porcher R. Comparison of treatment effects measured by the hazard ratio and by the ratio of restricted mean survival times in oncology randomized controlled trials. J Clin Oncol. 2016;34:1813–9.
Zhao L, Tian L, Uno H, et al. Utilizing the integrated difference of two survival functions to quantify the treatment contrast for designing, monitoring, and analyzing a comparative clinical study. Clin Trials. 2012;9:570–7.
Zhao L, Claggett B, Tian L, et al. On the restricted mean survival time curve in survival analysis. Biometrics. 2016;72:215–21.
Guyot P, Ades AE, Ouwens MJ, Welton NJ. Enhanced secondary analysis of survival data: reconstructing the data from published KaplanMeier survival curves. BMC Med Res Methodol. 2012;12:9.
Acknowledgements
Not applicable.
Funding
No funds were used to support this manuscript.
Author information
Authors and Affiliations
Contributions
AM performed data extraction and data analysis, produced the figures and the tables and drafted the manuscript. DT contributed to data analysis and interpretation. AN conceived and designed the manuscript, performed data extraction and contributed to interpret the data and revised the manuscript. The authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Additional file 1: Supplementary Figure 1.
Data extraction. The figure shows the accuracy of the data extraction using the UNSCANIT Graph Digitalizer software on the KaplanMeier of PROVEIT, LIFE and HOPE trials, respectively for 4pMACE*, 3pMACE and 3pMACE outcomes. Point assignment was set at the mid line of upper and lower surface of the curve.
Additional file 2: Supplementary Figure 2.
Glossary. Full glossary with definitions and mathematical formulae.
Additional file 3: Supplementary Figure 3.
Time Gain observed and fitted curves from other major trials. Time Gain curves (continuous line) and fitted Time Gain curves (f50%; dotted lines) with the extrapolation beyond time at which less than 50% of the cohort was in followup (indicated by arrows) throughout the duration of the studies for various outcomes of some CV prevention trials: UKPDS34, STENO2, CIBISII, EMPAREG OUTCOME, CANVAS, LEADER, SUSTAIN6. With regards to CIBISII and SUSTAIN6 trial (due to population in followup > 50% at the end of the trial) the fit and the extrapolation were performed at the end of the trial (indicated by arrows). The polynomial second order function obtained by performing curve fitting is displayed for each outcome along with the R^{2}. a) UPKPDS34, Death from any cause outcome; b) STENO2, CV events outcome: a composite of cardiovascular disease events that included death from cardiovascular causes, nonfatal stroke, nonfatal myocardial infarction, coronaryartery bypass grafting, percutaneous coronary intervention or revascularization for peripheral atherosclerotic arterial disease, and amputation because of ischemia; c) CIBIS II: Death from any cause outcome; d) EMPAREG OUTCOME, CV Death outcome; e) EMPAREG OUTCOME: Heart Failure outcome, f) CANVAS: Death from any cause outcome; g) LEADER: 3pMACE outcome, h) SUSTAIN6: 3pMACE outcome.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Mengozzi, A., Tricò, D. & Natali, A. A novel method for interpreting survival analysis data: description and test on three major clinical trials on cardiovascular prevention. Trials 21, 578 (2020). https://doi.org/10.1186/s1306302004511y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s1306302004511y