Research Methods & Reporting

Reporting of surrogate endpoints in randomised controlled trial reports (CONSORT-Surrogate): extension checklist with explanation and elaboration

BMJ 2024; 386 doi: https://doi.org/10.1136/bmj-2023-078524 (Published 09 July 2024) Cite this as: BMJ 2024;386:e078524

Linked Research Methods and Reporting

Reporting of surrogate endpoints in randomised controlled trial protocols (SPIRIT-Surrogate): extension checklist with explanation and elaboration

Anthony Muchai Manyara, honorary research fellow and senior research associate1 2,
Philippa Davies, senior research associate3,
Derek Stewart, patient and public involvement partner4,
Christopher J Weir, professor5,
Amber E Young, honorary professor3,
Jane Blazeby, professor3 6 7,
Nancy J Butcher, assistant professor8 9,
Sylwia Bujkiewicz, professor10,
An-Wen Chan, professor11 12,
Dalia Dawoud, associate director13 14,
Martin Offringa, professor8 15,
Mario Ouwens, group director of biostatistics16,
Asbjørn Hróbjartssson, professor and head of centre17 18,
Alain Amstutz, postdoctoral researcher19 20 21,
Luca Bertolaccini, deputy director22,
Vito Domenico Bruno, cardiac surgeon23,
Declan Devane, professor and director24 25,
Christina D C M Faria, associate professor26,
Peter B Gilbert, professor27,
Ray Harris4,
Marissa Lassere, staff specialist rheumatologist28,
Lucio Marinelli, associate professor29 30,
Sarah Markham, visiting researcher4 31,
John H Powers III, professor32,
Yousef Rezaei, general practitioner and research fellow33 34 35,
Laura Richert, professor36,
Falk Schwendicke, director37,
Larisa G Tereshchenko, associate professor38,
Achilles Thoma, clinical professor39,
Alparslan Turan, professor40,
Andrew Worrall4,
Robin Christensen, professor41,
Gary S Collins, professor42,
Joseph S Ross, professor43 44,
Rod S Taylor, professor1 45,
Oriana Ciani, associate professor46

¹MRC/CSO Social and Public Health Sciences Unit, School of Health and Wellbeing, University of Glasgow, Glasgow, UK
²Global Health and Ageing Research Unit, Bristol Medical School, University of Bristol, Bristol, UK
³Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
⁴Patient author, UK
⁵Edinburgh Clinical Trials Unit, Usher Institute, University of Edinburgh, Edinburgh, UK
⁶Bristol NIHR Biomedical Research Centre, Bristol, UK
⁷University Hospitals Bristol and Weston NHS Foundation Trust, Bristol, UK
⁸Child Health Evaluative Sciences, Hospital for Sick Children Research Institute, Toronto, ON, Canada
⁹Department of Psychiatry, University of Toronto, Toronto, ON, Canada
¹⁰Biostatistics Research Group, Department of Population Health Sciences, University of Leicester, Leicester, UK
¹¹Women’s College Research Institute, Toronto, ON, Canada
¹²Department of Medicine, University of Toronto, Toronto, ON, Canada
¹³Science, Evidence, and Analytics Directorate, Science Policy and Research Programme, National Institute for Health and Care Excellence, London, UK
¹⁴Faculty of Pharmacy, Cairo University, Cairo, Egypt
¹⁵Department of Paediatrics, University of Toronto, Toronto, ON, Canada
¹⁶AstraZeneca, Mölndal, Sweden
¹⁷Centre for Evidence-Based Medicine Odense and Cochrane Denmark, Department of Clinical Research, University of Southern Denmark, Odense, Denmark
¹⁸Open Patient data Explorative Network, Odense University hospital, Odense, Denmark
¹⁹CLEAR Methods Centre, Division of Clinical Epidemiology, Department of Clinical Research, University Hospital Basel and University of Basel, Basel, Switzerland
²⁰Oslo Centre for Biostatistics and Epidemiology, Oslo University Hospital, Oslo, Norway
²¹Bristol Medical School, University of Bristol, Bristol, UK
²²Department of Thoracic Surgery, IEO, European Institute of Oncology IRCCS, Milan, Italy
²³IRCCS Galeazzi-Sant’Ambrogio Hospital, Department of Minimally Invasive Cardiac Surgery, Milan, Italy
²⁴University of Galway, Galway, Ireland
²⁵Health Research Board-Trials Methodology Research Network, University of Galway, Galway, Ireland
²⁶Department of Physical Therapy, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
²⁷Fred Hutchinson Cancer Centre, Seattle, WA, USA
²⁸St George Hospital and School of Population Health, University of New South Wales, Sydney, NSW, Australia
²⁹Department of Neuroscience, Rehabilitation, Ophthalmology, Genetics, Maternal and Child Health, University of Genova, Genoa, Italy
³⁰IRCCS Ospedale Policlinico San Martino, Genoa, Italy
³¹Department of Biostatistics and Health Informatics, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
³²George Washington University School of Medicine, Washington, DC, USA
³³Heart Valve Disease Research Centre, Rajaie Cardiovascular Medical and Research Centre, Iran University of Medical Sciences, Tehran, Iran
³⁴Ardabil University of Medical Sciences, Ardabil, Iran
³⁵Behyan Clinic, Pardis New Town, Tehran, Iran
³⁶University of Bordeaux, Centre d’Investigation Clinique-Epidémiologie Clinique 1401, Research in Clinical Epidemiology and in Public Health and European Clinical Trials Platform & Development/French Clinical Research Infrastructure Network, Institut National de la Santé et de la Recherche Médicale/Institut Bergonié/Centre Hospitalier Universitaire Bordeaux, Bordeaux, France
³⁷Charité Universitätsmedizin Berlin, Berlin, Germany
³⁸Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA
³⁹McMaster University, Hamilton, ON, Canada
⁴⁰Department of Outcomes Research, Anaesthesiology Institute, Cleveland Clinic, OH, USA
⁴¹Section for Biostatistics and Evidence-Based Research, the Parker Institute, Bispebjerg and Frederiksberg Hospital, Copenhagen and Research Unit of Rheumatology, Department of Clinical Research, University of Southern Denmark, Odense University Hospital, Odense, Denmark
⁴²UK EQUATOR Centre, Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford, UK
⁴³Department of Health Policy and Management, Yale School of Public Health, New Haven, CT, USA
⁴⁴Section of General Medicine, Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA
⁴⁵Robertson Centre for Biostatistics, School of Health and Well Being, University of Glasgow, Glasgow, UK
⁴⁶Centre for Research on Health and Social Care Management, Bocconi University, Milan 20136, Italy

Correspondence to: O Ciani oriana.ciani{at}unibocconi.it (or @OrianaCiani on Twitter)

Accepted 30 April 2024

Randomised controlled trials commonly use surrogate endpoints to substitute for a target outcome (outcome of direct interest and relevance to trial participants, clinicians, and other stakeholders—eg, all cause mortality) to improve their efficiency (through shorter trial duration, reduced sample size, and thus lower research costs), or for ethical or practical reasons. But reliance on surrogate endpoints can increase the uncertainty of an intervention’s treatment effect and potential failure to provide adequate information on intervention harms, which has led to calls for improved reporting of trials using surrogate endpoints. This report presents a consensus driven reporting guideline for trials using surrogate endpoints as the primary outcomes—the CONSORT (Consolidated Standards of Reporting Trials) extension checklist: CONSORT-Surrogate. The extension includes nine items modified from the CONSORT 2010 checklist and two new items. Examples and explanations for each item are provided. We recommend that all stakeholders (including trial investigators and sponsors, journal editors and peer reviewers, research ethics reviewers, and funders) use this extension in reporting trial reports using surrogate endpoints. Use of this checklist will improve transparency, interpretation, and usefulness of trial findings, and ultimately reduce research waste.

Evidence from well designed, conducted, and reported randomised controlled trials (referred to as trials in this article) assessing the effect of an intervention on the target outcome of interest (eg, all cause mortality) are required to determine the efficacy or effectiveness of interventions.1 Inadequate reporting of trials reduces their usefulness for decision making and, thus, contributes to the rising problem of research waste.2 3 Using reporting guidelines has been shown to be successful in improving the usefulness of trial evidence and reduce research waste.3 The CONSORT (Consolidated Standards of Reporting Trials) statement is a 25 item checklist widely used for the reporting of parallel group trial reports.4 While the CONSORT checklist has improved the completeness of trial reports,5 it is not adequate for all types of trials. Consequently, CONSORT extensions (checklists with modified or new items) have been developed (eg, CONSORT-PRO (patient reported outcomes),6 CONSORT-Outcomes7). However, none of the existing extensions provides specific guidance for trials that use surrogate endpoints. Surrogate endpoints act as a substitute in trials for target outcomes.8 9 Table 1 lists some examples of surrogate endpoints applicable in trials.

Table 1

Examples of surrogate endpoints in randomised controlled trials

View this table:

Surrogate endpoints are frequently used to improve trial efficiency (eg, to shorten duration of follow-up, reduce sample size, and, thus, lower overall trial costs) among other feasibility, practical, ethical, and scientific reasons.16 Dependent on the disease or health area and definitions of a surrogate endpoint, it has been estimated that 20-78% of trials use surrogate endpoints as primary outcomes.17 18 19 20 However, in the absence of data on target outcomes, the use of surrogate endpoints in trials can be controversial and have fundamental limitations for clinical and policy decision making such as an increase of the uncertainty of the intervention’s true effect on target outcome (and clinical efficacy or effectiveness, and cost effectiveness) and failure to provide adequate information on intervention harms, given their typically smaller sample size and shorter follow-up period.16 Consequently, there have been calls for better reporting of trials that rely on surrogate endpoints, including an explicit statement and rationale for the use of a surrogate endpoint and consideration of their potential limitations.20 21 22 23 Considering the ongoing inadequacies in reporting trials using surrogate endpoints, the SPIRIT/CONSORT-Surrogate project was formed to develop extensions for SPIRIT and CONSORT for trials using a surrogate endpoint as a primary outcome (video 1). The SPIRIT-Surrogate extension is presented in Manyara et al.24 In this article, we present the CONSORT-Surrogate extension checklist along with an elaboration and explanation document. Table 2 provides a glossary of terminology used in the extension.

Table 2

Glossary of terminology used in the CONSORT-Surrogate extension

View this table:

Video 1

Surrogate outcomes

Summary points

Randomised controlled trials often rely on surrogate endpoints to replace a target outcome of interest, particularly in the regulatory approval and health technology assessment of drugs and biological agents
Use of surrogate endpoints in trials might be misleading in terms of claims of intervention efficacy or effectiveness on target outcomes, and by providing limited information on harms
This article describes the CONSORT-Surrogate extension, a guideline to improve reporting of trial reports using a surrogate endpoint as a primary outcome to consequently inform better patient care, healthcare decisions, and policies
Trial authors, journal editors, and reviewers should use the CONSORT-Surrogate extension to improve reporting relevant protocols to enhance completeness, transparency, replicability of methods, interpretation, and usefulness of findings

Scope and use of CONSORT-Surrogate

Box 1 summarises the scope and use of the CONSORT-Surrogate extension. The extension should be used to report all trial types and phases that use surrogate endpoints (based on any definition) as primary outcome(s) including when a surrogate endpoint is used as part of a composite outcome. Given that primary outcomes drive evaluation of interventions and trial conclusions, the focus of the extension is on this aspect. The extension provides the minimum recommended items to report, but authors can provide additional information that helps with transparency of surrogate endpoint trials and interpretation of results. Importantly, the extension does not mandate trial teams to change their design or plans to fit with recommended items: authors should just be explicit about what was done or planned but are strongly encouraged to consider implementing all items, when possible. Box 1 presents more aspects on the scope and use of the extension. Appendix table A1 presents key methodological considerations of the design and the reporting of surrogate endpoints in trial reports, that inform the extension items.

Box 1

Summary of scope and use of CONSORT-Surrogate extension

Eligibility for use

All intervention randomised controlled trials using surrogate endpoints (based on any definition) as primary outcome(s). Includes instances when surrogate endpoints are part of a primary composite outcome.

Minimum requirement

The extension is the minimum set of items to be reported but authors can provide more information for improved transparency, clarity, and interpretation of findings.

Surrogate validation methods are out of scope

The appraisal of surrogate validation methods or metrics to use or cite is out of the scope of this extension.

Target outcome(s)

Trial teams should consider collecting target outcomes (as secondary outcome(s)) and reporting their intervention effects. Such information can support subsequent surrogate endpoint validation analyses and assessment of potential intervention harms.

Flexibility in order of reporting items

Items can be combined or reported in different sections to those items suggested in the extension. The specific item sections are recommendations rather than requirements.

Extrapolation of extension items

The extension was developed for trials, but could be relevant to report non-randomised trials, observational studies, and other studies using surrogate endpoints.

CONSORT=Consolidated Standards of Reporting Trials.

RETURN TO TEXT

Development of CONSORT-Surrogate extension

Development of the CONSORT-Surrogate extension, undertaken alongside the SPIRIT-Surrogate extension, followed four phases informed by the EQUATOR (Enhancing the QUAlity and Transparency Of health Research) network guidance for developing health reporting guidelines.28 The development was pre-registered on the EQUATOR network website29 and the protocol published.30 Phase 1 involved literature reviews aimed at synthesising reporting items of trials using surrogate endpoints from current literature and identifying surrogate content experts (scoping review); and identifying trial investigators of recent trials using surrogate endpoints as primary outcomes for invitation to an e-Delphi survey (targeted review). The protocol for the literature reviews has been published elsewhere.31 The scoping review search was undertaken between March and May 2022 and 90 documents included after screening. Data on definitions, limitations, acceptability, and guidance were extracted and used to generate 17 trial reporting items, the findings of the scoping review including the 17 generated items have been published elsewhere.16 After a project team discussion, 13 items were taken forward for rating in the e-Delphi survey.

Phase 2 involved rating of potential reporting items in a two round, e-Delphi survey using a 9 point Likert scale (1-3: not important; 4-6: important but not critical; 7-9: critical) on the DelphiManager software (version 5.0), maintained by the COMET initiative (Core Outcome Measures in Effectiveness Trials; https://www.comet-initiative.org/delphimanager/). The first round was open from 24 August to 10 October 2022; and the second round from 31 October to 11 December 2022. Participants were identified through various ways: contacting authors of relevant articles from the literature reviews; project team professional contacts; calls for participants made in conferences and meetings, social media, and distributed through professional organisations and networks (listed in appendix 2).

A total of 212 eligible participants registered to participate, with 195 (92%) rating the items in the first round and 176 (83%) in the second round. Participants represented 30 countries and encompassed a multidisciplinary group of stakeholders, including trial investigators, trial methodologists (including statisticians), trial managers, clinicians and allied health professionals, surrogate content experts, journal editors, patient and public partners, regulators, and payers or health technology assessment experts, ethics committee and funding panel members. Appendix tables A2, A3, A4, and A5 list the characteristics of participants.

Consensus thresholds for inclusion of items were ≥70% score of 7-9 and <15% score of 1-3, consensus thresholds for exclusion were ≥70% score of 1-3 and <15% score of 7-9, and no consensus for inclusion or exclusion was the failure to achieve either threshold.30 Thirteen items for CONSORT-Surrogate were rated in round one and 14 items in round two (additional item was suggested by participants in round one). Eight items achieved consensus thresholds in round one and a further two items in round two while there was no consensus for four items after both rounds (appendix table A6).

Phase 3 was a hybrid consensus meeting held on 13 and 14 March 2023 at the University of Glasgow, Glasgow, UK and via Zoom. Meeting delegates included 13 project team members and an invited subset of 20 stakeholders who had participated in the e-Delphi survey. The four items that did not reach consensus in the e-Delphi survey were discussed and voted on (using https://www.mentimeter.com/). Consensus was predefined as ≥70% voting to include or exclude an item. All four items achieved consensus: two for inclusion and two for exclusion (appendix table A7). For items that reached consensus, meeting delegates also fine-tuned wording and merging of items and discussed free text comments provided from e-Delphi surveys.

Phase 4 is an ongoing knowledge translation that includes dissemination and implementation of extensions. Dissemination efforts have included publication of short articles to publicise the project32 33 34 35 36; publication of protocols30 31; and presentations in meetings and conferences. The completed checklist was piloted by eight trial investigators who had conducted at least one trial by providing them with published trial and asking them to note whether extension items were reported. All items were clear, and no changes were made as a result of the pilot exercise.

Structure of the CONSORT-Surrogate extension

The extension consists of a checklist that is accompanied by an explanation and an elaboration section to provide rationale and clarification on modified or new items. Additionally, exemplars reporting the extension items are provided. For items that have not been extended, users should refer to the CONSORT 2010 explanation and elaboration document.4 We used 12 published trial reports to provide at least one example of reporting in each of the 11 CONSORT-Surrogate extension items. Eight (67%) of the trial reports used as examples were identified from a targeted review of trial protocols published between January 2017 and June 2022 in six general medical journals and the rest were identified from exploratory searches in PubMed database. The example text in this article includes a “ref” in superscript to indicate cited references within the examples. We have supplemented some examples by adding terms and recommendations to enhance their use. Abbreviations have also been spelt out in the examples where necessary. Use of any of the examples does not imply our support for the trial findings, conclusions, or endorsement of the interventions evaluated. Furthermore, it is not possible to identify and list examples from all disease and research areas that should use this extension. Therefore, trial teams can use examples provided as a guide on how the items can be reported in their own disease or research area. The identification of examples for nearly all extension items demonstrates the feasibility of implementing the extensions in trial reports.

Despite extensive efforts, which included reviewing trial reports from targeted reviews and seeking exemplars from colleagues, we were unable to find an example that effectively implemented one item: informing participants that the trial used a surrogate endpoint. Therefore, together with patient and public partners who are coauthors of this extension (DS, RH, SM, AW), we have modified a quote from a published protocol to demonstrate how this item can be reported in a trial report (item 26a.1).

CONSORT-Surrogate extension

Table 3 compares the CONSORT 2010 checklist with the extension items in the CONSORT-Surrogate checklist. Appendix 3 presents a combined CONSORT 2010 and CONSORT-Surrogate checklist, which can be downloaded and completed separately.

Table 3

Comparison of items from CONSORT 2010 and CONSORT-Surrogate extension

View this table:

Title and abstract

Items 1b (extended)

CONSORT 2010 item 1b

Structured summary of trial design, methods, results, and conclusions.4

For specific guidance, see CONSORT for abstracts.37

CONSORT-Surrogate extension item 1b.1

State (a) that the primary outcome is a surrogate endpoint, and (b) the target outcome(s) whose intervention effect is being substituted for.

Examples of CONSORT-Surrogate item 1b.1

Example 1

“The primary outcome was the peak change of urinary neutrophil gelatinase-associated lipocalin within 48 h, a surrogate marker [endpoint] of kidney injury.”38 (We have added the word “endpoint” and recommend its use.)

Example 2

“To evaluate the effects of the dipeptidyl peptidase-4 (DPP-4) inhibitor linagliptin on aortic pulse wave velocity (PWV) as a surrogate [endpoint] marker of arterial stiffness and early atherosclerosis in people with early type 2 diabetes.”39 (We recommend the use of the term “surrogate endpoint” rather than “marker”.)

Explanation

Well written trial abstracts provide an initial assessment to readers to decide whether to read or access the full report and, in some cases, they can solely inform healthcare decisions.37 40 Explicit mention of certain items in abstracts is important for database indexing37 and consequent retrieval of trials for secondary research such as surrogate endpoint validation. Despite their importance, space restrictions require that abstracts only present the key information of a trial.41 In addition to using CONSORT for abstracts,37 trial authors should be explicit about the use of a surrogate endpoint as a primary outcome and the target outcome being substituted for. Given the varying structures of abstracts and limited space,40 41 authors can report this item in various ways, as seen from examples provided.

Introduction

Background and objectives (extended)

CONSORT 2010 item 2a

Scientific background and explanation of rationale.

See CONSORT 2010.4

CONSORT 2010 item 2b

Specific objectives or hypotheses.

See CONSORT 2010.4

CONSORT-Surrogate extension item 2.1

State (a) that the primary outcome is a surrogate endpoint, and (b) the target outcome(s) whose intervention effect is being substituted for.

Example of CONSORT-Surrogate extension item 2.1

“PWV [aortic pulse wave velocity] is an integrated index of arterial function and structure and hence a [surrogate endpoint] marker of early atherosclerosis ^ref. A higher PWV is associated with a more stiffened artery and an increased risk of CV [cardiovascular] events^ref.”39 (We have added the word “surrogate” in the example and recommend its use when reporting the item. We also recommend use of the term “surrogate endpoint” and specific citation of the reference supporting the validity of surrogate endpoint (see explanation of item 6a.2).)

Explanation

The introduction outlines the reasons for conducting the trial by summarising current evidence and knowledge gaps being filled4 40; see the REPORT guide for more information on introduction content and structure.40 The introduction gives readers a general outline of the trial report4 40 and allows journal editors and reviewers to assess the importance of a trial report.40 Therefore, authors need to be explicit about using a surrogate endpoint and the target outcome for whose treatment effect is substituted for. Given that introduction sections in final trial publications can be shorter than for protocol publications,41 a brief statement of the primary outcome being a surrogate endpoint and the associated target outcome would be sufficient for this item. Authors can outline more details on the surrogate endpoint(s) selected, including their justification in the methods section (see items 6a.1 and 6a.2). However, authors could summarise these items in the introduction if it gives readers a better context or importance of the trial. Finally, because introductions have different structures and word lengths, authors can report the item when reporting either CONSORT 2010 item 2a or item 2b.

Methods

Outcomes

CONSORT 2010 item 6a (extended)

Completely defined prespecified primary and secondary outcome measures, including how and when they were assessed.

See CONSORT 20104 and CONSORT-Outcomes extension.7

CONSORT-Surrogate extension item 6a.1

State the practical or scientific reason(s) for using a surrogate endpoint as a primary outcome.

CONSORT-Surrogate extension item 6a.2

Justification for selected surrogate: (a) evidence (or lack of evidence) of surrogate endpoint validation; and (b) evidence (or lack of evidence) of validity being specific to setting and context used (eg, intervention; disease; population).

Example of CONSORT-Surrogate item 6a.1

“We used surrogate endpoints for this trial because of a number of practical constraints, including the trial cost, rapidly evolving evidence in this field, and concern about the feasibility of conducting a long-term intervention in a vulnerable population. However, the endpoints selected have been validated as having prognostic significance for CVD [cardiovascular] events.”42

Explanation

Given limitations associated with surrogate endpoints,16 authors should inform readers of the scientific or practical reason(s) for using them. A commonly cited reason for use of surrogate endpoints is trial efficiency: shorter follow-up and smaller sample size. This use can be ideal for early phase trials where the focus is aimed at demonstrating biological activity and informing the need for future trials powered on target outcomes.16 Also, primary prevention trials can require a long time to accrue, and trials of rare diseases often have access to only small trial populations.16 Additionally, surrogate endpoints have been widely used in regulatory approval settings as part of expedited or accelerated approval for conditions with high unmet medical need in serious and life threatening diseases.8 16 43 Further, target outcomes might not be ideal in certain interventional contexts, for example, participant reported outcomes in paediatric trials can be challenging44 in newborn babies or very young children (aged <7 years) where observer reported outcomes are needed. The practical or scientific reasons for using surrogate endpoints highlighted here and elsewhere16 might not be exhaustive.

Reporting this item provides readers with a justification of using surrogate endpoint(s) as a primary outcome and contextualising the importance of the trial. However, adequate reporting of this item does not preclude authors from addressing item 6a.2 on the validity of the surrogate endpoint selected (see explanation for item 6a.2).

Examples of CONSORT-Surrogate item 6a.2

Example 1

“The primary end point for these trials was chosen in agreement with the US Food and Drug Administration. Although no published data specifically document overt clinical benefits related to a 30% or greater reduction of PTH [parathyroid hormone], several observational studies have shown that PTH concentrations greater than 600 pg/mL [as a surrogate endpoint] are associated with higher rates of [target outcomes] death, cardiovascular events, and fracture than PTH concentrations in the range of 150 to 300 pg/mL.^refs”45 (We have added words to the quote in square brackets and recommend their use when reporting the item.)

Example 2

“The primary efficacy endpoint was the change in daytime ambulatory systolic blood pressure from baseline to 2 months. Systolic blood pressure is a validated surrogate endpoint for prediction of cardiovascular events and mortality based on a meta-analysis of 123 blood pressure lowering drug trials, with 613,815 participants demonstrating a strong association between the treatment effect of systolic blood pressure and cardiovascular events ^ref. Specifically meta-regression showed relative risk reductions for major cardiovascular disease events (P<0.0001), stroke (P<0.0001), heart failure (P<0.0001), and all-cause mortality (P=0.014) to be proportional to the magnitude of the systolic blood pressure reduction achieved. However, risk reductions for various diseases differed across drug classes more evidence is needed to establish that validity of blood pressure lowering to predict for benefit in cardiovascular events and mortality holds when renal denervation is used.” (This example was written by the authors from a published trial10 and using the meta-analysis46 cited by the trial that reported a strong association in the treatment effect on the surrogate endpoint (difference in systolic blood pressure) and the target outcome (relative risk for cardiovascular and all case mortality) across randomised controlled trials of interventions using blood pressure lowering drugs, see explanation for item 6a.2.)

Explanation

Surrogate endpoints should be validated before they are used. Validation is determining whether the intervention’s effect on the surrogate endpoint predicts the intervention effect on the target outcome.47 48 While a detailed discussion of surrogate validation is beyond the scope of this extension, we signpost readers to several articles on surrogate validation methods,47 48 49 50 51 52 53 54 55 56 frameworks for evaluating validity of evidence,21 57 58 59 and recently, a checklist to report surrogate validation.60 In brief, validation should demonstrate both a strong association of the surrogate endpoint and target outcome (the so-called individual level association), and should demonstrate that the treatment effect on the surrogate must be tightly correlated with the treatment effect on the target outcome (the so-called trial level association).47 48

For instance, example 1 for item 6a.2 fails to achieve this desired level of evidence, based on an observational association between PTH (parathyroid hormone; surrogate endpoint) and the target outcomes of mortality and major clinical events. In contrast, example 2 cites the association in treatment effect between the surrogate of systolic blood pressure and target outcome of mortality, based on a meta-analysis (regression) of randomised controlled trials. To fully judge the strength of validation for the validity of a surrogate endpoint, authors should provide some key meta-regression metrics, that is: the slope coefficient (and 95% confidence interval) of the linear relation between the treatment effect of the surrogate and the target outcome, the strength of the association such as Spearman’s correlation coefficient (ρ) or R², and the surrogate treatment effect or prediction intervals (see item 7a.1). Illustration of these metrics for blood pressure and cardiovascular events can be found in the article by Lassere et al.58

Surrogate endpoint validation in trials needs to be better reported. An analysis of 626 trials published in 2005 and 2006 found that only 34% (37/109) of trial reports that used a surrogate endpoint as a primary outcome discussed its validity.61 In cancer, where several surrogate validation studies have been published, a systematic review indicated relatively poor validity of surrogate endpoints: 52% of surrogate endpoints used in trials had a low correlation in their treatment effect with the target outcome of overall survival (r ≤0.7), with only 23% demonstrating a high correlation (r ≥0.85). Surrogate validation models often provide the opportunity to predict the treatment effect on the target outcome in new trials for which the effect on the surrogate endpoint has been estimated. It is therefore important to quantify the accuracy of the predictions made.60 Leave-one-out cross validation and even external validation with new trials published after the model was fitted or trials whose individual patient data were not available for model estimation, are essential to assess the model’s predictive performance and calibration.62 Trial authors should therefore be explicit on the surrogate endpoint validity evidence (or lack of it). Over the years, many statistical approaches to surrogate validation have been proposed54 63 (some of which are summarised in box 2). The approach underpinning the selection of the surrogate endpoint should be presented in detail.

Box 2

Summary of statistical approaches for surrogate endpoint validation

Selected and non-exhaustive statistical methods and general approaches for evaluating the validity of surrogate endpoints in the assessment of treatment efficacy that have emerged over the past four decades.

Prentice’s criteria53

In pioneering work published in 1989, Prentice proposed three criteria for valid hypothesis testing extrapolation (rejecting the null hypothesis of no treatment effect on the surrogate endpoint implies rejecting the null hypothesis of no treatment effect on the target outcome):

The effect of the surrogate endpoint on the true endpoint does not vary with randomisation group;
The surrogate endpoint affects the true endpoint;
The effect of treatment on the surrogate endpoint changes the average effect of treatment on true endpoint.

The Prentice criteria remains conceptually important but of limited usefulness in practice.

Principal stratification64

This method maintains that causal effects should be the basis for surrogate endpoint evaluations, where the causal effect is a comparison between treatment groups of the potential outcomes on the same set of individuals. Two requirements are needed for surrogate validity: causal necessity, which requires that an effect of treatment on the target outcome can only exist if treatment has also affected the surrogate; and statistical generalisability, which requires good predictive performance of the surrogate for the target outcome in a future study in which only the surrogate is observed.

Meta-analytical regression based approach47 65

This approach relies on two stage, joint modelling of the surrogate and target outcome in a multi-trial (randomised trials) setting. Surrogacy is established on the basis of the coefficient of determination between the surrogate and target outcome at the individual patient level (individual level R²), and the coefficient of determination between the treatment effect on the surrogate and on the target outcome at the trial level (trial level R²). Alternatively, the surrogate threshold effect has been proposed as a practical measure to define the minimum level of treatment effect required on the surrogate to conclude that a significant treatment effect would also be present on the target outcome.66 Extensions of these meta-analytical methods based on information theory have been proposed as the preferred approach under the causal association paradigm.67

Bayesian approaches

While a bayesian approach will be readily applicable to all the methodologies outlined above, the most commonly used models are the meta-analytical fixed (independent) effects model proposed by Daniels and Hughes63 and a bayesian random effects meta-analysis to model trial level effects on the target outcome and surrogate endpoint.51 More recently, bayesian multivariate meta-analytical methods to take into account the association between the treatment effects on the surrogate and target outcomes have been proposed specifically for regulatory and reimbursement decision making.51

RETURN TO TEXT

Additionally, evidence of surrogate validity in one trial context (eg, sufficiently similar population, intervention, disease, control, and setting) might not generalise to another.16 For example, a systematic review of studies evaluating the validity of progression-free survival as a surrogate endpoint for overall survival found that trial level validity varied across the intervention evaluated, cancer localisation, and stage.68 The magnitude of weight loss assessed using body mass index, which predicts a morbidity or mortality benefit, often depends on disease or obesity related complications, the individual’s age, and their baseline obesity level.69 70 Therefore, trial investigators should justify the surrogate endpoint based on evidence of surrogate validity (or lack of it) in the context used (see example 2 for item 6a.2 on validity, for being specific to different diseases but with acknowledgement of lack of evidence to the specific intervention being tested).

Sample size

CONSORT 2010 item 7a (extended)

How sample size was determined.

See CONSORT 2010.4

CONSORT-Surrogate extension item 7a.1

Clarify if sample size was estimated to demonstrate that a minimum effect on the surrogate endpoint would be predictive of a benefit on the target outcome(s).

Examples of CONSORT-Surrogate item 7a.1

Example 1

“Because previously published data suggested a low overall incidence of CA-AKI [contrast associated-acute kidney injury] ^ref at our centre, we chose NGAL [neutrophil gelatinase-associated lipocalin] as primary outcome parameter. A formal power calculation was not performed for the primary endpoint of this exploratory study, because of a lack of suitable data on preventive therapy studies with rhC1INH at time of study design and therefore the use of potentially poor estimates of parameters for sample size calculations. In analogy to previous interventional studies using different prophylactic regimens ^refs and similar surrogate parameters of renal function, we calculated that 40 subjects are required in each study arm to allow for the detection of a difference in mean urinary peak NGAL concentration of 100 ng/mL assuming a standard deviation of 150 ng/mL, a power of 80%, and a 2-sided type 1 error of 5%. This difference has been shown to be predictive of [the target outcome:] AKI ^ref.”38 (We have added words in square brackets and recommend their use. Given the exploratory context of this trial, authors use metrics from an observational study; however, trial teams should aim to use of metrics drawn directly from surrogate validation studies.)

Example 2

“The assumptions for the power calculation (threshold of a 40-m increase as the [surrogate threshold effect] minimal clinically important improvement in 6-minute walk test distance, with an SD [standard deviation] of 80m) were based on (1) a meta-regression of prior randomized clinical trials in patients with pulmonary arterial hypertension ^ref (due to the lack of such data in patients with HFpEF [heart failure with preserved ejection fraction]) and (2) clinical consensus among members of the trial’s steering committee.”71 (We recommend using the term “surrogate threshold effect” rather than “minimal clinically important improvement,” which is consistent with the cited surrogate validation study.)

Explanation

Trial sample size determination must be appropriately justified and adequately reported including details of the target effect size and allowance for sample trial attrition in the outcome.4 72 Trials with a primary outcome that is a surrogate endpoint should consider their choice of a target effect size based on metrics of surrogate validity. For example, a commonly reported validation metric is the minimum treatment effect on the surrogate endpoint necessary to predict a treatment benefit on the target outcome known as a surrogate threshold effect.58 66 The concept of surrogate threshold effect was used in example 2 of item 7a.1, although the authors acknowledge that this is derived from a different patient population (owing to data unavailability for their trial population).71 In contrast, in example 1 of item 7a.1,38 other metrics of surrogate validity are used—justification of the difference in the surrogate endpoint that is predictive of the target outcome is from a prospective study that used cut-off thresholds derived from a receiver operating curve.73

In some instances, owing to the absence of previous surrogate endpoint validation, it might not be possible for authors to use surrogate validity metrics formally to determine the sample size. However, trial investigators could consider prospectively validating their chosen surrogate endpoint if the data are available (see item 26.2). Furthermore, given that surrogate endpoints are mainly used improve trial efficiency (ie, allow for smaller sample size compared to using target outcomes), authors are encouraged to determine the sample size for both the surrogate endpoint and target outcome. If the sample size based on treatment effect on the target outcome is the same as (or smaller than) what the surrogate endpoint would be, then sufficient justification for the choice of surrogate as the primary outcome should be provided. Finally, whether validity metrics are used or not, authors should discuss the interpretation of findings in the context of using a surrogate endpoint and its known validity (see item 22.1), including how the predicted effect on the target outcome and its uncertainty (reflected by its confidence interval) has been derived.

Results

Outcomes and estimation

CONSORT 2010 item 17a (extended)

For each primary and secondary outcome, results for each group, and the estimated effect size and its precision (such as 95% confidence interval).

See CONSORT 2010.4

CONSORT-Surrogate extension item 17a.1

If the primary outcome is a composite outcome that includes a surrogate endpoint; report the intervention effect on all components.

Examples of CONSORT-Surrogate item 17a.1

See table 4 and table 5 for examples.

Table 4

Reporting individual components of an example composite measure: the American College of Rheumatology (ACR) response. Table generated using results from van de Putte et al,74 with permission from BMJ Publishing Group

View this table:

Table 5

Reporting of individual components of a composite measure (progression-free survival) in per protocol population, including recurrence or death. Table adapted from Parekh et al,75 with permission from Elsevier and Copyright Clearance Centre

View this table:

Explanation

A composite outcome comprises of two or more component outcomes (eg, the proportion of participants who had raised systolic blood pressure, experienced a non-fatal stroke, or died). Experience of any one of the components is considered as experience of the composite outcome.25 The considerations for using composite outcomes in trials are discussed in detail elsewhere and out of scope of this extension.25 76 77 78

An audit of trials published in 2008-10 found that of 106 trials that used a composite outcome, 28% (n=30) included a surrogate endpoint as one of the components.79 Authors are encouraged to separately report the treatment effects on each component of a composite outcome. Reporting of this item applies to explicit composite outcomes (ie, “the primary outcome was a composite outcome [components a or b or c]”); and composite measures (table 4 and table 5). Composite measures and scales combine outcomes that are or include surrogate endpoint(s) such as disease or progression-free survival in cancer (disease recurrence or progression measured using tumour size or death)80; or clinical cure in infectious diseases, measured through clinician assessed response, and radiographical or microbiological criteria.81 82 This item should preferably be included in the main text of the trial report rather than in supplementary files or appendices.

Discussion

Interpretation

CONSORT 2010 item 22 (extended)

Interpretation is consistent with results, balancing benefits, and harms, and considering other relevant evidence.

Refer to CONSORT 2010.4 Reporting of the three subsequent CONSORT-Surrogate extension items can be done together and in any order.

CONSORT-Surrogate extension item 22.1

Interpretation of findings of the trial in the context of using a surrogate primary endpoint, including its known validity for intervention effects on the target outcome and the potential benefit-risk assessments of the tested intervention for participants.

CONSORT-Surrogate extension item 22.2

Comment on whether the trial design (including sample size and follow-up period), given the use of a surrogate endpoint, adequately captures the potential harms of the intervention being tested.

CONSORT-Surrogate extension item 22.3

State what the plans are to conduct subsequent analyses/studies to verify current findings on the target outcome(s).

Examples of CONSORT-Surrogate item 22.1

Example 1

“With the detection rate for invasive breast cancer representing an early screening surrogate parameter, results from TOSYMA [trial name] point towards possible effects of digital breast tomosynthesis on long-term screening benefits. An absolute increase in the detection rate of invasive breast cancer for early tumour stages in the screening phase of TOSYMA, presumably indicating diagnostic improvements, might be expected to reduce the incidence of advanced breast cancers in screened populations and, thus, potentially exert positive effects on breast cancer mortality ^refs. However, increased detection of small size cancers at screening without a reduction in the incidence of invasive interval cancers among screen negative women in the 2-year interval up to the subsequent screening examination would raise questions regarding the screening benefit and possible overdiagnosis ^refs.^”83

Example 2: Combining items 22.1 and 22.2

“If maintained in the long term as highlighted by the 3-year report of the Global SYMPLICITY Registry ^ref as well as the 12-month results of the RADIANCE-HTN SOLO study ^ref, the average 9.0 mm Hg reduction in [the surrogate endpoint of] office systolic blood pressure we observed after renal denervation in patients with resistant hypertension who are at high risk of a cardiovascular event, ^ref is of a magnitude previously associated with a reduction in [target outcomes:] stroke, coronary heart disease, heart failure, and all-cause mortality for antihypertensive drug therapy ^ref. A reduction in both cardiovascular and cerebrovascular events might also be expected if we confirm our previous observation in the RADIANCE-HTN SOLO trial of a reduced visit-to-visit variability in blood pressure after renal denervation ^ref.”10 (We have added the words in square brackets and recommend their use when reporting the item.)

Explanation

This item recommends authors to consider the main limitations of the specific surrogate endpoints used and discuss the implications for interpreting the trial findings. Specifically, readers should be informed what the findings imply for the intervention effect observed on the surrogate endpoint and what it means for the target outcome drawing from current validity evidence (or lack of it); and potential overall intervention benefit-risk balance. In case of good predictive performance of surrogate validation models in the setting of interest, researchers should provide the predicted effect on the target outcome, together with a measure of uncertainty (eg, confidence intervals) and the actual prediction equation used. The following surrogate validation studies illustrate this level of reporting.84 85 86 Reporting of this item is important to inform how stakeholders use the trial findings to guide practice and policy.

Trials that collect data on both surrogate endpoint and target outcome with a more extended follow-up and larger sample sizes can be less speculative about the intervention benefit-risk balance. Irrespective of sample sizes and follow-up time, trial authors should report the treatment effects on the target outcome(s) when collected. Adequate reporting of other extension items on surrogate validity (item 6a.2) and potential harms (item 22.1) will enable adequate reporting of this item.

Example of CONSORT-Surrogate item 22.2

“Additional follow-up will be required to determine whether the blood pressure lowering effect of ultrasound renal denervation remains durable over time, especially when patients receive additional antihypertensive medications (particularly the aldosterone antagonist spironolactone) to control their blood pressure in both masked (2-6 months) and unmasked conditions (after 6 months) ^ref. Although adverse events were infrequent, longer follow-up of this trial and more treated patients will be necessary to provide additional safety data.”10

Explanation

While trial treatment effects on a surrogate endpoint can indicate a potentially positive impact of an intervention, longer term trial follow-up or introduction of the intervention into routine practice could demonstrate the intervention to be harmful.87 In 1996, Fleming and DeMets described several examples where drugs had been approved on the basis of a positive treatment effect on a surrogate endpoint to be then shown to have overall harm to patients and the public.88 This example included suppression of arrhythmia (abnormal heart rhythm), where drugs to reduce arrhythmias (considered to be a surrogate endpoint for cardiovascular related mortality) were later found to increase mortality.88 More recent examples include a diabetes treatment (rosiglitazone) approved based on blood glucose reduction (a surrogate for serious diabetic complications, cardiovascular events, and death) that was later found to be associated with increased hospital admission for heart failure and increased heart attacks.89 Also, in the BELLINI trial, a drug (venetoclax) that improved progression-free survival (a surrogate for overall survival) in relapsed or refractory multiple myeloma patients was found to be associated with higher mortality.90

These harms could be due to various reasons, including unintended consequences of the intervention that are not mediated through the surrogate endpoint or known disease causal pathways; and intervention might not have a positive effect on the surrogate endpoint for the same people for whom the surrogate endpoint positively correlates with the target outcome.87 88 When a surrogate endpoint is used as the primary outcome, we recommend collecting the target outcome as a secondary outcome, especially if it could inform potential harmful effects of the intervention and would override results based on the surrogate endpoint. For example, the BELLINI trial captured the harm of the intervention, leading to its early termination, because it used progression-free survival as a primary outcome but also collected overall survival as a secondary outcome.90 91

Examples of CONSORT-Surrogate item 22.3

Example 1: Reporting subsequent analyses

“At a median follow-up of nearly 17 months, overall survival [target outcome] was not yet mature; however, fewer deaths occurred in the KdD [carfilzomib, dexamethasone, and daratumumab] group (19%) versus the Kd group [carfilzomib and dexamethasone] (23%), and a trend towards an overall survival benefit for KdD versus Kd was observed (appendix p 7). Overall survival will be reassessed in a subsequent pre-planned analysis.”92 (We have added the words in square brackets and recommend their use when reporting the item.)

Example 2: Combining items 22.1, 22.2, and 22.3 (reports ongoing study)

“This study is limited in that direct evaluation of the effect of vosoritide treatment on final adult height and how this relates to functionality, quality of life, and activities of daily living in people with achondroplasia cannot be evaluated at this time. In addition, whether treatment with vosoritide will ameliorate the medical complications associated with achondroplasia and decrease the need for surgical interventions is unknown.

“Concerns around these limitations are shared by some in the short-statured community, and their support groups, who consider that a treatment that only increases height [the surrogate endpoint] in achondroplasia is not a priority, and that the [target outcomes of] short term and long term health of individuals must also be enhanced. These perspectives are balanced by the views of some participants in this trial and their families, who agree that while better health is an important outcome, increased height in and of itself will facilitate better access to the environment, less discrimination, and higher self-esteem. To address these limitations, concerns, and unanswered questions, an ongoing, open-label, phase 3, extension study (ClinicalTrials.gov number, NCT03424018) will continue to evaluate the balance of benefits and harms of vosoritide until the patients reported in this study reach final adult height. This study will collect data regarding vosoritide therapy on wider health measures including quality of life, activities of daily living, and frequency and type of medical and surgical interventions compared with registry data of untreated children with achondroplasia. This long term study will also provide data on whether treatment of children with achondroplasia with vosoritide will result in a pubertal growth spurt, which appears to be absent in this condition ^ref and provide the opportunity to detect any harms associated with long term therapy.”93 (We have added the words in square brackets and recommend their use when reporting the item.)

Explanation

This item builds on the previous item to inform readers of subsequent analyses or studies to verify current findings (on observed benefit, lack of benefit, or harms) using a target outcome. These could include extended follow-up of the trial population to confirm the intervention effect on target outcome and evidence from surrogate endpoint validation studies. A survey of cardiovascular trials using surrogate endpoints as primary outcomes and published in three high impact journals between 1990 and 2011 found that only 27% had subsequent trials to verify findings using a target outcome.94 In cancer, a retrospective analysis of drug approvals by the US Food and Drug Administration (FDA) found that 56% of accelerated approvals and 37% traditional approvals were not supported by strong surrogate validation evidence.95 Nevertheless, only 45% of the approvals had subsequent analysis on the target outcome of overall survival.95 Lack of subsequent studies to verify the effect could extend beyond cardiovascular diseases, cancer, and drug related interventions, and could lead to the continued use of interventions that have no benefit.23

We acknowledge that the conduct of subsequent trial analyses or additional studies depends on several factors, including feasibility and availability of research funding. Furthermore, plans to conduct such future analyses studies could change over time. Nevertheless, we recommend that authors are transparent in reporting this item—that is, explicit statement of no plans (with justification), description of plans (including planned follow-up beyond study period, planned or confirmatory target outcome trial in progress), or description of initial plans that have changed.

Other information

New items

CONSORT-Surrogate extension item 26.1

State whether and how trial participants were engaged and informed before enrolment that the trial was designed to evaluate an intervention’s effect using a surrogate endpoint.

CONSORT-Surrogate extension item 26.2

If surrogate endpoint and target outcome data were collected in the trial, state the open access arrangements for the data for future secondary research.

Example of CONSORT-Surrogate item 26.1

“All participants [received] adequate information about the nature, purpose, possible risks, and benefits of the trial [given the use of a surrogate endpoint as the primary outcome], and alternative therapeutic choices using an informed consent protocol approved by the IRB [institutional review board]. All participants [were] given ample time and opportunity to ask questions and consider participation in the trial.”96 (This example did not implement the item but has been used to show how the item can be reported using the words in square brackets. It was also taken from a protocol and has been modified to show past tense.)

Explanation

Public engagement (also known as community engagement) is listening to, interacting with, and connecting with members of the public to share research activity or benefits, discuss relevant issues (such as ethics), or obtain input on preliminary research ideas.97 Patient and public involvement is focused on a specific study and involves conduct of research with or by members of the public (rather than “for,” “to,” or “about” members of the public).97 Public engagement is vital for both planning and conduct of trials but also translation of trial findings, and greater benefit for trial participants and the public.98 99 Public engagement and informed consent are mutually supportive issues with the same goal: maximising social value of research conducted in a respectful manner.100 101 102 Informed consent is a legal and ethical requirement of research involving human participants before their enter the study.103 104 It involves adequately informing participants of trial details including the anticipated benefits and potential risks of participation.103 105 Therefore, for trials using surrogate endpoints as primary outcomes, informed consent provides an avenue to engage trial participants on the use of surrogate endpoints and their risks and benefits or ideally to continue ongoing engagement. However, evidence from early phase trials (many of which might rely on surrogate endpoints) suggests that participant risk-benefit communication is suboptimal. A survey of 172 early phase trials’ informed consent documents found that only 45% specified the outcome of mentioned health benefits, and only 63% mentioned the likelihood of health risks of which only half were specific on whether risks would be due to research procedures or potentially beneficial interventions.105

Informing trial participants that the study used a surrogate endpoint (and related limitations) is critical to informed consent.106 We have discussed this item in detail in the SPIRIT-Surrogate extension, including suggestions on implementing it. Authors should justify trials that do not implement the item.

Examples of CONSORT-Surrogate item 26.2

Example 1: Data available on request

“Data Sharing Statement: The complete deidentified patient data set will be made available upon publication to researchers whose proposed use of the data has been approved. Requests should be sent to ctu.beatlupus{at}ucl.ac.uk.”107 (The trial’s primary outcome was the surrogate endpoint of levels of anti-double stranded DNA antibodies in serum IgG, and disease flares were the target outcome which was a secondary outcome.)

Example 2: Data available via an intermediary

“Janssen has an agreement with the Yale Open Data Access (YODA) Project to serve as the independent review panel for the evaluation of requests for clinical study reports and participant-level data from investigators and physicians for scientific research that will advance medical knowledge and public health. Data will be made available following publication and approval by YODA of any formal requests with a defined analysis plan. For more information on this process or to make a request, please visit the Yoda Project site at https://yoda.yale.edu. The data sharing policy of Janssen Pharmaceutical Companies of Johnson & Johnson is available at https://www.janssen.com/clinical-trials/transparency.”108 (This phase 1 trial had safety as the primary outcome but is an example of the deposit of data in an intermediary.)

Explanation

We have already highlighted the importance of collecting target outcome data when a surrogate endpoint is used as a primary outcome: it can allow for surrogate endpoint validation and can contribute to monitoring intervention harms. Therefore, we encourage trial teams to collect target outcomes as secondary outcomes. A key challenge of undertaking surrogate validation studies is limited access of individual participant data from completed studies.109 Therefore, sharing surrogate endpoint and target outcome data, when collected, allows leveraging the trial dataset to advance the surrogate validation field.

Adequate reporting of this item (ie, statements that data will be available) is not enough: trial investigator teams should be genuinely committed to sharing their datasets. Recent surveys of published trials found that access to individual patient level data was overwhelmingly low (<25%) despite most trial authors having declared an intention to share the data.110 111 There could be challenges to data sharing, including risk of loss of participant confidentiality, perceived risk of inappropriate use of data, and competition from peers with access to the data.110 112 Consequently, when data sharing is impossible or only possible for part of the data; authors should be explicit about it with a justification.

Conclusion

Trials using surrogate endpoints need better, more transparent reporting. The CONSORT-Surrogate extension provides the minimum reporting requirements for trial reports and publications that have used surrogate endpoints as primary outcomes. Proper application of CONSORT-Surrogate should improve reporting of such trials, aiding interpretation of findings to inform practice and policy. The extension should be used along with the main CONSORT 2010 reporting guideline.

The CONSORT-Surrogate extension can contribute to reduction of research waste.3 Nevertheless, while many journals endorse using the main CONSORT checklists, very few endorse using extensions.1 We therefore call on all stakeholders (including funders, journal editors, and reviewers) to encourage use of the CONSORT-Surrogate extension. However, we acknowledge that use of the extension does not rule out other sources of research waste, including wrong choice of research question, biases, or poor design.2 Specifically, trial teams and readers should note that bias in surrogate endpoint measurement contributes to poor prediction of intervention effects on target outcomes.16 Finally, adequate reporting of all items in this extension does not preclude trial investigator teams and the wider scientific community from directly assessing and reporting intervention effects on target outcomes, whenever possible.

Ethics statements

Ethical approval

The project received ethical approval on 24 May 2022 from the ethics committee of the University of Glasgow College of Medical, Veterinary, and Life Sciences (project No 200210151). All participants gave informed consent before taking part in the study.

Data availability statement

Additional data are available through request from the corresponding author. After publication of all project’s manuscripts, data will be deposited in the UK Data archive, and will be accessed through their standard end user licence (this would require users to login to the UK Data Service).

Acknowledgments

We thank all that professional networks, organisations, and groups (listed in the appendix) that helped with circulation of our mobilisation calls for participants; all participants in the development of this extension through completing the e-Delphi survey and piloting of the extensions (listed in alphabetical order in the appendix). We are indebted to Amber E Young (University of Bristol) for her contribution to the planning and conduct of the SPIRIT/CONSORT-Surrogate project, who passed away in September 2022.

SPIRIT/CONSORT-Surrogate project team: (project management group) Anthony Muchai Manyara, Philippa Davies, Derek Stewart, Christopher J Weir, Amber E Young, Jane Blazeby, Rod S Taylor, Oriana Ciani; (executive committee) Nancy J Butcher, Sylwia Bujkiewicz, An-Wen Chan, Gary S Collins, Dalia Dawoud, Martin Offringa, Mario Ouwens, Joseph S Ross.

SPIRIT/CONSORT-Surrogate consensus group: Robin Christensen, Marissa Lassere, Asbjørn Hróbjartssson, Oriana Ciani, Derek Stewart (co-chair, patient and public involvement lead), Jane Blazeby, Joseph S Ross (co-chair), Mario Ouwens, Anthony Muchai Manyara, Rod S Taylor, Alain Amstutz, Luca Bertolaccini, Vito Domenico Bruno, Sylwia Bujkiewicz, Gary S Collins (co-chair), Philippa Davies, Dalia Dawoud, Declan Devane, Christina D C M Faria, Peter B Gilbert, Ray Harris, Lucio Marinelli, Sarah Markham, Martin Offringa, John H Powers, Yousef Rezaei, Laura Richert, Falk Schwendicke, Larisa G Tereshchenko, Achilles Thoma, Alparslan Turan, Christopher J Weir, Andrew Worrall.

Footnotes

Contributors: AMM, RST, and OC are joint first authors. PD, CJW, AEY, RST, and OC were involved in funding acquisition. AMM, PD, DS, CJW, AEY, RST, and OC were involved in study conception and design. AMM, PD, DS, CJW, AEY, JB, RST, OC, NJB, SB, A-WC, GSC, DD, MOf, MOu, and JSR contributed to the methodology. NJB, SB, A-WC, GSC, DD, MOf, MOu, and JSR supervised the project. AMM, RST, and OC curated the data and conducted formal analysis. AMM, RST, OC, SM, DS, RH, and AW were responsible for the first draft of the manuscript. All authors critically reviewed the first draft and approved the final version. AMM, RST, and OC are the guarantors. The corresponding author confirms that all listed authors meet authorship criteria and that no others meeting the criteria have been omitted.
Funding: The research was funded as part of the development of SPIRIT and CONSORT extensions by the UK Medical Research Council (grant MR/V038400/1). GSC was supported by Cancer Research UK (programme grant C49297/A27294). JB was supported by the NIHR Bristol Biomedical Research centre. SB was supported by the UK Medical Research Council (MR/T025166/1) and Leicester NIHR Biomedical Research Centre. AA receives his salary from the Research Fund Junior Researchers of the University of Basel. RC acknowledges that the Section for Biostatistics and Evidence-Based Research (Parker Institute, Bispebjerg and Frederiksberg Hospital) is supported by core grants. CDCMF receives research productivity fellowships from the Oak Foundation (OCAY-18-774-OFIL) and National Council for Scientific and Technological Development (CNPq/Brazil grant 08516/2021-4). The views expressed in this article are those of the authors and not their employers or funders. The funders had no role in the design and conduct of the study; the data collection, management, analysis, and interpretation; the drafting, review, or approval of the manuscript; or the decision to submit the manuscript for publication. This article reflects personal the views of the authors, the Delphi participants, and the consensus meeting delegates, and may not represent the views of the broader stakeholder groups, authors’ institutions, or other affiliations.
Competing interests: All authors have completed the ICMJE uniform disclosure form at www.icmje.org/disclosure-of-interest/ and declare: support from the UK Medical Research Council for the submitted work. SB is a member of the NICE Decision Support Unit and NICE Guidelines Technical Support Unit; has served as a paid consultant, providing methodological advice, to NICE, Roche, IQVIA, and RTI Health Solutions; received payments for educational events from Roche; and has received research funding from European Federation of Pharmaceutical Industries and Associations and Johnson & Johnson. MOu works for and has shares in AstraZeneca. JSR is a deputy editor at JAMA, was formerly an associate editor at The BMJ, and is co-founder (unpaid) of medRxiv; has received research support through Yale University from Johnson & Johnson to develop methods of clinical trial data sharing, from the Medical Device Innovation Consortium as part of the National Evaluation System for Health Technology, from the Food and Drug Administration for the Yale-Mayo Clinic Center for Excellence in Regulatory Science and Innovation programme (U01FD005938), from the Agency for Healthcare Research and Quality (R01HS022882), from the National Heart, Lung and Blood Institute of the National Institutes of Health (R01HS025164, R01HL144644), and from Arnold Ventures; was an expert witness at the request of Relator’s attorneys, the Greene Law Firm, in a qui tam suit alleging violations of the False Claims Act and Anti-Kickback Statute against Biogen that was settled in September 2022. NJB has received consulting fees from Nobias Therapeutics. AA and YR are associate editors at BMC Trials. OC is an associate editor for Value in Health and has received consulting fees from MSD and Janssen. RC is a founding member of the OMERACT Technical Advisory Group, which might be perceived as a possible conflict of interest. RH has shares in Johnson & Johnson. JHP has been a consultant for AdaptivePhage, Arrevus, Atheln, BavariaNordic, Cellularity, Eicos, Evofem, Eyecheck, Gilead, GSK, Mustang, OPKO, Otsuka, Resolve, Romark, SpineBioPPharma, and UTIlity,Vir. GSC is a statistics editor for The BMJ, and director of the UK EQUATOR Centre. CJW has undertaken consultancy for AB Science, for which his institution has received a fee. DD is an associate editor of Value in Health.
Transparency: The guarantors affirm that this manuscript is an honest, accurate, and transparent account of the study being reported; that no important aspects of the study have been omitted; and that any discrepancies from the study as planned (and, if relevant, registered) have been explained.
Patient and public involvement: Four patient and public involvement (PPI) partners (DS, SM, RH, AW) were consensus meeting delegates and are coauthors of this extension. DS has been a member of the project team and the project PPI lead. Nineteen PPI partners participated in the e-Delphi survey.
Dissemination to participants and related patient and public communities: The SPIRIT-Surrogate and CONSORT-Surrogate extensions will be disseminated to the public through press releases, presentations at conferences, video tutorials, and plain language summaries posted on websites and social media.
Provenance and peer review: Not commissioned; externally peer reviewed.

http://creativecommons.org/licenses/by/4.0/

This is an Open Access article distributed in accordance with the terms of the Creative Commons Attribution (CC BY 4.0) license, which permits others to distribute, remix, adapt and build upon this work, for commercial use, provided the original work is properly cited. See: http://creativecommons.org/licenses/by/4.0/.

References

↵
1. Junqueira DR,
2. Zorzela L,
3. Golder S,
4. et al.,
5. CONSORT Harms Group
. CONSORT Harms 2022 statement, explanation, and elaboration: updated guideline for the reporting of harms in randomised trials. BMJ2023;381:e073725. doi:10.1136/bmj-2022-073725. pmid:37094878
OpenUrl FREE Full Text
↵
1. Chalmers I,
2. Glasziou P
. Avoidable waste in the production and reporting of research evidence. Lancet2009;374:86-9. doi:10.1016/S0140-6736(09)60329-9. pmid:19525005
OpenUrl CrossRef PubMed Web of Science
↵
1. Glasziou P,
2. Altman DG,
3. Bossuyt P,
4. et al
. Reducing waste from incomplete or unusable reports of biomedical research. Lancet2014;383:267-76. doi:10.1016/S0140-6736(13)62228-X. pmid:24411647
OpenUrl CrossRef PubMed Web of Science
↵
1. Moher D,
2. Hopewell S,
3. Schulz KF,
4. et al
. CONSORT 2010 explanation and elaboration: updated guidelines for reporting parallel group randomised trials. BMJ2010;340:c869. doi:10.1136/bmj.c869. pmid:20332511
OpenUrl FREE Full Text
↵
1. Turner L,
2. Shamseer L,
3. Altman DG,
4. Schulz KF,
5. Moher D
. Does use of the CONSORT Statement impact the completeness of reporting of randomised controlled trials published in medical journals? A Cochrane review. Syst Rev2012;1:60. doi:10.1186/2046-4053-1-60. pmid:23194585
OpenUrl CrossRef PubMed
↵
1. Calvert M,
2. Blazeby J,
3. Altman DG,
4. Revicki DA,
5. Moher D,
6. Brundage MD,
7. CONSORT PRO Group
. Reporting of patient-reported outcomes in randomized trials: the CONSORT PRO extension. JAMA2013;309:814-22. doi:10.1001/jama.2013.879. pmid:23443445
OpenUrl CrossRef PubMed Web of Science
↵
1. Butcher NJ,
2. Monsour A,
3. Mew EJ,
4. et al
. Guidelines for Reporting Outcomes in Trial Reports: The CONSORT-Outcomes 2022 Extension. JAMA2022;328:2252-64. doi:10.1001/jama.2022.21022. pmid:36511921
OpenUrl CrossRef PubMed
↵
FDA-NIH Biomarker Working Group. BEST (biomarkers, endpoints, and other tools) resource. 2016. https://www.ncbi.nlm.nih.gov/books/NBK326791/pdf/Bookshelf_NBK326791.pdf.
↵
EUnetHTA. EUnetHTA 21 – Individual Practical Guideline Document D4.4 – OUTCOMES (ENDPOINTS), 2023.
↵
1. Azizi M,
2. Sanghvi K,
3. Saxena M,
4. et al.,
5. RADIANCE-HTN investigators
. Ultrasound renal denervation for hypertension resistant to a triple medication pill (RADIANCE-HTN TRIO): a randomised, multicentre, single-blind, sham-controlled trial. Lancet2021;397:2476-86. doi:10.1016/S0140-6736(21)00788-1. pmid:34010611
OpenUrl CrossRef PubMed
1. Skoulidis F,
2. Li BT,
3. Dy GK,
4. et al
. Sotorasib for Lung Cancers with KRAS p.G12C Mutation. N Engl J Med2021;384:2371-81. doi:10.1056/NEJMoa2103695. pmid:34096690
OpenUrl CrossRef PubMed
1. Wright ME,
2. Delacroix E,
3. Sonneville KR,
4. et al
. Reducing paediatric overweight and obesity through motivational interviewing: study protocol for a randomised controlled trial in the AAP PROS research network. BMJ Open2020;10:e035720. doi:10.1136/bmjopen-2019-035720. pmid:32723736
OpenUrl Abstract/FREE Full Text
1. Chan A-W,
2. Tetzlaff JM,
3. Gøtzsche PC,
4. et al
. SPIRIT 2013 explanation and elaboration: guidance for protocols of clinical trials. BMJ2013;346:e7586. doi:10.1136/bmj.e7586. pmid:23303884
OpenUrl Abstract/FREE Full Text
1. Mayo-Wilson E,
2. Fusco N,
3. Li T,
4. Hong H,
5. Canner JK,
6. Dickersin K,
7. MUDS investigators
. Multiple outcomes and analyses in clinical trials create challenges for interpretation and research synthesis. J Clin Epidemiol2017;86:39-50. doi:10.1016/j.jclinepi.2017.05.007. pmid:28529187
OpenUrl CrossRef PubMed
1. Zarin DA,
2. Tse T,
3. Williams RJ,
4. Califf RM,
5. Ide NC
. The ClinicalTrials.gov results database--update and key issues. N Engl J Med2011;364:852-60. doi:10.1056/NEJMsa1012065. pmid:21366476
OpenUrl CrossRef PubMed Web of Science
↵
1. Manyara AM,
2. Davies P,
3. Stewart D,
4. et al
. Definitions, acceptability, limitations, and guidance in the use and reporting of surrogate end points in trials: a scoping review. J Clin Epidemiol2023;160:83-99. doi:10.1016/j.jclinepi.2023.06.013. pmid:37380118
OpenUrl CrossRef PubMed
↵
1. Patel RB,
2. Vaduganathan M,
3. Samman-Tahhan A,
4. et al
. Trends in Utilization of Surrogate Endpoints in Contemporary Cardiovascular Clinical Trials. Am J Cardiol2016;117:1845-50. doi:10.1016/j.amjcard.2016.03.021. pmid:27085935
OpenUrl CrossRef PubMed
↵
1. Ciani O,
2. Buyse M,
3. Garside R,
4. et al
. Meta-analyses of randomized controlled trials show suboptimal validity of surrogate outcomes for overall survival in advanced colorectal cancer. J Clin Epidemiol2015;68:833-42. doi:10.1016/j.jclinepi.2015.02.016. pmid:25863582
OpenUrl CrossRef PubMed
↵
1. Gandhi GY,
2. Murad MH,
3. Fujiyoshi A,
4. et al
. Patient-important outcomes in registered diabetes trials. JAMA2008;299:2543-9. doi:10.1001/jama.299.21.2543. pmid:18523223
OpenUrl CrossRef PubMed Web of Science
↵
1. la Cour JL,
2. Brok J,
3. Gøtzsche PC
. Inconsistent reporting of surrogate outcomes in randomised clinical trials: cohort study. BMJ2010;341:c3653. doi:10.1136/bmj.c3653. pmid:20719823
OpenUrl Abstract/FREE Full Text
↵
1. Ciani O,
2. Buyse M,
3. Drummond M,
4. Rasi G,
5. Saad ED,
6. Taylor RS
. Time to Review the Role of Surrogate End Points in Health Policy: State of the Art and the Way Forward. Value Health2017;20:487-95. doi:10.1016/j.jval.2016.10.011. pmid:28292495
OpenUrl CrossRef PubMed
↵
1. Ciani O,
2. Buyse M,
3. Garside R,
4. et al
. Comparison of treatment effect sizes associated with surrogate and final patient relevant outcomes in randomised controlled trials: meta-epidemiological study. BMJ2013;346:f457. doi:10.1136/bmj.f457. pmid:23360719
OpenUrl Abstract/FREE Full Text
↵
1. Dawoud D,
2. Naci H,
3. Ciani O,
4. Bujkiewicz S
. Raising the bar for using surrogate endpoints in drug regulation and health technology assessment. BMJ2021;374:n2191. doi:10.1136/bmj.n2191. pmid:34526320
OpenUrl FREE Full Text
↵
1. Manyara AM,
2. Davies P,
3. Stewart D,
4. et al
. Reporting of surrogate endpoints in randomised controlled trial protocols (SPIRIT-Surrogate): extension checklist with explanation and elaboration. BMJ2024;386:e078525. doi:10.1136/bmj-2023-078525 pmid:33507252
OpenUrl FREE Full Text
↵
1. Ferreira-González I,
2. Permanyer-Miralda G,
3. Busse JW,
4. et al
. Methodologic discussions for using and interpreting composite endpoints are limited, but still identify major concerns. J Clin Epidemiol2007;60:651-7, discussion 658-62. doi:10.1016/j.jclinepi.2006.10.020. pmid:17573977
OpenUrl CrossRef PubMed
1. Butcher NJ,
2. Mew EJ,
3. Monsour A,
4. Chan AW,
5. Moher D,
6. Offringa M
. Outcome reporting recommendations for clinical trial protocols and reports: a scoping review. Trials2020;21:620. doi:10.1186/s13063-020-04440-w pmid:32641085
OpenUrl CrossRef PubMed
1. Ciani O,
2. Manyara AM,
3. Davies P,
4. et al
. A framework for the definition and interpretation of the use of surrogate endpoints in interventional trials. EClinicalMedicine2023;65:102283. doi:10.1016/j.eclinm.2023.102283 pmid:37877001
OpenUrl CrossRef PubMed
↵
1. Moher D,
2. Schulz KF,
3. Simera I,
4. Altman DG
. Guidance for developers of health research reporting guidelines. PLoS Med2010;7:e1000217. doi:10.1371/journal.pmed.1000217. pmid:20169112
OpenUrl CrossRef PubMed
↵
EQUATOR. CONSORT-SURROGATE – CONSORT extension for trials using surrogate primary endpoints 2022. https://www.equator-network.org/library/reporting-guidelines-under-development/reporting-guidelines-under-development-for-clinical-trials/#SURROGATE.
↵
1. Manyara AM,
2. Davies P,
3. Stewart D,
4. et al
. Protocol for the development of SPIRIT and CONSORT extensions for randomised controlled trials with surrogate primary endpoints: SPIRIT-SURROGATE and CONSORT-SURROGATE. BMJ Open2022;12:e064304. doi:10.1136/bmjopen-2022-064304. pmid:36220321
OpenUrl CrossRef PubMed
↵
1. Manyara AM,
2. Davies P,
3. Stewart D,
4. et al
. Scoping and targeted reviews to support development of SPIRIT and CONSORT extensions for randomised controlled trials with surrogate primary endpoints: protocol. BMJ Open2022;12:e062798. doi:10.1136/bmjopen-2022-062798. pmid:36229145
OpenUrl Abstract/FREE Full Text
↵
1. Ciani O,
2. Manyara A,
3. Taylor RS
. Need for better reporting of trials with surrogate endpoints: SPIRIT|CONSORT-SURROGATE extensions. J Epidemiol Community Health2022;76:769-70. doi:10.1136/jech-2022-219294. pmid:35750481
OpenUrl FREE Full Text
↵
1. Ciani O,
2. Manyara AM,
3. Chan A-W,
4. Taylor RS,
5. SPIRIT-SURROGATE/CONSORT-SURROGATE project group
. Surrogate endpoints in trials: a call for better reporting. Trials2022;23:991. doi:10.1186/s13063-022-06904-7. pmid:36503559
OpenUrl CrossRef PubMed
↵
1. Ciani O,
2. Manyara AM,
3. Taylor RS
. Surrogate endpoints in trials-a call for better reporting. BMJ2022;378:o1912. doi:10.1136/bmj.o1912. pmid:35905986
OpenUrl FREE Full Text
↵
1. Ciani O,
2. Manyara AM,
3. Taylor RS
. Surrogate end points in cardio-thoracic trials: a call for better reporting and improved interpretation of trial findings. Eur J Cardiothorac Surg2022;62:ezac449. doi:10.1093/ejcts/ezac449. pmid:36112148
OpenUrl CrossRef PubMed
↵
1. Manyara AM,
2. Ciani O,
3. Taylor RS
. A call for better reporting of trials using surrogate primary endpoints. Alzheimers Dement (N Y)2022;8:e12340. doi:10.1002/trc2.12340. pmid:35910671
OpenUrl CrossRef PubMed
↵
1. Hopewell S,
2. Clarke M,
3. Moher D,
4. et al.,
5. CONSORT Group
. CONSORT for reporting randomized controlled trials in journal and conference abstracts: explanation and elaboration. PLoS Med2008;5:e20. doi:10.1371/journal.pmed.0050020. pmid:18215107
OpenUrl CrossRef PubMed
↵
1. Panagiotou A,
2. Trendelenburg M,
3. Heijnen IAFM,
4. et al
. A Randomized Trial of Recombinant Human C1-Esterase-Inhibitor in the Prevention of Contrast-Induced Kidney Injury. JACC Cardiovasc Interv2020;13:833-42. doi:10.1016/j.jcin.2019.11.021. pmid:32171721
OpenUrl Abstract/FREE Full Text
↵
1. de Boer SA,
2. Heerspink HJL,
3. Juárez Orozco LE,
4. et al
. Effect of linagliptin on pulse wave velocity in early type 2 diabetes: A randomized, double-blind, controlled 26-week trial (RELEASE). Diabetes Obes Metab2017;19:1147-54. doi:10.1111/dom.12925. pmid:28244635
OpenUrl CrossRef PubMed
↵
1. Bandholm T,
2. Thorborg K,
3. Ardern CL,
4. Christensen R,
5. Henriksen M
. Writing up your clinical trial report for a scientific journal: the REPORT trial guide for effective and transparent research reporting without spin. Br J Sports Med2022;56:683-91. doi:10.1136/bjsports-2021-105058. pmid:35193854
OpenUrl Abstract/FREE Full Text
↵
1. Bauchner H,
2. Golub RM,
3. Fontanarosa PB
. Reporting and Interpretation of Randomized Clinical Trials. JAMA2019;322:732-5. doi:10.1001/jama.2019.12056. pmid:31454026
OpenUrl CrossRef PubMed
↵
1. Howard BV,
2. Roman MJ,
3. Devereux RB,
4. et al
. Effect of lower targets for blood pressure and LDL cholesterol on atherosclerosis in diabetes: the SANDS randomized trial. JAMA2008;299:1678-89. doi:10.1001/jama.299.14.1678. pmid:18398080
OpenUrl CrossRef PubMed Web of Science
↵
1. Schuster Bruce C,
2. Brhlikova P,
3. Heath J,
4. McGettigan P
. The use of validated and nonvalidated surrogate endpoints in two European Medicines Agency expedited approval pathways: A cross-sectional study of products authorised 2011-2018. PLoS Med2019;16:e1002873. doi:10.1371/journal.pmed.1002873 pmid:31504034
OpenUrl CrossRef PubMed
↵
1. Hey SP,
2. Kesselheim AS,
3. Patel P,
4. Mehrotra P,
5. Powers JH 3rd.
. US Food and Drug Administration Recommendations on the Use of Surrogate Measures as End Points in New Anti-infective Drug Approvals. JAMA Intern Med2020;180:131-8. doi:10.1001/jamainternmed.2019.5451. pmid:31710344
OpenUrl CrossRef PubMed
↵
1. Block GA,
2. Bushinsky DA,
3. Cheng S,
4. et al
. Effect of Etelcalcetide vs Cinacalcet on Serum Parathyroid Hormone in Patients Receiving Hemodialysis With Secondary Hyperparathyroidism: A Randomized Clinical Trial. JAMA2017;317:156-64. doi:10.1001/jama.2016.19468. pmid:28097356
OpenUrl CrossRef PubMed
↵
1. Ettehad D,
2. Emdin CA,
3. Kiran A,
4. et al
. Blood pressure lowering for prevention of cardiovascular disease and death: a systematic review and meta-analysis. Lancet2016;387:957-67. doi:10.1016/S0140-6736(15)01225-8. pmid:26724178
OpenUrl CrossRef PubMed
↵
1. Buyse M,
2. Molenberghs G,
3. Burzykowski T,
4. Renard D,
5. Geys H
. The validation of surrogate endpoints in meta-analyses of randomized experiments. Biostatistics2000;1:49-67. doi:10.1093/biostatistics/1.1.49. pmid:12933525
OpenUrl CrossRef PubMed
↵
1. Buyse M,
2. Saad ED,
3. Burzykowski T,
4. Regan MM,
5. Sweeney CS
. Surrogacy Beyond Prognosis: The Importance of “Trial-Level” Surrogacy. Oncologist2022;27:266-71. doi:10.1093/oncolo/oyac006. pmid:35380717
OpenUrl CrossRef PubMed
↵
1. Alonso A,
2. Van der Elst W,
3. Molenberghs G,
4. Buyse M,
5. Burzykowski T
. On the relationship between the causal-inference and meta-analytic paradigms for the validation of surrogate endpoints. Biometrics2015;71:15-24. doi:10.1111/biom.12245. pmid:25274284
OpenUrl CrossRef PubMed
↵
1. Alonso A,
2. Van der Elst W,
3. Molenberghs G,
4. Buyse M,
5. Burzykowski T
. An information-theoretic approach for the evaluation of surrogate endpoints based on causal inference. Biometrics2016;72:669-77. doi:10.1111/biom.12483. pmid:26864244
OpenUrl CrossRef PubMed
↵
1. Bujkiewicz S,
2. Jackson D,
3. Thompson JR,
4. et al
. Bivariate network meta-analysis for surrogate endpoint evaluation. Stat Med2019;38:3322-41. doi:10.1002/sim.8187 pmid:31131475
OpenUrl CrossRef PubMed
↵
1. Papanikos T,
2. Thompson JR,
3. Abrams KR,
4. et al
. Bayesian hierarchical meta-analytic methods for modeling surrogate relationships that vary across treatment classes using aggregate data. Stat Med2020;39:1103-24. doi:10.1002/sim.8465 pmid:31990083
OpenUrl CrossRef PubMed
↵
1. Prentice RL
. Surrogate endpoints in clinical trials: definition and operational criteria. Stat Med1989;8:431-40. doi:10.1002/sim.4780080407. pmid:2727467
OpenUrl CrossRef PubMed Web of Science
↵
1. Weir CJ,
2. Taylor RS
. Informed decision-making: Statistical methodology for surrogacy evaluation and its role in licensing and reimbursement assessments. Pharm Stat2022;21:740-56. doi:10.1002/pst.2219. pmid:35819121
OpenUrl CrossRef PubMed
↵
1. Weir CJ,
2. Walley RJ
. Statistical evaluation of biomarkers as surrogate endpoints: a literature review. Stat Med2006;25:183-203. doi:10.1002/sim.2319. pmid:16252272
OpenUrl CrossRef PubMed Web of Science
↵
1. Molenberghs G,
2. Burzykowski T,
3. Alonso A,
4. et al
. The meta-analytic framework for the evaluation of surrogate endpoints in clinical trials. J Stat Plan Inference2008;138:432-49. doi:10.1016/j.jspi.2007.06.005.
OpenUrl CrossRef
↵
Institute for Quality and Efficiency in Health Care (IQWiG). Validity of surrogate endpoints in oncology: Executive summary of rapid report A10-05, Version 1.1. Institute for Quality and Efficiency in Health Care: Executive Summaries. 2005.
↵
1. Lassere MN,
2. Johnson KR,
3. Schiff M,
4. Rees D
. Is blood pressure reduction a valid surrogate endpoint for stroke prevention? An analysis incorporating a systematic review of randomised controlled trials, a by-trial weighted errors-in-variables regression, the surrogate threshold effect (STE) and the Biomarker-Surrogacy (BioSurrogate) Evaluation Schema (BSES). BMC Med Res Methodol2012;12:27. doi:10.1186/1471-2288-12-27. pmid:22409774
OpenUrl CrossRef PubMed
↵
1. Boissel JP,
2. Collet JP,
3. Moleur P,
4. Haugh M
. Surrogate endpoints: a basis for a rational approach. Eur J Clin Pharmacol1992;43:235-44. doi:10.1007/BF02333016. pmid:1425885
OpenUrl CrossRef PubMed Web of Science
↵
1. Xie W,
2. Halabi S,
3. Tierney JF,
4. et al
. A Systematic Review and Recommendation for Reporting of Surrogate Endpoint Evaluation Using Meta-analyses. JNCI Cancer Spectr2019;3:pkz002. doi:10.1093/jncics/pkz002. pmid:31360890
OpenUrl CrossRef PubMed
↵
1. la Cour JL,
2. Brok J,
3. Gøtzsche PC
. Inconsistent reporting of surrogate outcomes in randomised clinical trials: cohort study. BMJ2010;341:c3653. doi:10.1136/bmj.c3653. pmid:20719823
OpenUrl Abstract/FREE Full Text
↵
1. Buyse M,
2. Molenberghs G,
3. Paoletti X,
4. et al
. Statistical evaluation of surrogate endpoints with examples from cancer clinical trials. Biom J2016;58:104-32. doi:10.1002/bimj.201400049. pmid:25682941
OpenUrl CrossRef PubMed
↵
1. Daniels MJ,
2. Hughes MD
. Meta-analysis for the evaluation of potential surrogate markers. Stat Med1997;16:1965-82. doi:10.1002/(SICI)1097-0258(19970915)16:17<1965::AID-SIM630>3.0.CO;2-M. pmid:9304767
OpenUrl CrossRef PubMed Web of Science
↵
1. Frangakis CE,
2. Rubin DB
. Principal stratification in causal inference. Biometrics2002;58:21-9. doi:10.1111/j.0006-341X.2002.00021.x. pmid:11890317
OpenUrl CrossRef PubMed Web of Science
↵
1. Tibaldi F,
2. Abrahantes JC,
3. Molenberghs G,
4. et al
. Simplified hierarchical linear models for the evaluation of surrogate endpoints. J Stat Comput Simul2003;73:643-58. doi:10.1080/0094965031000062177.
OpenUrl CrossRef
↵
1. Burzykowski T,
2. Buyse M
. Surrogate threshold effect: an alternative measure for meta-analytic surrogate endpoint validation. Pharm Stat2006;5:173-86. doi:10.1002/pst.207. pmid:17080751
OpenUrl CrossRef PubMed Web of Science
↵
1. Alonso A,
2. Molenberghs G
. Surrogate marker evaluation from an information theory perspective. Biometrics2007;63:180-6. doi:10.1111/j.1541-0420.2006.00634.x. pmid:17447943
OpenUrl CrossRef PubMed Web of Science
↵
1. Belin L,
2. Tan A,
3. De Rycke Y,
4. Dechartres A
. Progression-free survival as a surrogate for overall survival in oncology trials: a methodological systematic review. Br J Cancer2020;122:1707-14. doi:10.1038/s41416-020-0805-y. pmid:32214230
OpenUrl CrossRef PubMed
↵
1. Haase CL,
2. Lopes S,
3. Olsen AH,
4. Satylganova A,
5. Schnecke V,
6. McEwan P
. Weight loss and risk reduction of obesity-related outcomes in 0.5 million people: evidence from a UK primary care database. Int J Obes (Lond)2021;45:1249-58. doi:10.1038/s41366-021-00788-4. pmid:33658682
OpenUrl CrossRef PubMed
↵
1. Winter JE,
2. MacInnis RJ,
3. Wattanapenpaiboon N,
4. Nowson CA
. BMI and all-cause mortality in older adults: a meta-analysis. Am J Clin Nutr2014;99:875-90. doi:10.3945/ajcn.113.068122. pmid:24452240
OpenUrl Abstract/FREE Full Text
↵
1. Shah SJ,
2. Voors AA,
3. McMurray JJV,
4. et al
. Effect of Neladenoson Bialanate on Exercise Capacity Among Patients With Heart Failure With Preserved Ejection Fraction: A Randomized Clinical Trial. JAMA2019;321:2101-12. doi:10.1001/jama.2019.6717. pmid:31162568
OpenUrl CrossRef PubMed
↵
1. Cook JA,
2. Julious SA,
3. Sones W,
4. et al
. DELTA² guidance on choosing the target difference and undertaking and reporting the sample size calculation for a randomised controlled trial. BMJ2018;363:k3750. doi:10.1136/bmj.k3750. pmid:30560792
OpenUrl FREE Full Text
↵
1. Tasanarong A,
2. Hutayanon P,
3. Piyayotai D
. Urinary Neutrophil Gelatinase-Associated Lipocalin predicts the severity of contrast-induced acute kidney injury in chronic kidney disease patients undergoing elective coronary procedures. BMC Nephrol2013;14:270. doi:10.1186/1471-2369-14-270. pmid:24305547
OpenUrl CrossRef PubMed
↵
1. van de Putte LB,
2. Atkins C,
3. Malaise M,
4. et al
. Efficacy and safety of adalimumab as monotherapy in patients with rheumatoid arthritis for whom previous disease modifying antirheumatic drug treatment has failed. Ann Rheum Dis2004;63:508-16. doi:10.1136/ard.2003.013052. pmid:15082480
OpenUrl Abstract/FREE Full Text
↵
1. Parekh DJ,
2. Reis IM,
3. Castle EP,
4. et al
. Robot-assisted radical cystectomy versus open radical cystectomy in patients with bladder cancer (RAZOR): an open-label, randomised, phase 3, non-inferiority trial. Lancet2018;391:2525-36. doi:10.1016/S0140-6736(18)30996-6. pmid:29976469
OpenUrl CrossRef PubMed
↵
1. Cordoba G,
2. Schwartz L,
3. Woloshin S,
4. Bae H,
5. Gøtzsche PC
. Definition, reporting, and interpretation of composite outcomes in clinical trials: systematic review. BMJ2010;341:c3920. doi:10.1136/bmj.c3920. pmid:20719825
OpenUrl Abstract/FREE Full Text
↵
1. Lim E,
2. Brown A,
3. Helmy A,
4. Mussa S,
5. Altman DG
. Composite outcomes in cardiovascular research: a survey of randomized trials. Ann Intern Med2008;149:612-7. doi:10.7326/0003-4819-149-9-200811040-00004. pmid:18981486
OpenUrl CrossRef PubMed Web of Science
↵
1. Wells GA,
2. Tugwell P,
3. Tomasson G,
4. et al
. Composite outcomes at OMERACT: Multi-outcome domains and composite outcome domains. Semin Arthritis Rheum2021;51:1370-7. doi:10.1016/j.semarthrit.2021.11.001. pmid:34863558
OpenUrl CrossRef PubMed
↵
1. Hochman M,
2. McCormick D
. Endpoint selection and relative (versus absolute) risk reporting in published medication trials. J Gen Intern Med2011;26:1246-52. doi:10.1007/s11606-011-1813-7. pmid:21842324
OpenUrl CrossRef PubMed
↵
1. Delgado A,
2. Guddati AK
. Clinical endpoints in oncology - a primer. Am J Cancer Res2021;11:1121-31.pmid:33948349
OpenUrl PubMed
↵
1. Sorbello A,
2. Komo S,
3. Valappil T,
4. Nambiar S
. Registration trials of antibacterial drugs for the treatment of nosocomial pneumonia. Clin Infect Dis2010;51(Suppl 1):S36-41. doi:10.1086/653038. pmid:20597669
OpenUrl CrossRef PubMed Web of Science
↵
1. Timsit JF,
2. de Kraker MEA,
3. Sommer H,
4. et al.,
5. COMBACTE-NET consortium
. Appropriate endpoints for evaluation of new antibiotic therapies for severe infections: a perspective from COMBACTE’s STAT-Net. Intensive Care Med2017;43:1002-12. doi:10.1007/s00134-017-4802-4. pmid:28466147
OpenUrl CrossRef PubMed
↵
1. Heindel W,
2. Weigel S,
3. Gerß J,
4. et al.,
5. TOSYMA Screening Trial Study Group
. Digital breast tomosynthesis plus synthesised mammography versus digital screening mammography for the detection of invasive breast cancer (TOSYMA): a multicentre, open-label, randomised, controlled, superiority trial. Lancet Oncol2022;23:601-11. doi:10.1016/S1470-2045(22)00194-2. pmid:35427470
OpenUrl CrossRef PubMed
↵
1. Baechle C,
2. Scherler W,
3. Lang A,
4. Filla T,
5. Kuss O
. Is HbA1c a valid surrogate for mortality in type 2 diabetes? Evidence from a meta-analysis of randomized trials. Acta Diabetol2022;59:1257-63. doi:10.1007/s00592-022-01887-y. pmid:35534726
OpenUrl CrossRef PubMed
↵
1. Ciani O,
2. Piepoli M,
3. Smart N,
4. et al
. Validation of Exercise Capacity as a Surrogate Endpoint in Exercise-Based Rehabilitation for Heart Failure: A Meta-Analysis of Randomized Controlled Trials. JACC Heart Fail2018;6:596-604. doi:10.1016/j.jchf.2018.03.017. pmid:29957192
OpenUrl Abstract/FREE Full Text
↵
1. Green E,
2. Yothers G,
3. Sargent DJ
. Surrogate endpoint validation: statistical elegance versus clinical relevance. Stat Methods Med Res2008;17:477-86. doi:10.1177/0962280207081863. pmid:18285438
OpenUrl CrossRef PubMed
↵
1. Vanderweele TJ
. Surrogate measures and consistent surrogates. Biometrics2013;69:561-9. doi:10.1111/biom.12071. pmid:24073861
OpenUrl CrossRef PubMed Web of Science
↵
1. Fleming TR,
2. DeMets DL
. Surrogate end points in clinical trials: are we being misled?Ann Intern Med1996;125:605-13. doi:10.7326/0003-4819-125-7-199610010-00011. pmid:8815760
OpenUrl CrossRef PubMed Web of Science
↵
1. Cohen D
. Rosiglitazone: what went wrong?BMJ2010;341:c4848. doi:10.1136/bmj.c4848 pmid:20819889
OpenUrl FREE Full Text
↵
1. Kumar S,
2. Rajkumar SV
. Surrogate endpoints in randomised controlled trials: a reality check. Lancet2019;394:281-3. doi:10.1016/S0140-6736(19)31711-8. pmid:31354129
OpenUrl CrossRef PubMed
↵
1. Kumar SK,
2. Harrison SJ,
3. Cavo M,
4. et al
. Venetoclax or placebo in combination with bortezomib and dexamethasone in patients with relapsed or refractory multiple myeloma (BELLINI): a randomised, double-blind, multicentre, phase 3 trial. Lancet Oncol2020;21:1630-42. doi:10.1016/S1470-2045(20)30525-8. pmid:33129376
OpenUrl CrossRef PubMed
↵
1. Dimopoulos M,
2. Quach H,
3. Mateos M-V,
4. et al
. Carfilzomib, dexamethasone, and daratumumab versus carfilzomib and dexamethasone for patients with relapsed or refractory multiple myeloma (CANDOR): results from a randomised, multicentre, open-label, phase 3 study. Lancet2020;396:186-97. doi:10.1016/S0140-6736(20)30734-0. pmid:32682484
OpenUrl CrossRef PubMed
↵
1. Savarirayan R,
2. Tofts L,
3. Irving M,
4. et al
. Once-daily, subcutaneous vosoritide therapy in children with achondroplasia: a randomised, double-blind, phase 3, placebo-controlled, multicentre trial. Lancet2020;396:684-92. doi:10.1016/S0140-6736(20)31541-5. pmid:32891212
OpenUrl CrossRef PubMed
↵
1. Bikdeli B,
2. Punnanithinont N,
3. Akram Y,
4. et al
. Two Decades of Cardiovascular Trials With Primary Surrogate Endpoints: 1990-2011. J Am Heart Assoc2017;6:e005285. doi:10.1161/JAHA.116.005285. pmid:28325713
OpenUrl Abstract/FREE Full Text
↵
1. Kim C,
2. Prasad V
. Strength of Validation for Surrogate End Points Used in the US Food and Drug Administration’s Approval of Oncology Drugs. Mayo Clin Proc2016;S0025-6196(16)00125-7. doi:10.1016/j.mayocp.2016.02.012. pmid:27236424
OpenUrl CrossRef PubMed
↵
1. Koshizaka M,
2. Ishikawa K,
3. Ishikawa T,
4. et al.,
5. PRIME-V Study Investigators
. Efficacy and safety of ipragliflozin and metformin for visceral fat reduction in patients with type 2 diabetes receiving treatment with dipeptidyl peptidase-4 inhibitors in Japan: a study protocol for a prospective, multicentre, blinded-endpoint phase IV randomised controlled trial (PRIME-V study). BMJ Open2017;7:e015766. doi:10.1136/bmjopen-2016-015766. pmid:28490565
OpenUrl Abstract/FREE Full Text
↵
NIHR School for Primary Care Research. What is Patient and Public Involvement and Public Engagement? 2023. https://www.spcr.nihr.ac.uk/PPI.
↵
1. Geißler J,
2. Isham E,
3. Hickey G,
4. Ballard C,
5. Corbett A,
6. Lubbert C
. Patient involvement in clinical trials. Commun Med (Lond)2022;2:94. doi:10.1038/s43856-022-00156-x. pmid:35903184
OpenUrl CrossRef PubMed
↵
1. Selman LE,
2. Clement C,
3. Douglas M,
4. et al
. Patient and public involvement in randomised clinical trials: a mixed-methods study of a clinical trials unit to identify good practice, barriers and facilitators. Trials2021;22:735. doi:10.1186/s13063-021-05701-y. pmid:34688304
OpenUrl CrossRef PubMed
↵
1. Davies A,
2. Ormel I,
3. Bernier A,
4. et al
. A rapid review of community engagement and informed consent processes for adaptive platform trials and alternative design trials for public health emergencies[version 1; peer review: 2 approved]. Wellcome Open Res2023;8:194. doi:10.12688/wellcomeopenres.19318.1. pmid:37654739
OpenUrl CrossRef PubMed
↵
1. Molyneux S,
2. Bull S
. Consent and Community Engagement in Diverse Research Contexts: Reviewing and Developing Research and Practice: Participants in the Community Engagement and Consent Workshop, Kilifi, Kenya, March 2011. J Empir Res Hum Res Ethics2013;8:1-18. doi:10.1525/jer.2013.8.4.1.
OpenUrl CrossRef PubMed
↵
1. Zulu JM,
2. Sandøy IF,
3. Moland KM,
4. Musonda P,
5. Munsaka E,
6. Blystad A
. The challenge of community engagement and informed consent in rural Zambia: an example from a pilot study. BMC Med Ethics2019;20:45. doi:10.1186/s12910-019-0382-x. pmid:31272489
OpenUrl CrossRef PubMed
↵
1. World Medical Association
. World Medical Association Declaration of Helsinki: ethical principles for medical research involving human subjects. JAMA2013;310:2191-4. doi:10.1001/jama.2013.281053. pmid:24141714
OpenUrl CrossRef PubMed Web of Science
↵
1. Council for International Organizations of Medical Sciences
. International ethical guidelines for biomedical research involving human subjects. Bull Med Ethics2002;(182):17-23.pmid:14983848
OpenUrl PubMed
↵
1. Kahrass H,
2. Bossert S,
3. Schürmann C,
4. Strech D
. Details of risk-benefit communication in informed consent documents for phase I/II trials. Clin Trials2021;18:71-80. doi:10.1177/1740774520971770. pmid:33231107
OpenUrl CrossRef PubMed
↵
1. Daly TP
. Informing consent to antibodies in Alzheimer’s disease. BMJ2023;383:2350. doi:10.1136/bmj.p2350. pmid:37821123
OpenUrl CrossRef PubMed
↵
1. Shipa M,
2. Embleton-Thirsk A,
3. Parvaz M,
4. et al.,
5. BEAT-LUPUS Investigators
. Effectiveness of Belimumab After Rituximab in Systemic Lupus Erythematosus : A Randomized Controlled Trial. Ann Intern Med2021;174:1647-57. doi:10.7326/M21-2078. pmid:34698499
OpenUrl CrossRef PubMed
↵
1. Bockstal V,
2. Shukarev G,
3. McLean C,
4. et al
. First-in-human study to evaluate safety, tolerability, and immunogenicity of heterologous regimens using the multivalent filovirus vaccines Ad26.Filo and MVA-BN-Filo administered in different sequences and schedules: A randomized, controlled study. PLoS One2022;17:e0274906. doi:10.1371/journal.pone.0274906. pmid:36197845
OpenUrl CrossRef PubMed
↵
1. Taylor R,
2. Ciani O
. Response to Wang et al. Quality of individual participant data (IPD) meta-analyses reporting might need improving but leveraging access to IPD is a more fundamental problem. BMJ2021. Accessed 5 November 2023. https://www.bmj.com/content/373/bmj.n736/rr-0
↵
1. Danchev V,
2. Min Y,
3. Borghi J,
4. Baiocchi M,
5. Ioannidis JPA
. Evaluation of Data Sharing After Implementation of the International Committee of Medical Journal Editors Data Sharing Statement Requirement. JAMA Netw Open2021;4:e2033972. doi:10.1001/jamanetworkopen.2020.33972. pmid:33507256
OpenUrl CrossRef PubMed
↵
1. Esmail LC,
2. Kapp P,
3. Assi R,
4. et al
. Sharing of Individual Patient-Level Data by Trialists of Randomized Clinical Trials of Pharmacological Treatments for COVID-19. JAMA2023;329:1695-7. doi:10.1001/jama.2023.4590. pmid:37010865
OpenUrl CrossRef PubMed
↵
1. Butte AJ
. Trials and Tribulations-11 Reasons Why We Need to Promote Clinical Trials Data Sharing. JAMA Netw Open2021;4:e2035043. doi:10.1001/jamanetworkopen.2020.35043. pmid:33507252
OpenUrl CrossRef PubMed

Reporting of surrogate endpoints in randomised controlled trial reports (CONSORT-Surrogate): extension checklist with explanation and elaboration

Linked Research Methods and Reporting

Summary points

Scope and use of CONSORT-Surrogate

Summary of scope and use of CONSORT-Surrogate extension

Eligibility for use

Minimum requirement

Surrogate validation methods are out of scope

Target outcome(s)

Flexibility in order of reporting items

Extrapolation of extension items

Development of CONSORT-Surrogate extension

Structure of the CONSORT-Surrogate extension

CONSORT-Surrogate extension

Title and abstract

Items 1b (extended)

CONSORT 2010 item 1b

CONSORT-Surrogate extension item 1b.1

Examples of CONSORT-Surrogate item 1b.1

Example 1

Example 2

Explanation

Introduction

Background and objectives (extended)

CONSORT 2010 item 2a

CONSORT 2010 item 2b

CONSORT-Surrogate extension item 2.1

Example of CONSORT-Surrogate extension item 2.1

Explanation

Methods

Outcomes

CONSORT 2010 item 6a (extended)

CONSORT-Surrogate extension item 6a.1

CONSORT-Surrogate extension item 6a.2

Example of CONSORT-Surrogate item 6a.1

Explanation

Examples of CONSORT-Surrogate item 6a.2

Example 1

Example 2

Explanation

Summary of statistical approaches for surrogate endpoint validation

Prentice’s criteria53

Principal stratification64

Meta-analytical regression based approach4765

Bayesian approaches

Sample size

CONSORT 2010 item 7a (extended)

CONSORT-Surrogate extension item 7a.1

Examples of CONSORT-Surrogate item 7a.1

Example 1

Example 2

Explanation

Results

Outcomes and estimation

CONSORT 2010 item 17a (extended)

CONSORT-Surrogate extension item 17a.1

Examples of CONSORT-Surrogate item 17a.1

Explanation

Discussion

Interpretation

CONSORT 2010 item 22 (extended)

CONSORT-Surrogate extension item 22.1

CONSORT-Surrogate extension item 22.2

CONSORT-Surrogate extension item 22.3

Examples of CONSORT-Surrogate item 22.1

Example 1

Example 2: Combining items 22.1 and 22.2

Explanation

Example of CONSORT-Surrogate item 22.2

Explanation

Examples of CONSORT-Surrogate item 22.3

Example 1: Reporting subsequent analyses

Example 2: Combining items 22.1, 22.2, and 22.3 (reports ongoing study)

Explanation

Other information

New items

CONSORT-Surrogate extension item 26.1

CONSORT-Surrogate extension item 26.2

Example of CONSORT-Surrogate item 26.1

Explanation

Meta-analytical regression based approach47 65