The study aimed to compare the effectiveness of various exercise modalities in improving exercise performance, echocardiographic parameters and quality of life in patients with heart failure with preserved ejection fraction.
MethodsA systematic review over 7 databases was conducted up to May 2024. Randomised controlled trials analysing the efficacy of any exercise modality in isolation were included. For each variable, a Frequentist and Bayesian Network Meta-Analysis were conducted. Mean differences between groups with 95% confidence and credible intervals were calculated for each outcome. Quality of the evidence was assessed with GRADE, being very low for every outcome.
Results12 RCTs were included, providing data from 602 patients. The Frequentist model showed effectiveness for: CT on peak VO2, VO2 at ventilatory threshold (VT) and E/é ratio; high-intensity interval training (HIIT) on peak VO2, VE/VCO2 slope, cardiopulmonary exercise testing (CPET) time and E/é; inspiratory muscle training (IMT) on peak VO2, VE/VCO2 slope, RER and 6-minute walking test (6MWT); moderate-intensity aerobic training (MIAT) on peak VO2, CPET time, peak power output (PPO), 6MWT and E/é ratio; and Tai Chi on peak VO2. The Bayesian model only supports these findings for CT on peak VO2; IMT on peak VO2; MIAT on CPET time, PPO, and 6MWT; and Tai Chi on peak VO2.
ConclusionsConsidering both models, the evidence, albeit of very low quality, supports the effectiveness of various exercise modalities except HIIT in exercise capacity parameters (peak VO2, CPET time, PPO and 6MWT). There is an effect over VO2 at VT, VE/VCO2 slope, RER, LVEF, E/é and MLHFQ, but only supported by the Frequentist model.
PROSPERO registration numberCRD42024560028
AbbreviationsA late mitral wave early mitral wave Consensus on exercise reporting template Confidence interval Cardiopulmonary exercise testing Credible interval Frequency, intensity, time, type. Grading recommendations Assessment, Development, and Evaluation. Heart failure with preserved ejection fraction High-intensity interval training Heart rate reserve Inspiratory muscle training Kansas City cardiomyopathy questionnaire Left atrial diameter Left ventricle ejection fraction Left ventricle mass index Left ventricle volume index Medical subject headings Moderate intensity aerobic training Minnesota living with heart failure questionnaire Network Meta-Analysis Quality of life Peak power output Randomized controlled trial Respiratory exchange ratio Risk of Bias short-form 36 health survey Mean difference Usual care Ventilatory threshold CO2 equivalent 6-minute walking test
Heart failure (HF) is a clinical syndrome characterised by high morbidity and mortality, with a median incidence of 3.20 cases per 1000 person-years and a prevalence of 1–2% of adults in Europe.1–3 HF also significantly impairs physical function and quality of life, with an annual healthcare costs reaching $284.17 billion globally.4 Traditionally it has been classified by left ventricle ejection fraction (LVEF), with worse pharmacological response in patients with >50% LVEF.5 However, differences have also been observed in mortality, sex, cardiac remodelling and systemic biomarkers.6
With an ageing population, HF with preserved ejection fraction (HFpEF) has become the most common type of HF, and its incidence is expected to increase. Its typical multimorbid nature and lack of established pharmacological treatments present significant challenges for healthcare systems.5 Current clinical Practice Guidelines.5,7 recommend focusing on comorbidity management in patients with HFpEF. Among non-pharmacological interventions, exercise programmes effectively control cardiovascular risk factors and key comorbidities such as obesity, diabetes, obstructive pulmonary disease, anaemia.5,7,8 and atrial fibrillation,9,10 making them a valuable adjunctive option.
Previous reviews.11–13 have analysed the comparative effectivity of different exercise modalities on HFpEF, showing positive results especially for peak VO2, which is correlated with fewer recurrent hospitalisations.14 However, some exercise modalities studied were not included in the assessment of effectiveness across all exercise performance, echocardiographic parameters, and quality of life (QoL) variables. In addition, they all included studies in which the baseline LVEF mean with its standard deviation exceeded the 50% threshold, which could imply a selection bias, affecting the implementation of the results.
In light of the prognostic differences among phenotypes, the studied interventions, and the growing body of scientific literature, we conducted a network meta-analysis (NMA). The objective was to analyse the effectiveness of different exercise modalities on exercise performance, echocardiographic parameters and quality of life in HFpEF patients. An additional objective was to compare the exercise modalities to analyse which ones were superior for each outcome.
MethodsThis systematic review followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses Protocols (PRISMA-P) extension for network meta-analyses.15 and the Cochrane Handbook for Systematic Reviews of Interventions.16 The protocol of this review was registered in PROSPERO. Literature search, studies selection, data extraction, the risk of bias and quality of the evidence assessment were conducted by two independent authors and discrepancies were solved by a third author.
Data sources and search strategyTwo authors independently searched in PubMed, SCOPUS, Web of Science, EMBASE, CINAHL Complete, Physiotherapy Evidence Database May 2024. Gray literature search was limited to OpenGray database and snowball technique. The search strategy and keywords used are shown in Supplementary material A. The study duplicates were managed using Covidence (https://www.covidence.org) and Rayyan (https://www.rayyan.ai/) software.
To complete the search, the reference lists of previous studies, abstracts and documents were all analysed. All the searches were conducted following the PICOS framework:
-Population: patients HFpEF.
-Intervention: any exercise modality.
-Comparators: any comparator, as it could be used for the network.
-Outcomes: peak VO2, VO2 at ventilatory threshold (VT), peak power output (PPO), cardiopulmonary exercise testing (CPET) time, peak heart rate (HR), respiratory exchange ratio (RER), CO2 equivalent (VE/VCO2) slope, and 6-minute walking test (6MWT). early mitral wave (E), late mitral wave (A), E/A ratio, early diastolic velocity of the mitral annulus (e′), E/e′ ratio, LVEF, left ventricular volume index (LVVI), left ventricular mass index (LVMI), left atrial volume index (LAVI), left atrial diameter (LAD), stroke volume, cardiac output, flow-mediated dilatation, and global longitudinal strain (GLS). Minnesota Living with Heart Failure Questionnaire (MLHFQ), Kansas City Cardiomyopathy Questionnaire (KCCQ), Short-Form 36 Health Survey (SF-36).
The inclusion criteria were as follows: (1) Randomised Controlled Trials (RCTs) published in scientific journals; (2) including patients with HFpEF, defined as having a mean left ventricular ejection fraction (LVEF) ≥ 50%, with standard deviation not crossing this threshold in either the intervention or control group; (3) included at least one treatment group receiving any kind of exercise; (4) compared to other interventions or usual care (UC); (5) studies that provided statistical data on the outcome variables described above.
The exclusion criteria were: (1) inadequate study design, (2) other pathologies, (3) absence of an exercise group, (4) studies including interventions with no voluntary actions such as functional electrical stimulation, (5) studies in which the treatment arm was not exercised in isolation.
Data extractionMicrosoft Excel was used for data collection. The following data were collected from the included studies: (1) general characteristics. (2) Characteristics of participants from each group. (3) Characteristics of the intervention. (4) Characteristics of the comparison group. (5) Data for each variable of interest: type of variable assessed, instrument used to measure it, point in time of the assessment, quantitative data for each outcome extracted in means and standard deviations (SD). When a study reported interquartile ranges and medians, these were converted into means and SD.17 When no information was provided in an article, we contacted the authors via e-mail. The Consensus on Exercise Reporting Template (CERT) was used to summarise the characteristics of the exercise interventions.18 The assessment process for CERT is described in Supplementary material B. When a study provided repeated measurements over time, only assessments at the end of the intervention were included in the analysis.
Risk of bias and quality of evidenceRisk of bias of the RCTs was assessed using the Cochrane Collaboration tool for assessing the risk of bias (RoB 2).19 The degree of concordance between both assessors in the RoB 2 was evaluated with Gwet’s AC2 coefficient.20
Meanwhile, quality of the evidence was assessed following the Grading recommendations Assessment, Development, and Evaluation (GRADE) tool and its checklist adaptation to NMA.21
The assessment process for the use of both instruments is described in Supplementary Material C and D.
Statistical analysisTo enhance robustness and address sensitivity of the results, two NMA were performed for each variable following a Frequentist and a Bayesian approach in RStudio (RStudio, PBC, Boston, MA, USA), employing R version 4.1.1. The “netmeta” package was used for the Frequentist approach (https://CRAN.R-project.org/package=netmeta) and the “gemtc” for the Bayesian model (https://CRAN.R-project.org/package=gemtc), following the guide developed by Harrer et al.22 Exercise modalities were categorised as combined training (CT) (aerobic training plus resistance training), high-intensity aerobic training (HIIT), inspiratory muscle training (IMT), moderate-intensity aerobic training (MIAT) and Tai Chi. Direct comparisons between exercise modalities were represented with a network graph in which the nodes represented the type of intervention. For each treatment comparison, the effect size was calculated in mean differences (MD) plus their 95% confidence intervals (CI) or credible intervals (CrI). Statistical significance threshold was set at p < 0.05. A modality was considered supported when both models estimate pointed the same direction and their 95% CIs and CrIs did not cross the null value. For each NMA were presented using forest plots to illustrate differences between each exercise modality and usual care (UC).
Further explanation on transitivity, consistency, publication bias, meta-regression, treatment ranking and sensitivity analyses are described in Supplementary Material E.
ResultsA total of 12 studies were included in the quantitative analysis.23–34 The selection process is summarised in the PRISMA flow diagram in Supplementary material F.
The included studies were carried out between 2011 and 2021, reporting data on 602 patients, with a mean age of 64.23 years, of whom 397 were women (65.94%). The type of exercise interventions found were HIIT,23,26,31 IMT,32,33 MIAT,23–26,28,30,31 CT27,29,34 and Tai Chi.34 The weekly frequency of the treatment was 3 days for HIIT, 2–5 days for MIAT, 2–7 days for CT, 7 days for IMT, and 2 days for Tai Chi. Other characteristics of the included articles are summarised in Table 1.
Characteristics of the included studies.
Data expressed as means (standard deviations), medians (interquartile range) or absolute (relative frequencies). late mitral flow, CG: control group, COPD: chronic obstructive pulmonary disease, CPET: cardiopulmonary exercise testing, DBP: diastolic blood pressure, E: early mitral flow, e´: early diastolic velocity of the mitral annulus, FITT: frequency/intensity/time/type, HR: Heart rate, HRR: heart rate reserve, HIIT: high intensity interval training, IG: intervention group, IMT: inspiratory muscle training, LAVI: left atrial volume index, LVEF: left ventricle ejection fraction, MIAT: moderate intensity aerobic training, min: minutes, MLHFQ: Minnesota living with heart failure questionnaire, NR: not reported, QoL: quality of life, RER: respiratory exchange ratio, RT: resistance training, SBP: systolic blood pressure, VE: minute ventilation, VCO2: carbon dioxide production, VO2 peak: peak oxygen uptake, VT1: oxygen uptake at first ventilatory threshold, 6MWT: 6-minute walking test.
With respect to exercise reporting, CERT complementation ranged between 26 and 74%. Only one study reported >60% (13 items) of the checklist. Fig. 1 shows the compliance for each item per individual study.
Supplementary material G shows the assessment for risk of bias. No studies were classified as low risk, only 1 study was classified as some concerns, and 11 studies were rated as high risk of bias. The concordance between revisers was good (AC2=0.86).
GRADE assessment showed very low quality for the efficacy of every intervention when compared to UC across all the NMA outcomes. Furthermore, when comparing the efficacy of each intervention respect to the others, the quality of evidence was also very low across the studied variables (Supplementary material H).
Exercise performance variablesCompared to UC: CT showed an effect on peak VO2, but only in the Frequentist model for VO2 at VT; IMT showed an effect on peak VO2, but only in the Frequentist model for VE/VCO2 slope, RER and 6MWT; MIAT showed an effect on peak CPET time, PPO, and 6MWT, but only in the Frequentist approach for peak VO2. Tai Chi showed an effect on peak VO2. HIIT only reached statistical significance in the Frequentist model for peak VO2, VE/VCO2 slope and CPET. No intervention reached had a significant effect on peak HR. According to the rank table, CT was most probable first-line treatment for VO2 at VT (P-score=0.77) and PPO (P-score=0.75), HIIT was the most probable first-line treatment for CPET time (P-score=0.81), IMT was the most probable first-line treatment for VE/VCO2 slope (P-score=0.99), RER (P-score=0.96) and 6MWT (P-score=0.80), and Tai Chi was the most probable first-line treatment for VO2 peak (P-score=0.92). Regarding between-intervention comparisons, HIIT was superior to IMT for VE/VCO2 slope. Only the Frequentist model supported that: for peak VO2 CT was superior to MIAT and to HIIT, and Tai Chi was superior to MIAT. For VE/VCO2 slope, MIAT was superior to IMT. Finally, IMT was superior to MIAT for RER. Table 2 shows the effect estimates across all comparisons. Fig. 2 shows the forest plot for each exercise modality on peak VO2. Forest plots for the rest of variables can be found in Supplementary material I.
Effect estimates for every possible treatment comparison.
The upper part of each table contains the effect estimate for direct comparisons, the lower part contains the effect estimate from the Network-Meta-Analysis. For each comparison, the upper cell presents the Frequentist estimate and the lower cell the Bayesian estimate. Green cells represent statistical significance for that specific comparison. A: late mitral flow, CT: combined training, CPET: cardiopulmonary exercise testing, E: early mitral flow, e´: early diastolic velocity of the mitral annulus, HR: Heart rate, HIIT: high intensity interval training, IMT: inspiratory muscle training, LAVI: left atrial volume index, LVEF: left ventricle ejection fraction, MIAT: moderate intensity aerobic training, MLHFQ: Minnesota living with heart failure questionnaire, RER: respiratory exchange ratio, RT: resistance training, VE: minute ventilation, VCO2: carbon dioxide production, VO2 peak: peak oxygen uptake, VT1: oxygen uptake at first ventilatory threshold, 6MWT: 6-minute walking test.
Heterogeneity was found for peak VO2, VO2 at VT, HR peak, and especially for 6MWT. Meanwhile, heterogeneity was lower for RER, PPO and VE/VCO2. Inconsistency was not significant for peak VO2 and VE/VCO2 and could not be calculated for the rest of the variables (Supplementary material J). Net and node-splitting functions revealed no inconsistency for VO₂ peak and the VE/VCO₂ slope; for the remaining variables, inconsistency could not be assessed due to the absence of loops. (Supplementary material K).
The leverage versus residual deviance plots indicated good model fit for all variables, except for VO₂ peak, where one study (16) was identified as contributing to a poorer model fit (Supplementary material L). Sensitivity analysis for peak VO2 showed that, when compared to UC: CT, HIIT, IMT, MIAT and Tai Chi were still effective (Supplementary material M). Heterogeneity significantly decreased and inconsistency remained non-significant (Supplementary material J).
Meta-regression for assessing if age, sex differences or weeks of treatment could influence the magnitude of the estimates was non-significant for peak VO2.
Comparison-adjusted funnel plot and Egger test suggested there is no publication bias. (Supplementary material N).
Echocardiographic variablesWhen compared to UC, only the Frequentist model supported: CT showed an effect on E/é ratio, HIIT showed an effect on E/é, IMT showed an effect on LVEF, MIAT showed an effect on E/é ratio. Tai Chi did not show any effect across the studied outcomes. None of the interventions reached statistical significance for LAVI, A, E, é and E/A. According to the rank table, HIIT was the most probable first-line treatment for LVEF (P-score=0.83) and LAVI (P-score=0.71), and Tai Chi was the most probable first-line treatment for E/é. Additionally, only the Frequentist model supported that: for LVEF, HIIT was superior to IMT and IMT was superior to MIAT and Tai Chi. Table 2 shows the effect across all comparisons. Forest plots for the outcomes can be found in Supplementary material I.
Heterogeneity was identified for é, LAVI and E. Meanwhile, heterogeneity was lower for LVEF, E/A ratio, E/é ratio and A. Inconsistency was found for é; however, Q value significantly decreased after assuming a full design-by-treatment interaction random effects model. Inconsistency was non-significant for LAVI and E/é, but was not calculated for E, LVED, E/A and A (Supplementary material J.). The net and node splitting showed no inconsistency for LAVI, E/é ratio and é, while for the rest of the variables they were not performed as there were no loops (Supplementary material K).
Leverage versus residual deviance plots showed good model fit for all the variables, so sensitivity analysis was not performed (Supplementary material L). Meta-regression for assessing if age, sex differences or weeks of treatment could influence the magnitude of the estimates was non-significant for LAVI and E/é. Comparisons adjusted funnel plots and Egger tests suggested there is no publication bias (Supplementary material N).
Quality of lifeWhen compared to UC, MIAT was the only intervention that showed an effect on MLHFQ, but only in the Frequentist approach (Supplementary material I). Heterogeneity was detected and inconsistency was not calculated (Supplementary material J). Net and node splitting were not performed as there were no loops in the Network (Supplementary material K). There were no significant differences in the effect estimates between interventions (Table 2).
Leverage versus residual deviance plot showed good model fit for MLHFQ, so sensitivity analysis was not performed (Supplementary material L).
Comparisons adjusted funnel plot and Egger test suggest no publication bias (Supplementary material N).
DiscussionOur results align with previous research for the study of peak VO2, as we found the effectiveness of CT,11,12 HIIT,11–13 MIAT.11,12 and IMT.12 Contrary to Gomes-Neto,12 we observed an effect of Tai Chi on this outcome, possibly due to the inclusion of only one study meeting the LVEF ≥ 50% criterion, resulting in low precision of the observed estimate.29 For CPET time and PPO, we found MIAT to be effective, consistent with the findings of the only previous NMA that addressed these variables.11 Conversely, we did not observe any effect on peak HR, nor HIIT across the outcomes. This could be due to our inclusion criteria, different studies and more comparators included in this NMA. For 6MWT, our results coincide with Cavero-Redondo.11 for the effectiveness of MIAT, with the novel finding of an effect on IMT on this outcome. Regarding RER, our NMA is the first to obtain an effect of IMT. Only one previous systematic review addressed this outcome, finding no differences when comparing HIIT to MIAT, which is also in line with our findings.35
Concerning the echocardiographic variables, our findings are in line with Cavero-Redondo.11 for the effectiveness of HIIT on E/é ratio and the absence of statistical significance on E/A; however we did not find any effect on é. Surprisingly, we found a negative effect for IMT on LVEF. As only one study reported information for this exercise modality, this should be further studied in future trials.32 Regarding LAVI, this is the first time an NMA has analysed it. Although we did not find a significant effect, consistent with a previous RS comparing HIIT and MIAT,35 there is a tendency for HIIT to reduce LAVI; future investigations may corroborate or refute this. The lack of an observed effect on é, E and A, may be attributed to differences in the included studies, low statistical power to detect differences, or the absence of it. The better results for the exercise performance variables in contrast to the echocardiographic parameters could be explained as myocardial muscle has a limited capacity for improvement when compared to skeletal muscle,36 furthermore, it seems that improvements in exercise performance are driven primarily by peripheral (arteriovenous O2 difference) rather than central adaptations.37,38 Longer interventions may be required to achieve better results for these outcomes.
Regarding QoL, our results contrast with those of Walters,13 as we found MIAT but not HIIT to be effective on MLHFQ. The more conservative nature of our findings compared to previous reviews may be attributed to: (1) the inclusion of only studies where the intervention was exercise in isolation, (2) the exclusion of articles involving HF patients with LVEF < 50%, and (3) the exclusion of duplicate studies with the same NCT, retaining only the most recent one, (19) thus lowering the precision of the effect estimates and not reaching statistical significance.
Regarding transitivity assumption, we did not find age, sex or duration of the intervention to influence on some estimates. In addition, patients ‘characteristics and exclusion criteria were similarly distributed across studies (Table 1), except for one study reporting lower BMI,29 other including patients with cancer.34 and two including patients with NYHA class IV.32,33 Furthermore, for several comparisons we could not perform inconsistency tests to assess incoherence, or there was substantial heterogeneity (Supplementary material J). Thus, we decided to downgrade for indirectness and inconsistency in some of the comparisons.21,39,40
Current guidelines.5,7 recommend exercise to improve exercise capacity, functional status, QoL and reduce hospitalisations with a grade IA recommendation for HF patients in general, although there are no exact recommendations regarding which kind of exercise modality is the best for HFpEF.41 The main finding of this NMA was the effectiveness of CT, IMT, MIAT, and Tai Chi, particularly in improving exercise performance, as its impairment is a hallmark of this population. Our findings support the implementation of MIAT in clinical practice, as it appears to have the greatest overall impact across the evaluated outcomes, while incorporating resistance training yields superior improvements in peak VO2 (Table 2). From a practical point of view, progressing from lower to higher intensities could have an impact across the continuum of metabolic adaptations such as mitochondrial oxidative function.42–45 and neurohormonal effects.46,47 Adding IMT seems plausible to further improve these results.
To our knowledge, this is the first review to present results from both models on the comparative effectiveness of all exercise interventions in isolation across the spectrum of exercise capacity, echocardiography and QoL. Notably, this is the first NMA providing information about the effect of CT on VO2 at VT, HIIT on VE/VCO2 slope, IMT on 6MWT, RER and LVEF, MIAT on MLHFQ and Tai Chi for peak VO2. Our findings offer a guide for selecting specific exercise interventions to target each parameter in HFpEF. Additionally, this review presents a characterisation of the FITT and exercise monitoring parameters summarised in the in Table 1, which are also relevant for clinicians. Another insight is that exercise interventions in RCTs including HFpEF patients are moderately reported according to CERT, hence limiting the replicability of the interventions. This underscores the need for scientific journals to mandate more detailed reporting of exercise interventions in future RCTs, particularly for the less frequently reported items outlined in Fig. 1.
Although our findings have clinical relevance, they are not without limitations. First, the quality of the evidence was very low for all the outcomes evaluated, partially due to the high risk of bias for almost all the included studies. Second, the heterogeneity observed in some of the estimates limits the applicability of the findings. Third, the low number of studies included in the NMA impacts the precision of the results, reduces the power for some analyses, and prevents the testing of overall consistency within the networks. Fourth, we were only able to perform meta-regressions for a limited number of variables. Although no significant effects modifiers were identified, it may have impacted the transitivity assumption. Fifth, the large number of variables analysed for exercise performance and echocardiography parameters implies more likelihood of false positives, which is why we highly recommend taking into account the convergence of both approaches. Sixth, in one study we had to estimate the mean and SD.32 Seventh, although we intended to minimise the presence of patients with an EF < 50% in the analysis, we cannot assume their absence because we didn’t have individual data.
ConclusionsConsidering both approaches very low quality of the evidence indicates an effect of CT, IMT, and Tai Chi on peak VO2, as well as an impact of MIAT on other exercise performance parameters. Furthermore, CT is superior to MIAT on peak VO2, and HIIT is superior to IMT on VE/VCO2 slope. Only the Frequentist model showed an effect for CT, HIIT and MIAT on exercise performance and echocardiography parameters, and an effect of MIAT on quality of life. Only the Frequentist model supported the superiority of CT over HIIT for peak VO2 and Tai Chi over MIAT; for VE/VCO2 slope, MIAT is superior to IMT, and for RER IMT is superior to MIAT. Future RCTs should aim to improve methodological quality and include bigger sample sizes (especially for HIIT and Tai-Chi) to enhance precision and overall quality of the evidence.
FundingFunding for open access charge: Universidad de Málaga / CBUA
Supplementary materialSupplementary material A. Search strategy in the bibliographical search.
Supplementary material B. Consensus on Exercise Reporting Template description.
Supplementary material C. Risk of bias assessment description.
Supplementary material D. Grading recommendations Assessment, Development, and Evaluation (GRADE) tool assessment description.
Supplementary material E. Sensitivity analysis, meta-regression and publication bias analyses.
Supplementary material F. PRISMA flow diagram of the bibliographical search.
Supplementary material G. Risk of bias assessment.
Supplementary material H. GRADE assessment for each Network Meta-Analysis.
Supplementary material I. Frequentist and Bayesian Forest plots for each Network Meta-Analysis.
Supplementary material J. Heterogeneity and Inconsistency for each Network Meta-Analysis
Supplementary material K. Network graph for various Network Meta-Analysis
Supplementary material L. Leverage versus residual deviance plots for each Network Meta-Analysis.
Supplementary material M. Sensitivity analysis for peak VO2.
Supplementary material N. Comparison adjusted funnel plots for each Network Meta-Analysis.
Authors’ ContributionsConcept and design (AICV, JMHJ), acquisition (JMHJ, IJFA), analysis or interpretation of the data (JMHJ, IVJFA, CGC), drafting of the manuscript (JMHJ, AICV, IJFA, CGC), critical revision of the manuscript (IJFA, AICV), statistical analysis (JMHJ), and supervision (AICV).
The authors declare no competing interest
We would like to express our gratitude to Alvaro Reina Varona and José Fierro Marrero from CSEU La Salle (Madrid), and to Rubén Fernández Matías from Universidad de Valencia for their technical expertise on this manuscript. We would also like to thank Sara Maldonado Martín in the name of Wake Forest University for kindly providing access to individual patient data, which was essential for conducting this NMA.









