Abstract

Aim/Background: After the introduction of sorafenib in the treatment of advanced hepatocellular carcinoma (HCC), different studies tried to evaluate whether other systemic therapies can improve survival. To provide a comprehensive indirect treatment comparison of efficacy and safety of novel drugs, a network meta-analysis (NMA) of phase III randomized controlled trials was performed. Methods: After pertinent literature search up to November 1, 2016, 6 studies were eligible for the analysis including 4,812 individual patients with advanced HCC: 2,454 received sorafenib, 577 received brivanib, 530 received sunitinib, 514 received linifanib, 358 received sorafenib + erlotinib and 379 received placebo. Frequentist NMA was used to compare treatments within a single analytical framework. Results: NMA showed that sorafenib alone, regardless of combination with erlotinib, and linifanib provide a significant survival advantage over placebo (p < 0.05) but without any significant difference between each other. Conversely, all regimens significantly ameliorate progression-free survival versus placebo (p < 0.05). The rank order of efficacy was: sorafenib ± erlotinib, linifanib, brivanib, sunitinib, and placebo. Sorafenib ± erlotinib was the regimen with the fewest number of adverse events that required discontinuation of treatment, whereas linifanib and brivanib resulted in the most adverse events. The risk-benefit summary identified one cluster of therapies with a similar balance between efficacy and safety which included sorafenib alone or in combination with erlotinib, having, at the same time, the highest efficacy and safety. Conclusions: Sorafenib remains the best systemic treatment for advanced HCC; linifanib also resulted in survival advantages over placebo but with a lower safety profile.

© 2017 S. Karger AG, Basel

Introduction

Sorafenib is an oral multikinase inhibitor with effective antiangiogenic and proapoptotic activity and marked antitumoral effect in patents with hepatocellular carcinoma (HCC) [1,2]. The improvement of survival in the patients treated with sorafenib is supported by the evidence derived from 2 phase III randomized controlled trials (RCTs) [3,4]. According to the European Association for the Study of Liver (EASL) and the American Association for the Study of Liver Disease (AASLD) guidelines, sorafenib is currently considered the standard systemic therapy for patients with advanced stage HCC and well-preserved liver function [5,6]. Based on the increasing knowledge of the large number of molecular pathways involved in HCC growth, numerous target-specific and/or broad-spectrum tyrosine kinase inhibitors (TKIs) have been developed and tested in different phase II and III trials in HCC [7]. These trials came after the licensing of sorafenib in 2007 attempting to test if further improvement in patient survival, with better safety profile, is possible.

In presence of new antitumoral agents, it would be useful to summarize and rank efficacy and safety of such different treatments. This is particularly important when more agents are approved for the same indication (e.g., first-line treatment), but not all direct comparisons have been carried out. Even though only one agent is approved for first-line therapy for HCC at present, a comparison of efficacy and safety of the different molecules tested so far, among each other, could also be relevant to highlight the roles and the importance of different mechanisms of action in this particular clinical setting. In fact, the various agents tested in first line, have different antiangiogenic and antiproliferative potential and target receptors, although they nearly all belong to the TKI class. To produce such a comparison of multiple different agents each with another, a multiarm trial would be necessary, but unfortunately it is unlikely that such level of evidence will ever become available, and to our knowledge there are no plans to conduct such a trial [8,9]. This lack of direct comparison can be analytically compensated by adopting a multiple (mixed) treatment indirect comparison through development of a network meta-analysis (NMA) [8,9]. NMA is a technique to meta-analyze networks of trials comparing two or more treatments at the same time. It has advantages over conventional pairwise meta-analysis, as this technique borrows strength from indirect evidence to gain certainty about all treatment comparisons, allowing estimations of comparative effects that have not been investigated in the head-to-head RCTs. Using hierarchical model, all direct and indirect comparisons are taken into account to attain at a single consistent estimate of the effect of all included treatments based on all included studies.

The aim of this study was thus to perform an NMA that takes into account efficacy and safety of different systemic therapies in the treatment of advanced HCC, as defined by the EASL/AASLD guidelines, which were tested in phase III RCTs. Such treatment network of available highest quality studies can offer the exceptional opportunity to compare the evidence available for each treatment with all others in the setting of advanced HCC.

Methods

Literature Search Strategy

A systematic search of PubMed, ISI Web of Knowledge and Cochrane Central Register of Controlled Trials (CENTRAL) databases was performed for articles published until 1 November 2016 relevant to the use of sorafenib and different pharmacotherapies in the systemic treatment of advanced HCC. PubMed database was searched using the following key words: (“sorafenib” [Supplementary Concept] OR “sorafenib” [All Fields]) AND (“carcinoma, hepatocellular” [MeSH Terms] OR “hepatocellular carcinoma”) with the flag “randomized controlled trial” on. ISI Web of Knowledge was searched using the following terms: TS = (“hepatocellular carcinoma” AND “sorafenib”) AND (TS = randomized controlled trial). CENTRAL database was searched using the following key words: “hepatocellular carcinoma”:ti,ab,kw and “sorafenib” and “randomized controlled trial”:pt (Word variations have been searched). The search was limited to human subjects and focused on RCTs. No language limitations were imposed. Additional articles were searched in the International Clinical Trials Registry Platform (ICTRP) and Google Scholar without retrieving any additional available study. The reference list of identified RCTs was also manually searched until no additional eligible trials could be identified. Results from the three electronic databases were compared to obtain a single list of individual articles for screening.

Literature Screening

One author (A.C.) conducted the initial screening of the generated single list of individual articles to identify all relevant studies meeting our eligibility criteria (see online suppl. Fig. 1; for all online suppl. material, see www.karger.com/doi/10.1159/000481314). Reviews, letters to the editors, study protocols, and congress abstracts were excluded at this step. Then, 2 independent reviewers (F.P. and F.M.) assessed the studies for relevance, inclusion, and methodological quality. Inclusion criteria were the following: (1) phase III RCT as study design; (2) study intervention in which the treatment arm received Sorafenib-only therapy and the control arms received other pharmacotherapies or sorafenib-based therapies or placebo as first-line treatment; (3) study population consisted of adult patients with advanced HCC with an Eastern Cooperative Oncology Group (ECOG) performance status score of 2 or less; adequate hepatic, hematological, and renal function as defined in each of retrieved studies and a life expectancy of 12 weeks or more; (4) primary outcome measure was overall survival, and the secondary outcomes included progression-free survival (PFS) and adverse events (AEs). For the last level of screening, the full text was obtained for relevant articles as well as for any citations for which a decision could not be made on the basis of the abstract. In case the same trial appeared in different publications, only the most informative article was retained to avoid duplicate information. Such linked studies were identified on the grounds of authorship, institutions, design, length of follow-up, and study populations. If additional data or results were needed, the corresponding author of each report was contacted by e-mail. Any discrepancies in inclusion were resolved by discussion between the reviewers and a third investigator (A.C.).

Data Collection and Statistical Analysis

Data derived from retrieved/included studies were collected in an Excel file. Two investigators (F.P. and F.M.) extracted in a piloted form the following data from each study: first author, year of publication, number of patients for each treatment arm, performance status, number of patients with macroscopic tumour vascular invasion, number of patients having extrahepatic disease, overall survival, time to progression, and AEs. The main outcome measures were patient survival (efficacy) and discontinuation due to drug-related AEs (safety) whereas PFS was considered as secondary outcome measure. Numbers of events, at-risk patients, and censored patients for survival and time-to-progression were estimated from Kaplan-Meier survival curves in the interval of 0-12 months, using the appropriate formulas proposed by Tierney et al. [10]. The total person-time at risk and the mortality or progression ×100 person-months were properly estimated [10]. Since studies were all RCTs, the methodological quality of each study was assessed using the RoB 2.0 tool [11]. The principal summary measure was hazard ratio (HR) for time-dependent efficacy events and risk ratio for safety events.

Frequentist NMA was used to compare available treatment strategies within a single analytical framework [12]. NMA was performed with Stata 14.0 (StataCorp., College Station, TX, USA) using the “network” command and routines developed by Chaimani and White [12,13]. The relative ranking probability of each treatment was estimated, and rankograms were used to estimate the hierarchy of each intervention in terms of both efficacy (patient survival and PFS) and safety (discontinuation due to AEs). The mean rank values and the surface under the cumulative ranking (SUCRA) curves were also calculated [9,12,13]. SUCRA is a relative ranking measure that derives from using the rank probabilities from rankograms (the cumulative rank curves) and that accounts for the uncertainty in treatment order, that is, accounts both for the location and the variance of all relative treatment effects. In analogy to the area under a receiver operating characteristic curve, the SUCRA value can range from a minimum of 0 (worst) to a maximum of 1 (best). Thus, the larger the SUCRA value, the better the rank for a specific treatment among the n available treatment options. As supplementary information, clustered ranking plot was constructed using SUCRA values for efficacy (overall survival) and safety (discontinuation due to AE) outcomes to obtain information on meaningful groups of treatments that maximize benefits [13]. The network was also checked for inconsistency to assess when the direct comparison of one treatment versus another one, derived from one or more studies included in the NMA, conflicts with evidence drawn via the indirect comparison estimated through the NMA; however, few direct comparisons were present to formally detect inconsistency in the present model [9,12,13]. The restricted maximum likelihood method was used to estimate heterogeneity, assuming a common variance estimate across different comparisons for each single outcome considered [12]. The extent of heterogeneity in each network analysis was evaluated and measured through a common heterogeneity variance for the network (tau [τ]) considering the range of expected treatment estimates (HRs and risk ratios). Values of τ between 0.1 and 0.5 were considered reasonable, between 0.5 and 1.0 were considered fairly high, and larger than 1.0 represented extreme heterogeneity. The NMA was reported in accordance with the modified PRISMA guidelines for network meta-analyses [9].

Results

Literature Search Results

A total of 89 unique articles were identified using our search criteria for screening (see online suppl. Fig. 1). Following assessment by the adopted exclusion criteria, 83 articles were excluded and 6 phase III RCTs were retained for NMA (Table 1) [3,4,14,15,16,17]. The complete list of excluded studies and the causes of exclusion are reported in the online supplementary Table 1. The risk of bias according to the RoB 2.0 tool was graded as “low” for most of domains (see online suppl. Fig. 2). The NMA was developed on a total of 4,812 individual patients with HCC fulfilling eligibility criteria. Of these, 2,454 received sorafenib (51.0%), 577 received brivanib (12.0%), 530 received sunitinib (11.0%), 514 received linifanib (10.7%), 358 received sorafenib in combination with erlotinib (7.4%) and 379 received placebo (7.9%). The structure of the NMA is reported in Figure 1, and further details of the included literature are provided in the online supplementary Table 2.

Table 1

Fig. 1

NMA of Efficacy

Efficacy from NMA was first investigated having placebo as comparator (Fig. 2). As expected, a significant benefit in terms of overall survival was estimated for sorafenib (HR = 0.69; 95% CI: 0.58-0.84); however, a benefit was also indirectly estimated for the adoption of sorafenib + erlotinib (HR = 0.67; 95% CI: 0.51-0.88) and for linifanib therapy (HR = 0.73; 95% CI: 0.57-0.93) over placebo. On the contrary, the indirect estimate of overall survival benefit for brivanib and for sunitinib against placebo was found not statistically significant (95% CI of the HR included the value of 1). When instead assessing the PFS, all the regimens tested were found to be significantly associated with a longer PFS than placebo; the heterogeneity for both overall survival and PFS was deemed to be low (τ <0.1) and no inconsistency was observed (Fig. 2).

Fig. 2

NMA results for each possible combination between different pharmacotherapies are reported in Table 2. This “league table” reports, in terms of HR of overall survival and PFS, the relative benefit (if any) obtainable from one therapy against all the other considered. In terms of overall survival, Sorafenib alone or in combination with erlotinib was found superior to sunitinib (HR = 0.80; 95% CI: 0.69-0.94 and HR = 0.77; 95% CI: 0.60-0.99), whereas no differences in comparison with linifanib or brivanib were observed. Furthermore, the addition of erlotinib to sorafenib did not result in any gain in patient survival in comparison to sorafenib alone. In terms of PFS, sorafenib either alone or in combination with erlotinib, and linifanib were found superior to sunitinib (HR = 0.81; 95% CI: 0.65-0.99; 95% CI: 0.71-0.91 and = 0.80; 95% CI: 0.67-0.97, respectively). Again, the addition of erlotinib to sorafenib did not modify the PFS (Table 2).

Table 2

The probabilities of being the best treatment in terms of overall survival are reported in the rankogram of Figure 3a. The combination of erlotinib with sorafenib was the most probable best approach considering overall survival, ranking first in 58.5% of hypothetical cases, whereas sorafenib was estimated to be the best approach in 21.3% of hypothetical cases. Either one or the other therapeutic regimen fitted as the first or the second best treatment in about 75% of hypothetical cases. The mean rank values for survival, among the 6 possible therapies, were as follows: sorafenib + erlotinib = 1.8 (SUCRA = 0.847), sorafenib alone = 2.1 (SUCRA = 0.788), linifanib = 2.8 (SUCRA = 0.636), brivanib = 3.7 (SUCRA = 0.469), sunitinib = 4.9 (SUCRA = 0.225), placebo = 5.8 (SUCRA = 0.036).

Fig. 3

NMA of Safety

The rankogram of Figure 3b reports results from NMA when considering discontinuation due to drug-related AEs. The safest treatment is the one showing the lowest incidence of discontinuation and was obviously estimated for placebo. Sorafenib without or with the addition of erlotinib showed a very good safety profile, being the first two most tolerated active drugs. On the contrary, brivanib and linifanib were the therapies most frequently associated with discontinuation of treatment due to AEs. The mean safety rank values, among the 6 possible therapies, were as follows: placebo = 1.0 (SUCRA = 0.995, least associated with discontinuation due to AEs), sorafenib + erlotinib = 2.6 (SUCRA = 0.685), sorafenib alone = 3.1 (SUCRA = 0.579), sunitinib = 3.5 (SUCRA = 0. 509), brivanib = 5.4 (SUCRA = 0.126) and linifanib = 5.5 (SUCRA = 0.105). The heterogeneity for safety was deemed to be low (τ <0.1) and, again, no inconsistency was observed.

Detailed mean rank and SUCRA values for grade 3-4 diarrhea, hand-foot syndrome, hypertension, vomiting, and liver dysfunction are reported in the online supplementary Table 3. Finally, by plotting SUCRA values for efficacy (overall survival) concurrently with the values for safety (discontinuation), a cluster of treatments was identified having similar risk-benefit balance, which included sorafenib ± erlotinib with the highest efficacy and safety being present at the same time (Fig. 4).

Fig. 4

Discussion

The present NMA compared all drugs and placebo that have been tested in first-line phase III RCTs up to end of 2016. Through the present analytical methodology, it was possible to produce new original information, which confirmed sorafenib as the most efficacious molecule, and claiming it as the only one licensed drug for HCC in Western countries. No other drugs can be recommended to be used in off-label setting, and sorafenib should remain the mainstay of systemic treatment for advanced HCC.

After the pivotal registration trials that showed a prolongation of survival under sorafenib in comparison to placebo [2,3] and before the current study, a number of drugs have been tested against sorafenib [19]. None of these alternative drugs produced an overall survival benefit meeting the criteria for the Food and Drug Administration for approval (namely, that the upper value of the 95% CI of the hazard ratio does not exceed 1.08) [20]. At the same time, it is not clear if these drugs would have been better than placebo, if directly tested. The present study showed that a hypothetical direct comparison with placebo would likely have not produced a significant superiority for brivanib and sunitinib (HR spanning over the unit value; Table 2; Fig. 2), although all compounds produced improvement in the PFS compared to placebo (Table 2; Fig. 2). Our NMA also allowed to confirm in a very large number of subjects (more than 2,000 individuals treated with sorafenib much larger than those included in the registration trials) that sorafenib-based therapies are not only better than placebo but that are associated with the higher efficacy than any other tested molecule (Table 2). The combination of sorafenib with erlotinib produced minimally and negligible better efficacy to sorafenib alone, at the expenses of greater toxicity and costs. Reasons for failure of new drugs to be superior, or at least noninferior, to sorafenib were hypothesized to be various, including the lack of understanding of critical drivers of tumor progression/dissemination, liver toxicity, flaws in trial design, or marginal antitumoral potency [20]. The present analysis provided a comprehensive overview of efficacy and safety of all drugs tested in RCTs so far.

The current results can be viewed in the light of the current knowledge of mechanisms of actions of drugs compared in this NMA. In fact, although all the tested drugs are TKI and almost all of them include an antiangiogenic effect, they all have slightly different targets and different modalities of action (suppl. Table 4), and even when acting on the same final target they possess different potencies in vitro. In the setting of HCC, that is most often superimposed on a background of liver cirrhosis, involving a severe intra-hepatic and splanchnic vascular and angiogenic derangements, maximal potency (either due to the mechanism of action or drug dosage) does not always translate into maximal efficacy due to the potential risk of AEs [20,21]. For instance, sunitinib was claimed to be a more potent antiangiogenic compound than sorafenib based on the zebrafish model (see online suppl. Table 4) [22], but does not target as efficiently the antiproliferative pathway as sorafenib and, differently from the expectations, the final clinical effect at the tested dosage was a more toxic drug than sorafenib, with a trend to poorer, rather than better survival (Fig. 3) [15].

The potential of some drugs, such as brivanib or sunitinib, to inhibit fibroblast growth factor (FGF) concurrently with vascular endothelial growth factor (VEGF) raised high hopes that the inhibition of FGF could maintain the antiangiogenic effect in tumor clones that had become resistant to VEGF inhibition, thereby improving the length of tumor response [23]. Unfortunately, the assessment of clinical effects did not confirm the hypothesis derived from cellular data [14,15]. Probably, it is not just a matter of the affected tumor pathways, but also of the potency of the molecule to achieve one or another action and of the effect of the drug on background liver function, portal hypertension as well as on other organs, which are hard to predict from laboratory data. Unfortunately, an established and reliable model of HCC in the setting of liver cirrhosis still does not exist in the preclinical animal setting, stressing the need for well-performed and large enough phase II trials, at best with a randomized design [24]. In the absence of the possibility to accurately predict in vivo results from in vitro findings, the only way to test the effect of promising molecules are clinical trials. Thus, the results of the current meta-analysis could provide insights about the role of potential pathways, to speculate if a new future molecule could be associated with a theoretical higher or lower likelihood of clinical effectiveness, hence deserving or not to be tested in future clinical trials.

The present study is subject to limitations inherent to all NMAs in terms of both quality of the included studies, heterogeneity of studied populations, and transitivity assumption. Defects in the design, management, and analysis of RCTs can lead to bias, raising concerns about the strength of the included findings. The trials included in the present NMA were, however, all phase III trials, with similar inclusion criteria and low risk of bias for most of domains analyzed (RoB 2.0, see online suppl. Fig. 2). Despite similar inclusion criteria, a nonnegligible heterogeneity can be perceived from observing clinical baseline characteristics of the 6 different RCTs (Table 1). This may question the NMA assumption of transitivity. However, the τ values from NMA of efficacy and safety grossly remained below the threshold value of 0.1 (Table 2; Fig. 3) which provides additional assurance about validity of our analysis. In addition, due to the “star-structure” of the present NMA, it was not possible to get any inconsistency (that is when the direct comparisons conflict with evidence drawn via the indirect comparisons estimated through the NMA), and this feature represent a further strength of the present study [9]. All these aspects confirm the goodness of the present model but also advocate that present results should be considered in light of these uncertainties intrinsic to the methodology of the NMA.

A particular mention should be made of the role of hepatitis status in determining response to sorafenib therapy. It has recently been shown that the impact of sorafenib is largely confined to the patients who are HCV positive [25,26]. The present NMA cannot account for this specific aspect to be ascertained, and the investigation of this issue would require subgroup efficacy and safety data segregated for HCV positive and negative patients. However, by analyzing the specific percentages of HCV-positive patients in each of the trial involved, we observed no statistical difference between HCV prevalence in the sorafenib group and in the comparator group (relative risk: 0.98; 95% CI: 0.86-1.11). On one side, this can reassure for the transitivity of the NMA with regard to HCV status, but on the other side we need to point out that the viral status should be considered when developing further meta-analytic approaches, such as the present NMA, with the possible access to completed trial data [27].

A final mention should be made of the new ongoing trials. In particular, the phase III REFLECT trial, involving 954 patients with unresectable HCC (January 25, 2017), achieved the primary end-point (overall survival) in the lenvatinib group, not inferior to that with sorafenib and with a longer, statistically superior, PFS [27,28]. At the time the present NMA was developed, results from the REFLECT trial were still not published, but considering the primary end-point (overall survival) it would be unlikely that lenvatinib inclusion would change present results (Fig. 4). However, it would be of interest to rank this drug in terms of PFS against placebo and the other antiangiogenic therapies included in the current NMA. Thus, further high-level evidence is keenly awaited.

In conclusion, the results of the present NMA support the use of sorafenib as the mainstay systemic treatment for advanced HCC. All other molecular compounds can improve PFS but only linifanib demonstrated improvement in the overall survival in these patients. The balance between efficacy and safety favors sorafenib as the optimal choice for management of HCC. Our analysis can be useful in development of new molecules for treatment of HCC.

Disclosure Statement

F. Piscaglia and L. Bolondi are on the speakers' bureau of and received grants from Bayer.

Financial Support

No commercial interest, financial source, or material support to disclose.

Author Contributions

A. Cucchetti and F. Mazzotti had the original idea and planned the study; F. Piscaglia and F. Mazzotti performed the literature review; A. Cucchetti and B. Djulbegovic performed the network meta-analysis; F. Piscaglia and A. Cucchetti interpreted the results and wrote the manuscript; A.D. Pinna and L. Bolondi provided expertise for important intellectual content of the manuscript.



Related Articles:

References

Author Contacts

Dr. Alessandro Cucchetti S. Orsola - Malpighi Hospital, Department of Medical and Surgical Sciences - DIMEC, Alma Mater Studiorum - University of Bologna Via Pietro Albertoni 15, IT-40138 Bologna (Italy) E-Mail aleqko@libero.it

Article / Publication Details

Published online: October 19, 2017

Issue release date: November 2017 Number of Print Pages: 12

Number of Figures: 4

Number of Tables: 2 ISSN: 2235-1795 (Print)

eISSN: 1664-5553 (Online) For additional information: https://www.karger.com/LIC

Copyright / Drug Dosage / Disclaimer

Copyright: All rights reserved. No part of this publication may be translated into other languages, reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying, recording, microcopying, or by any information storage and retrieval system, without permission in writing from the publisher.

Drug Dosage: The authors and the publisher have exerted every effort to ensure that drug selection and dosage set forth in this text are in accord with current recommendations and practice at the time of publication. However, in view of ongoing research, changes in government regulations, and the constant flow of information relating to drug therapy and drug reactions, the reader is urged to check the package insert for each drug for any changes in indications and dosage and for added warnings and precautions. This is particularly important when the recommended agent is a new and/or infrequently employed drug.

Disclaimer: The statements, opinions and data contained in this publication are solely those of the individual authors and contributors and not of the publishers and the editor(s). The appearance of advertisements or/and product references in the publication is not a warranty, endorsement, or approval of the products or services advertised or of their effectiveness, quality or safety. The publisher and the editor(s) disclaim responsibility for any injury to persons or property resulting from any ideas, methods, instructions or products referred to in the content or advertisements.