Out of 1,385 citations, eight studies were included. Based on the interventions and comparators, studies were analyzed in 3 different groups. GB compared with placebo (3 studies, n = 185) showed minimal improvement of pain (MD = 0.22 units, 95% CI [−0.5 to 0.07] I 2 = 0%; GRADE: very low). Three studies compared PG with other types of analgesic medication (n = 332) and showed greater improvement in the other analgesic group (MD = 0.42 units, 95% CI [0.20 to 0.64] I 2 = 0; GRADE: very low). Studies using PG as an adjuvant (n = 423) were not pooled due to heterogeneity, but the largest of them showed no benefit of adding PG to tapentadol. There were no deaths or hospitalizations reported. Compared with placebo, the following adverse events were more commonly reported with GB: dizziness-(RR = 1.99, 95% CI [1.17 to 3.37], I 2 = 49); fatigue (RR = 1.85, 95% CI [1.12 to 3.05], I 2 = 0); difficulties with mentation (RR = 3.34, 95% CI [1.54 to 7.25], I 2 = 0); and visual disturbances (RR = 5.72, 95% CI [1.94 to 16.91], I 2 = 0). The number needed to harm with 95% CI for dizziness, fatigue, difficulties with mentation, and visual disturbances were 7 (4 to 30), 8 (4 to 44), 6 (4 to 15), and 6 (4 to 13) respectively. The GRADE evidence quality was noted to be very low for dizziness and fatigue, low for difficulties with mentation, and moderate for visual disturbances. Functional and emotional improvements were reported by few studies and showed no significant improvements.

Electronic databases of MEDLINE, EMBASE, and Cochrane were searched from their inception until December 20 th , 2016. We included randomized control trials reporting the use of gabapentinoids for the treatment of CLBP of >3 months duration, in adult patients. Study selection and data extraction was performed independently by paired reviewers. Outcomes were guided by Initiative on Methods, Measurement and Pain Assessment in Clinical Trials guidelines, with pain relief and safety as the primary outcomes. Meta-analyses were performed for outcomes reported in 3 or more studies. Outcomes were reported as mean differences (MDs) or risk ratios (RRs) with their corresponding 95% confidence intervals (CIs), and I 2 in percentage representing the percentage variability in effect estimates that could be explained by heterogeneity. GRADE (Grading of Recommendations Assessment, Development, and Evaluation) was used to assess the quality of evidence.

Chronic Low Back Pain (CLBP) is very common, with a lifetime prevalence between 51% and 80%. In majority, it is nonspecific in nature and multifactorial in etiology. Pregabalin (PG) and Gabapentin (GB) are gabapentinoids that have demonstrated benefit in neuropathic pain conditions. Despite no clear rationale, they are increasingly used for nonspecific CLBP. They necessitate prolonged use and are associated with adverse effects and increased cost. Recent guidelines from the National Health Service (NHS), England, expressed concerns on their off-label use, in addition to the risk of misuse. We aimed to assess the effectiveness and safety of gabapentinoids in adult CLBP patients.

Chronic Low Back Pain (CLBP) is very common and is associated with significant patient burden and heath resource expenditure [ 1 – 3 ]. It is largely nonspecific in nature and in up to 85% of patients lacks a clear pathoanatomical diagnosis when present in isolation [ 1 – 4 ]. We have previously highlighted the etiological and treatment considerations for CLBP, along with the limitations within the existing evidence [ 5 ]. A large proportion of CLBP patients are treated with routine analgesic medications with unsatisfactory results leading to frequent exploration of second line options including gabapentinoids [ 6 , 7 ]. In particular, the use of gabapentin (GB) and pregabalin (PG) is made on the rationale of modulating the enhanced neurotransmission at the level of presynaptic receptors of the afferent neurons. Both of these medications primarily act on the α-2 delta-2 subunit of the voltage-dependent calcium channels [ 8 , 9 ] and can be considered to have very similar pharmacodynamic actions on pain and other symptoms. They are considered to be very effective for neuropathic pain (NP) conditions. Attempts at exploiting their therapeutic potential for other pain conditions have shown mixed results [ 10 , 11 ]. Use of gabapentinoids for CLBP requires slow titration to therapeutic doses and establishing maintenance on a long-term basis. With prolonged treatment, the potential gain over possible adverse effects and risks could become unclear [ 9 ]. There have been concerns over the excessive off-label use of GB, despite there being a clear lack of clinical studies [ 12 ], necessitating advisory guidelines by the National Health Services (NHS), United Kingdom on the risk of the misuse of gabapentinoids [ 13 ]. Our primary objectives were to assess the benefits of GB and PG in CLBP in decreasing pain and to examine the risk of adverse effects. Secondarily, we assessed the effects of PG and GB on the Initiative on Methods, Measurement and Pain Assessment in Clinical Trials (IMMPACT) outcomes [ 14 ]. The outcomes considered were physical and emotional functioning, participant ratings of global improvement and satisfaction with treatment, and participant disposition. Additionally, we attempted to assess whether the use of gabapentinoids selectively improve pain relief in patients with predominant neuropathic CLBP.

Data were pooled only if there are 3 or more studies contributing to an outcome domain. Our selection criteria allowed for a relatively homogeneous population of CLBP who tend to be approached similarly from a clinical situation. However, we recognized the potential for heterogeneity based on study interventions and comparator interventions. In view of these obvious sources of heterogeneity, we decided a priori to pool studies using PG or GB, either alone or in combination, separately. Extracted data were compiled and checked for accuracy using Microsoft Excel. RoB was assessed using a modified Cochrane RoB tool that is described below. For the primary analysis, we used a complete case analysis, as reported in individual studies. Sensitivity analyses for incomplete outcome data were performed. Analysis and synthesis was carried out using Review Manager (RevMan) [Computer program], Version 5.3. Copenhagen: The Nordic Cochrane Centre, The Cochrane Collaboration, 2014; and Microsoft Excel 2011 (Mac version). Based on the comparator and interventions, if we did not expect much between study variance, a fixed effects model was used for pooling. However, if we suspected between study variance, or in the presence of unexplained heterogeneity, a random effects model was chosen [ 18 ]. For crossover studies, we prioritized the results from a paired test. If not provided, results of unpaired tests were considered. If there was a potential for carryover effect, or if there is a significant drop out rate (>20%), the results from the first period only were considered [ 18 ]. Statistical heterogeneity was estimated using Cochrane’s Q test, with a threshold of p-value at 0.10, and the percentage variability in individual effect estimates was described by I 2 statistic [ 18 ]. Risk Ratio (RR), and mean difference (MD) or standardized mean differences (SMDs) as appropriate, were estimated along with their 95% confidence intervals (CI). We planned to report the findings in measures of absolute risk, if they were observed to be statistically significant. Rating of quality of evidence was done using GRADE approach, with a summary of findings (SOF) table.

A priori, we specified pain relief and safety (adverse effects) as our primary outcomes and others as secondary outcomes, and prioritized the use of intent to treat analysis. Pain relief expressed as both continuous and categorical outcomes, and at various time points, was extracted for all reported time points. For pooling, we considered the most common type and the longest duration of follow-up reported. A priori, we prioritized change scores over end scores for pooling analysis. Change scores are considered more efficient and powerful than comparison of final scores, as it removes a component of between-person variability from the analysis [ 18 ]. For pain relief expressed as continuous scores, we converted all study outcomes into a common 0–10 numerical rating scale, as it is commonly used and easy to interpret [ 14 ]. The approach to conversion into a common scale is shown in S2 Text . Safety was assessed by comparing the risk of serious adverse events causing death, hospitalisation, or study withdrawal. If unclear, we considered reporting the most commonly reported adverse effects. Due to the expected differences within measurement scales, secondary outcomes of improvement in physical and emotional functioning, and participant ratings of global improvement and satisfaction were not converted into a single common scale.

Additionally, we considered the domains for chronic pain studies as suggested by Moore et al. [ 20 ] and added the domains of outcome assessment time (12 weeks or more as low risk), outcome assessment threshold (>30% improvement in pain relief as low risk), and potential for publication bias based on the sample size threshold (>50 as low risk) to identify a trial as having the potential for publication bias based on low sample size. Trials with low sample size can increase the chances of erroneously large treatment effect sizes and indirectly contribute to publication bias [ 21 , 22 ].

RoB was assessed using the Cochrane RoB tool modified to capture the components of random sequence generation; allocation concealment; blinding of participants; blinding of outcome assessment; and analysis of incomplete outcome data. Further, we modified the response options of domains as “definitely yes,” “probably yes,” “probably no,” and “definitely no.” For each domain, the responses of “definitely yes” and “probably yes” categories were assigned a high RoB and those in the “probably no” and “definitely no” categories a low RoB[ 17 ]. Crossover studies were assessed for reasonable washout period [ 18 ]. No attempt was made to contact authors for clarification on the RoB items. Selective outcome reporting was judged based on the outcomes described in the methods section but not reported in the results section [ 19 ].

We searched the electronic databases of EMBASE, MEDLINE, and the Cochrane Central Registry of Controlled Trials (CENTRAL), from their inception until January 26 th , 2016. WHO clinical trial registry ( http://apps.who.int/trialsearch/Default.aspx ), and clinical trial registry ( https://clinicaltrials.gov/ ), were also searched to look for any registered studies, fulfilling our eligibility criteria, and crosschecked for their resulting publications. To be comprehensive, bibliographies of relevant reviews and selected studies were examined. Since performing the original search, we also repeated our search on December 20 th , 2016 to ensure that we have not missed any recent publications.

We included randomized controlled trials (RCTs) involving adult patients (>18 years of age) with predominant CLBP of 3 months or more, with or without leg pain. We did not have any language exclusions. Studies with mixed population of chronic pain were only included if they report outcomes separately for our study population of interest, or if at least 90% of the trial patients are >18 years with predominant CLBP. Studies were further screened for interventions and were included if they randomized patients to receive “PG” or “GB,” either “alone” or “in combination with other treatment,” and compared it with any active or inactive treatments.

NP was assessed using a screening questionnaire in 3 studies. Sakai et al. observed pain scores to decrease more with tramacet compared to PG in NP patients [ 32 ]. Baron et al. observed no differences in the components of neuropathic pain symptom inventory scores using PG plus TP in comparison to TP alone [ 28 ]. Whereas, Romano et al. observed that pain scores decreased significantly in patients of NP with PG as well as in combination with CX [ 31 ].

These are summarized in Tables 4 and 5 . All studies except Pota et al. had patients who were LTFU [ 30 ]. There were 5 studies that did include LTFU, even with >5% of their randomized sample, in their final analysis [ 25 – 27 , 31 , 32 ]. Functional improvement was observed in 5 studies using various scales [ 25 , 26 , 28 , 29 , 32 ]. The results indicate that there were improvements from the baseline in both treatment and control groups, without much difference between the groups. Emotional functioning was observed by 3 studies, but 2 studies reported the final scores, with no between-group differences [ 25 , 28 ]. Global improvement of change was reported as physician-reported by Atkinson et al. [ 25 ] and patient-reported by Baron et al. [ 28 ]. There were no between-group differences in studies with GB or PG, respectively.

There were no deaths or hospitalizations reported. The reasons for study withdrawal were not provided in all studies. All adverse effects reported in more than 1 study are summarized in Table 3 . Compared with placebo, the following adverse events were more commonly reported with GB: dizziness-(RR = 1.99, 95% CI [1.17 to 3.37], I 2 = 49); fatigue (RR = 1.85, 95% CI [1.12 to 3.05], I 2 = 0); difficulties with mentation (RR = 3.34, 95% CI [1.54 to 7.25], I 2 = 0); and visual disturbances (RR = 5.72, 95% CI [1.94 to 16.91], I 2 = 0) ( Fig 4 ). The GRADE quality of evidence was noted to be very low for dizziness and fatigue, low for difficulties with mentation, and moderate for visual disturbances ( Table 2 ). The resulting absolute risk increase (ARI) percentage and necessary number needed to harm (NNH) with 95% CI for dizziness, fatigue, mental difficulties, and visual disturbances were 14% and 7 (4 to 30), 13% and 8 (4 to 44), 16% and 6 (4 to 15), and 15% and 6 (4 to 13), respectively. With PG, dizziness was more common compared to the active comparator (RR = 2.70, 95% CI [1.25 to 5.83], I 2 = 0), with very low quality of evidence. The ARI% and NNH were 9% and 11(6 to 30).

Pain relief expressed in NRS or VAS scales were converted into a common scale of 0–10 NRS. Authors of 2 studies were successfully contacted to obtain final results of pain scores, as it was not clear in their reporting [ 25 , 32 ]. We were unable to use the change scores as many studies did not report their change in standard deviations (SDs), and imputing them based on another study or by using a correlation coefficient of change was observed to be inappropriate and not precise [ 18 ]. So, pooling was performed using end scores. Based on the variability in the study comparisons, we decided to pool studies for the use of GB and PG. In the first group ( Fig 3a ), studies using GB (n = 91) versus placebo (n = 94) were combined using a fixed effects model. Compared with placebo, the GB group had a small reduction in pain (MD = 0.22 units, 95% CI [−0.51 to 0.07], I 2 = 0%). There were no studies comparing PG with placebo. PG (n = 163) was compared with an active comparator (n = 169) in 3 studies ( Fig 3b ), using random effects model. This analysis showed an improvement in pain favoring the use of the active comparator group (MD = 0.42 units, 95% CI [0.20 to 0.64], I 2 = 0). Both the above comparisons were rated as very low quality evidence by GRADE ( Table 2 ). The third group consisted of comparisons that used PG as an adjunct to another analgesic medication (n = 215), such as buprenorphine (BUP) [ 30 ], tapentadol (TAP) [ 28 ], and CX [ 30 ], and compared it with the use of analgesic medication alone (n = 208). We decided that it was not appropriate to pool these studies considering the clinical heterogeneity involved within the studies, on the sides of both intervention and comparator. This was supported by the substantial statistical heterogeneity observed with such an attempt using random effects model, I 2 = 77%. The forest plot for this comparison is shown as S1 Fig . Among these 3 studies, the largest study by Baron et al. did not find any difference by adding PG to TP at their 10-week follow-up [ 28 ]. However, the smaller studies by Pota et al. [ 30 ] and Romano et al. [ 31 ] observed important differences in pain scores (difference of more than 2 points in 0–10 NRS) by using PG as an adjunct to BP and CX, respectively. There were also no significant differences when patients were assessed as success or failure with either GB versus placebo ( Fig 3c ) or PG versus active comparator ( Fig 3d ).

Important characteristics of the study population and treatments are provided in Table 1 . Of the 8 studies, 3 compared the use of GB to placebo treatment [ 25 , 27 ], and 5 used PG [ 28 – 32 ]. There were 2 crossover studies using GB [ 26 ] and PG [ 31 ]. Only 2 studies were multicentered and had external funding for the conduct of their trial [ 25 , 28 ]. Among the PG trials, 3 trials used an active comparator (amitryptline, celebrex, tramacet) versus PG alone [ 29 , 31 , 32 ]. As the study by Romano et al. had 3 arms [ 31 ], they compared PG alone versus celebrex (CX) versus a combination of PG plus CX. So, there were 3 comparisons involving PG as an adjunct to an analgesic medication versus their respective analgesic medication [ 28 , 30 , 31 ]. The mean age ranged between 41.6 to 58.5 years, except in the study by Sakai et al. [ 32 ]. However, the duration of pre-existing CLBP had a much wider range of 13 to 213 months. The treatment doses were titrated for clinical effect in all studies, except for Sakai et al., who had a fixed dosing of PG [ 32 ]. The doses ranged from 300 to 3,600 mg/day with GB and 100 to 600 mg/day with PG, in divided doses. Only 3 studies assessed specifically for NP using a screening questionnaire [ 28 , 31 , 32 ].

Discussion

Despite the widespread use, our systematic review with meta-analysis found that there are very few RCTs that have attempted to assess the benefit of using GB or PG in patients of CLBP. Use of GB and PG, compared to placebo and active analgesic comparators, respectively, were associated with significant increase in adverse effects without limited evidence for improvement in pain scores or other outcomes. We were unable to examine the pooled effect of using PG as an adjuvant analgesic medication given the limited evidence and heterogeneity of studies. It is reasonable to assume that the clinical benefit would depend upon the primary medication and its potency within each study. The differences within the results of Pota et al. [30] and Romano et al. [31], compared to Baron et al. [28] could be attributed to methodological differences. The study by Baron et al. had a larger sample size along with longer duration of follow up. Hence, the existing evidence does not support the use of gabapentinoids for predominant CLBP, and calls for larger, high quality RCTs to more definitively inform this issue.

Considering the expanding use of gabapentinoids for chronic pain and CLBP [33, 34], this review fulfils the immediate need to scrutinize and closely examine the existing evidence. Noting that there is a published Cochrane protocol [35], ours is the first review combined with meta-analysis to examine the benefits and safety of gabapentinoids in CLBP. Results of our review are in contrast with nonrandomized studies that have shown benefit with PG in patients of CLBP [36, 37]. Gabapentinoids have proven efficacy in NP conditions [38]. However, they are also widely used for conditions in which the neuropathic component is difficult to establish, most of which are off label uses [12]. This development perhaps reflects the penumbra sort of effect (clinicians generalizing the selection criteria of clinical studies into their patient population without recognizing the limitations) [39]. In England, there was a 46% and 53% rise in the prescription use of GB and PG respectively from 2011 to 2013 alone [13]. A recent Canadian study showed that the off-label use of PG is as high as 75%, and the most prevalent condition of use was CLBP [40]. The true burden of NP in CLBP is hard to establish [41]. Distribution of pain can be considered as a corollary of the pathological process, and it is important to broadly classify patients based on their predominance of axial or leg pain for diagnosis and management [2]. A common assumption is of leg pain indicating NP. However, in most cases leg pain is nonspecific and inconsistent with radicular pain, and only a painful radiculopathy with sensory signs would fulfill the diagnosis of definite NP [41]. Even if one considers that gabapentinoids are effective against NP related to CLBP, contrasting evidences are observed in literature. In patients of radicular pain or pain of spinal stenosis, observational studies of CLBP demonstrate significant improvements with PG [42, 43]. However, RCTs performed by Baron et al. in patients of lumbar radiculopathy and Markman et al. in patients of spinal stenosis did not find clinical improvements when PG was compared with placebo [44, 45]. Cohen et al. examined the benefit of GB in patients of leg pain and found no difference as compared to epidural steroid injections [46]. Even within the included study by Baron et al., the reduction of pain and NP symptoms was similar with the combination of PG with TP, compared to TP [28]. Our results are important for practitioners across several specialties who treat patients with CLBP and have to decide on the relative merits and demerits of treatment with gabapentinoids.

Our review is not without its limitations. We excluded studies in patients of predominant leg pain or spinal stenosis. This was done to limit the heterogeneity within our study population. Although the measure of heterogeneity (I2-proportion of variability that can be explained by individual studies) was low in many comparisons, the CIs around those I2 were very wide, reflecting that there is uncertainty in any claim of homogeneity. Heterogeneity has been shown to be an issue with meta-analyses involving a smaller number of trials or events [47]. Topiramate was not considered in this review, as it has a slightly different mechanism of action and is not commonly used, although some controlled studies have shown benefit [48]. The use of PG or GB is associated with significant adverse effects, cost [13], and potential for misuse [34, 49].

Our review demonstrates that there is limited evidence on the use of gabapentinoids in nonspecific CLBP, and the existing evidence in the form of RCTs does not support their use. It is possible that ongoing or unpublished studies [50, 51] may more definitively inform us on this issue, although one such study specific to CLBP was withdrawn prior to enrollment [52].