Research on gender differences in intelligence has led to inconsistent results. Different methods seem to provide different estimates using the Wechsler scales. The present study analyzed WAIS-IV and WAIS-R standardization data. We assessed gender differences in subtest scores, indices, and IQs and used a Multi-Group Confirmatory Factor Analysis (MGCFA) to investigate whether gender differences were due to differences in ability factors or test bias. The larger effect sizes in favor of men were found in the Arithmetic subtest and the Working Memory Index of the WAIS-IV. Likewise, for the WAIS-R, Arithmetic, Information, and Block Design subtests and all IQ scores were the larger effect sizes. The MGCFA supported the metric invariance of both editions. For the WAIS-IV, partial scalar invariance was achieved by releasing intercept equality for Information, Comprehension, and Arithmetic. For the WAIS-R, also Coding and Digit Span intercepts had to be unconstrained. Men outperformed women on the perceptual reasoning ability factor. No gender differences were found for the processing speed factor. Men had higher working memory ability, while no difference emerged for verbal comprehension. However, the latent mean differences could be affected by a lack of scalar invariance. A comparison of observed effect sizes at the subtest level to those expected according to MGCFA revealed that Information, Arithmetic, and Comprehension subtests were gender-biased in both editions. As a whole, gender differences appeared more pronounced in the WAIS-R. Gender differences in perceptual reasoning seem genuine, as does their absence in processing speed. Implications for clinical assessment are discussed.