Earlier last month Cavium announced that ThunderX2 has hit general availability, a new family of Xeon-class ARM server processors optimized for HPC workloads. Contrary to its namesake and despite succeeding the original ThunderX line, the new processors are of an entirely different design, unlike anything Cavium has introduced in the past.

Overview

The new ThunderX2 family includes over 40 SKUs for scale-up and scale-out applications. All models are based on the same design. The die incorporates 32 custom ARM cores (detailed later) along with eight DDR4 memory channels, two USB ports, two SATA ports, x56 lanes of PCIe Gen 3, and 2nd-generation CCPI (also detailed later).

With eight memory channels, each supporting DDR4 at rates of up to 2666 MT/s for both 2 DPC and 1 DPC, those processors have an aggregated bandwidth of up to 170.7 GB/s – same as AMD’s EPYC. RDIMM, LRDIMM, and NVDIMM-P in single, dual and quad rank modules are supported. Cavium has many of the enhanced RAS features one should expect from a modern server chip including failed DIMM identification, demand and patrol scrubbing, address parity protection, corrupt data containment (data poisoning), SECDED ECC, and SDDC.

As far as I/O go, ThunderX2 incorporates 56 lanes of PCIe Gen 3. That’s x8 lanes more than Skylake SP.

With 14 PCIe controllers, those lanes can be bifurcated as 3×16, 7×8, and various other combinations. Other I/O include 2 SATA3 ports, 2 USB 3 ports, and general-purpose I/O.

Models

Cavium is launching over forty SKUs which have been sorted out by how they compare to Intel’s Skylake offerings. In paticular, the models were compared against Xeon Gold and Platinum based on their SPECIntRate and TDP scores (tested using gcc â€“o3). To simplify things, we can actually compress the stack down to just six models:

ThunderX2 SKU Stack Model Cores / Threads Frequency PCIe Lanes Memory CN9980 32 / 128 2.0 – 2.5 GHz x56 8 Ã— DDR4-2666 CN9978 30 / 120 1.8 – 2.3 GHz x56 4/8 Ã— DDR4-2666 CN9975 28 / 112 1.8 – 2.4 GHz x56 6/8 Ã— DDR4-2666 CN9970 24 / 96 1.8 – 2.5 GHz x56 4/6/8 Ã— DDR4-2666 CN9965 20 / 80 1.8 – 2.5 GHz x56 4/6 Ã— DDR4-2666 CN9960 16 / 64 1.6 – 2.2 GHz x48 4 Ã— DDR4-2666

For the most part, the stack segmentation is straightforward – cores, frequency, and memory channels. SKUs range from $800-$3000. The top-binned CN9980 part is a 32-core chip running at 2.5, topping out at 200 Watts. The 2.2 GHz variant of the CN9980 with a TDP of 180 W is priced at $1,795 in quantities of a 1,000 units. At the other end of the scale is the 16-core CN9960 part operating at 1.6 GHz with a TDP of 75 W which costs $800.

A few things are notably missing from the table. The technical specs Cavium has given us mentions that the top binned part operates at 2.5 GHz with a turbo mode of up to 3 GHz. Turbo modes for all other parts supposedly exist. Unfortunately, we were unable to obtain this information. For now, Cavium is not sharing the full TDP specs for the other models either. The full SKU stack disclosed is shown below.

The table above, provided by Cavium, has what they consider a direct mapping between their SKUs and Intel’s Skylake. We have expanded on their table to show the full spec from both models for a better comparison.

Note that we have only listed the base clock frequencies for Intel (non-AVX, AVX2, and AVX-512). Since Cavium did not provide turbo frequencies and TDPs, a direct comparison is quite difficult. It’s also worth pointing out that Intel models have three turbo frequencies, making the comparison highly workload-specific. Regardless of the model, all Intel models in the table have six memory channels, x48 PCIe lanes, and all features enabled. It’s also worth noting that the Xeon Gold 6000 series, as well as all Xeon Platinum models, come with a second AVX-512 FMA unit.



–

Spotted an error? Help us fix it! Simply select the problematic text and press Ctrl+Enter to notify us.

–

