I recently built a small cluster for running Solr. The cluster consists of 12 Supermicro Blades running E3-1270V2 with 32GB of ram.

11 of these servers are running fine. One of them crashes on me constantly. When the server crashes it typically produces some output on the terminal. The first time it was:

double fault: 0000 [#1]

Hmm... thats pretty cryptic. Since then I've recreated the problem and gottem some more interesting messages.

Here's another equally cryptic message...

Another interesting wrinkle is that I can fire up sysbench and max out the CPU without it crashing, but it's not until I start Java that it crashes reliably.

I've tried turning off the following CPU features:

Turbo Mode

C States

T States

XHCI

Is this just a bad CPU?

Many thanks!