The controversy surrounding “Ballghazi” or “Deflate-gate” (depending on your politics, I guess) has only intensified. Press conferences were given. Locker room attendants’ bathroom visits were analyzed. The Columbia University physics department was called in. And, in a series of posts that made the media rounds late last week, the Patriots’ ability to avoid fumbling was declared “nearly impossible” (without, say, the systematic deflation of footballs).

The gist of the posts, written by a tout named Warren Sharp, is that the Patriots are a phenomenal statistical outlier when it comes to hanging onto the ball. Sharp presents a chart showing how far New England has stood above its peers in “offensive plays per (lost) fumble” over the last five seasons, giving the odds against such a performance happening by chance as 1 in about 16,234. He also notes that, over the same span of years, the Patriots have fumbled (whether the ball was lost or recovered) far less often than their peers, after excluding dome-based teams from the comparison. And finally, he notes that individual members of the Patriots appear to fumble far less with New England than when they play for other franchises.

The data science community responded with a number of rebuttals (I put together a roundup of my favorite ones below). Collectively, these posts did a great job of breaking down the Statistics 101 problems with Sharp’s original analyses. But even if Sharp had been less sloppy, it would have been right to take issue with the larger implication of his work — that any major outlier, if shown to be statistically significant, should be seen as evidence of rule-breaking.

Barry Bonds and Lance Armstrong were outliers. But so is Lionel Messi. And Phil Jackson. And the San Antonio Spurs. It would be irresponsible — and depressing — to assume every incredible performance equals cheating. Celebrating outliers is one of the best parts about being a sports fan.

I’d be remiss if I didn’t note that, in cases such as these, our traditional methods of determining statistical significance can severely underestimate the odds of something happening due to chance. That’s because of the so-called Wyatt Earp Effect, named after the frontier lawman known for taking part in lots of gunfights without getting hurt. Earp’s feat seems improbable in hindsight, but given the sheer number of shootouts in the Old West, it was actually pretty likely that somebody would make it out alive.

Likewise, it’s difficult to estimate the true odds against a team preventing fumbles to the extent Sharp originally suggested New England did. Knowing particulars about the Patriots after the fact can bias us into computing the odds that a specific team would have a specific fumble record over a specific period of years. But the real question regarding New England’s outlier-ness should surround the odds that any team would post any outlier statistic over any span of seasons. And the probability of that happening, as you may imagine, is a lot higher than the odds of a very specific set of circumstances.

Here were some of the responses to Sharp’s posts: