I saw a post a while back about drug use in different SF neighborhoods. The most basic example was how there was a higher percentage of crack and cocaine users in the Tenderloin compared to higher percentages of weed users in the Haight.

It affirmed stuff that people already knew, but it was still pretty interesting how the data reflected the theory in the police reports. I decided to use the same dataset to just see if any of the neighborhoods had higher percentages that stood out in terms of certain crimes. Essentially I wanted to ask the question: Does each neighborhood have a stand-out crime it would be known for?

With the SF Kaggle competition going on right now on predicting crimes in San Francisco, there's got to be some feature selection that's looking at some of the more common crimes that would get classified based on area and location.

The SF crime dataset is hosted by Socrata (Note: I used to work for them as an intern), and consists of open SF crime data since 2003. I used a 800 meter radius for an approximate half a mile and picked the center points in each neighborhood based on my own discretion and where the Google Maps neighborhood label was (You can check out the coordinates at the bottom of this post). I grouped the data by category and took crime data since January of 2014 as a recency mark.