— Tools and Techniques — Document Mining with Overview Interested in mining documents? Overview is a tool for reading and analyzing thousands of documents super quickly. It offers full text search, topic modeling, coding, tagging, visualizations and more. Several investigative stories have been accomplished using this tool, including one that won a Pulitzer. There's a lot to explore here and it's free to create your own account. MetricsGraphics.js Yah! A plotting library for D3. MetricsGraphics.js is optimized for visualizing time-series data and currently supports line charts, scatterplots and histograms as well as features like rug plots and basic linear regression. I'd like to see more interactive options but it looks like this library will continue to evolve and is definitely worth watching. dariusk/corpora Here's a repository of small datasets - for those times when you don't want to manage something big. It's intended to help with rapid prototyping but I can imagine a lot of uses for something like this. Currently, you'll find data under broad headings like "animals," "governments," "geographies," "plants," and "words." The data has been cleaned and is formatted as JSON - looks super useful. FnordMetric | Create charts and dashboards from SQL FnordMetric allows you to write SQL queries that return SVG charts rather than tables. Turning a query result into a chart is literally one line of code.

— Resources — New to Data Science? Nice collection of tutorials, courses, Meetups, and books covering Data Science.