Divvy is a 64-bit Mac OS X 10.6 or 10.7 (Snow Leopard or Lion) application for performing unsupervised machine learning and visualization. We focus on the clustering (separating data into groups) and dimensionality reduction (finding low dimensional structure in high dimensional data) subfields of machine learning. For visualization we provide support for both the whole dataset (e.g. a scatter plot) and points (e.g. transforming a particular point into an image).

Free and open.

The NSF funded this work so that you don't have to. Go get our MIT-licensed code at GitHub. Hack it; improve it; run with it.

Endlessly extensible.

Every clusterer, reducer, point visualizer and dataset visualizer in Divvy is a plugin. We've provided a few big ones (K-means, PCA, scatter plot, &c.) and we're hoping the community will use our plugin protocol to build many more. Each plugin defines its own UI, so your algorithm can look and behave the way that you want it to without top-down constraints.

Have lots of cores?

Divvy is both task and data parallel. No longer will you be waiting for one algorithm to complete before you start another. Start as many as you want and keep using the UI. Only started one? With data parallelism we'll still push your new MacBook Pro to 800% CPU utilization.

Part of your workflow.

Export your clusterings and reductions to .csv and your visualizations to .png. Use your R or Matlab data with our R and Matlab to Divvy export tools.