Gary Marcus and Ernest Davis wrote this useful news article on the promise and limitations of “big data.”

And let me add this related point:

Big data are typically not random samples, hence the need for “big model” to map from sample to population. Here’s an example (with Wei Wang, David Rothschild, and Sharad Goel):

