Google’s BigQuery is a large-scale, interactive query environment that can handle billions of records in seconds. Now, wouldn’t it be cool to process the 26+ billion triples from the LOD cloud with BigQuery?

I guess so 😉

So, I did a first step into this direction by setting up the BigQuery for Linked Data project containing:

A Python script called nt2csv.py that converts RDF/NTriples into BigQuery-compliant CSV;

BigQuery schemes that can be used together with the CSV data from above;

Step-by-step instructions how to use nt2csv.py along with Google’s gsutil and bq command line tools to import the above data into Google Storage and issue a query against the uploaded data in BigQuery.