Proof that Less can be More

Proof that Less can be More

Snowflake Data Warehouse release 2.11 (September 7, 2017) included the addition of an OVERWRITE parameter for the INSERT command. INSERT OVERWRITE effectively consolidates two commands, TRUNCATE TABLE and INSERT, into one. INSERT OVERWRITE also deletes the file load history and retains access control privileges on the target table; both of which are not insignificant details. […]

Read More
Installing StreamSets Data Collector on Amazon Web Services EC2

Stream Me Up (to the Cloud), Scotty

I’ve had some fun working with StreamSets Data Collector lately and wanted to share how to quickly get up and running on an Amazon Web Services (AWS) Elastic Compute Cloud (EC2) instance and build a simple pipeline. For anyone unaware, StreamSets Data Collector is, in their own words, a low-latency ingest infrastructure tool that lets […]

Read More

Kicking with JSON in the Snow

Employees at the Minneapolis Red Pill Analytics office recently had the luxury of sitting with Snowflake’s Steve Herskovitz for some on-site education. Among other things, one of my key takeaways from the exercise was how simple it is to query JSON data in the Snowflake Data Warehouse. The syntax is straightforward and makes the process […]

Read More