Installing StreamSets Data Collector on Amazon Web Services EC2

Stream Me Up (to the Cloud), Scotty

I’ve had some fun working with StreamSets Data Collector lately and wanted to share how to quickly get up and running on an Amazon Web Services (AWS) Elastic Compute Cloud (EC2) instance and build a simple pipeline. For anyone unaware, StreamSets Data Collector is, in their own words, a low-latency ingest infrastructure tool that lets […]

Read More

Kicking with JSON in the Snow

Employees at the Minneapolis Red Pill Analytics office recently had the luxury of sitting with Snowflake’s Steve Herskovitz for some on-site education. Among other things, one of my key takeaways from the exercise was how simple it is to query JSON data in the Snowflake Data Warehouse. The syntax is straightforward and makes the process […]

Read More