Spring Events with Red Pill Analytics
The sun is shining, and the data-driven community is coming out of hibernation to gather across the country. Check out where Red Pill Analytics will be in the coming months, and let us know if you will be attending. We’d love to connect in person!
Kafka Summit 2019:
April 2, 2019
New York, NY
The Kafka Summit is the premier event for data architects, engineers, DevOps professionals, and developers who want to learn about streaming data. It brings the Apache Kafka community together to share best practices, write code and discuss the future of streaming technologies. We are honored to have Stewart Bryson, founder and CEO, join the list of presenters and sessions at this event.
Connect with us here:
Use Apache Gradle to Build and Automate KSQL and Kafka Streams
April 2, 2019, 5:15 pm — 5:55 pm
KSQL is an easy-to-use and easy-to-understand streaming SQL engine for Apache Kafka built on top of Kafka Streams. The ability to write streaming applications using only SQL makes Apache Kafka available to a whole range of new developers and potential use cases, either as a stand-alone solution or as a single component to a broader Kafka Streams implementation. Inspired by a customer project now in production, experience the lifecycle of a streaming application developed using KSQL and Kafka Streams. With Apache Gradle as our build framework, we’ll explore the open-source Gradle plugin we built during this project to improve developer efficiency and automate the deployment of KSQL pipelines, user-defined functions, and Kafka Streams microservices.
We’ll demonstrate the deployment process live, and discuss design decisions around incorporating SQL-based processes into an overall streaming application.
April 7-11, 2019
San Antonio, TX
Red Pill is thrilled to be involved in COLLABORATE 19, the largest user-led, user-focused event for Oracle users of all levels. Join Red Pill Analytics and IT decision makers, super users, Oracle, and Oracle vendors as they gather to share real-world insights and practical solutions to maximize return on their Oracle investments.
Expand your Oracle skill set by creating a customized agenda from 1,000+ interactive sessions all about Oracle. Network with experienced professionals who have the same challenges as you, or ask questions directly to Oracle and Oracle vendors at the Exhibitor Showcase. The user-focus of COLLABORATE 19 ensures that it’s full of first-hand experiences, real-world scenarios, and honest advice. Whether you want to optimize your Oracle Cloud investment, need help transitioning, or are considering a move to Oracle Cloud, there’s something here for everyone.
Connect with us here:
Join Stewart Bryson (owner and co-founder of Red Pill Analytics) as he leads this interactive “Architecture Live” session, a discussion of the current state of on-premise data warehousing and why cloud data warehousing is that much better. Attendees will leave empowered with knowledge that will help prepare them for the future of data warehousing.
June 3-6, 2019
San Francisco, CA
We are sponsoring the upcoming inaugural Snowflake Summit, a comprehensive learning opportunity for data-driven practitioners and leaders. With a host of content covering every skill level and interest, anyone who has a desire to develop deeper insights from their data is welcome. Most importantly, there will be ample opportunity to learn from and network with industry peers. We are excited to be participating as a partner for the event and hope to see you in San Francisco in June.
June 23-27, 2019
Team up with Red Pill and Oracle experts from all around the world at ODTUG Kscope19, the premier Oracle developer conference. Individuals from over 30 different countries gather in June each year to learn, share technical content, network, and have fun with like-minded professionals. Since it’s launch in 2006, ODTUG Kscope has continued to grow, drawing more presenters and attendees every year. We’d like to recognize and appreciate Mike Jelen of Red Pill Analytics for helping with the organization and content for the Analytics Track Lead at Kscope19. We’re proud to be a part of ODTUG and hope to see you there.
Connect with us at these Red Pill Sessions:
Data Engineering and the Death of Visual ETL
June 24, 2019 (11:45 AM — 12:45 PM)
A funny thing happened on the way to modern, data-driven analytics: we ditched the GUI for ETL development. There are drawbacks to moving data using code instead of clicks, especially for organizations heavily invested in visual tools. But writing code has an upside as well.
In this presentation, we’ll explore the data engineering discipline and how it compares to conventional ETL. We’ll look at classic batch as well as modern streaming use cases, and we’ll consider DataOps requirements around testing and delivery where code really shines. We’ll do it all inside of an interactive notebook.
Machine Learning Live — Let’s Build a Taxi Fare Predictor
June 25, 2019 (03:45 PM — 04:45 PM)
Regardless of what you’ve seen or heard, machine learning is not that complicated, and simple but valuable use cases that extend traditional analytical capabilities are easily unlocked. In this session, we’ll build a prediction model for taxi fares based on existing, public data. Watch as we explore the dataset, derive estimates with basic analytics, and then train our model with several iterations of linear regression algorithms to outperform those estimates. You’ll understand the methods, tools and algorithms used in this predictor, and witness the live development and training of our machine learning model. Are you ready to finally understand what machine learning is all about?
Data Profiling Is OK, Really It Is
June 24, 2019 (03:45 PM — 04:45 PM)
In a targeted political advertising world, the words “data profiling” have been hijacked to become something less than socially desirable. However, true data profiling has a major role in the delivery of analytics and reporting projects, even Agile ones!
This talk goes back to a more functional approach to data profiling and how we can use that to build better BI systems. It’s better in terms of performance, better in terms of the need for less reworking during development, and has fewer “surprises.” Starting from the importance of building models and discovering data relationships, the presentation expands on the need to actually look at the data to consider things such as:
• Event transition states — the way that factual statuses progress through a sequence of states over time.
• Hierarchies, parentage, ways to handle “adoption,” and the time when data is reclassified.
• Multiple data sources to one data store and how we can be sure we are talking about the same thing.
Throughout the talk, there will be real-world examples of recent projects using both locally and cloud-hosted data stores. As we’ll discover, data profiling is not a bad thing and used well it delivers enormous benefits to a BI project.
Rethinking What We Do: How Streams of Events have Forced the Evolution of the Star Schema
June 25, 2019 (03:45 PM — 04:45 PM)
At the beginning of the century, data warehouses tended to be small (less than 10TB) and followed one of the two main schools of design philosophy — Kimball’s Star Schema and Inmon’s Information Factory. Both methodologies had their merits, but Kimball had a few advantages over Inmon, which gave it the edge — how the business thought about its data, it worked with incremental development, and a lot of commercial reporting tools worked best with star structures.
Now data volumes have exploded, data sources have multiplied, and information is required to be available sooner than ever. These changes put pressure on concepts from two decades ago; technology capabilities, business needs, and the reduced (relative) costs of hardware provision make us think of new ways to do things. One major change is the move from batch or micro-batch orientated data load to one of information streaming, whether this is database log-based change capture, for example, Oracle GoldenGate or something more event oriented such as message queues and Kafka streams. Although event streams were a great enabler of near real-time reporting, they also represented a way that the sheer volume of today’s data could be ingested without compromising on reporting availability.
We'll see you there
If you are planning on attending Kafka Summit, Collaborate 2019, or Snowflake Summit, let us know. Contact us at firstname.lastname@example.org or through our many social media channels. We’re proud to be involved in the sponsorship, speaking, and planning of these events and love collaborating with the data community. Come see us!