DataDotz BigData Weekly

DataDotz Bigdata Weekly

Scheduling Notebooks at Netflix
============================

At Netflix we’ve put substantial effort into adopting notebooks as an integrated development platform. The idea started as a discussion of what development and collaboration interfaces might look like in the future. It evolved into a strategic bet on notebooks, both as an interactive UI and as the unifying foundation of our workflow scheduler. Continue reading

Read More
DataDotz BigData Weekly

DataDotz Bigdata Weekly

UDAF in KSQL 5.0
===============

KSQL is the open source streaming SQL engine that enables real-time data processing against Apache Kafka. KSQL makes it easy to read, write and process streaming data in real time, at scale, using SQL-like semantics.KSQL already has plenty of available functions like SUBSTRING, STRINGTOTIMESTAMP or COUNT. Even so, many users need additional functions to process their data streams Continue reading

Read More
DataDotz BigData Weekly

DataDotz Bigdata Weekly

Data Pipeline Patterns in the Decoupled Processing Era
================================================

A Data pipeline is a sequence of transformations that converts raw data into actionable insights. In the past, the processing and storage engines were coupled together e.g., a traditional MPP warehouse combines both a processing and storage engine. Continue reading

Read More