InfoQ Homepage Data Pipelines Content on InfoQ

Presentations

RSS Feed

Newer Older

Architecture & Design

Streaming Reactive Systems & Data Pipes w. Squbs

Anil Gursel and Akara Sucharitakul focus on modeling and building software that considers all input and all output as stream of events, and introducing Squbs.

Akara Sucharitakul Anil Gursel
on May 04, 2018

Icon

39:48
Architecture & Design

Scaling Uber's Elasticsearch Clusters

Danny Yuan talks about how Uber scaled its Elasticsearch clusters as well as its ingestion pipelines for ingestions, queries, data storage, and operations by a three-person team.

Danny Yuan
on Apr 11, 2018

Icon

48:18
AI, ML & Data Engineering

Effective Data Pipelines: Data Mngmt from Chaos

Katharine Jarmul discusses implementation decisions for those looking for a practical recommendation on the "what" and "how" of data automation workflows.

Katharine Jarmul
on Mar 29, 2017

Icon

45:22
AI, ML & Data Engineering

Building Data Pipelines in Python

Marco Bonzanini discusses the process of building data pipelines and all the steps necessary to prepare data, focusing on data plumbing and going from prototype to production.

Marco Bonzanini
on Mar 28, 2017

Icon

48:49
AI, ML & Data Engineering

Cloud Native Streaming and Event-driven Microservices

Marius Bogoevici demonstrates how to create complex data processing pipelines that bridge the big data and enterprise integration together and how to orchestrate them with Spring Cloud Data Flow.

Marius Bogoevici
on Jan 14, 2017

Icon

01:10:46
AI, ML & Data Engineering

Spring and Big Data

Thomas Risberg discusses developing big data pipelines with Spring, focusing around the code needed and he also covers how to set up a test environment both locally and in the cloud.

Thomas Risberg
on Jan 08, 2017

Icon

55:24
AI, ML & Data Engineering

Data Microservices in the Cloud

Mark Pollack introduces Spring Cloud Data Flow enabling one to create pipelines for data ingestion, real-time analytics and data import/export, demoing apps that are deployed onto multiple runtimes.

Mark Pollack
on Jan 08, 2017

Icon

01:07:25
AI, ML & Data Engineering

Hydrator: Open Source, Code-Free Data Pipelines

Jonathan Gray introduces Hydrator, an open source framework and user interface for creating data lakes for building and managing data pipelines on Spark, MapReduce, Spark Streaming and Tigon.

Jonathan Gray
on Oct 23, 2016

Icon

41:39