InfoQ Homepage Data Content on InfoQ

Presentations

RSS Feed

Newer Older

Development

Homoiconicity: It Is What It Is

Stuart Sierra demonstrates the power that comes from having the same data representation at all layers: programming language, specification, database, inter-process communication, and user interface.

Stuart Sierra
on Oct 31, 2017

Icon

47:06
Culture & Methods

Data-Driven Coaching - Safely Turning Team Data into Coaching Insights

Troy Magennis shows how to expose data to teams in order for them to retrospect productively, determine if a process experiment is panning out as expected, and to explore process change opportunities.

Troy Magennis
on Oct 29, 2017

Icon

45:00
AI, ML & Data Engineering

Machine Learning in Academia and Industry

Deborah Hanus discusses some of the challenges that can arise when working with data.

Deborah Hanus
on Oct 10, 2017

Icon

40:26
AI, ML & Data Engineering

AI-Based Data Extraction

George Roth presents the challenges of data extraction from unstructured content in the context of preparing the data for Data Analytics.

George Roth
on May 28, 2017

Icon

39:23
AI, ML & Data Engineering

Data Preparation for Data Science: A Field Guide

Casey Stella presents a utility written with Apache Spark to automate data preparation, discovering missing values, values with skewed distributions and discovering likely errors within data.

Casey Stella
on Apr 23, 2017

Icon

45:00
AI, ML & Data Engineering

Straggler Free Data Processing in Cloud Dataflow

Eugene Kirpichov describes the theory and practice behind Cloud Dataflow's approach to straggler elimination, and the associated non-obvious challenges, benefits, and implications of the technique.

Eugene Kirpichov
on Apr 11, 2017

Icon

46:03
Architecture & Design

Scaling up Near Real-Time Analytics @Uber &LinkedIn

Chinmay Soman and Yi Pan discuss how Uber and LinkedIn use Apache Samza, Calcite and Pinot along with the analytics platform AthenaX to transform data to make it available for querying in minutes.

Chinmay Soman Yi Pan
on Mar 30, 2017

Icon

46:03
AI, ML & Data Engineering

Effective Data Pipelines: Data Mngmt from Chaos

Katharine Jarmul discusses implementation decisions for those looking for a practical recommendation on the "what" and "how" of data automation workflows.

Katharine Jarmul
on Mar 29, 2017

Icon

45:22
AI, ML & Data Engineering

Building Data Pipelines in Python

Marco Bonzanini discusses the process of building data pipelines and all the steps necessary to prepare data, focusing on data plumbing and going from prototype to production.

Marco Bonzanini
on Mar 28, 2017

Icon

48:49
AI, ML & Data Engineering

Data Science in the Cloud @StitchFix

Stefan Krawczyk discusses how StitchFix used the cloud to enable over 80 data scientists to be productive and have easy access, covering prototyping, algorithms used, keeping schema in sync, etc.

Stefan Krawczyk
on Feb 17, 2017

Icon

40:48
AI, ML & Data Engineering

Scaling the Data Infrastructure @Spotify

Mārtiņš Kalvāns and Matti Pehrs overview the Data Infrastructure at Spotify, diving into some of the data infrastructure components, such us Event Delivery, Datamon and Styx.

Mārtiņš Kalvāns Matti Pehrs
on Jan 28, 2017

Icon

44:06
AI, ML & Data Engineering

Data Microservices in the Cloud

Mark Pollack introduces Spring Cloud Data Flow enabling one to create pipelines for data ingestion, real-time analytics and data import/export, demoing apps that are deployed onto multiple runtimes.

Mark Pollack
on Jan 08, 2017

Icon

01:07:25

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations