InfoQ Homepage Netflix Content on InfoQ
-
Disenchantment: Netflix Titus, Its Feisty Team, and Daemons
Andrew Spyker talks about Netflix's feisty team’s work across container runtimes, scheduling & control plane, and cloud infrastructure integration.
-
Crisis to Calm: Story of Data Validation @ Netflix
Lavanya Kanchanapalli discusses safe data propagation at Netflix, circuit breakers, data canaries and staggered rollout effective, and efficient validations via sharing data and isolating change.
-
Human-centric Machine Learning Infrastructure @Netflix
Ville Tuulos discusses the tools Netflix built for the data scientists and some of the challenges and solutions made to create a paved road for machine learning models to production.
-
Building Resilience in Production Migrations
Sangeeta Handa shares Netflix’s migration stories, what helped them build resilience, why resilience is important, and what Netflix Billing Infrastructure is doing to avoid taking downtime.
-
Netflix Play API - An Evolutionary Architecture
Suudhan Rangarajan talks about what patterns Netflix observed in their previous architectures and how they arrived at a list of practices to create an Evolutionary Architecture.
-
Full Cycle Developers @Netflix
Greg Burrell presents Netflix’s journey from siloed teams to their Full Cycle Developer model for building and operating their services at Netflix.
-
Better DevEx at Netflix: Polyglot and Containers
Mike McGarr talks about the evolution of developer tooling at Netflix, focusing on command line tools they built to address evolving needs around programming languages, containers and more.
-
How Machines Help Humans Root Case Issues @ Netflix
Seth Katz discusses ways to build tools designed to enhance the cognitive ability of humans through automated analysis to speed root cause detection in distributed systems.
-
Scaling Push Messaging for Millions of Devices @Netflix
Susheel Aroskar talks about Zuul Push - a massively scalable push notification service that handles millions of "always-on" persistent connections from all Netflix apps.
-
Incident Management at Netflix Velocity
Dave Hahn talks about how Netflix engineering teams think about failure, why they believe chaos is their friend, failure is guaranteed, and why Netflix is better off having both.
-
Custom, Complex Windows @Scale Using Apache Flink
Matt Zimmer discusses Apache Flink, how to use it to aggregate events into windows customized along varying definitions of a session, handling out-of-order events, and more.
-
Automating Netflix ML Pipelines with Meson
Davis Shepherd and Eugen Cepoi discuss the evolution of ML automation at Netflix and how that lead them to build Meson, challenges faced and lessons learned automating thousands of ML pipelines.