InfoQ Homepage Architecture & Design Content on InfoQ
-
How Netflix Ensures Highly-Reliable Online Stateful Systems
Joseph Lynch discusses the architecture of Netflix's stateful caches and databases, including how they capacity plan, bulkhead, and deploy software to their global, full-active, data topology.
-
LSEG Cloud Lessons Learned: after Nearly a Decade of Being Cloud-First, What Have We Learned?
Oli Bage shares LSEG’s organizational, economic and technical tips about the journey to cloud. He talks about the CDMC standard, and where analytics might head in the future.
-
Managing 238M Memberships at Netflix
Surabhi Diwan discusses how the Netflix’ membership team outgrew many of its technology and architectural choices as memberships went from a few hundred thousand to 200 million.
-
Maximizing Performance and Efficiency in Financial Trading Systems through Vertical Scalability and Effective Testing
Peter Lawrey discusses achieving vertical scalability by minimizing accidental complexity and using an event-driven architecture.
-
Living on the Edge: Boosting Your Site's Performance with Edge Computing
Erica Pisani discusses what the edge is, how running code and serving data on the edge can improve site performance, and how to leverage these options to maximize site performance.
-
The Rise of the Serverless Data Architectures
Gwen Shapira explores the implications of serverless workloads on the design of data stores, and the evolution of data architectures toward more flexible scalability.
-
From Open Source to SaaS: the Journey of ClickHouse
Sichen Zhao and Shane Andrade discuss architectural design decisions and some of the pitfalls one may run into along the way.
-
Reliable Architectures through Observability
Kent Quirk shows an overview of observability tools and techniques, and specific recommendations for how to fit observability into their system designs and day-to-day development process.
-
Banking on Thousands of Microservices
Suhail Patel covers lessons learned creating a banking platform on the cloud that serves over 7 million customers daily and relies on a lean engineering team, microservices, Cassandra, and Kubernetes.
-
How to Build a Reliable Kafka Data Processing Pipeline, Focusing on Contention, Uptime and Latency
Lily Mara shares how OneSignal improved the performance and maintainability of its highest-throughput HTTP endpoints (backed by a Kafka consumer in Rust) by making it an asynchronous system.
-
Deconstructing an Abstraction to Reconstruct an Outage
Chris Sinjakli explores the aftermath of a complex outage in a Postgres cluster, retracing the steps taken to reliably reproduce the failure in a local environment.
-
Hard Problems in Front-End Platforms
Katie Sylor-Miller discusses the world of Front-end Platform Engineering, exploring the unique challenges, strategies, and best practices involved in creating robust, scalable, and reliable systems.