InfoQ Homepage Project Management Content on InfoQ
-
The Journey to a Million Ops / Sec / Node in Venice
Alex Dubrouski, andGaojie Liu discuss some of the tricks used in their pursuit to lower read latency and to reach 1M operations per second per node.
-
Lessons from Leading the Serverless First Journey at CapitalOne
George Mao discusses their journey into serverless, the best practices they picked up, the lessons learned along the way, and the optimizations for Lambda.
-
Sigstore: Secure and Scalable Infrastructure for Signing and Verifying Software
Billy Lynch and Zack Newman discuss the architecture and internals of Sigstore and keyless signing, along with the security considerations that drove the design.
-
Building Sub-Second Latency Video Infrastructure at Cloudflare
Renan Dincer provides insight on how Cloudflare has deployed a sub-second latency live streaming system at scale, focusing on protocols used: HLS, DASH, RTMPS, SRT and WebRTC.
-
Going beyond the Case of Black Box AutoML
Kiran Kate covers the basics of AutoML and then presents Lale (https://github.com/IBM/lale), an open-source scikit-learn compatible AutoML library which implements Gradual AutoML.
-
How Netflix Ensures Highly-Reliable Online Stateful Systems
Joseph Lynch discusses the architecture of Netflix's stateful caches and databases, including how they capacity plan, bulkhead, and deploy software to their global, full-active, data topology.
-
Eventual Consistency – Don’t Be Afraid!
Susanne Braun shares her experiences from different case studies with industry clients, and open access design guidelines developed using action research.
-
Using Traffic Modeling to Load-Balance Netflix Traffic at Global Scale
Niosha Behnam and Sergey Fedorov discuss how Netflix shifted from geo-based DNS load-balancing to a latency-based approach, relying on real-user measurements and building a model of Netflix traffic.
-
Performance: Adventures in Thread-per-Core Async with Redpanda and Seastar
John Spray describes an experience of building high performance systems with C++20 in an asynchronous runtime, and explores the challenges & tradeoffs in adopting a thread-per-core architecture.
-
Providing a Personalized Experience to Millions of Users @BBC
Manisha Lopes discusses levers available to tune a cloud application, aligning the Data Model with the usage patterns for the data, using Performance Tests to benchmark an application, and others.
-
Strategy & Principles to Scale and Evolve MLOps @DoorDash
Hien Luu shares their approach to MLOps, and the strategy and principles that have helped them to scale and evolve their platform to support hundreds of models and billions of predictions per day.
-
How to Build a Successful Cloud Capability on a Heavy Regulated Organization
Ana Sirvent discusses their cloud capability journey, highlighting lessons learned and best practices on culture, processes and technology.