InfoQ Homepage Performance & Scalability Content on InfoQ
-
Modern Compute Stack for Scaling Large AI/ML/LLM Workloads
Jules Damji discusses which infrastructure should be used for distributed fine-tuning and training, how to scale ML workloads, how to accommodate large models, and how CPUs and GPUs can be utilized.
-
Sleeping at Scale - Delivering 10k Timers per Second per Node with Rust, Tokio, Kafka, and Scylla
Lily Mara and Hunter Laine walk through the design of a system, its performance characteristics, and how they scaled it.
-
Several Components are Rendering: Client Performance at Slack-Scale
Jenna Zeigen discusses front-end performance issues encountered by Slack as they continue to grow and evolve the desktop app.
-
Effective Performance Engineering at Twitter-Scale
Yao Yue recapitulates scaling a project at Twitter while summarizing some key lessons learned about effective performance engineering.
-
Scaling Organizations with Platform Engineering
Lesley Cordero focuses on how Platform Engineering can drive sustainability for growing organizations through DevOps principles, centralization, and scalable technical practices.
-
The Journey to a Million Ops / Sec / Node in Venice
Alex Dubrouski, andGaojie Liu discuss some of the tricks used in their pursuit to lower read latency and to reach 1M operations per second per node.
-
Sigstore: Secure and Scalable Infrastructure for Signing and Verifying Software
Billy Lynch and Zack Newman discuss the architecture and internals of Sigstore and keyless signing, along with the security considerations that drove the design.
-
Managing 238M Memberships at Netflix
Surabhi Diwan discusses how the Netflix’ membership team outgrew many of its technology and architectural choices as memberships went from a few hundred thousand to 200 million.
-
Maximizing Performance and Efficiency in Financial Trading Systems through Vertical Scalability and Effective Testing
Peter Lawrey discusses achieving vertical scalability by minimizing accidental complexity and using an event-driven architecture.
-
Performance: Adventures in Thread-per-Core Async with Redpanda and Seastar
John Spray describes an experience of building high performance systems with C++20 in an asynchronous runtime, and explores the challenges & tradeoffs in adopting a thread-per-core architecture.
-
Scaling Defenses Amidst Evolving Threat Landscape
Aditi Gupta discusses the design choices made early on during service development that were crucial to scaling operations later on at Netflix.
-
Azure Cosmos DB: Low Latency and High Availability at Planet Scale
Mei-Chin Tsai and Vinod Sridharan discuss the internal architecture of Azure Cosmos DB and how it achieves high availability, low latency, and scalability.