BT

Facilitating the Spread of Knowledge and Innovation in Professional Software Development

Write for InfoQ

Topics

Choose your language

InfoQ Homepage QCon San Francisco 2023 Content on InfoQ

  • How Netflix Ensures Highly-Reliable Online Stateful Systems

    Building reliable stateful services at scale isn’t a matter of building reliability into the servers, the clients, or the APIs in isolation. By combining smart and meaningful choices for each of these three components, we can build massively scalable, SLO-compliant stateful services at Netflix.

  • How to Get Tech-Debt on the Roadmap

    Only doing product-led work can lead to an unmaintainable system with lots of downtime. Unfortunately, getting time to work on the things that would prevent that can be challenging. Engineering roadmaps balance many competing demands, and the people making the decisions aren’t always the closest to the code. There is a need to learn how to speak their language.

  • Unpacking How Ad Ranking Works at Pinterest

    Aayush Mudgal describes how Pinterest serves advertisements. He discussed in detail how Machine Learning is used to serve ads at large scale. He went over ads marketplaces and the ad delivery funnel, the ad serving architecture, and two of the main problems: ad retrieval and ranking. Finally, he discussed some of the challenges and solutions for training and serving large models.

  • Managing 238M Memberships at Netflix

    In this article Surabhi Diwan shared how the Netflix membership team does distributed systems: the architecture bets, technology choices, and operational semantics that serve the needs of Netflix’s ever-growing member base.

  • Architecting for High Availability in the Cloud with Cellular Architecture

    Cellular architecture is a design pattern that helps achieve high availability in multi-tenant applications. The goal is to design your application so that you can deploy all of its components into an isolated "cell" that is fully self-sufficient. It can benefit your customers regarding availability and ensure you hit your SLAs.

  • Relational Data at the Edge: How Cloudflare Operates Distributed PostgreSQL Clusters

    Explore Cloudflare's distributed PostgreSQL clusters and learn how a cross-region architecture ensures resilience. Discover how data storage and access at the edge deliver massive performance gains by reducing location-sensitive latency and why architecting for degraded states is much harder than for failure states.

  • AWS Lambda under the Hood

    Mike Danilov, a senior principal engineer at AWS, presented on AWS Lambda and what is under the hood during QCon San Francisco 2023. This article represents the talk, which will start with an introduction to Lambda itself to outline the key concepts of the service and its fundamentals, which will facilitate a deep dive into the understanding of the system.

  • Generative AI: Shaping a New Future for Fraud Prevention

    This article explores how generative AI affects fraud detection by reducing false positives and dynamically adapting to changing fraud patterns. This combination offers a potent preventive solution when integrated with machine learning. The efficacy and scalability of fraud prevention initiatives are enhanced by this innovative approach.

  • Multiplying Engineering Productivity in the Face of Constant Change

    Leaders know that they need to support organizational efficiency, but how they do that can vary based on the situation and context of the organization. Learning how to utilize a few key leadership frameworks in certain environments can help maintain the appropriate level of support, urgency, and independence when situations shift.

  • Understanding Architectures for Multi-Region Data Residency

    This article focuses on implementing data residency strategies for a positive stakeholder experience. It underscores the need to diversify data locations, driven by motivations like disaster recovery and geo-redundancy. The core principle is data distribution, ensuring specific sets reside in distinct regions without overlap - a practice termed data residency.

  • Generative AI and Organizational Resilience

    Generative AI will profoundly transform communication and information sharing over the next decade, but the change will be uneven across industries and roles. Organizations should empower workers to use AI augmentation thoughtfully, while building literacy on capabilities and limits. A balanced, conscientious integration, using iterations and customer feedback, will produce the best outcomes.

  • Orchestrating Resilience Building Modern Asynchronous Systems

    In this article, we will discuss what problems we had to solve at Twilio to efficiently build a resilient and scalable asynchronous system to handle a complex workflow and the advantages we got from adopting a Workflow Orchestration solution, including abstracting away state management and out-of-the-box support for retries, observability, and audibility.

BT