InfoQ Homepage Cloud Architecture Content on InfoQ
-
QCon London: Scaling Microservices Architecture and Technology Organization at Trainline
During the recent QCon London conference, Trainline’s CTO spoke about the evolution of the company’s system architecture and organizational structure over the last five years. The company had to adapt to market changes and growing customer expectations by improving the performance and reliability of its technology platform.
-
Netflix Uses Metaflow to Manage Hundreds of AI/ML Applications at Scale
Netflix recently published how its Machine Learning Platform (MLP) team provides an ecosystem around Metaflow, an open-source machine learning infrastructure framework. By creating various integrations for Metaflow, Netflix already has hundreds of Metaflow projects maintained by multiple engineering teams.
-
Uber's CacheFront: Powering 40M Reads per Second with Significantly Reduced Latency
Uber developed an innovative caching solution, CacheFront, for its in-house distributed database, Docstore. CacheFront enables over 40M reads per second from online storage and achieves substantial performance improvements, including a 75% reduction in P75 latency and over 67% reduction in P99.9 latency, demonstrating its effectiveness in enhancing system efficiency and scalability.
-
Slack Migrates to Cell-Based Architecture on AWS to Mitigate Gray Failures
Slack migrated most of the critical user-facing services from a monolithic to a cell-based architecture over the last 1.5 years. The move was triggered by the impact of networking outages affecting a single availability zone, causing user-impacting service degradation. The new architecture allows incrementally draining all the traffic away from the affected availability zone within 5 minutes.
-
Amazon OpenSearch Zero ETL with S3 and New OR1 Instances
Amazon has announced the preview of the Amazon OpenSearch Service's zero-extraction, transformation, and loading (ETL) integration with Amazon S3, offering a novel method to analyze operational logs in Amazon S3 and S3-based data lakes without the need to switch between services. Amazon also announced the new OR1 instances for Amazon OpenSearch Service.
-
The Frugal Architect: AWS Promotes Cost Awareness for Sustainability
Dr. Werner Vogels, AWS' CTO, dedicated the first part of his re:Invent keynote to discussing the laws of frugal architectures, cloud-native architectures aiming at delivering cost-aware, sustainable, and maintainable solutions. Vogels laid out seven simple laws based on his and AWS’ experience building and evolving cloud platform services, with cost implications as one of the primary drivers.
-
Partitioned Namespaces for Azure Service Bus Premium Are Now Generally Available
During the recent Ignite conference, Microsoft announced the general availability (GA) of partitioned namespaces feature for Azure Service Bus, which allows customers to use partitioning for the premium messaging tier.
-
Microsoft Refreshes its Well-Architected Framework
Microsoft recently announced a comprehensive refresh of the Well-Architected Framework (WAF) for designing and running optimized workloads on Azure.
-
Reddit Unveils REV2: Modernised Rule-Execution with Kubernetes, Kafka, and Flink Stateful Functions
Reddit's Safety Engineering team recently published how it modernised its Rule-Execution system, which detects and acts on policy-violating content in real time. The new architecture includes improvements like transitioning from legacy EC2-based systems to Kubernetes, better rule version control with Github and S3 storage, and the capability to scale more efficiently with Flink Stateful Functions.
-
AWS Introduces Amazon Route 53 Resolver on AWS Outposts Rack
AWS recently announced that Amazon Route 53 Resolver is now available on AWS Outposts rack providing on-premises services and applications with local Domain Name Service (DNS) resolution directly from Outposts. In addition, local Route 53 Resolver endpoints also enable DNS resolution between Outposts and on-premises DNS servers.
-
Upbound Releases Managed Control Plane Service
Upbound, the creators of Crossplane, has released a fully-managed control plane based on Crossplane. Crossplane is a Cloud Native Computing Foundation incubating project that provides a low-code experience for building control planes. Upbound Managed Control Planes (MCPs) manage the control plane components and provide access via both a UI and a CLI.
-
Prime Video Switched from Serverless to EC2 and ECS to Save Costs
Prime Video, Amazon’s video streaming service, has explained how it re-architected the audio/video quality inspection solution to reduce operational costs and address scalability problems. It moved the workload to EC2 and ECS compute services, and achieved a 90% reduction in operational costs as a result.
-
AWS Updates the Well-Architected Framework
AWS recently released an update to the Well-Architected Framework. The new version strengthens prescriptive guidance with over 100 best practices updated across all six pillars of the framework.
-
From Cloud-Hosted to Cloud-Native: Rosemary Wang at QCon London
Rosemary Wang, developer advocate at HashiCorp, delivered a presentation at QCon London that focused on five key considerations for technology practitioners looking to optimize the advantages of running platforms and applications in the cloud: adaptability, observability, immutability, elasticity, and changeability.
-
Tales of Kafka at Cloudflare: Andrea Medda and Matt Boyle at QCon London
At QCon London, Andrea Medda, senior systems engineer at Cloudflare, and Matt Boyle, engineering manager at Cloudflare, shared the lessons their platform services team learned from enabling the use of Apache Kafka at the scale of 1 trillion messages.