InfoQ Homepage Cloud Computing Content on InfoQ
-
Google Cloud Launches Sixth Generation Trillium TPUs: More Performance, Scalability and Efficiency
Google Cloud's Trillium, its sixth-generation TPU, is now available. It enhances AI workloads with unmatched performance and 67% better energy efficiency. Integral to the AI Hypercomputer, Trillium boasts training speeds over 4x faster and triples inference throughput. This leap positions Google as a contender against Nvidia in the AI data center market.
-
KubeCon + CloudNativeCon NA 2024: Yahoo’s Kubernetes Journey On-Prem to Multi-Cloud
At KubeCon+CloudNativeCon North America 2024, Nandhakumar Venkatachalam and Payal Patel shared Yahoo's Kubernetes journey from on-premises to multi-cloud at scale, underscoring challenges faced and lessons learned during this transition.
-
Anna Berenberg Talks at QCon San Francisco on Google's One Network
Anna Berenberg, an Engineering Fellow at Google Cloud, unveiled One Network, a cloud-agnostic architecture that simplifies complex interconnected systems. Unifying disparate environments and leveraging open-source technologies enhances operational efficiency and consistency in security policies, empowering developers to focus on service endpoints while ensuring seamless platform integration.
-
Vercel Expands AI Toolkit with AI SDK 4.0 Update
Vercel has announced version 4.0 of its open-source AI SDK toolkit designed for building AI applications in JavaScript and TypeScript. The update introduces key features like PDF support, computer use integration, and a new xAI Grok API.
-
First Google Axion Processor Now Available: Claims Best Performance in Cloud Market
Google has announced the general availability of its C4A virtual machines, marking the debut of Axion-based instances. The cloud provider claims these instances deliver up to 10% better price-performance compared to the latest Arm-based alternatives from competitors, including Amazon Graviton4.
-
How Allegro Reduced the Cost of Running a GCP Dataflow Pipeline by 60%
Allegro achieved significant savings for one of the Dataflow Pipelines running on GCP Big Data. The company continues working on improving the cost-effectiveness of its data workflows by evaluating resource utilization, enhancing pipeline configurations, optimizing input and output datasets, and improving storage strategies.
-
Microsoft Unveils Azure Cobalt 100-Based Virtual Machines: Enhanced Performance and Sustainability
Microsoft's Azure Cobalt 100 VMs are now generally available. They deliver up to 50% improved price performance with energy-efficient Arm architecture. Tailored for diverse workloads, these VMs offer various configurations, including general-purpose and memory-optimized options. Their release supports sustainable computing, aligning with Microsoft's commitment to lower carbon footprints.
-
System Initiative Launches DevOps Platform to Address Cloud Stack Drift
System Initiative aims to tackle some long-standing problems in conventional industry practices for deploying and managing cloud infrastructure. Specifically, existing solutions requiring static configurations for cloud infrastructure cannot reliably reflect fast-changing and dynamic cloud services, leading to "stack drift": unpredictability when engineers apply changes to infrastructure.
-
AWS CodeBuild Now Supports Mac Builds
Amazon has recently announced that AWS CodeBuild, its managed build service, now supports building applications on macOS. However, due to Apple's licensing requirements, developers must still reserve a dedicated macOS fleet to utilize this new option.
-
How AWS Well-Architected Framework Supports Frugal Architecture
AWS shared how to deliver frugal architecture using Well-Architectured Framework guidance. The authors outline areas of alignment between the WAF and the Frugal Architect laws Dr. Werner Vogels introduced in his re:Invent 2023 keynote. By implementing the frugal approach, organizations can achieve sustainability and cost optimization goals for their cloud architectures.
-
Google Cloud Launches C4 Machine Series: High-Performance Computing and Data Analytics
Google Cloud recently announced the general availability of its new C4 machine series, powered by 4th Gen Intel Xeon Scalable Processors (Sapphire Rapids). The series offers a range of configurations tailored to meet the needs of demanding applications such as high-performance computing (HPC), large-scale simulations, and data analytics.
-
How Amazon Aurora Serverless Manages Resources and Scaling for Fleets of 10K+ Instances
AWS engineers published a paper describing the evolution and latest design of resource management and scaling for the Amazon Aurora Serverless platform. Aurora Serverless uses a combination of components at different levels to create a holistic approach for dynamically scaling and adjusting resources to satisfy the needs of customer workloads.
-
Increased Popularity of Artist Platform Cara Led to Substantial Vercel Functions Expenses
Cara's web and mobile app, a platform for artists, combines the experience of Instagram and X. It provides a space for users to showcase their portfolios and share updates on their feeds. The app's serverless backend, powered by Vercel Functions, recently incurred a significant cost of $ 98,280 due to a sudden surge in usage.
-
People, Planet, Cloud and AI: Key Takeaways from QCon London
This year’s QCon London brought a wealth of talks directly or indirectly related to software architecture, ranging from the rise of AI to more established areas like anything cloud-related to the usual classics like architecture quality traits . The conference also featured many talks about sociotechnical aspects of software architecture and engineering and broadly considered sustainability.
-
Confluent Cloud for Apache Flink is Now Generally Available with AI Features
Confluent announced last month the general availability (GA) of Confluent Cloud for Apache Flink. This fully-managed service enables real-time data processing and the creation of high-quality, reusable data streams. The service is available across Amazon Web Services (AWS), Google Cloud, and Microsoft Azure.