InfoQ Homepage Google Cloud Platform Content on InfoQ
-
Google Cloud Launches A4 VMs with NVIDIA Blackwell GPUs for AI Workloads
Google Cloud has launched A4 VMs, powered by NVIDIA's Blackwell B200 GPUs, revolutionizing AI workloads with a 2.25x performance boost and advanced networking. Key features include tight integration with Google Kubernetes Engine and Vertex AI, enhancing deployment and management of large-scale projects. This innovative partnership sets a new standard in cloud infrastructure for AI.
-
Google Cloud's AI Protection: a Solution to Securing AI Assets
Google Cloud introduces AI Protection, a solution to safeguard against generative AI threats. Managing AI risks through vulnerability assessments, security policies, and proactive threat management enhances asset protection. Integrating with Google’s Security Command Center offers a centralized view of IT posture and advanced security intelligence for robust AI system defense.
-
Google Enhances AlloyDB Vector Search with Inline Filtering and Enterprise Observability
Google enhanced its AlloyDB service with inline filtering and enterprise observability for vector search. This fully-managed PostgreSQL-compatible database now allows direct filtering during queries, offering improved speed and efficiency. Enhanced monitoring features provide deep insights, addressing scaling vector search operations challenges.
-
Google Cloud Introduces Quantum-Safe Digital Signatures in Cloud KMS to Future-Proof Data Security
Google has introduced quantum-safe digital signatures in its Cloud Key Management Service, adhering to NIST post-quantum cryptography standards. This vital update counters the imminent threats of quantum computing on traditional encryption methods, enabling organizations to integrate resilient, future-proof security measures seamlessly.
-
How Monzo Bank Built a Cost-Effective, Unorthodox Backup System to Ensure Resilient Banking
Monzo Bank recently revealed Stand-in, an independent backup system on GCP that ensures essential banking services remain operational during application and AWS infrastructure outages. Unlike traditional backups, it's a minimal stand-alone system that exclusively supports key operations and features a cost-effective design, resulting in 1% of the operational costs of the primary deployment.
-
Google Releases Spanner Graph into General Availability
Google Cloud's Spanner Graph is now generally available, featuring enhancements like Graph Notebook, GraphRAG integration, and advanced schema management. This unified database combines graph, relational, and AI capabilities, ensuring high availability and scalability. Ideal for real-time analysis, it transforms use cases like fraud detection and recommendation engines.
-
Google Cloud Launches Sixth Generation Trillium TPUs: More Performance, Scalability and Efficiency
Google Cloud's Trillium, its sixth-generation TPU, is now available. It enhances AI workloads with unmatched performance and 67% better energy efficiency. Integral to the AI Hypercomputer, Trillium boasts training speeds over 4x faster and triples inference throughput. This leap positions Google as a contender against Nvidia in the AI data center market.
-
How Allegro Reduced the Cost of Running a GCP Dataflow Pipeline by 60%
Allegro achieved significant savings for one of the Dataflow Pipelines running on GCP Big Data. The company continues working on improving the cost-effectiveness of its data workflows by evaluating resource utilization, enhancing pipeline configurations, optimizing input and output datasets, and improving storage strategies.
-
Scaling Uber’s Batch Data Platform: a Journey to the Cloud with Data Mesh Principles
Some months ago, Uber started the migration to the cloud, on Google Cloud Platform (GCP), of its batch data analytics and machine learning platform. In a recent post on its engineering blog, Uber provided additional information regarding its batch data cloud migration that incorporated crucial data mesh principles.
-
Google Cloud Launches C4 Machine Series: High-Performance Computing and Data Analytics
Google Cloud recently announced the general availability of its new C4 machine series, powered by 4th Gen Intel Xeon Scalable Processors (Sapphire Rapids). The series offers a range of configurations tailored to meet the needs of demanding applications such as high-performance computing (HPC), large-scale simulations, and data analytics.
-
Google Cloud Enhances Spanner with Dual-Region Configuration
Google Cloud has introduced a significant update to its fully-managed distributed SQL database service, Spanner, which now offers a dual-region configuration option. The company aims with this enhancement to assist enterprises in complying with data residency norms across countries with limited cloud support while ensuring high availability.
-
UniSuper’s Entire Infrastructure Deleted by Internal Google Cloud Error
An Australian superannuation fund manager, UniSuper, using Google Cloud for an Infrastructure-as-a-Service (IaaS) contract, found it had no disaster recovery (DR) recourse when the entire infrastructure subscription was deleted.
-
Gemini Code Assist to Create APIs, Integrations, and Automation Flows in Public Preview
At the Cloud Next Conference, Google unveiled its enterprise-focused AI Code completion and assistance tool, Gemini Code Assist, which is available in public preview for Apigee API Management and Application Integration.
-
Building SaaS from Scratch Using Cloud-Native Patterns: a Deep Dive into a Cloud Startup
Joni Collinge, Diagrid's founding software engineer, presented at QCon London and discussed a case study on the evolutionary design and implementation of the Diagrid Cloud platform, which underpins Diagrid’s SaaS offerings.
-
Google Announces Agent Builder, Expanded Gemini 1.5, Open-Source Additions
At the Google Cloud Next 2024 event, Google announced the launch of Vertex AI Agent Builder, the public preview of Google's most advanced generative AI model, Gemini 1.5 Pro, and the addition of open-source language models to the Vertex AI platform.