InfoQ Homepage Kubernetes Content on InfoQ
-
Grab Improves Kafka on Kubernetes Fault Tolerance with Strimzi, AWS AddOns and EBS
Grab updated its Kafka on Kubernetes setup to improve fault tolerance and completely eliminate human intervention in case of unexpected Kafka broker terminations. To address the shortcomings of the initial design, the team integrated with AWS Node Termination Handler (NTH), used the Load Balancer Controller for target group mapping, and switched to ELB volumes for storage.
-
Docker Build Cloud: Using Cloud Compute and Cache to Decrease Image Build Times
Docker recently announced the general availability of their cloud-based Container Image builder, Docker Build Cloud. Docker Build Cloud offers a remote shared-cache and native builders for AMD64 and ARM64 CPU architectures.
-
TikTok Owner Open-Sources Next Gen Kubernetes Federation Tool
ByteDance, the company behind popular global platforms like TikTok, has unveiled KubeAdmiral, its next-generation cluster federation system for Kubernetes, designed to manage multiple clusters with the efficiency and effectiveness comparable to a seasoned navy admiral commanding a fleet. KubeAdmiral scales to run more than 10 million pods across dozens of federated Kubernetes clusters.
-
ScaleOps Automates Rightsizing to Reduce Kubernetes Costs
ScaleOps, a startup working in the cloud resource management sector, has unveiled a fully-automated cloud-native cost saving platform. ScaleOps claims to slash cloud costs by up to 80%, with a fully-automated platform that continuously optimizes and manages cloud-native resources during runtime.
-
Kubernetes 1.29 Released with KMS V2 Improvements and nftables Support
The Cloud Native Computing Foundation (CNCF) released Kubernetes 1.29 named Mandala last month. The latest release introduces new features such as load balancer IP mode for services, mutable pod resources for Windows containers, and nftables for the kube-proxy.
-
Hugging Face and Google Cloud Announce Collaboration
Hugging Face and Google Cloud have announced a strategic alliance to advance machine learning and open AI research. Google Cloud customers, Hugging Face Hub users, and open source are the three main focuses of the strategic partnership. Google wants to make cutting-edge AI discoveries available through Hugging Face's open-source frameworks.
-
DoorDash Uses Service Mesh and Cell-Based Architecture to Significantly Reduce Data Transfer Costs
In a recent move, DoorDash has significantly optimized its cloud infrastructure costs. The company faced increased cross-AZ data transfer costs when transitioning to a microservices architecture. To substantially reduce this cost, DoorDash implemented zone-aware routing with its Envoy-based service mesh, taking advantage of its Cell-Based Architecture.
-
Griffin 2.0: Instacart Revamps Its Machine Learning Platform
Instacart created the next-generation platform based on experiences using the original Griffin machine-learning platform. The company wanted to improve user experience and help manage all ML workloads. The revamped platform leverages the latest developments in MLOps and introduces new capabilities for current and future applications.
-
Expedia Uses WebSockets and Kafka to Query Near Real-Time Streaming Data
Expedia created a solution to support querying the clickstream data from their platform in near-real time to enable their product and engineering teams to explore live data while working on new and enhancing existing data-driven functional use cases. The team used a combination of WebSockets, Apache Kafka, and PostgreSQL to allow streaming query results continuously to users’ browsers.
-
AWS Improves Kubernetes Monitoring with New Enhanced Observability for Amazon EKS
AWS has released several monitoring and observability-themed improvements. The releases include Amazon CloudWatch Container Insights with Enhanced Observability for Amazon EKS on EC2 a fully managed service that provides visibility into containerized workloads. Other releases include a new Amazon Managed Service for Prometheus collector and Amazon Managed Grafana community plugins.
-
HashiCorp Terraform Cloud Operator for Kubernetes Improves Scalability and Performance
HashiCorp has released v2 of their Terraform Cloud Operator for Kubernetes. The Operator enables managing Terraform Cloud workspaces through a single Kubernetes custom resource. The release adds support for multiple customer resources, setting the watch scope to specific namespaces, improved synchronization, and newly exposed metrics.
-
Addressing Orphaned Pods on Netflix’s Titus Container Platform
Netflix's engineering team disclosed the investigation, identification, and resolution of the issue about "orphaned" pods causing inconvenience to engineers on Titus, shedding light on the journey from kernel panics to Kubernetes (k8s) and ultimately providing operators with the tools to understand why nodes are going away
-
KubeCon NA 2023: Ishan Sharma on Real-Time Generative AI for Gaming Apps Running on Kubernetes
Kubernetes provides a great platform for gaming applications using generative artificial intelligence (GenAI) for both game development and gameplay. Ishan Sharma from Google spoke at the recent KubeCon CloudNativeCon NA 2023 Conference about real-time GenAI inference integrated with distributed game servers running on Kubernetes.
-
ZEDEDA Launches Managed Kubernetes for the Distributed Edge
ZEDEDA, a San Jose-based developer and provider of edge computing virtualization software, announced during KubeCon NA 2023 a managed Kubernetes solution for the edge. The service is based on a partnership with SUSE and offers a centrally-managed, turn-key User edge solution built on the open source operating system Eve and Rancher’s lightweight Kubernetes distribution K3s.
-
KubeCon NA 2023: Kubernetes Storage Platform to Run Real-Time Analytic Databases
Kubernetes storage platform provides a portable and flexible foundation for data management to help developers build their own data solutions. Robert Hodges spoke last week at KubeCon CloudNativeCon North America 2023 Conference on different techniques his teams developed to build their own data platform.