InfoQ Homepage Operations management Content on InfoQ
-
Data Teams Survey: Lag in DataOps and Value Delivered
We report on Jesse Anderson's 2024 Data Teams Survey which showed a lag in DataOps capabilities, slow LLM adoption, and a concerning decline in perceived value creation by data teams. It called out the importance of teams spread with data science, engineering, and operations capabilities. We also cover Petr Janda's recent podcast on the need for more engineering rigour for parity with other teams.
-
The Impact of Cloudflare's Sudden Service Change at an Online Casino
Recently, an online casino website experienced a severe disruption when Cloudflare abruptly disabled its services. Robin Dev, a systems operations engineer at the casino, provided a detailed account of the sequence of events in a blog post, shedding light on the extent of the disruption and its aftermath.
-
Public Preview of Azure Compute Fleet: Streamlining Azure Compute Capacity Management
At the annual Build conference, Microsoft announced the public preview of Azure Compute Fleet, a new service that streamlines the provisioning and management of Azure compute capacity across different virtual machine (VM) types, availability zones, and pricing models to achieve desired scale, performance, and cost.
-
Google Cloud Launches Security Command Center Enterprise
Google Cloud has launched Security Command Center (SSC) Enterprise, a cloud risk management solution that offers proactive cloud security with enterprise security operations. The solution helps customers manage and mitigate risk across multi-cloud environments and is enhanced by Mandiant expertise.
-
Arc-Enabled Servers Run Command Public Preview Feature: Remote Management for Various Environments
Microsoft has recently announced a significant preview feature related to Arc-enabled servers, introducing the Run Command. This feature allows customers to manage Azure Arc-enabled servers remotely and securely.
-
Intuitive Application Resource Management with myApplications in the AWS Management Console
AWS recently announced at its re:Invent conference the general availability of myApplications. myApplications in the AWS Management Console can help customers manage and monitor the cost, health, security posture, and performance of their applications on AWS more effectively.
-
OpenTelemetry Logging Marked Stable: Morgan McLean at KubeCon NA
Logging is a core capability of applications today. OpenTelemetry (OTel) has stabilized logging as another available signal within the project. OTel Logging offers improvements to traditional logging.
-
AWS Introduces Amazon Route 53 Resolver on AWS Outposts Rack
AWS recently announced that Amazon Route 53 Resolver is now available on AWS Outposts rack providing on-premises services and applications with local Domain Name Service (DNS) resolution directly from Outposts. In addition, local Route 53 Resolver endpoints also enable DNS resolution between Outposts and on-premises DNS servers.
-
Platform Engineering, DevOps, and Cognitive Load: a Summary of Community Discussions
Operations engineering is moving in the direction of platform engineering according to Charity Majors, CTO at Honeycomb. Majors sees platform teams tending to work higher up the stack than operations, DevOps, and SRE teams do. This shift in focus enables organizations to focus their limited development resources on their core product to drive maximum business value.
-
How AI Supports IT Operators to Resolve Issues Faster and Keep Systems Running
AIOps is all about equipping IT teams with algorithms that can help in quicker evaluation, remediation or actionable insights based on their historical data without the need to solicit feedback from users directly. AI can help IT operators to work smart, resolve issues faster and keep the systems up and running to deliver great end-user experience.
-
AWS Launches a New Console Home Page to Manage Cloud Resources
Recently, AWS launched a version of the AWS Management Console. Through the home page of the console, customers can have access to each service console, and it offers a single place to access the information they need to perform their AWS related tasks.
-
NGINX Controller Application Delivery Modules Improve Health Checks and Caching Configurations
NGINX has released new versions of their NGINX Controller Application Delivery Module, a control plane solution for NGINX Plus load balancers. The new features include enhanced workload health-checks, improvements to caching configuration, and instance groups.
-
Linkerd Showcases Rust in Cloud-Native Infrastructure
The Linkerd project has recently become a graduated project in the CNCF. One of the most interesting aspects of Linkerd that differentiates it from other service mesh products is the Rust-based Linkerd2-proxy. Rust has made Linkerd significantly faster and lighter than other service mesh solutions.
-
Ambassador Developer Control Plane Integrates Common Kubernetes Full Lifecycle Tooling
Ambassador Labs announced the release of their Developer Control Plane (DCP). The DCP brings together tooling to support the full development and operations of Kubernetes based services. This includes popular Cloud Native Computing Foundation (CNCF) tools such as Argo, Telepresence, and Envoy Proxy.
-
Cloudflare Improves Automated Terraform Generation Tool
Cloudflare recently released an updated version of their cf-terraforming tool. This tool streamlines generating Terraform HCL from existing Cloudflare resources. The new release simplifies the generation process and introduces changes to better future proof the tool.