InfoQ Homepage Monitoring Content on InfoQ
-
Amazon Introduces Cloudwatch Cross Account Alarms to Consolidate Management
Amazon CloudWatch recently announced cross account alarms, a new feature that enables customers to set alerts and take actions based on changes to metrics across different AWS accounts.
-
Need Help Tracking Cloud Emissions? Microsoft Previews Microsoft Cloud for Sustainability
At the recent Inspire 2021 conference, Microsoft announced the preview of Microsoft Cloud for Sustainability, a new service to help companies measure and manage their carbon emissions, set sustainability goals and take measurable action.
-
BasisAI Open Source Boxkite Machine Learning Monitoring Tool
Boxkite is an open source instrumentation library designed to track concept drift in highly available model servers. It integrates with DevOps tools such as Grafana, Prometheus, fluentd and kubeflow, scaling horizontally to multiple replicas without needing changes to code or infrastructure. The project claims to be fast, correct and simple.
-
Artificial Intelligence for IT Operations: an Overview
Artificial intelligence for IT operations (AIOps) combines sophisticated methods from deep learning, data streaming processing, and domain knowledge to analyse infrastructure data from internal and external sources to automate operations and detect anomalies (unusual system behavior) before they impact the quality of service.
-
AWS Announces the General Availability of Lambda Extensions
Recently AWS announced Lambda extensions' general availability (GA), which allows its users to integrate Lambda functions with their preferred tooling for monitoring, observability, security, and governance. The GA release includes performance improvements and an expanded set of partners.
-
Cloudflare Grows Its Serverless Ecosystem with New Observability Integrations
Cloudflare recently announced several new observability integrations for Workers, its serverless product. In a blog post, the company described integrations with six vendors: Sentry, New Relic, Datadog, Splunk, Sumologic, and Honeycomb. The new capabilities are a boost to the Workers ecosystem, and will appeal to enterprises that already use these observability vendors.
-
OpenTelemetry Moves Python and Swift Tracing API/SDKs to 1.0
OpenTelemetry released version 1.0 of the Python and Swift distributed tracing API and SDK. They both include OpenTelemetry API support, SDKs, exporters to common telemetry formats, and getting started materials. The Python release is considered stable whereas the Swift release is still in beta.
-
Grafana Labs Changes Licenses to AGPLv3 for Grafana, Loki, and Tempo
Grafana Labs has recently announced the plan to change the licenses for their core products. They will relicense Grafana, Grafana Loki, and Grafana Tempo from the Apache License 2.0 to the Affero General Public License (AGPL) v3. Plugins, agents, and certain libraries will remain Apache-licensed.
-
Announcing Refinery by Honeycomb: a Trace-Aware Sampling Proxy
Honeycomb, the company providing observability tools, recently announced Refinery. Supporting multiple sampling methods by default, Refinery runs on the customer's infrastructure to provide critical debugging data.
-
Cheryl Hung on Trends in Cloud Native and DevOps for 2021
In a recent keynote for The DEVOPS Conference, Cheryl Hung, VP ecosystem for the Cloud Native Computing Foundation (CNCF), shared her top 10 predictions for cloud native in the upcoming year. This includes improvements in cross cloud support, growth in GitOps and chaos engineering practices, and an increase in the adoption of FinOps.
-
AWS Releases Health Aware Providing Automated Health Alerts for Accounts
AWS recently announced the release of AWS Health Aware (AHA), an incident management and communications framework. AHA is an automated notification tool that sends AWS Health Alerts to a variety of endpoints. AHA is able to integrate with AWS Organizations to provide aggregated alerts across all accounts within the organization.
-
Microsoft Announces Azure Monitor SQL Insights for Azure SQL in Public Preview
Recently Microsoft announced Azure Monitor SQL Insights for Azure SQL in public preview. With the preview, customers will get a flexible canvas for telemetry collection, analysis, and rich custom visualization.
-
PagerDuty Adds AWS DevOps Guru and Microsoft Teams Integrations
PagerDuty has released a number of new updates and enhancements to their incident response platform. This includes new integrations with Amazon DevOps Guru, AWS Control Tower, and Microsoft Teams. Other improvements include improvements to mapping failures back to changes, automatic triggers, and content-based alert grouping.
-
Grafana Adds Enterprise Logs to Its Managed Observability Stack
Grafana Labs recently released a new log aggregation module for Grafana Enterprise Stack, its commercial observability platform.. Grafana Enterprise Logs ingests and stores logs from applications and other components. When using the module with other components of the Grafana Enterprise Stack, users can configure dashboards to display log data alongside metrics.
-
OpenTelemetry Announces Roadmap for Metrics Specification
The OpenTelemetry project announced its roadmap for its metrics specification. The roadmap includes a stable metrics API/SDK, metrics data model and protocol, and compatibility with Prometheus.