InfoQ Homepage Metrics Content on InfoQ
-
Lightstep Connects Tracing and Metrics with New Change Intelligence Feature
Lightstep has released a number of improvements to their observability platform. These include native support for OpenTelemetry metrics, a new underlying time series database, and Change Intelligence, a new feature that looks to connect unusual patterns with impacting changes by bringing together system metrics and trace data.
-
Airbnb: Using Guardrails to Identify Changes with Negative Impact across Teams
Airbnb rolled out an internal Experiment Guardrails system to identify potentially negative impacts of changes across different teams. Whenever a proposed change does not pass any of the guardrails, it is escalated for further analysis by affected teams and stakeholders, explains Airbnb data scientist Tatiana Xifara.
-
What's New in MicroProfile 4.0
Delivered under the newly-formed MicroProfile Working Group, the much anticipated release of MicroProfile 4.0 was made available to the Java community. Features include alignment with Jakarta EE 8 and updates to all APIs. The standalone APIs remain unchanged. MicroProfile 4.0 was delivered with incompatible changes to five of the APIs, namely Config, Fault Tolerance, Health, Metrics and OpenAPI.
-
Logz.io Extends Monitoring Platform with Hosted Prometheus and Jaeger
Logz.io recently announced the addition of Prometheus-as-a-Service to their infrastructure monitoring product. The service incorporates the metrics collection of Prometheus with the Logz.io platform that includes Grafana, ELK, and, also added recently, Jaeger. The data correlation features included within Logz.io allow for connecting metrics, traces, and logs all within a single platform.
-
AWS Publishes Best Practices Guide for Operational Dashboards
AWS recently added to the Amazon Builders' Library their best practices for building dashboards for operational visibility. The document includes a detailed description of the different types of dashboards that exist at Amazon as well as a discussion of the design best practices used to create dashboards.
-
VMware Tanzu Service Mesh from a Developer's Perspective
Deepa Kalani and Ramiro Salas from the VMware team spoke at SpringOne 2020 Conference last week about the service mesh product and how it helps developers with Global Namespaces to implement access control and security policies, as well as visualization tools to show application-centric metrics.
-
Using Measurement to Optimise Remote Work
Dave Longman of HeadForwards wrote about using data to demonstrate the effectiveness of remote work during lockdown. Michael Schrage of MIT’s School of Management also wrote about the importance of using data-driven insights to improve performance in the current climate. Laura Giurge of London Business School talked about how isolation is teaching individuals and firms to be remotely effective.
-
Metrics Collection at Scale: Learning from Uber's M3
In a recent InfoQ podcast, Rob Skillington, co-founder and CTO at Chronosphere, shared his experience and opinions on the topic of observability in modern distributed systems. Key topics covered: metrics collection at scale, multi-dimensional metrics and high-cardinality, the importance of the developer experience, and the value of open standards, such as OpenMetrics.
-
Predicting the Future with Forecasting and Agile Metrics
Common estimation approaches often fail to give us the predictability we want. Forecasting provides a range of possible outcomes with the chance of outcomes becoming reality. It can answer questions like “When will it be done?” or “What can we deliver by xx?” with confidence.
-
What's New in MicroProfile 3.3
The Eclipse Foundation released MicroProfile 3.3 featuring updates to five APIs - Rest Client, Config, Fault Tolerance, Metrics and Health. Other improvements include clarifications and enhancements to specifications and documentation, improved integration among all the MicroProfile APIs, interoperability across different MicroProfile implementations, and a complete set of artifacts for each API.
-
Elastic Stack 7.6 Released with Security, Performance, and Observability Improvements
Elastic announced the release of Elastic Stack 7.6. This release contains a number of security improvements including a new SIEM detection engine and a redesigned SIEM overview dashboard page. This release also includes performance improvements to queries that are sorted by date, enhanced supervised machine learning capabilities, and support for ingesting Jaeger trace data.
-
Amazon Announces AWS Firelens – a New Way to Manage Container Logs
Recently, Amazon announced a new log aggregation service called AWS Firelens. The service unifies log filtering and routing across all AWS container services including Amazon ECS, Amazon EKS, and AWS Fargate.
-
Managing eBay Vast Service Architecture Using Knowledge Graphs
Knowledge graphs describe knowledge domains based on expert input, data, and machine learning algorithms. eBay is using an application/infrastructure knowledge graph to manage its vast service architecture and provide a better experience for the roughly 200M buyers visiting the site.
-
Microsoft Announces 1.0 Release of Kubernetes-Based Event-Driven Autoscaling (KEDA)
Microsoft has announced the 1.0 version of the Kubernetes-based event-driven autoscaling (KEDA) component, an open-source project that can run in a Kubernetes cluster to provide "fine grained autoscaling (including to/from zero)" for every container. KEDA also serves as a Kubernetes Metrics Server and allows users to define autoscaling rules using a dedicated Kubernetes custom resource.
-
Scaling Graphite at Booking.com
Booking.com's engineering team scaled their Graphite deployment from a small cluster to one that handles millions of metrics per second. Along the way, they modified and optimized Graphite's core components - the carbon-relay and carbon-cache, and the rendering API.