InfoQ Homepage Observability Content on InfoQ
-
InfoQ Live Virtual Event on Aug 25th: Session Spotlights and Roundtables
The inaugural InfoQ Live (Aug 25th) is a one-day virtual learning event that deep-dives into building and operating microservices and distributed systems. Discover practical strategies for the current environment that you can put into use straight away. Join world-class practitioners for inspiration, connections, and actionable ideas. See the InfoQ Live full schedule and the speaker line-up.
-
Brenda - an Artificial Intelligence Team Member
Brenda uses artificial intelligence with machine learning to monitor the infrastructure, do quality assurance checks and support troubleshooting, handle alerts and communicate critical issues, and apply auto-healing. Sree Rama Murthy Pakkala and Collin Mendons from Swisscom will talk about an AI/ML framework named Brenda, who helps their teams to increase quality at Swiss Testing Day 2020.
-
Metrics Collection at Scale: Learning from Uber's M3
In a recent InfoQ podcast, Rob Skillington, co-founder and CTO at Chronosphere, shared his experience and opinions on the topic of observability in modern distributed systems. Key topics covered: metrics collection at scale, multi-dimensional metrics and high-cardinality, the importance of the developer experience, and the value of open standards, such as OpenMetrics.
-
Applying Observability to Ship Faster
To get fast feedback, ship work often, as soon as it is ready, and use automated systems in Live to test the changes. Monitoring can be used to verify if things are good, and to raise an alarm if not. Shipping fast in this way can result in having fewer tests and can make you more resilient to problems.
-
How Netlify’s Infrastructure Team Improved Observability While Increasing Deployment Speed
Netlify's infrastructure team shared their story of how they increased their customer deployment speeds by up to 2x by optimizing their deployment algorithm and increased observability into their systems in the process.
-
Splunk Launches New Release of SignalFx APM
Splunk, a platform for searching, monitoring, and examining machine-generated big data, has launched a new release of application monitoring tool SignalFx Microservices APM™. The new release combines NoSample™ tracing, open standards based instrumentation and artificial intelligence (AI)-driven directed troubleshooting from SignalFx and Omnition into a single solution.
-
Instana Launches Context Guide: Enabling Visual Navigation of Infrastructure & Services
Provider of automated application performance management (APM) solutions for microservices, Instana, has launched the Instana Context Guide, providing GUI-based access to the company’s underlying system model called the Dynamic Graph. Instana’s solution discovers application service components and application infrastructure, including cloud infrastructure.
-
Periskop: SoundCloud's Exception Monitoring Service
SoundCloud's engineering team wrote about their exception monitoring software called Periskop, which collects and aggregates exceptions across servers and reports to a central server for analysis.
-
Grafana Labs Announces GA of Cortex v1.0 and Discusses Architectural Changes
Grafana Labs, the company behind popular open-source monitoring projects Grafana and Loki, announced the General Availability of Cortex v1.0. Cortex is a clustered Prometheus implementation that includes features such as horizontal scalability, multi-tenancy, durability, and long-term storage.
-
Elastic Stack 7.6 Released with Security, Performance, and Observability Improvements
Elastic announced the release of Elastic Stack 7.6. This release contains a number of security improvements including a new SIEM detection engine and a redesigned SIEM overview dashboard page. This release also includes performance improvements to queries that are sorted by date, enhanced supervised machine learning capabilities, and support for ingesting Jaeger trace data.
-
Logz.io Survey Finds Tool Sprawl and Complex Architecture Key Challenges for Observability
Logz.io released their annual survey of the DevOps industry with the spotlight this year on observability. The key findings include that DevOps and observability tool sprawl is becoming an issue and complex architectures present the key challenge in implementing an observability solution. In the next year, they predict greater investment in observability with a focus on distributed tracing.
-
Reducing Build Time with Observability in the Software Supply Chain
Tools commonly used in production can also be applied to gain insight into the CI/CD pipeline to reduce the build time. Ben Hartshorne, engineer at honeycomb.io, gave the presentation Observability in the SSC: Seeing into Your Build System at QCon San Francisco 2019.
-
AWS CloudWatch Adds Observability Tool for Visualizing Distributed Applications
AWS released ServiceLens, a fully managed observability solution built within CloudWatch. ServiceLens is designed to visualize and analyze the health, performance, and availability of distributed applications. Currently it is available in all commercial regions but requires the usage of AWS X-Ray.
-
Elastic Releases New Security Suite Integrating SIEM with Endpoint Protection
Elastic recently released Elastic Endpoint Protection, a new feature for integrated security built upon Elastic’s acquisition of Endgame. With Endpoint, Elastic is combining their SIEM product and endpoint security into a single solution built on the Elastic stack.
-
Managing Microservice "Deep Systems": Q&A with Ben Sigelman
InfoQ interviewed Ben Sigelman, CEO of LightStep, about managing microservice "depth" at scale.