InfoQ Software Architects' Newsletter

A monthly overview of things you need to know as an architect or aspiring architect.

Enter your e-mail address

Select your country

We protect your privacy.

InfoQ Homepage Monitoring Content on InfoQ

Articles

RSS Feed

Newer Older

Culture & Methods

How to Fight Climate Change as a Software Engineer

We need to reduce and eliminate greenhouse gas emissions to stop climate change. But what role does software play, and what can software engineers do? Let’s take a look under the hood to uncover the relationship between greenhouse gas emissions and software, learn about the impact that we can have, and identify concrete ways to reduce emissions when creating and running software.

Martin Lippert
on May 09, 2022
Culture & Methods

Chaos Engineering and Observability with Visual Metaphors

This article introduces a new actor for visualising chaos engineering and observability: metaphors. It provides the conceptual foundations of chaos engineering and observability, presents a state of art of visualisation techniques available in the market and shows how treemaps, gauge charts, geocentric and city metaphors can enrich the spectrum of the visual strategies to observe the chaos.

Yury Niño Roa
on May 02, 2022
DevOps

How to Best Use MTT* Metrics to Optimize Your Incident Response

Selecting the correct MTT* metric to improve your incident response is important. If the wrong metric is chosen, the improvements may get lost in the noise of a multivariable equation. This article reviews the various MTT* metrics available and discusses the best scenarios for selecting each one.

Alex Ewerlöf
on Mar 17, 2022
DevOps

Why Change Intelligence is Necessary to Effectively Troubleshoot Modern Applications

Change Intelligence is often a missing component in incident management. Successfully correlating monitoring and observability data to arrive allows engineers to arrive at the root cause more rapidly. Telemetry provides the building blocks that enable change intelligence to identify and map the root cause, based on changes in the system and their broader impact.

Mickael Alliel
on Jan 24, 2022
DevOps

Why the Future of Monitoring Is Agentless

Traditionally, monitoring software has relied heavily on agent-based approaches for extracting telemetry data from systems. Observability requires better telemetry than agents currently provide. OpenTelemetry is driving advances in this area by creating a standard format and APIs to create, transmit, and store telemetry data. This unlocks new opportunities in observability.

Austin Parker
on Oct 15, 2021
Architecture & Design

How Unnecessary Complexity Gave the Service Mesh a Bad Name

There is immense value in adopting a service mesh, but it must be done in a lightweight manner to avoid unnecessary complexity. Take a pragmatic approach when implementing a service mesh by aligning with the core features of the technology, such as standardized monitoring and smart routing, and watching out for distractions.

Chris Campbell
on Sep 28, 2021
Culture & Methods

Improving Speed and Stability of Software Delivery Simultaneously at Siemens Healthineers

In this article, we focus on the software delivery process at Siemens Healthineers Digital Health. The process is subject to strict regulations valid in the medical industry. We show our journey of transforming the process towards speed and stability. Both measures improved at the same time during the transformation, confirming research from the “Accelerate” book.

Vladyslav Ukis
on Aug 24, 2021
DevOps

DevOps and Cloud InfoQ Trends Report - July 2021

This article summarizes how we see the "cloud computing and DevOps" space in 2021, which focuses on fundamental infrastructure and operational patterns, the realization of patterns in technology frameworks, and the design processes and skills that a software architect or engineer must cultivate.

Matt Campbell Steef-Jan Wiggers Shaaron A Alvares Helen Beal Daniel Bryant Lena Hall Rupert Field Aditya Kulkarni Jared Ruckle Renato Losio Holly Cummins
on Jul 19, 2021
Cloud

Solving Mysteries Faster with Observability

At QCon plus, a virtual conference for senior software engineers and architects covering the trends, best practices, and solutions leveraged by the world's most innovative software organizations, Elizabeth Carretto discussed observability at Netflix and how their internal tool, Edgar, comes into play.

Elizabeth Carretto
on Jun 30, 2021
DevOps

Using the Plan-Do-Check-Act Framework to Produce Performant and Highly Available Systems

The PDCA (plan-do-check-act) framework can be used to outline the performance, availability, and monitoring to enable teams to ensure performant and highly available applications. These include infrastructure design and setup, application architecture and design, coding, performance testing, and application monitoring.

Kulkarni Girish
on Jun 09, 2021
Cloud

Cloud Native and Kubernetes Observability: Expert Panel

InfoQ recently caught up with Observability experts to discuss several topics including fundamental questions about what Observability really entails, the misconceptions and challenges that the users are facing, the open standards that are influencing the industry in general and why there is more interest in this area off late.

Rags Srinivas Liz Fong-Jones Bartłomiej Plotka Josh Suereth Frederic Branczyk
on May 06, 2021
Culture & Methods

Site Reliability Engineering Experiences at Instana

With the popularity of distributed architectures, distributed databases, containers and container orchestrators, an approach that emphasizes automation and a culture of collaboration is a natural fit for modern day operations. Site Reliability Engineering takes engineering practices that have been established and proven in software engineering and applies them to the field of operations.

Bastian Spanneberg
on Apr 29, 2021

Newer Articles

Older Articles

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

Articles