InfoQ Software Architects' Newsletter

A monthly overview of things you need to know as an architect or aspiring architect.

Enter your e-mail address

Select your country

We protect your privacy.

InfoQ Homepage Monitoring Content on InfoQ

News

RSS Feed

Newer Older

Architecture & Design

O11ycon Discusses Benefits and Challenges of Observability

The first o11ycon provides a comprehensive look at the emerging concept of observability in software and systems which allow people to understand if things are working as expected, and to diagnose problems and identify solutions.

Dylan Schiemann
on Aug 09, 2018
DevOps

Plaid.com’s Monitoring System for 9600+ Integrations

Plaid.com has integrations with over 9600 financial institutions, and their monitoring challenges arise from the heterogeneous nature of these integrations and as well as their large number. They rebuilt their monitoring system on Kinesis, Prometheus, Alertmanager and Grafana to solve the challenges of scalability and low latency.

Hrishikesh Barua
on Aug 01, 2018
DevOps

How SendGrid Scales Its Email Delivery Systems

SendGrid, a cloud based email service, has seen its backend architecture evolve from a small Postfix installation to a system hosted on their own data-centers as well as on the public cloud. Rewriting of services in Go, a gradual move to AWS, and a distributed Ceph-based queue allows the team to hand over 40 billion emails per month.

Hrishikesh Barua
on Jul 28, 2018
DevOps

Bloomberg’s Standardization and Scaling of Its Monitoring Systems

One of the outcomes of Bloomberg’s adoption of SRE practices across its development teams is the monitoring system, backed by the Cassandra-based Metrictank time-series database, that they put in place.

Hrishikesh Barua
on Jul 21, 2018
Cloud

AWS Config Gains Cross-Account, Cross-Region Data Aggregation

Amazon Web Services (AWS) recently added the capability to aggregate compliance data produced by AWS Config rules across multiple accounts and/or regions to enable centralized auditing and governance of AWS resources. A new aggregated dashboard view displays non-compliant rules across the organization. Users can then drill down to view details about resources that are violating any rules.

Steffen Opel
on Jun 30, 2018
Architecture & Design

Observability and Microservices: The Need for Effective Tracing and Metrics

Zach Jory has written an article discussing how microservices and service mesh implementations need observability to ensure that developers can build cloud-native applications which scale and can be more easily managed. This ties into a number of articles and interviews we have spoken about over recent months too.

Mark Little
on Jun 17, 2018
DevOps

AppDynamics Launches New European Software-as-a-Service Offering

Application intelligence vendor, AppDynamics, has launched a new European Software-as-a-Service (SaaS) offering, built on the Amazon Web Services (AWS) EU (Frankfurt) Region.

Helen Beal
on Jun 15, 2018
DevOps

Understanding Production with DevOps Archeology

Lee Fox spoke at Continuous Lifecycle London about tools and methods to help make sense of today’s complex systems and infrastructure; he calls it DevOps archeology.

Manuel Pais
on Jun 14, 2018
DevOps

Building Observable Distributed Systems

Today's systems are more and more complex; microservices distributed over the network and scaling dynamically, resulting in many more ways of failure, ways we can't always predict. Investing in observability gives us the ability to ask questions to systems, things we never thought about before. Some of the tools that can be used for this are metrics, tracing, structured and correlated logging.

Ben Linders
on Jun 12, 2018
DevOps

Thanos - a Scalable Prometheus with Unlimited Storage

The Improbable engineering team open sourced Thanos, a set of components that adds high availability to Prometheus installations by cross-cluster federation, unlimited storage and global querying across clusters.

Hrishikesh Barua
on Jun 09, 2018
DevOps

AppDynamics Extends Business Transaction Tracing to SAP Environments

AppDynamics, an application intelligence and performance management vendor owned by Cisco, has announced the availability of AppDynamics for SAP. New ABAP code-level monitoring provides visibility of customer experiences, from digital touch-points through mission-critical SAP business applications, from code-level insights to customer taps, swipes and clicks.

Helen Beal
on May 31, 2018
DevOps

Google's Stackdriver Monitoring Announces Better Support for Kubernetes Deployments

At the recently concluded KubeCon, Google announced the beta release of Stackdriver monitoring for Kubernetes. The key features include central visibility of Kubernetes-orchestrated container metrics and logs along with other metrics in the existing Stackdriver dashboard, and better Prometheus support.

Hrishikesh Barua
on May 19, 2018
Culture & Methods

Automating Visual Testing with Appraise

Developing applications where the look and feel is key for success might help to automate visual testing. Appraise, an open source tool on Github licensed under MIT, applies the approach of specification by example to visuals. It helps teams to review and approve changes to web pages using visual inspection.

Ben Linders
on May 03, 2018
DevOps

What It Means to Be a Site Reliability Engineer According to a Survey from Catchpoint

Site Reliability Engineering intersects software engineering with IT Operations and is an approach created at Google in 2003 and described in detail in their 2016 book, Site Reliability Engineering, How Google Runs Production Systems. Digital experience intelligence provider, Catchpoint, surveyed 416 Site Reliability Engineers (SREs) with the goal of understanding what it means to be a SRE.

Helen Beal
on Apr 13, 2018
DevOps

Monitoring Microservices at Scale at Crisp

Crisp’s engineering team shared their experience in monitoring their microservices stack. Vigil, their open sourced project in Rust, is a set of pull/push probes to collect health data with support for multiple languages, a status dashboard and integration with some external alerting tools.

Hrishikesh Barua
on Mar 24, 2018

Newer News

Older News

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

News