InfoQ Homepage Operations management Content on InfoQ
-
Consul-Terraform-Sync Enables Automating of Common Networking Tasks
HashiCorp has moved Consul-Terraform-Sync (CTS) into full general availability. CTS allows for the definition of tasks as Terraform modules that can be run as services are added or removed from Consul. CTS is part of a solution called Network Infrastructure Automation (NIA) which focuses on automating day two network tasks such as updating load balancer pools or firewall policies.
-
AWS Publishes Best Practices Guide for Operational Dashboards
AWS recently added to the Amazon Builders' Library their best practices for building dashboards for operational visibility. The document includes a detailed description of the different types of dashboards that exist at Amazon as well as a discussion of the design best practices used to create dashboards.
-
Microsoft Introduces the Azure Well-Architected Framework
In a recent blog post, Microsoft introduced the Azure Well-Architected Framework, which provides customers with a set of Azure architecture best practices to help them build and deliver well-designed solutions.
-
Improving Incident Management through Role Assignments and Game Days
John Arundel, principal consultant at Bitfield Consulting, shared his thoughts on how to ensure incidents are handled smoothly and quickly. He suggests assigning specific roles to each team member responding to the incident. Red team versus blue team exercises can also be leveraged to ensure the team is prepared to respond accurately and quickly.
-
Failure Modes and Building Resilient Systems: Adrian Cockcroft at QCon SF
Adrian Cockcroft recently shared his thoughts on how to produce resilient systems that operate successfully in spite of the presence of failures. At the recent QCon San Francisco event, he also shared what he considers are good cloud resilience patterns for building with a continuous resilience mindset.
-
DataOps and Operations-Centric Data Architecture
Eric Estabrooks from DataKitchen spoke at this year's Data Architecture Summit 2019 Conference about how DevOps tasks should be managed for data architecture. DataOps is a collaborative data management practice and is emerging as an area of interest in the industry.
-
OpsRamp Releases Improved Alert Correlation and Better Insights into Event Management Models
OpsRamp, a SaaS platform for datacenter operations management, announced its Fall 2019 release which includes a number of enhancements to its intelligent event management and correlation machine learning models. This release also includes multi-cloud infrastructure monitoring capabilities, synthetic monitoring, and a custom integration framework.
-
Bringing VMware Environments to Azure, Microsoft and VMWare Establish Partnership
At the recent Dell Technologies World conference, Microsoft and VMware announced an expanded partnership that enables certified VMware cloud infrastructure to run in Microsoft Azure. The Microsoft first party capability is made possible through a solution provided by CloudSimple, a VMware certified partner, and officially is called Azure VMware Solution by CloudSimple.
-
Infrastructure Automation Company Chef Commits to Open Source
Chef, an infrastructure automation company, has committed to developing all of their software as open source under the Apache 2.0 license.
-
OpsRamp Announces Improved Service Centricity, AIOps and Cloud Monitoring
OpsRamp, a service-centric AIOps software-as-a-service (SaaS) platform for the hybrid enterprise, has announced new topology maps, enhanced artificial intelligence for IT operations (AIOps) features and new monitoring capabilities for cloud native workloads.
-
Amazon Introduces CloudFormation Drift Detection
In a recent blog post, Amazon announced CloudFormation Drift Detection which organizations can leverage to automate configuration consistency across AWS cloud resources. The CloudFormation Drift Detection feature allows organizations who have templated their configurations and deployments, known as stacks, to detect when configuration drift occurs from out-of-band changes.
-
Amazon Releases a New Session Manager in AWS Systems Manager
Amazon released a new Session Manager in the AWS Systems Manager. This new session manager will provide a new of way of shell-level access to EC2 instances. IT Administrators can now use a new browser-based interactive shell and a command-line interface (CLI) to manage their Windows and Linux instances.
-
IT Operations Is the Most Predictable DevOps Differentiator Says Damon Edwards at DOES18 London
InfoQ spoke to Damon Edwards, co-founder and chief product officer, at Rundeck at DevOps Enterprise Summit London about his talk ‘Operations - The Last Mile Problem for DevOps in the Enterprise’ and the sneak preview of the new version of RunDeck, V3.0.
-
Avoiding Alerts Overload from Microservices: Sarah Wells at QCon London
At QCon London, Sarah Wells presented “Avoiding Alerts Overload from Microservices”, and cautioned that developers and operators must fundamentally change the way they think about monitoring when building a microservice system. Key takeaways included: build a system that can be supported; focus on ‘stuff that matters’ when creating monitoring and alerts; and cultivate and improve alerts.
-
Operational Data Stream and Batch Processing at Netflix with Mantis
Operational Data Stream and Batch Processing at Netflix with Mantis