The DevOps 2.2 Toolkit by Farcic, Viktor -- Read -- Imperial Library of Trantor

Index

Title Page Copyright

The DevOps 2.2 Toolkit

Dedication Contributor

About the author Packt is searching for authors like you

Packt Upsell

Why subscribe? PacktPub.com

Preface

Overview

Download the example code files Download the color images Conventions used

Get in touch

Introduction to Self-Adapting and Self-Healing Systems

What is a self-adaptive system? What is a self-healing system? What now?

Choosing a Solution for Metrics Storage and Query

Non-dimensional versus dimensional metrics

Graphite InfluxDB Nagios and Sensu Prometheus Which tool should we choose?

What now?

Deploying and Configuring Prometheus

Deploying Prometheus stack Designing a more dynamic monitoring solution Deploying Docker Flow Monitor Integrating Docker Flow Monitor with Docker Flow Proxy What now?

Scraping Metrics

Creating the cluster and deploying services Deploying exporters Exploring exporter metrics Querying metrics Updating service constraints Using memory reservations and limits in Prometheus What now?

Defining Cluster-Wide Alerts

Creating the cluster and deploying services Creating alerts based on metrics Defining multiple alerts for a service Postponing alerts firing Defining additional alert information through labels and annotations Using shortcuts to define alerts What now?

Alerting Humans

Creating the cluster and deploying services Setting up Alertmanager Using templates in Alertmanager configuration What now?

Alerting the System

The four quadrants of a dynamic and self-sufficient system

Self-Healing Applied to Services

Creating the cluster and deploying services Using Docker Swarm for self-healing services Is it enough to have self-healing applied to services? What now?

Self-Adaptation Applied to Services

Choosing the tool for scaling Creating the cluster and deploying services Preparing the system for alerts Creating a scaling pipeline Preventing the scaling disaster Notifying humans that scaling failed Integrating Alertmanager with Jenkins What now

Painting the Big Picture – The Self-Sufficient System Thus Far

Developer's role in the system Continuous deployment role in the system Service configuration role in the system Proxy role in the system Metrics role in the system Alerting role in the system Scheduler role in the system Cluster role in the system What now?

Instrumenting Services

Defining requirements behind service specific metrics Differentiating services based on their types Choosing instrumentation type Creating the cluster and deploying services Instrumenting services using counter Instrumenting services using gauges Instrumenting services using histograms and summaries What now?

Self-Adaptation Applied to Instrumented Services

Setting up the objectives Creating the cluster and deploying services Scraping metrics from instrumented services Querying metrics from instrumented services Firing alerts based on instrumented metrics Scaling services automatically Sending error notifications to slack What now?

Setting Up a Production Cluster

Creating a Docker for AWS cluster Deploying services Securing services Persisting state Alternatives to CloudStor volume driver Setting up centralized logging Extending the capacity of the cluster What now?

Self-Healing Applied to Infrastructure

Automating cluster setup Exploring fault tolerance What now?

Self-Adaptation Applied to Infrastructure

Creating a cluster Scaling nodes manually Creating scaling job Scaling cluster nodes automatically Rescheduling services after scaling nodes Scaling nodes when replica state is pending What now?

Blueprint of a Self-Sufficient System

Service tasks

Self-healing flow Deployment flow Reconfiguration flow Request flow Self-adaptation flow

Infrastructure tasks

Self-healing flow Request flow Self-adaptation flow

Logic matters, tools might vary What now?

Other Books You May Enjoy

Leave a review - let other readers know what you think

← Prev
Back
Next →

← Prev
Back
Next →