Log In
Or create an account ->
Imperial Library
Home
About
News
Upload
Forum
Help
Login/SignUp
Index
Title Page
Copyright
The DevOps 2.2 Toolkit
Dedication
Contributor
About the author
Packt is searching for authors like you
Packt Upsell
Why subscribe?
PacktPub.com
Preface
Overview
Download the example code files
Download the color images
Conventions used
Get in touch
Introduction to Self-Adapting and Self-Healing Systems
What is a self-adaptive system?
What is a self-healing system?
What now?
Choosing a Solution for Metrics Storage and Query
Non-dimensional versus dimensional metrics
Graphite
InfluxDB
Nagios and Sensu
Prometheus
Which tool should we choose?
What now?
Deploying and Configuring Prometheus
Deploying Prometheus stack
Designing a more dynamic monitoring solution
Deploying Docker Flow Monitor
Integrating Docker Flow Monitor with Docker Flow Proxy
What now?
Scraping Metrics
Creating the cluster and deploying services
Deploying exporters
Exploring exporter metrics
Querying metrics
Updating service constraints
Using memory reservations and limits in Prometheus
What now?
Defining Cluster-Wide Alerts
Creating the cluster and deploying services
Creating alerts based on metrics
Defining multiple alerts for a service
Postponing alerts firing
Defining additional alert information through labels and annotations
Using shortcuts to define alerts
What now?
Alerting Humans
Creating the cluster and deploying services
Setting up Alertmanager
Using templates in Alertmanager configuration
What now?
Alerting the System
The four quadrants of a dynamic and self-sufficient system
Self-Healing Applied to Services
Creating the cluster and deploying services
Using Docker Swarm for self-healing services
Is it enough to have self-healing applied to services?
What now?
Self-Adaptation Applied to Services
Choosing the tool for scaling
Creating the cluster and deploying services
Preparing the system for alerts
Creating a scaling pipeline
Preventing the scaling disaster
Notifying humans that scaling failed
Integrating Alertmanager with Jenkins
What now
Painting the Big Picture – The Self-Sufficient System Thus Far
Developer's role in the system
Continuous deployment role in the system
Service configuration role in the system
Proxy role in the system
Metrics role in the system
Alerting role in the system
Scheduler role in the system
Cluster role in the system
What now?
Instrumenting Services
Defining requirements behind service specific metrics
Differentiating services based on their types
Choosing instrumentation type
Creating the cluster and deploying services
Instrumenting services using counter
Instrumenting services using gauges
Instrumenting services using histograms and summaries
What now?
Self-Adaptation Applied to Instrumented Services
Setting up the objectives
Creating the cluster and deploying services
Scraping metrics from instrumented services
Querying metrics from instrumented services
Firing alerts based on instrumented metrics
Scaling services automatically
Sending error notifications to slack
What now?
Setting Up a Production Cluster
Creating a Docker for AWS cluster
Deploying services
Securing services
Persisting state
Alternatives to CloudStor volume driver
Setting up centralized logging
Extending the capacity of the cluster
What now?
Self-Healing Applied to Infrastructure
Automating cluster setup
Exploring fault tolerance
What now?
Self-Adaptation Applied to Infrastructure
Creating a cluster
Scaling nodes manually
Creating scaling job
Scaling cluster nodes automatically
Rescheduling services after scaling nodes
Scaling nodes when replica state is pending
What now?
Blueprint of a Self-Sufficient System
Service tasks
Self-healing flow
Deployment flow
Reconfiguration flow
Request flow
Self-adaptation flow
Infrastructure tasks
Self-healing flow
Request flow
Self-adaptation flow
Logic matters, tools might vary
What now?
Other Books You May Enjoy
Leave a review - let other readers know what you think
← Prev
Back
Next →
← Prev
Back
Next →