Log In
Or create an account ->
Imperial Library
Home
About
News
Upload
Forum
Help
Login/SignUp
Index
Preface
Who Should Read This Book
Why I Wrote This Book
A Word on Monitoring Today
Navigating This Book
Online Resources
Conventions Used in This Book
Using Code Examples
O’Reilly Safari
How to Contact Us
Acknowledgments
I. Monitoring Principles
1. Monitoring Anti-Patterns
Anti-Pattern #1: Tool Obsession
Monitoring Is Multiple Complex Problems Under One Name
Avoid Cargo-Culting Tools
Sometimes, You Really Do Have to Build It
The Single Pane of Glass Is a Myth
Anti-Pattern #2: Monitoring-as-a-Job
Anti-Pattern #3: Checkbox Monitoring
What Does “Working” Actually Mean? Monitor That.
OS Metrics Aren’t Very Useful—for Alerting
Collect Your Metrics More Often
Anti-Pattern #4: Using Monitoring as a Crutch
Anti-Pattern #5: Manual Configuration
Wrap-Up
2. Monitoring Design Patterns
Pattern #1: Composable Monitoring
The Components of a Monitoring Service
Data collection
Metrics
Logs
Data storage
Visualization
Analytics and Reporting
Alerting
Pattern #2: Monitor from the User Perspective
Pattern #3: Buy, Not Build
It’s Cheaper
You’re (Probably) Not an Expert at Architecting These Tools
SaaS Allows You to Focus on the Company’s Product
No, Really, SaaS Is Actually Better
Pattern #4: Continual Improvement
Wrap-Up
3. Alerts, On-Call, and Incident Management
What Makes a Good Alert?
Stop Using Email for Alerts
Write Runbooks
Arbitrary Static Thresholds Aren’t the Only Way
Delete and Tune Alerts
Use Maintenance Periods
Attempt Automated Self-Healing First
On-Call
Fixing False Alarms
Cutting Down on Needless Firefighting
Building a Better On-Call Rotation
Incident Management
Postmortems
Wrap-Up
4. Statistics Primer
Before Statistics in Systems Operations
Math to the Rescue!
Statistics Isn’t Magic
Mean and Average
Median
Seasonality
Quantiles
Standard Deviation
Wrap-Up
II. Monitoring Tactics
5. Monitoring the Business
Business KPIs
Two Real-World Examples
Yelp
Reddit
Tying Business KPIs to Technical Metrics
My App Doesn’t Have Those Metrics!
Finding Your Company’s Business KPIs
Wrap-Up
6. Frontend Monitoring
The Cost of a Slow App
Two Approaches to Frontend Monitoring
Document Object Model (DOM)
Frontend Performance Metrics
Navigation Timing API
Speed Index
OK, That’s Great, but How Do I Use This?
Logging
Synthetic Monitoring
Wrap-Up
7. Application Monitoring
Instrumenting Your Apps with Metrics
How It Works Under the Hood
Monitoring Build and Release Pipelines
Health Endpoint Pattern
Application Logging
Wait a Minute…Should I Have a Metric or a Log Entry?
What Should I Be Logging?
Write to Disk or Write to Network?
Serverless / Function-as-a-Service
Monitoring Microservice Architectures
Wrap-Up
8. Server Monitoring
Standard OS Metrics
CPU
Memory
Network
Disk
Load
SSL Certificates
SNMP
Web Servers
Database Servers
Load Balancers
Message Queues
Caching
DNS
NTP
Miscellaneous Corporate Infrastructure
DHCP
SMTP
Monitoring Scheduled Jobs
Logging
Collection
Storage
Analysis
Wrap-Up
9. Network Monitoring
The Pains of SNMP
What Is SNMP?
How Does It Work?
A Word on Security
How Do I Use SNMP?
Installation and configuration on Linux
Installation configuration on macOS
Testing
net-snmp
Installing vendor MIBs
That’s great, Mike. But where’s the list of OIDs I should monitor?
Interface Metrics
Interface and Logging
Recap
Configuration Tracking
Voice and Video
Routing
Spanning Tree Protocol (STP)
Chassis
CPU and Memory
Hardware
Flow Monitoring
Capacity Planning
Working Backward
Forecasting
Wrap-up
10. Security Monitoring
Monitoring and Compliance
User, Command, and Filesystem Auditing
Setting Up auditd
auditd and Remote Logs
Host Intrusion Detection System (HIDS)
rkhunter
Network Intrusion Detection System (NIDS)
Wrap-Up
11. Conducting a Monitoring Assessment
Business KPIs
Frontend Monitoring
Application and Server Monitoring
Security Monitoring
Alerting
Wrap-Up
A. An Example Runbook: Demo App
Demo App
Metadata
Escalation Procedure
External Dependencies
Internal Dependencies
Tech Stack
Metrics and Logs
Alerts
B. Availability Chart
Index
← Prev
Back
Next →
← Prev
Back
Next →