- Campbell, Matthew: Scaling to a Million Machines with Prometheus (PromCon 2016): https://promcon.io/2016-berlin/talks/scaling-to-a-million-machines-with-prometheus
- Consul: Secure service networking: https://consul.io
- Docker: Enterprise container platform: https://www.docker.com
- Grafana: The open observability platform: https://grafana.com/
- Graphite: An enterprise-ready monitoring tool that runs equally well on cheap hardware or a cloud infrastructure: https://graphiteapp.org/
- InfluxDB: A time-series database designed to handle high write and query loads: https://www.influxdata.com/products/influxdb-overview
- Nagios: The industry standard In IT infrastructure monitoring: https://www.nagios.org
- Prometheus: Configuration options: https://prometheus.io/docs/prometheus/latest/configuration/configuration
- Prometheus: Exporter for machine metrics: https://github.com/prometheus/node_exporter
- Prometheus: Monitoring system and time-series database: https://prometheus.io
- StatsD: Daemon for easy but powerful stats aggregation: https://github.com/statsd/statsd