A blog on monitoring, scale and operational Sanity
April 20, 2020
Federation can be quite useful, but it's not replication.
April 13, 2020
The quoted storage numbers for Prometheus are usually for the blocks, not including the WAL.
April 6, 2020
The node exporter provides kernel file descriptor metrics.
March 30, 2020
Prometheus 2.17.0 is now out, following on from 2.16.0 with many fixes and improvements.
March 23, 2020
You've seen metrics like prometheus_build_info, but why do they have a value of 1?
March 16, 2020
Your HTTP router is usually the best place to measure your application latency.
March 9, 2020
The node exporter exposes the various hardware monitoring metrics of Linux, including temperature, fans, and voltages.
March 2, 2020
Alert thresholds can be surprisingly tricky to get right.
February 24, 2020
Have you ever found yourself having to keep on updating and tweaking certain regexes in PromQL?
February 17, 2020
Prometheus 2.16.0 is now out, following on from 2.15.0 with many fixes and improvements.
Blog | Training | Book | Careers | Privacy | Demo