A blog on monitoring, scale and operational Sanity
March 30, 2020
Prometheus 2.17.0 is now out, following on from 2.16.0 with many fixes and improvements.
March 23, 2020
You've seen metrics like prometheus_build_info, but why do they have a value of 1?
March 16, 2020
Your HTTP router is usually the best place to measure your application latency.
March 9, 2020
The node exporter exposes the various hardware monitoring metrics of Linux, including temperature, fans, and voltages.
March 2, 2020
Alert thresholds can be surprisingly tricky to get right.
February 24, 2020
Have you ever found yourself having to keep on updating and tweaking certain regexes in PromQL?
February 17, 2020
Prometheus 2.16.0 is now out, following on from 2.15.0 with many fixes and improvements.
February 10, 2020
We've previously looked at scraping services from consul and ssh checks. How can we combine those?
February 3, 2020
NaN is just a number in Prometheus.
January 27, 2020
Not all applications produce useful metrics, but some of them do produce logs.
Blog | Training | Book | Careers | Privacy | Demo