Reliable Insights

A blog on monitoring, scale and operational Sanity

April 20, 2020

Don’t federate instance labels

Federation can be quite useful, but it's not replication.

Read more

April 13, 2020

How much space does the WAL take up?

The quoted storage numbers for Prometheus are usually for the blocks, not including the WAL.

Read more

April 6, 2020

Kernel file descriptor metrics from the node exporter

The node exporter provides kernel file descriptor metrics.

Read more

March 30, 2020

New Features in Prometheus 2.17.0

Prometheus 2.17.0 is now out, following on from 2.16.0 with many fixes and improvements.

Read more

March 23, 2020

Why info-style metrics have a value of 1

You've seen metrics like prometheus_build_info, but why do they have a value of 1?

Read more

March 16, 2020

Prometheus Middleware for Gorilla Mux

Your HTTP router is usually the best place to measure your application latency.

Read more

March 9, 2020

Temperature and hardware monitoring metrics from the node exporter

The node exporter exposes the various hardware monitoring metrics of Linux, including temperature, fans, and voltages.

Read more

March 2, 2020

Setting Thresholds on Alerts

Alert thresholds can be surprisingly tricky to get right.

Read more

February 24, 2020

Regex Selectors are a Smell

Have you ever found yourself having to keep on updating and tweaking certain regexes in PromQL?

Read more

February 17, 2020

New Features in Prometheus 2.16.0

Prometheus 2.16.0 is now out, following on from 2.15.0 with many fixes and improvements.

Read more

twitter
youtube
linkedin

Blog   |   Training   |   Book   |   Careers   |   Privacy   |   Demo