Reliable Insights

A blog on monitoring, scale and operational Sanity

March 23, 2020

Why info-style metrics have a value of 1

You've seen metrics like prometheus_build_info, but why do they have a value of 1?

Read more

March 16, 2020

Prometheus Middleware for Gorilla Mux

Your HTTP router is usually the best place to measure your application latency.

Read more

March 9, 2020

Temperature and hardware monitoring metrics from the node exporter

The node exporter exposes the various hardware monitoring metrics of Linux, including temperature, fans, and voltages.

Read more

March 2, 2020

Setting Thresholds on Alerts

Alert thresholds can be surprisingly tricky to get right.

Read more

February 24, 2020

Regex Selectors are a Smell

Have you ever found yourself having to keep on updating and tweaking certain regexes in PromQL?

Read more

February 17, 2020

New Features in Prometheus 2.16.0

Prometheus 2.16.0 is now out, following on from 2.15.0 with many fixes and improvements.

Read more

February 10, 2020

Testing SSH of hosts from Consul

We've previously looked at scraping services from consul and ssh checks. How can we combine those?

Read more

February 3, 2020

Get thee to a NaNnary

NaN is just a number in Prometheus.

Read more

January 27, 2020

Getting metrics from Apache logs using the grok exporter

Not all applications produce useful metrics, but some of them do produce logs.

Read more

January 20, 2020

Graphite’s summarize and smartSummarize in PromQL

How do you convert summarizeinto PromQL?

Read more

twitter
youtube
linkedin

Blog   |   Training   |   Book   |   Privacy