Reliable Insights

A blog on monitoring, scale and operational Sanity

August 22, 2016

Exposing the software version to Prometheus

I've previously mentioned that you shouldn't have the version of your software as either a target label, or exposed via a label on all metrics of your server as it'll make using the metrics more challenging. What should you do instead?

Read more

August 8, 2016

On the naming of things

How you choose to name metrics is important. If everyone choose different schemes it'd lead to confusion, irritation and prevent us from sharing and reusing each others' work. I'd like to share some guidelines to help keep things sane for everyone.

Read more

April 8, 2016

How does a Prometheus Counter work?

There are four standard types of metric in Prometheus instrumentation: Gauge, Counter, Summary and Histogram. Today we'll have a look at the principles around Counters, and how Prometheus differs from other monitoring systems.

Read more

October 9, 2015

Monitoring Batch Jobs in Python

Prometheus monitoring is usually against on long-lived daemons, but what if you've a batch job that you want to monitor?

Read more


Blog   |   Training   |   Book   |   Careers   |   Privacy   |   Demo