Reliable Insights

A blog on monitoring, scale and operational Sanity

February 20, 2017

Translating between monitoring languages

There's so many monitoring systems out there these days that it's difficult to figure out what's actually different, and what just has a different name or falls under a different concept. Let's look at the Graphite, InfluxDB and Prometheus query languages and see how the same ideas are represented in each.

Read more

January 16, 2017

Federation, what is it good for?

There's various ways Prometheus federation can be used. To ensure your monitoring is scalable and reliable, let's look at how to best use it.

Read more

December 26, 2016

How does a Prometheus Gauge work?

We looked previously at the counter, how does the Prometheus gauge work?

Read more

December 7, 2016

Which targets have the most samples?

We previously looked at finding your biggest metrics, that involves an expensive query though. A new feature in Prometheus 1.3 offers another approach.

Read more

December 5, 2016

Composing range vector functions in PromQL

If you try and do max_over_time(rate(my_counter_total[5m])[1h]) or predict_linear(rate(my_counter_total[5m])[1d], 3600) in Prometheus it won't work. How can you combine these functions?

Read more

September 5, 2016

New Features in Prometheus 1.1.0

Following on from 1.0.0, Prometheus 1.1.0 has been released. Let's have a look at the main improvements!

Read more

August 29, 2016

Undoing the benefits of labels

It can seem like a good idea to use recording rules to make more explicit the content of a time series, particularly for those not used to labels. However this usually leads to confusing names and losing the benefits of labels.

Read more

August 22, 2016

Exposing the software version to Prometheus

I've previously mentioned that you shouldn't have the version of your software as either a target label, or exposed via a label on all metrics of your server as it'll make using the metrics more challenging. What should you do instead?

Read more

May 25, 2016

New Features in Prometheus 0.19.0

It has been five weeks since 0.18.0 was released, and 0.19.0 is now out. Let's have a look.

Read more

May 9, 2016

Rate then sum, never sum then rate

There's a common misunderstanding when dealing with Prometheus counters, and that is how to apply aggregation and other operations when using the rate and other counter-only functions.

Read more


Blog   |   Training   |   Book   |   Careers   |   Privacy   |   Demo