There’s so many monitoring systems out there these days that it’s difficult to figure out what’s actually different, and what just has a different name or falls under a different concept. Let’s look at the Graphite, InfluxDB and Prometheus query languages and see how the same ideas are represented in each.read more
There’s various ways Prometheus federation can be used. To ensure your monitoring is scalable and reliable, let’s look at how to best use it.read more
We previously looked at finding your biggest metrics, that involves an expensive query though. A new feature in Prometheus 1.3 offers another approach.read more
If you try and do
predict_linear(rate(my_counter_total[5m])[1d], 3600) in Prometheus it won’t work. How can you combine these functions?
It can seem like a good idea to use recording rules to make more explicit the content of a time series, particularly for those not used to labels. However this usually leads to confusing names and losing the benefits of labels.read more
I’ve previously mentioned that you shouldn’t have the version of your software as either a target label, or exposed via a label on all metrics of your server as it’ll make using the metrics more challenging. What should you do instead?read more
There’s a common misunderstanding when dealing with Prometheus counters, and that is how to apply aggregation and other operations when using the
rate and other counter-only functions.