Reliable Insights

A blog on monitoring, scale and operational Sanity

February 4, 2019

Using tsdb analyze to investigate churn and cardinality

The Prometheus TSDB's code base includes a tool to help you find "interesting" metrics in terms of storage performance.

Read more

January 21, 2019

Optimising Prometheus 2.6.0 Memory Usage with pprof

The 2.6.0 release of Prometheus includes optimisations to reduce the memory taken by indexes and compaction.

Read more

January 7, 2019

Optimising startup time of Prometheus 2.6.0 with pprof

The 2.6.0 release of Prometheus includes WAL loading optimisations to make startup faster.

Read more

January 23, 2017

Optimising Go allocations using pprof

As I mentioned in a previous post, I made some memory-related improvements to Prometheus that'll be in the 1.5 release. Let's look at how I came across unneeded memory allocations and ultimately improved the code.

Read more

December 7, 2016

Which targets have the most samples?

We previously looked at finding your biggest metrics, that involves an expensive query though. A new feature in Prometheus 1.3 offers another approach.

Read more

June 21, 2016

Analysing Prometheus Memory Usage

Ever wondered how Prometheus is using its memory? Let's find out!

Read more


Blog   |   Training   |   Book   |   Privacy