reliability

All posts tagged reliability

Dropping metrics at scrape time with Prometheus

It’s easy to get carried away by the power of labels with Prometheus. In the extreme this can overload your Prometheus server, such as if you create a time series for each of hundreds of thousands of users. Thankfully there’s a way to deal with this without having to turn off monitoring or deploy a new version of your code.

read more
Brian BrazilDropping metrics at scrape time with Prometheus