best practices – Robust Perception | Prometheus Monitoring Experts

December 28, 2020

Monitoring is a means, not an end

Does it really have to be perfect?

Published by Brian Brazil in Posts

Tags: best practices, prometheus

December 21, 2020

Policy is for configuration, not metric names

Metric names are part of a time series's identity, so shouldn't include information unrelated to identity.

Published by Brian Brazil in Posts

Tags: best practices, federation, prometheus, relabelling, remote write

December 14, 2020

Prefer without and ignoring

Which of by/without and on/ignoring should you use?

Published by Brian Brazil in Posts

Tags: best practices, prometheus, promql

September 21, 2020

Don’t Try to Swim Upstream

Have you ever felt that a piece of software just isn't doing what you need?

Published by Brian Brazil in Posts

Tags: best practices, prometheus, push

July 20, 2020

Delete All Your Alerts

Trying to improve alerting piecemeal can be difficult.

Published by Brian Brazil in Posts

Tags: alerting, best practices, prometheus

June 22, 2020

Remote read and partial failures

What happens when your clustered storage fails?

Published by Brian Brazil in Posts

Tags: best practices, prometheus, remote read

May 18, 2020

Atomic Writes and the Textfile Collector

To avoid weirdness, write your files atomically.

Published by Brian Brazil in Posts

Tags: best practices, node exporter, prometheus

April 20, 2020

Don’t federate instance labels

Federation can be quite useful, but it's not replication.

Published by Brian Brazil in Posts

Tags: best practices, prometheus

March 2, 2020

Setting Thresholds on Alerts

Alert thresholds can be surprisingly tricky to get right.

Published by Brian Brazil in Posts

Tags: alerting, best practices, prometheus

February 24, 2020

Regex Selectors are a Smell

Have you ever found yourself having to keep on updating and tweaking certain regexes in PromQL?

Published by Brian Brazil in Posts

Tags: best practices, instrumentation, promql

Reliable Insights