There’s a common misunderstanding when dealing with
Prometheus counters, and that is how to apply aggregation and other operations when using the
rate and other counter-only functions.
read more Rate then sum, never sum then rate 05.09.2016
March 31, 2016
When you’ve a complicated manual process that you want to improve, your first instinct as a developer might be to jump in and start coding. Hold off a bit, the first step is to document.
read more The first step is to document 03.31.2016
February 9, 2016
When running a production system there’s an endless stream of issues that have the potential to cause you significant hassle. How should you deal with this?
read more I’ve got 99 Failure Modes, Yours is Just One 02.09.2016
January 20, 2016
As part of designing and building
Prometheus, hundreds of technical decisions have to be made. Every one of them is important in building a sustainable consistent ecosystem. Today, let’s look at one small decision that was made by the Prometheus developers in Consul service discovery. read more Little Things Matter 01.20.2016
December 21, 2015
Whether you’re on bare metal or using a cloud provider, there’s a question you should always be able to answer. What machines do I have, and what is meant to be running on them?
read more You look good, have you lost machines? 12.21.2015
December 11, 2015
Failed requests are a fact of life, network weirdness and machine failures are inevitable. It can be tempting to simply retry the request when this happens, but this may cause more harm than good.
read more It’s overloaded? Try harder! 12.11.2015
November 28, 2015
When getting something working for the first time, it’s easy to get caught up in Docker or Vargant. Before you run it in production with full access and user data, do you know what code you’re running?
read more Do you know what software you’re running? 11.28.2015
November 18, 2015
When starting out it’s easy to think that you need Docker, Kubernetes, Microservices, Continuous Deployment and all the other trending topics on Hacker News/Reddit/Lobsters. What do you really need?
read more Do you have basic infrastructure? 11.18.2015
November 4, 2015
This week Microsoft
removed unlimited storage from their OneDrive offering, because surprise surprise people were using it as unlimited storage. Does your product have features that cost you time and money, without your users paying accordingly? read more Unlimited costs, Limited revenue 11.04.2015
November 2, 2015
Your service’s traffic is steadily growing, latency has increased a bit but it’s within reason. One day you launch a new customer and the latency jumps through the roof causing an outage. What happened? You hit the knee.
read more Avoid outages: Beware the Knee 11.02.2015