Prometheus monitoring is usually against on long-lived daemons, but what if you've a batch job that you want to monitor?
A blog on monitoring, scale and operational Sanity
A common question is is there a way to ingest JSON metrics from a random system into Prometheus? It's not possible to extract useful metrics from an arbitrary JSON blob, so that's not something the can be offered out of the box. However it's easy to write an exporter in Python to produce meaningful metrics.