Prometheus at scale with Thanos

If you’re using Prometheus at scale , you’ve probably run into one or more of the following issues 1. Data retention costs money, the more historical data you keep, the more SSD you need, and that gets expensive. 2. You scale prometheus by sharding, and the ensuing contortions you go through to get a unified view involves all sorts of madness with federation, grafana queries, and whatnot. 3. An HA setup inevitably runs into the entertainment associated with de-duplicating prometheus data. 4. If you run into more than one of the above, then whatever you do ends up being somewhat Rube-Goldberg-ish, with associated maintenance headaches, replicability, inefficiency, etc. Enter Thanos , which integrates transparently with Prometheus, and pretty much takes care of all of the above issues, and remarkably elegantly at that. It consists of a bunch of modular components that can be snapped together to — transparently! — provide a unified view across your existin...