# Automating Kubernetes Observability: Scaling Your Metrics with Dynamic Discovery
Let’s say you have a kubernetes cluster and prometheus with multiple workloads running on it. You want to monitor the health of the cluster and the workloads.
Let’s say you have a kubernetes cluster and prometheus with multiple workloads running on it. You want to monitor the health of the cluster and the workloads.
Picture this: I’m sitting in a room packed with the infrastructure team, the vendor, and our developers. Tension is high. We had just gone through a platform bridge change that caused IPs to cycle.…
There is a stark difference between working at a company that builds technology and a company that merely uses it. In organizations where technology is not the core business, IT and engineering…
We have all seen it happen. An API endpoint gets slow. The database CPU starts spiking during peak hours. The immediate, knee-jerk reaction from the dev team?
Once upon a time, my engineering team was stuck in a vicious cycle.
I recently dealt with an incident that took a development environment down for an entire week.