Today was day 5 and no alerts again today 🎉🎉
So happy. I however spent a lot of time looking at the infrastructure in my tribe and try to make sure to see and restart any failing pods before they become an issue. Call it proactiveness, call it a sense of responsibility.
I was paying attention to a pod and saw that the deployment was running at maximum capacity. I tried to scale up the number of pods in the deployment and watched as the latency spiked and dipped.
Being on call allows me the freedom to perform these experiments and know that if anything goes wrong, it’ll be me that’ll have to deal with it so no qualms.
That’s it for today