I've had the luck to see some great incident responses, and tried to capture the most important cultural things that created those foundations.
Would love to chat if you have any comments
I changed career from consulting into engineering, and found incidents a super useful way to level up as an engineer.
Now working at a super small startup focussing on incidents - hope y'all find this interesting!
Being on-call is hard, and letting your pager load get out of control really sucks.
We ran a short project to reduce our pager load, and found some good, low-hanging fruit to reduce our pager load while still being confident we'd know when things went wrong.
Have other people ran similar projects? What worked well?