10 mins
You have 0 further articles remaining this month. Join LeadDev.com for free to read unlimited articles.

How confident are you in your prod servers staying up without your help? Too often in tech we mistakenly interchange three important concepts when describing our socio-technical systems: how resilient they are, the reliability they exhibit in day to day work, and how robust they are under duress. Though interrelated, they are not equivalent.

How can we successfully gain insights in post-incident reviews, execute chaos engineering experiments, and build scalable infrastructure if we're misinterpreting our approaches? By separating out these core concepts, we can isolate better approaches in adapting to unforeseen circumstances. We'll look at common misconceptions when describing our systems as resilient and focus on proven methods to help us improve our understanding of our systems.

Optimizing the 'glue work' in your team
Episode 06 Optimizing the 'glue work' in your team
Principles for managing product quality
Episode 08 Principles for managing product quality