Breaking down our understanding of system resilience

10 mins

How confident are you in your prod servers staying up without your help? Too often in tech we mistakenly interchange three important concepts when describing our socio-technical systems: how resilient they are, the reliability they exhibit in day to day work, and how robust they are under duress. Though interrelated, they are not equivalent.

How can we successfully gain insights in post-incident reviews, execute chaos engineering experiments, and build scalable infrastructure if we're misinterpreting our approaches? By separating out these core concepts, we can isolate better approaches in adapting to unforeseen circumstances. We'll look at common misconceptions when describing our systems as resilient and focus on proven methods to help us improve our understanding of our systems.

Episode 06 Optimizing the 'glue work' in your team

Episode 08 Principles for managing product quality

Breaking down our understanding of system resilience

Posted in:

Featuring:

Share:

Related content

How to build an effective technical strategy

WebAssembly is still waiting for its moment

Generate buy-in with compelling engineering strategies

PostgreSQL: The database that quietly ate the world

How Zalando uses its own Tech Radar to make better technology choices

4 things you need to know from the latest Thoughtworks Tech Radar

AI and Kubernetes are pushing cloud costs out of control

Who holds the edge in the JavaScript framework wars?

12 things to consider when assessing open source software

Leading open-source teams in large organizations

Working with leadership to plan for a successful new year

How to get leadership buy-in on your tech strategy

The 6 biggest generative AI risks for developers

Being a tech lead doesn’t mean having all the answers

Can platform engineering help you do more with less?

When to migrate from a monolithic to a distributed frontend architecture

Kubernetes for engineering managers

Using workshops to align technical vision and team principles

Want to stay technical as a manager? Stay curious

Crafting an effective technical strategy: Business success through targeted investment

Building an effective technical strategy

Crafting an effective tech strategy and getting buy-in for it

How to make plans for an uncertain future

The five stages of digital maturity

The difficult teenage years: setting your tech strategy after the launch

Setting a vision, mission, and strategy for your team

Using Open Source safely and effectively

Learnings from 'Carving a modern engineering org out of an enterprise’

Building a successful and sustainable CI/CD pipeline

Scaling Incident Management: How we grew Google Meet 50x during COVID19

Technical strategy power chords

Making ‘Big Changes’ Successfully

Forging the path to faster shipping in enterprise orgs

When planning long-term, favor accuracy over precision

Best practice for seamless product integration

Laying the foundations for a successful build

A terrible, horrible, no-good, very bad day at Slack

Building a more globally inclusive internet

Carving a modern engineering org out of an enterprise

The thin line between technology advocacy and ideology

To build, or to buy, that is the question

Measuring and improving the efficiency of software delivery

Four key metrics for measuring DevOps success

Managing technical risk

Creating technology products that your customers love

Getting GitOps right

Learnings from 'Maintaining speed while minimizing risk'

Achieving speed and quality without sacrifice in engineering

Scaling held knowledge to unblock teams and untangle software complexity

How to adapt your UI testing strategy to your product's stage

Hypothesis-driven development

The problem with "the platform"

The Boring Stack

Avoiding the pitfalls of rebuilding software

Building and conveying vision

Avoiding “shiny object” syndrome when building software

Lessons for frontend development at scale

Creating Architecture and Teams at Less-than-Google Scale

Telling stories through your commits

The importance of pragmatism when building and maintaining systems

Plug in to LeadDev