Steve Upton
Building a culture from chaos
#1about 6 minutes
The origin of chaos engineering at Netflix
Netflix created chaos engineering to validate their principle of having no single points of failure during their microservices transformation.
#2about 4 minutes
Understanding the core loop of chaos engineering
Chaos engineering is a disciplined cycle of inducing failures, learning from them, and improving system resilience to build confidence in production.
#3about 4 minutes
Defining the characteristics of complex systems
Complex systems are characterized by a high degree of multiplicity, interdependence, and diversity among their elements and connections.
#4about 7 minutes
The unpredictable behavior of complex adaptive systems
Complex adaptive systems are fundamentally unpredictable because they adapt over time and produce unintended consequences, making past solutions unreliable.
#5about 7 minutes
Adopting new approaches for complex systems
Traditional planning methods like "big design up front" are ineffective for complex systems, requiring agile approaches that prioritize responding to change and accepting failure.
#6about 4 minutes
Building a resilient culture through consistent habits
Culture is shaped by repeated actions, not just stated values, and chaos engineering provides a practical habit for validating plans and embracing failure.
#7about 7 minutes
How to design safe and effective chaos experiments
Designing effective chaos experiments requires careful target selection, limiting the blast radius, robust monitoring, and automated rollbacks to ensure they are safe to fail.
#8about 4 minutes
The second-order effects of chaos engineering on culture
Beyond improving system resilience, chaos engineering's second-order effect is building a culture and skillset for navigating complex systems.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
09:15 MIN
Moving from reactive fixes to proactive chaos engineering
Empathy: The secret sauce of Resilience
20:21 MIN
Validating system robustness with chaos engineering
Building resilient .NET applications for the modern age
26:25 MIN
Q&A on building chaos days and managing on-call stress
Empathy: The secret sauce of Resilience
01:51 MIN
Understanding the chaos of tech stack fragmentation
Introducing a Digital Service Catalog for speed and scale
26:12 MIN
Building a productive failure and learning culture
Leading Through Stagility: Human Capital Trends That Redefine Work
15:09 MIN
DevOps as an outcome of a healthy engineering culture
DevOps at Netflix
15:05 MIN
Why a healthy failure culture is essential for agility
Employee Happiness: The Underrated Growth Engine
14:07 MIN
Fostering a culture of experimentation and learning from failure
From Words to Action: Advancing Inclusion Through Leadership
Featured Partners
Related Videos
Chaos in Containers - Unleashing Resilience
Maish Saidel-Keesing
Empathy: The secret sauce of Resilience
Malin Litwinski
Unveiling the Dark Side: Navigating the Pitfalls of Digital Ambitions
Johannes Hansen
DevOps at Netflix
Tejas Chopra
We adopted DevOps and are Cloud-native, Now What?
Bruno Amaro Almeida
The Rise of Reactive Microservices
David Leitner
3 Key Steps for Optimizing DevOps Workflows
Daniel Tao
Platform Engineering vs. DevOps Why not both?
Christian Strack
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Lead Backend Engineer (m/f/d)
Peter Park System GmbH
München, Germany
Senior
Python
Docker
Node.js
JavaScript

Architect / Staff Python Engineer (m/f/d)
CONTIAMO GMBH
Berlin, Germany
Senior
Python
Docker
TypeScript
PostgreSQL

Senior DevOps Engineer - Edge Data Platform (all genders)
SYSKRON GmbH
Regensburg, Germany
Intermediate
Senior
.NET
Python
Kubernetes




{"@context":"https://schema.org/","@type":"JobPosting","title":"Software Engineer 2 - Full-Stack - Behavioral Security Products
Abnormal AI
Intermediate
Apache Kafka

