Mihaela-Roxana Ghidersa
System Resilience: Surviving the Software Storm
#1about 3 minutes
The business necessity of system resilience
An e-commerce site failure during a Black Friday sale illustrates how downtime leads to financial loss and why resilience is essential.
#2about 5 minutes
Understanding faults, failures, and tolerance mechanisms
A fault is a latent bug in the code, while a failure is the observable crash it causes, which can be mitigated with fault tolerance and fail-safe systems.
#3about 5 minutes
Navigating the challenges of modern software development
Modern systems face challenges from increasing complexity, evolving technology, and high user expectations, requiring a balance to avoid over-engineering.
#4about 3 minutes
Building resilience across all software stack layers
True resilience requires a holistic approach that addresses the infrastructure, application, and database layers, as well as the crucial human layer of team culture.
#5about 4 minutes
Core strategies for building resilient systems
Key architectural strategies for resilience include implementing redundancy, failover mechanisms, load balancing, and using availability zones or microservices.
#6about 5 minutes
Implementing disaster recovery and secure coding practices
Proactive resilience involves creating a disaster recovery plan through risk assessment and empowering developers to contribute through secure coding practices.
#7about 7 minutes
Using monitoring and continuous testing for improvement
A continuous improvement cycle is driven by monitoring system health, using automated testing to catch issues early, and analyzing failures to learn from them.
#8about 2 minutes
A practical starting point for individual developers
Developers can significantly impact resilience by focusing on core software quality attributes like performance, security, scalability, and maintainability.
#9about 3 minutes
Adopting a proactive mindset for future resilience
The future of resilience lies in a proactive approach, embracing innovations like AI for predictive failure analysis and fostering a culture of continuous adaptation.
#10about 4 minutes
Balancing security practices with system performance
Security and performance are not a trade-off but a balance that must be determined by the specific context and priorities of the system.
#11about 4 minutes
Prioritizing components when designing for resilience
Focus resilience efforts on foundational components like infrastructure and architecture, as these "shearing layers" are the most difficult and costly to change later.
#12about 5 minutes
Communicating the value of resilience to stakeholders
To get buy-in from decision-makers, present a data-driven business case that clearly documents the financial losses and risks associated with poor resilience.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
03:39 MIN
Understanding the key layers of application resilience
Building resilient .NET applications for the modern age
00:05 MIN
The high cost of ignoring system resilience
Building resilient .NET applications for the modern age
07:42 MIN
Using application design patterns for system resilience
Azure-Well Architected Framework - designing mission critical workloads in practice
24:38 MIN
Using long-running patterns to build resilient systems
Are you done yet? Mastering long-running processes in modern architectures
28:37 MIN
Why domain models create resilient and adaptable software
How to Domain Model – An example from manufacturing
20:21 MIN
Validating system robustness with chaos engineering
Building resilient .NET applications for the modern age
35:31 MIN
Implementing advanced patterns for resilience and decoupling
Cloud-nativeApplications- What’s the buzz about
00:04 MIN
The inevitability of software failures in real life
Azure-Well Architected Framework - designing mission critical workloads in practice
Featured Partners
Related Videos
Building resilient .NET applications for the modern age
Sander ten Brinke
What makes a great Software Engineer? Challenging our Assumptions Instead of Talking about Imposter Syndrome
Irina Anastasiu
Climate vs. Weather: How Do We Sustainably Make Software More Secure?
Panel Discussion
Empathy: The secret sauce of Resilience
Malin Litwinski
Building high performance and scalable architectures for enterprises
Luka Samaržija & Ivan Sokol
The Rise of Reactive Microservices
David Leitner
Durable Execution: A Revolutionary Abstraction for Building Resilient Applications
Maxim Fateev
The Resilience of the World Wide Web
Niels Leenheer
Related Articles
View all articles
.webp?w=240&auto=compress,format)


From learning to earning
Jobs that call for the skills explored in this talk.






{"@context":"https://schema.org/","@type":"JobPosting","title":"Software Developer
Live Wire
Unreal Engine


