Tobias Dunn-Krahn

Applying Agile Principles to Incident Management

What if you treated every incident like a compressed agile sprint? Learn how this approach helps you resolve outages faster and minimize customer impact.

Applying Agile Principles to Incident Management
#1about 6 minutes

Defining digital service incidents and key stakeholders

An incident is any interruption to a digital service, from a full outage to an SLO breach, involving service teams, IT, support, and management.

#2about 4 minutes

Applying agile and SRE principles to incident response

Improve incident management by adopting agile principles like iterative mitigation, DevOps culture-bridging, Scrum retrospectives, and SRE-driven automation.

#3about 3 minutes

Using Failure Friday to practice incident management

Regularly practicing incident response through simulated outages, known as Failure Friday, builds team confidence and refines resolution processes.

#4about 2 minutes

Demo setup of a company's modern and legacy toolchains

The demo scenario involves a company with an agile team using tools like Slack and Jira, and a major incident team using ServiceNow.

#5about 5 minutes

Demo of receiving an alert and initiating an incident

An automated workflow enriches an incoming alert with diagnostic data and, upon escalation, creates linked artifacts in Slack, Jira, and ServiceNow.

#6about 6 minutes

Using an incident console to manage response and resolvers

The incident console provides a central hub for tracking status, managing on-call resolvers, and accessing collaboration channels to streamline remediation.

#7about 2 minutes

Conducting a post-incident review to drive improvement

After resolution, a post-incident review helps analyze the timeline, document learnings, and create trackable action items to prevent future occurrences.

#8about 9 minutes

Building custom automation with a low-code flow designer

The low-code flow designer allows teams to build custom automation workflows by connecting triggers and steps to integrate with any tool, including on-premise systems.

Related jobs
Jobs that call for the skills explored in this talk.

d

Saby Company
Delebio, Italy

Junior

test

Milly
Vienna, Austria

Intermediate

Featured Partners

Related Articles

View all articles
CH
Chris Heilmann
WeAreDevelopers LIVE days are changing - get ready to take part
Starting with this week's Web Dev Day edition of WeAreDevelopers LIVE Days, we changed the the way we run these online conferences. The main differences are:Shorter talks (half an hour tops)More interaction in Q&AA tips and tricks "Did you know" sect...
WeAreDevelopers LIVE days are changing - get ready to take part
EM
Eli McGarvie
Stop Wasting Time: How to Lead a Stand-Up Meeting & Get Results
We all know the feeling: your stand-up meeting starts… and the energy in the room slowly deflates. Eyes glaze over, minds wander. Maybe you can even see their attention drop on smartphones or laptops.Within minutes or even seconds, instead of a quick...
Stop Wasting Time: How to Lead a Stand-Up Meeting & Get Results
CH
Chris Heilmann
Dev Digest 134 - Where pixels sing?
News and ArticlesWeAreDevelopers LIVE Data and Security Day is on Wednesday, 25/09/2024. Learn about OPC UA Updates, Best Practices for Using GitHub Secrets, Passwordless Web 1.5, Emerging AI Security Risks, Data Privacy in LLMs and get a chance to t...
Dev Digest 134 - Where pixels sing?
CH
Chris Heilmann
WWC24 Talk - Brenda Romero - Stay: Surviving and Thriving in Tech
Brenda Romero discusses her tech career journey, overcoming burnout, and inspiring future game developers at WWC24.Here is what she had to say in the video:Hey everyone! Thanks for joining us!Reflections on a Rough YearLast year, I gave a talk about ...
WWC24 Talk - Brenda Romero - Stay: Surviving and Thriving in Tech

From learning to earning

Jobs that call for the skills explored in this talk.