Overview

This advanced course is designed for experienced site reliability engineers (SREs) looking to deepen their knowledge and practical skills in implementing and managing reliability engineering principles at scale. Participants will explore anti-patterns, service level objectives (SLOs), observability, chaos engineering, incident response, and automation. The course includes real-world case studies, hands-on exercises, and group discussions to reinforce learning and application in professional environments.

Read more +

Prerequisites

Participants should have:

  • A foundational understanding of Site Reliability Engineering (SRE) principles.
  • Prior completion of the SRE Foundation certification (mandatory).
  • Experience with DevOps practices, system administration, or software development.
  • Familiarity with incident response, monitoring, and automation.

Target Audience

This course is ideal for:

  • Site Reliability Engineers (SREs) aiming to advance their expertise.
  • DevOps engineers and software developers seeking reliability best practices.
  • IT operations professionals responsible for maintaining highly available services.
  • Engineering managers and technical leaders looking to implement SRE strategies.
Read more +

Delegates will learn how to

By the end of this course, learners will be able to:

  • Identify and mitigate SRE anti-patterns to improve reliability.
  • Define and implement Service Level Objectives (SLOs) aligned with business needs.
  • Apply full-stack observability to monitor system health and detect failures.
  • Use AIOps and platform engineering to enhance automation and efficiency.
  • Implement incident response management best practices.
  • Explore chaos engineering techniques to build resilient systems.
  • Understand how SRE integrates with DevOps methodologies.
Read more +

Outline

SRE Anti-Patterns

  • Common reliability pitfalls and how to avoid them.
  • Case study: Monzo Bank's reliability failures and lessons learned.
  • Conducting blameless postmortems and retrospectives.

Service Level Objectives (SLOs) – The Proxy for Customer Happiness

  • Establishing SLOs, SLIs, and error budgets.
  • Case studies: Kudos Engineering and Home Depot’s SLO implementation.
  • Practical exercise: Obtaining service credits.

Full-Stack Observability

  • Implementing end-to-end monitoring, logging, and alerting.
  • Reducing false positives and alert fatigue.

Using Platform Engineering & AIOps

  • Leveraging automation and AI-driven operations to enhance system reliability.

SRE & Incident Response Management

  • Best practices for incident response and on-call management.
  • The role of incident command systems.

Chaos Engineering

  • Designing fault injection experiments to improve system resilience.
  • Case study: How Netflix uses chaos engineering.

SRE as a Form of DevOps

  • Bridging software engineering and system operations.
  • Implementing SRE culture and best practices in organisations.

Exams and Assessments

  • An exam voucher is included which can be used to attempt the exam separately from the course.
  • SRE Practitioner Exam (DevOps Institute accreditation).
    • 40 multiple-choice questions
    • 90 minutes duration
    • 65% pass mark
  • Hands-on exercises and knowledge checks throughout the course.
Read more +

Why choose QA

  • Expert instructors with real-world SRE experience.
  • Hands-on learning with practical exercises and case studies.
  • Industry-recognised certification from DevOps Institute.
  • Comprehensive resources, including post-course materials and best practices.

Dates & Locations

Want to boost your career in DevOps? Click on the roles below to see QA‘s learning pathways, specially designed to give you the skills to succeed.

= Required
= Certification

DevOps Institute

Get DevOps Institute certified to validate your knowledge and understanding of various DevOps skills and practices in-demand today and advance your career.

= Required
= Certification
Need to know

Frequently asked questions

How can I create an account on myQA.com?

There are a number of ways to create an account. If you are a self-funder, simply select the "Create account" option on the login page.

If you have been booked onto a course by your company, you will receive a confirmation email. From this email, select "Sign into myQA" and you will be taken to the "Create account" page. Complete all of the details and select "Create account".

If you have the booking number you can also go here and select the "I have a booking number" option. Enter the booking reference and your surname. If the details match, you will be taken to the "Create account" page from where you can enter your details and confirm your account.

Find more answers to frequently asked questions in our FAQs: Bookings & Cancellations page.

How do QA’s virtual classroom courses work?

Our virtual classroom courses allow you to access award-winning classroom training, without leaving your home or office. Our learning professionals are specially trained on how to interact with remote attendees and our remote labs ensure all participants can take part in hands-on exercises wherever they are.

We use the WebEx video conferencing platform by Cisco. Before you book, check that you meet the WebEx system requirements and run a test meeting to ensure the software is compatible with your firewall settings. If it doesn’t work, try adjusting your settings or contact your IT department about permitting the website.

How do QA’s online courses work?

QA online courses, also commonly known as distance learning courses or elearning courses, take the form of interactive software designed for individual learning, but you will also have access to full support from our subject-matter experts for the duration of your course. When you book a QA online learning course you will receive immediate access to it through our e-learning platform and you can start to learn straight away, from any compatible device. Access to the online learning platform is valid for one year from the booking date.

All courses are built around case studies and presented in an engaging format, which includes storytelling elements, video, audio and humour. Every case study is supported by sample documents and a collection of Knowledge Nuggets that provide more in-depth detail on the wider processes.

When will I receive my joining instructions?

Joining instructions for QA courses are sent two weeks prior to the course start date, or immediately if the booking is confirmed within this timeframe. For course bookings made via QA but delivered by a third-party supplier, joining instructions are sent to attendees prior to the training course, but timescales vary depending on each supplier’s terms. Read more FAQs.

When will I receive my certificate?

Certificates of Achievement are issued at the end the course, either as a hard copy or via email. Read more here.

Let's talk

By submitting this form, you agree to QA processing your data in accordance with our Privacy Policy and Terms & Conditions. You can unsubscribe at any time by clicking the link in our emails or contacting us directly.