SRE Foundation (SREF)℠

The Site Reliability Foundation two-day course is an introduction to the principles and practices that enable an organisation to reliably and economically scale critical services.
product
2 day course
Supporting material
Private
Private
A private training session for your team. Groups can be of any size, at a location of your choice including our training centres.

The course highlights the evolution of SRE and its future direction, and equips participants with the practices, methods, and tools to engage people across the organisation involved in reliability and stability.

The course has been developed by leveraging key SRE sources, engaging with thought-leaders in the SRE space and working with organisations embracing SRE to extract real-life best practices, and has been designed to teach the key principles and practices necessary for starting SRE adoption.

Upon completion of the course, you will have tangible takeaways to leverage when back in the office such as understanding, setting and tracking Service Level Objectives (SLOs).

This course positions learners to successfully complete the SRE Foundation certification exam.

Our SRE Foundation course is available as a private training session and will be run over two consecutive days. It can be delivered at a location of your choice or via Virtual Classroom.

Course overview
Who should attend:
This course is aimed at professionals including Business Managers, Business Stakeholders, Change Agents, Consultants, DevOps Practitioners, IT Directors, IT Managers, IT Team Leaders, Product Owners, Scrum Masters, Software Engineers and System Integrators.
Walk away with the ability to:

Demonstrate a practical understanding of:

  • The history of SRE and its emergence at Google
  • The inter-relationship of SRE with DevOps and other popular frameworks
  • The underlying principles behind SRE
  • Service Level Objectives (SLOs) and Service Level Indicators (SLIs)
  • Error budgets, toil and its effect on an organisation’s productivity
  • Observability as something to indicate the health of a service
  • SRE tools, automation techniques and the importance of security
  • The organisational impact that introducing SRE brings
Course agenda
Module 1: SRE Principles & Practices
  • What is Site Reliability Engineering?
  • SRE & DevOps: What is the Difference?
  • SRE Principles & Practices
Module 2: Service Level Objectives & Error Budgets
  • Service Level Objectives (SLOs)
  • Error Budgets
  • Error Budget Policies
Module 3: Reducing Toil
  • What is Toil?
  • Why is Toil Bad?
  • Doing Something About Toil
Module 4: Monitoring & Service Level Indicators
  • Service Level Indicators (SLIs)
  • Monitoring
  • Observability
Module 5: SRE Tools & Automation
  • Automation Defined
  • Automation Focus
  • Hierarchy of Automation Types
  • Secure Automation
  • Automation Tools
Module 6: Anti-Fragility & Learning from Failure
  • Why Learn from Failure?
  • Benefits of Anti-Fragility
  • Shifting the Organisational Balance
Module 7: Organisational Impact of SRE
  • Why Organisations Embrace SRE
  • Patterns for SRE Adoption
  • Sustainable Incident Response
  • Blameless Post-Mortems
  • SRE & Scale
Module 8: SRE, Other Frameworks, Trends
  • SRE & Other Frameworks
  • SRE Evolution
Book this course
Call our sales team today
close
Don't miss out
Keep up to date with news, views and offers from Jellyfish Training.
Your data will be handled in accordance with our Privacy Policy