Get in Touch

Course Outline

SRE Anti-patterns

  • Identifying counterproductive practices.
  • Recognizing the impact of anti-patterns on system reliability.
  • Exploring best practices and corrective alternatives.

SLO as a Proxy for Customer Satisfaction

  • Defining Service Level Indicators (SLIs) and Service Level Objectives (SLOs).
  • Managing error budgets and balancing innovation with reliability.
  • Understanding the limitations of distributed systems.

Building Secure and Reliable Systems

  • Designing for fault tolerance and resilience.
  • Integrating security into reliability engineering.
  • Implementing scalability and data protection strategies.

Full-stack Observability

  • Instrumentation and metrics collection techniques.
  • Distributed tracing and synthetic monitoring.
  • Observability-driven development practices.

Platform Engineering and AIOps

  • Platform-centered engineering approaches.
  • Automation and orchestration in SRE.
  • Leveraging DataOps and operational intelligence.

Incident Management in SRE

  • Clarifying roles and responsibilities in incident response.
  • Applying frameworks such as OODA.
  • Automated remediation and AI/ML-assisted resolution techniques.

Chaos Engineering

  • Principles and strategies for resilience testing.
  • Planning and executing "game day" exercises.
  • Gaining insights from controlled failure experiments.

SRE as a Pure Form of DevOps

  • Integrating SRE into DevOps workflows.
  • Fostering cultural alignment and collaboration practices.
  • Driving organizational transformation through SRE.

Post-class Exercises

  • Case studies on large-scale system design.
  • Advanced instrumentation and monitoring scenarios.
  • Solving real-world reliability problems.

Review and Exam Preparation

  • Final review of the DevOps Institute SRE Practitioner syllabus.
  • Practice with sample questions and mock tests.
  • Strategies and recommendations for exam performance.

Summary and Next Steps

Requirements

  • Solid understanding of core Site Reliability Engineering principles.
  • Practical experience with DevOps practices and associated tools.
  • Familiarity with system monitoring, incident management, and automation techniques.

Target Audience

  • SRE professionals preparing for the DevOps Institute SRE Practitioner certification.
  • DevOps engineers looking to transition into reliability-focused roles.
  • Operations leaders tasked with defining and executing reliability strategies.
 35 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories