Online Course For Free - DevOps Foundations: Site Reliability Engineering

LinkedIn Learning

Free Trial Available

English

Certificate Available

1-2 hours worth of material

selfpaced

Overview

Explore the basics of site reliability engineering for DevOps. Learn SRE techniques for release, change and incident management, self-service automation, and more.

Site reliability engineering (SRE) is an emerging paradigm in DevOps. The biggest names in techâcompanies like Google, Netflix, Microsoft, and LinkedInâall use SRE. In fact, industry wide, "site reliability engineer" is replacing "DevOps engineer" in job posts. Simply put, SRE is software engineering applied to operationsâfor the cloud native era. This course introduces the basics of site reliability engineering, including how SRE fits into DevOps and how it can be integrated into your unique business environment. Instructors Ernest Mueller and James Wickett cover the major areas of expertise, including release engineering, change management, incident management and retrospectives, self-service automation, troubleshooting, performance, and deliberate adversity. Learn how to define reliability through SLAs and SLOs, handle crisis, design distributed systems, and scale your systems and your team. Plus, explore time and project management strategies that bring humanity back to the SRE's job.

Syllabus

Introduction

Welcome
What you should know

1. SRE Basics

Your job as a DevOp
You aren't Google or Netflix

2. SRE Practice Areas

Release engineering
Change management
Self-service automation
SLAs and SLOs
Incident management
Introducing postmortems
The postmortem process
Troubleshooting
Performance engineering
Capacity and scalability
Distributed design
Deliberate adversity

3. SRE Organization

Organizing SREs
The softer side of SRE

Conclusion

Next steps

Taught by

James Wickett and Ernest Mueller

DevOps Foundations: Site Reliability Engineering

Overview

Syllabus

Taught by

Related Courses

DevOps Foundations: Site Reliability Engineering

Developing a Google SRE Culture

Site Reliability Engineer

Developing a Google SRE Culture en Français

Site Reliability Engineering: Measuring and Managing Reliability

Incorporating Site Reliability Engineering (SRE) in Your System Design