Staff Site Reliability Engineer - Platform

Quizlet
Full-time
San Francisco, CA
$196,320 – $268,100
Posted on a month ago

Job Description

Quizlet is seeking a Staff Site Reliability Engineer to lead reliability engineering efforts across the platform, focusing on automation, scaling systems, and ensuring infrastructure supports AI-powered learning innovation. The role involves architectural direction for resilience, observability, and performance, along with mentoring engineers and influencing platform standards. This is an onsite position requiring a minimum of three days a week in the San Francisco office.

Responsibilities

  • Lead design and implementation of self-healing, auto-scaling infrastructure
  • Architect and implement CI/CD reliability improvements
  • Define and enforce SLOs and operational excellence standards
  • Build systems for proactive reliability and capacity management
  • Drive incident analysis and postmortems
  • Mentor engineers and establish best practices

Requirements

  • 8+ years of experience in SRE, systems, or infrastructure engineering
  • Expertise in Kubernetes, Terraform, and CI/CD pipelines
  • Deep programming skills in Go and/or Python
  • Strong experience in Datadog, system monitoring, and distributed tracing
  • Familiarity with GCP services, Linux internals, and large-scale networking
  • Experience leading cross-team reliability initiatives

Benefits

  • No benefits