Senior Site Reliability Engineer – Platform

Quizlet
Full-time
San Francisco, CA
$170,880 – $231,600
Posted on a month ago

Job Description

Quizlet is seeking a Senior Site Reliability Engineer to design and build automation, observability, and systems architecture to ensure the reliability and scalability of their AI-powered learning platform. This role involves engineering software, tools, and processes to improve service performance, reduce operational toil, and maintain strict SLOs for global learners.

Responsibilities

  • Develop and maintain automation for uptime
  • Build self-healing systems for Kubernetes clusters and service mesh
  • Optimize CI/CD toolchain
  • Design and deploy observability and diagnostics
  • Conduct capacity planning and performance tuning
  • Collaborate with product engineering to define SLOs

Requirements

  • 5+ years of experience in SRE, infrastructure, or systems software engineering
  • Proficiency in Go and/or Python
  • Deep understanding of Kubernetes, Istio, and distributed systems
  • Experience with CI/CD systems and Terraform
  • Expertise in Datadog and incident response
  • Solid foundation in Linux systems, networking, and cloud environments
  • Proven ability to improve reliability and reduce MTTR

Benefits

  • No benefits