Site Reliability Engineer

OpenEye
Full-time
Liberty Lake, WA
$70,000 - $100,000
Posted on 2 months ago

Job Description

OpenEye is seeking a Site Reliability Engineer to elevate the reliability, quality, and consistency of their software and release processes. The role involves maintaining system reliability, automating operations, monitoring key metrics, participating in incident response, and collaborating with development and DevOps teams to enhance observability and tooling.

Responsibilities

  • Engineer and implement solutions for system integrity, scalability, security, and reliability
  • Monitor, analyze, and improve system metrics
  • Implement and improve monitoring, alerting, and observability systems
  • Participate in incident response and postmortems
  • Partner with teams to deliver reliability and quality initiatives
  • Define and improve service-level indicators (SLIs) and objectives (SLOs)
  • Identify system performance risks and engineer solutions
  • Champion operational excellence and quality
  • Collaborate on CI/CD pipeline optimization
  • Participate in capacity planning and performance tuning

Requirements

  • 1-5 years related experience
  • Technical proficiency in cloud environments (AWS preferred)
  • Experience with CI/CD technologies and infrastructure automation
  • Experience in scripting languages (C#, Java, C++)
  • Solid understanding of development practices and TCP/IP network protocols
  • Experience with service monitoring tools (Coralogix, Datadog, Prometheus, Grafana)
  • Familiarity with Agile methodologies and Jira
  • Ability to learn new technologies quickly
  • Excellent critical thinking and problem-solving skills
  • Strong quality ethic and test-first attitude
  • Great communication and teamwork skills

Benefits

  • No benefits