SENIOR SITE RELIABILITY ENGINEER I, DATA PROTECTION PRODUCTS

ConnectWise
Full-time
ARM Remote
Posted on 19 days ago

Job Description

The Senior Site Reliability Engineer I is responsible for ensuring the availability, performance, and scalability of systems and applications. This role partners with Engineering teams to design, implement, and maintain robust infrastructure and automation solutions to improve reliability and performance.

Responsibilities

  • Provide support to Engineering teams
  • Research, analyze, and document findings
  • Design, implement, and maintain scalable infrastructure solutions
  • Develop and optimize automation scripts
  • Monitor and analyze system performance
  • Collaborate on best practices for deployment and configuration
  • Troubleshoot and resolve system issues
  • Perform capacity planning and scalability assessments
  • Implement and maintain monitoring and alerting systems
  • Participate in incident response and root cause analysis
  • Optimize infrastructure and deployment processes for cost efficiency

Requirements

  • Independent work with general supervision
  • Knowledge of applicable work area
  • Adaptability to new technology
  • Scripting and automation skills (Python, Bash, PowerShell)
  • Understanding of networking concepts and security practices
  • Linux/Unix systems administration knowledge
  • Familiarity with containerization technologies (Docker, AWS Fargate)
  • Understanding of container orchestration platforms (Kubernetes, AWS ECS)
  • Collaborative teamwork and communication skills
  • Strong problem-solving skills
  • Attention to detail

Benefits

  • No benefits