Job Description
Senior Manager – Site Reliability Engineering
# Positions: 1
Location: Remote
Visa Status: USC/Green Card ONLY
Length: 6-month contract to hire
Salary: 170K
Summary:
The Senior Manager of Site Reliability Engineering (SRE) is responsible for managing a team of SREs in an agile environment. They will motivate, empower, and drive innovation within the team to build infrastructure automation across various components of company’s products and services. You’ll work in a fast-paced domestically distributed team to build a culture that attracts talent and moves the business forward
Duties and Responsibilities:
- Engage in and improve the whole life cycle of services—from inception and design, through deployment, operation, and refinement
- Responsible for consistently meeting service-level agreements (SLAs) for our services
- Drive a culture of reliability, and ensuring teams are aligned around common priorities and approaches
- Educate the team on SREs principles: automation, visibility improvements, toil reduction, self-healing, and root cause analysis
- Proactively identify deficiencies and continuously improve
- Assist in capacity planning and cost optimization
- Skills in documenting work and deliverables in a collaborative and clear manner, keeping them up to date as changes are made
- Establish standard practices and processes for planning and prioritizing reliability work
- Able to lead Proof-of-Concept activities.
Requirements:
Education: Four (4) year degree or equivalent experience
Experience: 7-12 years
Skills: High Proficiency in (several): Network design, Kubernetes, IaC, Configuration Management, Scripting, Observability, Service Mesh, SRE Principles
Certifications: GCP, AWS, or Azure Cloud Professional or Associate Certification
< Back to Job Search