Job Details

ID #12190743
State Colorado
City Littleton
Job type Permanent
Salary USD Depends on Experience Depends on Experience
Source Prosum
Showed 2021-04-13
Date 2021-04-12
Deadline 2021-06-11
Category Et cetera
Create resume

Lead Site Reliability Engineer

Colorado, Littleton, 80120 Littleton USA

Vacancy expired!

Senior/Lead SRERemote from home with occasional in office in Littleton, CODirect Hire

DESCRIPTIONSite Reliability Engineers (SREs) are responsible for keeping all systems up and running reliably. We have a mindset for constant improvement and implementing good practices for running our distributed systems. SREs will use a combination of infrastructure knowledge paired with software practices to automate and engineer solutions to achieve our goals. SREs work in a DevOps environment with close collaboration to software engineers and traditional infrastructure groups. SREs maintain a diverse set of technologies including in-house developed SaaS platforms for our customers, IoT for our electronic monitoring devices, cloud and traditional datacenters.

SKILLSETThe following list shows skillsets that an ideal candidate would have. These are not all required, but you will need a majority of these to be successful in the position.
  • Cloud experience (IaaS and PaaS, preferably in Azure)
  • Container orchestration experience (Docker, Kubernetes, Azure Kubernetes Service)
  • Infrastructure as Code (Terraform, Ansible, Git)
  • Enterprise Architecture
  • Windows Server experience (2012 – 2019)
  • Linux shell experience
  • Strong troubleshooting skills to debug and troubleshoot during production issues
  • Programming experience ideally in one of the following languages (Python, Ruby, Go, C#)
  • Layer 7 reverse proxy / Load Balancing experience (NGINX, Azure Application Gateway, F5)
  • A mindset for constant improvement and documentation
  • Initiative to fix things that you find are broken

ESSENTIAL FUNCTIONS AND BASIC DUTIES
  • Assist with CI/CD builds and releases of our software and infrastructure
  • Monitoring and alerting of our production environment
  • GitOps approach for Infrastructure as Code (IaC)
  • Enable developers with good practices
  • Help with containerization efforts for production systems
  • Build and manage cloud environments
  • Help track and maintain uptime for production systems
  • Help lead and document Root Cause Analysis (RCA)
  • If there is an issue, fix the problem once so we never have it again
  • Participate in an on-call rotation to triage and fix production issues

QUALIFICATIONS

Education/Certification:
  • College education preferred, but not required
  • Technical certification preferred in cloud, networking, or server technologies

Experience Required:
  • Technical experience troubleshooting servers, networks, and applications
  • Experience or familiarity in at least one of the following tools/technologies (Ansible, Terraform, Helm, Docker, Kubernetes, Git)

PREFERRED Experience:
  • Azure Cloud experience (IaaS and PaaS)
  • Load Balancing experience
  • Experience running distributed systems

Vacancy expired!

Subscribe Report job