Vacancy expired!
100% REMOTE We're looking for a Site Reliability Engineering Lead to play a key role in a data center automation initiative.
We can facilitate w2 and corp-to-corp consultants. For our w2 consultants, we offer a great benefits package that includes Medical, Dental, and Vision benefits, 401k with company matching, and life insurance. Responsibilities of the SRE Lead:- As as technical lead specialist for data center operations
- Work with operations leads and SMEs to provide solutions in line with best practices
- Work with DevOps teams to ensure reliability of services to minimize business impact
- Collaborate with MSPs to identify root cause for critical incidents and opportunities for automation
- Provide technical and process guidance to team members
- Identify opportunities to improve operational stability and performance
- Establish, maintain, and evolve concepts in CI/CD pipelines for new and existing services
- Bachelor's degree and at least 10 years of experience in IT system maintenance or administration
- At least 5 years of experience and ability to solve problems across the entire stack
- At least 5 years of experience with monitoring tools and script development for system automation/network administration
- Experience with design and implementation of high availability/fault tolerant orchestration tools
- Experience with monitoring and observability solutions and methodologies
- Experience with observability, AIOps tools and methodology of products such as Splunk and Dynatrace
- Strong integration experience focused on REST, SOAP, JSON, CLI, and SQL
- Strong ITSM/ITOM domain knowledge
- Experience designing, building, and maintaining cloud environments, IaaS, PaaS, AKS, and SaaS cloud service model
- Experience with cloud-based storage and database concepts
- Expertise in Shell scripting and Python
Vacancy expired!