Job Details

ID #20724261
State Missouri
City Saintlouis
Job type Contract
Salary USD TBD TBD
Source Sage IT Inc
Showed 2021-10-06
Date 2021-10-04
Deadline 2021-12-02
Category Et cetera
Create resume

Immediate Requirement of Site Reliability Engineer (SRE) St. Louis, MO (Remote Till COVID) Long term Contract

Missouri, Saintlouis, 63104 Saintlouis USA

Vacancy expired!

Job title: Site Reliability Engineer (SRE) Location: St. Louis, MO (Remote Till COVID) Term: Long term Contract Job Description: Screening Highlights:

  • 3+ years of developing and/or administering software in public cloud (AWS, Azure or GCP).
  • 6+ months of hands-on experience in GCP.
  • Hands-on experience with informix/redhat workloads running in GCE.
  • Hands-on experience managing Infrastructure as Code using Terraform
  • Hands-on experience in setting up and managing/modifying CI/CD pipelines using Jenkins preferably.
  • Hands-on experience with scripting languages such as Python and Bash.
Roles and Responsibilities:
  • Influence and design cloud infrastructure, architecture, standards and methods for large-scale systems
  • Support services prior to production via infrastructure design, software platform development, load testing, capacity planning and launch reviews
  • Maintain services during deployment and in production by measuring and monitoring key performance and service level indicators including availability, latency, and overall system health
  • Automate system scalability and continually work to improve system resiliency, performance and efficiency
  • Practice sustainable incident response as part of an on-call rotation and through blameless postmortems
  • Remediate tasks within corrective action plan via sustainable, preventative, and automated measures whenever possible
  • Provision and manage GCP infrastructure including Deploying and implementing Google Compute Engine(GCE) resources
  • Automating infrastructure builds/configurations
  • Build and manage CI/CD pipelines using Jenkins.
  • Define, Implement and assign ownership for Stability/Reliability (SLIs, SLOs, Error Budgets)
  • Collaboration with tribes/dev teams on Reliability development (Fixes, Logging, Delivery Metrics)
Key Skillsets:
  • 3+ years of experience developing and/or administering software in public cloud. Hands-on 6+ months in GCP.
  • Experience in monitoring infrastructure and application uptime and availability to ensure functional and performance objectives.
  • Experience in languages such as Python, Ruby, Bash, Java, Go, Perl, JavaScript and/or node.js
  • Demonstrable cross-functional knowledge with systems, storage, networking, security and databases System administration skills, including automation and orchestration of Linux/Windows using Chef, Puppet, Ansible, Salt Stack and/or containers (Docker, Kubernetes, etc.)
  • Proficiency with continuous integration and continuous delivery tooling and practices
  • Experience managing Infrastructure as code via tools such as Terraform or Cloud Formation
  • Experience in setting up and managing/modifying CI/CD pipelines using Jenkins.
  • Significant experience in configuring industry leading infrastructure/application monitoring tools (Stackdriver, Kibana, Grafana, Datadog, Splunk, Dynatrace, AppDynamics etc)

Vacancy expired!

Subscribe Report job