Vacancy expired!
Job Description
Direct HireTitle- Lead SRE Site Reliability EngineerLocation -Atlanta GA/ Remote/ Essential Functions:- Leads the adoption and implementation of cloud-based application reliability, resiliency, observability and deployment best practices for production & non-prod environments.
- Discover & Define SLA/SLO and identify business as well as systems KPIs.
- Enable robust instrumentation, collation, monitoring and utilization of such metrics along with operations/C&O teams. Define thresholds & help with alert orchestration.
- Provide 24x7 production support for owned applications on a rotational basis.
- Lead Blameless Post Mortem sessions, collaborate with cross functional teams and identify areas for improvement.
- Assist with designing and executing chaos/destructive testing, related analysis and provide feedback to requisite teams.
- Works independently and provides guidance within technical area, applying in-depth knowledge of multiple technologies as appropriate.
- Leads the adoption and implementation of cloud-based application reliability, resiliency, observability and deployment best practices for production & non-prod environments.
- Client & Define SLA/SLO and identify business as well as systems KPIs.
- Hands-on development experience with Java, Python or Go
- Experience with tools & technologies such as Prometheus, Grafana, StackDriver, Distributed tracing, AppDynamics, Dynatrace, NewRelic, PagerDuty, WireShark is a plus
- Experience with Cloud Architecture and Operations including: migration, resilience, maintainability, and cost efficiency. Knowledge of the Google Cloud Platform is a strong plus. If no Google Cloud Platform experience then experience with public cloud such as AWS or Azure
- Experience with CI/CD tools such as GIT, Maven, Jenkins, Concourse, Sonar, Artifactory, Chef, Puppet, Spinnaker
- Excellent troubleshooting skills including software, systems, and network.
- Experience with application Profiling Skills (Core Java, Thread Dumps etc.).
Vacancy expired!