Job Details

ID #43720832
State Pennsylvania
City Oaks
Job type Permanent
Salary USD Depends on Experience Depends on Experience
Source Coforge
Showed 2022-06-30
Date 2022-06-22
Deadline 2022-08-21
Category Et cetera
Create resume

Site Reliability Consultant - SME

Pennsylvania, Oaks, 19456 Oaks USA

Vacancy expired!

Role: Site Reliability Consultant - SME

Location: Oaks, PA

Mode of Hire: Full Time

Skill Required
  • Independently designs, implements, productionizes and maintains site reliability guidelines, processes and systems
  • Service Level Definition, Configuration and Measurement: Define SLIs, SLOs & SLAs specific to each application or system: Configuration of monitoring & alerting tools suitable for each product and/or platform team Measure reliability & resilience (through pre-defined SLIs & SLOs) utilizing monitoring/alerting tools to drive continuous improvement based on data analysis
  • Incident Management Facilitation of incident response through the engagement of various teams and stakeholders, while providing robust communication and visibility to the organization during service interruptions Provide Root Cause Analysis for failures Experience with a modern incident management platform to effectively drive incident response and problem resolution
  • Monitoring & Alerting Debug defects as well as develop dashboards using modern monitoring tools (e.g. New Relic, Splunk, AIOPs) to enable a reduction in mttd (detection time) & mttr (resolution time) Build monitors and alerts designed to manage SLAs, optimize performance, and minimize outages Construct E2E customer journey dashboards and alerts for customized transactions and applications.
  • Automates reliability requirements into system and application implementations and updates; including the implementation of self-healing solutions (ansible, terraform, etc).
  • Work with product management team to contribute to 1) the identification of reliability features & requirements and 2) level of effort estimates
"The ideal candidates should have advanced coding skills in Python, Shell and YAML, preferably with a minimum of 5-7 years of experience in all of these or similar languages.
  • Candidates should have 10+ years’ experience in SRE and either or both of the following roles: DevOps, Software Engineering, leveraging automation extensively to achieve key deliverables.
The role of Sr. Site Reliability Consultant is to support and enforce reliability elements into technological solutions that deliver an exceptional customer experience. As part of Site Reliability Engineering team, you’ll leverage your development background to promote a framework which will deliver optimal levels of performance and reliability throughout systems and services. You will collaborate with product teams and software developers to improve the resiliency of our applications through development based on reliability requirements.

Vacancy expired!

Subscribe Report job