Job Details

ID #12256153
State New York
City New york city
Job type Permanent
Salary USD TBD TBD
Source Mitchell Martin, Inc.
Showed 2021-04-15
Date 2021-04-14
Deadline 2021-06-13
Category Architect/engineer/CAD
Create resume

SRE/DevOps Engineer

New York, New york city, 10005 New york city USA

Vacancy expired!

Our client, a global provider of information technology products and services, is seeking an SRE/DevOps Engineer

Location: Remote

Position Type: Full Time

Job Summary:

The ideal candidate is self-driven, data-driven, and can work in a distributed team. This professional hold strong knowledge of Site Reliability Engineering and DevOps methodologies related to Delivery solutions & Platform Automation.

In this role you will be part of the Site Reliability team, sharing your experience in the field with our Delivery, Support, Product Engineering, and Infrastructure teams. You will simultaneously focus on technical excellence and quickly deliver value to customers who have deployed our software in production. The person who fills this role is a subject matter expert who excels in collaboration, open communication, and reaching across functional borders.

Key responsibilities:

-Show ownership of customer success with the platform management.

-Partner with Delivery, Engineering, and Product to steer SRE alignment and strategy to ensure reliability of the platform deployments

-Respond to client reliability concerns and agile problem resolution.

-Lead teams that design, code, test, and deliver software to ensure application performance and resiliency

-Ability to communicate with various customer teams and navigate them with WF best practices (IT, DevOps, Security, Tech).

-Strives for environment management automation either by coding it or by leading and influencing developers to build systems that are easy to run in production.

-Proactively work on the efficiency and capacity planning to set clear requirements and reduce the system resources usage to make cheaper to run for all our customers.

-Identify parts of the system that do not scale, provides immediate palliative measures, and drives long term resolution of these incidents.

-Identify Service Level Indicators (SLIs) that will align the team to meet the availability and latency objectives.

-Proposes and drives architectural changes that affect the whole company to solve scaling and performance problems

-Measure the risk of introduced features to plan and improve the infrastructure.

Qualifications/Experience:

-Bachelor's degree in Computer Science, related technical field or equivalent practical experience

-6+ years of experience with system design, algorithms, data structures, analysis, and software design

-Deep knowledge in DevOps tools and practices, Enterprise standards and security.

-Knowledge of scripting languages (e.g. shellscript, groovy) and main concepts of Object-Oriented programming (Python or Java is a plus).

-Solid knowledge on Ansible and CI/CD.

-At least 5 years' experience with AWS,Azure, Google Cloud Platform cloud technologies (S3 bucket administration; EC2; ELB; Security Groups; IGW; ACLs; etc.)

-At least 3 years of SRE experience.

-Knowledge of large-scale distributed systems in practice, including multi-tier.

-6+ years of experience managing a distributed team of engineers

-Experience conducting technical deep dives into the system configuration or code.

Vacancy expired!

Subscribe Report job