Vacancy expired!
- Define a roadmap for all engineering teams to utilize fully automated, self-service, highly scalable, cost-efficient, observable, auditable and reliable infrastructure services as standard practice
- Drive the execution of this roadmap across the engineering organization, collaborating with SREs and senior engineers across engineering while also performing hands-on work on the most critical challenges
- Provide expert technical guidance and ongoing engineering design review to teams planning and implementing large migrations, service-oriented architecture, broad architectural shifts, and capacity growth
- Build a metrics-driven operational culture standardizing our practices for SLO definition and review as well as for logging, monitoring, alerting, and on-call practices
- Make iterative improvements to blameless incident management processes, root cause analyses, outage prevention, and service recovery strategies across the engineering organization
- Partner closely with Security, Quality, and Product teams to achieve high priority security, privacy, compliance, reliability, and business-continuity objectives on our overall roadmap
- Propose and drive large improvements to production systems to achieve a significant impact to our business and engineering teams
- Mentor and coach engineers to be curious and effective at discovering and solving technical challenges
- 8- 10+ years ' experience demonstrating hands-on technical leadership and business impact in combining software engineering skills with systems engineering skills to solve complex automation and reliability challenges
- D eep technical experience with various cloud providers, containerization technologies, automated deployment frameworks, orchestration frameworks, monitoring, logging, alerting, system internals, networking, databases, distributed systems, and service-oriented architecture
- Ability to implement load, stress, performance, and reliability testing standards at scale to improve service, platform, and infrastructure resiliency
- A drive to promote openness, diversity of opinions, and inclusive discussions at all times to evaluate a wide variety of ideas and perspectives in solving challenging problems
- Clear decision making skills and good trade-offs in complex situations comprising multiple opinions, needs, teams, technologies, cloud providers, and architectural settings
- Ability to communicate effectively with stakeholders ranging from executives to junior engineers across the breadth and depth of the engineering organization
- Ability to exemplify high accountability, integrity, and resilience to maintain focus on both big-picture goals and milestones to get there
- Drive to enable the engineering organization to innovate and deliver with greater speed and safety
Vacancy expired!