Vacancy expired!
- Design and develop tools that will aide in improving reliability of our infrastructure.
- Engage with engineering teams to improve on-call efficiencies, drive incident management and post-mortem analysis.
- Develop expertise in Infrastructure and best practices and bring that to ad-platforms to run a world class distributed systems.
- Create frameworks that enable engineers to interact with Infrastructure
- Improve areas like capacity planning, configuration management and monitoring.
- Design and improve architectures of new and existing systems based on the principles of reliability and high availability with extensive logging and observability.
- Create tooling to improve the observability of ad systems.
- Own the reliability of ad-platforms.
- Design our next generation container platforms to run ad services.
- Create robust deployment and delivery pipelines
- Create systems to develop black box testing capabilities for ad delivery.
- Confirmed experience supporting internet-facing production services and distributed systems.
- Experience operating container based platforms like Kubernetes, Mesos, Nomad
- Experience in troubleshooting docker based deployments.
- Good programming skills in one of C, Java, Python or Go.
- Expertise in operating Linux based systems, with a solid understanding of its internals.
- Bachelor’s degree in Computer Science or equivalent industry experience
- Experience with container based platform, Nomad preferred
Vacancy expired!