Vacancy expired!
- Design and develop tools that will aide in improving reliability of our infrastructure.
- Engage with engineering teams to improve on-call efficiencies, drive incident management and post-mortem analysis.
- Develop expertise in Client's Infrastructure and best practices and bring that to ad-platforms to run a world class distributed systems.
- Improve areas like capacity planning, configuration management and monitoring.
- Design and improve architectures of new and existing systems based on the principles of reliability and high availability with extensive logging and observability.
- Own the reliability of ad-platforms.
- Confirmed experience supporting internet-facing production services and distributed systems.
- Experience operating container based platforms like Kubernetes, Mesos, Nomad (preferred)
- Experience in troubleshooting docker based deployments.
- Good programming skills in one of C, Java, Python or Go.
- Expertise in operating Linux based systems, with a solid understanding of its internals.
- Demonstrated problem solving ability utilizing creative and innovating thinking but also adhering to a strong sense of ownership, customer service, and integrity demonstrated through clear communication.
- Drive to be self-motivated, and eager to learn.
- Design our next generation container platforms to run ad services.
- Create tooling to improve the observability of ad systems. Create frameworks that enable engineers to interact with clients Infrastructure
- Create robust deployment and delivery pipelines Create systems to develop black box testing capabilities for ad delivery
Vacancy expired!