Vacancy expired!
Be part of a new function at Tradeweb, building and operating our data platform that is responsible for ingesting, processing and managing data from all of Tradewebs businesses. The platform will have to accommodate a wide range of use cases from simple customer facing data APIs to large scale machine learning models. As SRE, you will bring operational discipline to the running of critical data pipelines, systems and reduce toil through standardization and automation. This is a remote-based role and would suit someone who is comfortable with synchronous and asynchronous communication styles and working with colleagues in other time zones.
Job Responsibilities:- Support the release of new services, through capacity planning, rollout planning and release management.
- In collaboration with data engineers, define and implement monitoring strategies, define SLAs and error budgets.
- Build and deploy automation tooling for supported services and data pipelines.
- Troubleshoot and remediate issues with the services you manage.
- Manage and run critical production services.
- Track and execute continuous improvements.
- Strong understanding of Linux. Windows a plus.
- Strong proclivity for automation and DevOps practices and tooling such as Git, Ansible, Terraform
- Strong experience working with monitoring and logging tools: Prometheus, ELK, Grafana.
- Good programming experience in either: Bash, Python, C or Java
- Familiarity with container orchestration platforms such as Kubernetes, Nomad.
- Understanding of general networking protocols such as TCP/IP, DNS, TLS.
- Broad exposure to at least one cloud platform: AWS, Google, Azure
- Experience with PostgreSQL, SQL Server, Oracle, Redis or Kafka a strong plus
- Familiarity working with open source software community a strong plus
- Strong communication and written skills.
- Financial Services experience a plus but not required.
- BS or higher in a technical field: CS, Physics, Maths etc.
Vacancy expired!