Vacancy expired!
- Use Engineering approach to solve operational problems, letting the machine do operational tasks as much as possible (automate monitoring, troubleshooting and build & deployment process).
- Develop meaningful monitoring, actionable alerting, logging, and availability dashboard/metrics that provide service health, usage, and performance data about the service to reduce or eliminate outage.
- Be on a PagerDuty rotation to respond to the incidents and provide support
- Engage in and improve the software development lifecycle – from inception and design, through development, deployment, operation and refinement
- Work with Network, infrastructure, DBAs, and other Support groups for the on-premise data center migration from one location to other.
- You will influence and design infrastructure, architecture, standards and methods for large-scale systems
- Support services prior to production via infrastructure design, software platform development, load testing, capacity planning and launch reviews
- Maintain services during deployment and in production by measuring and monitoring key performance and service level indicators including availability, latency, and overall system health
- You will automate system scalability and continually work to improve system resiliency, performance and efficiency
- You will practice sustainable incident response as part of an on-call rotation and through blameless postmortems
- You will remediate tasks within corrective action plan via sustainable, preventative, and automated measures whenever possible
- BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent job experience required
- 6+ years of experience developing and/or administering software in Windows with Dotnet applications
- Prior experience setting up, configuring and deploying applications to Windows server environments
- Solid hands-on experience with Windows Service, IIS, F5, and Chef is a MUST
- Experienced with Splunk monitoring and creating alerts, CI/CD, Jenkins
- Prior experience in Data Center migration is an added plus
- Experience in monitoring infrastructure and application uptime and availability to ensure functional and performance objectives
- Able to work in multiple shifts
- Experience/knowledge in languages such as Dotnet, Python, Ruby, Bash, Java, Go, Perl, JavaScript and/or node.js
- Demonstrable cross-functional knowledge with systems, storage, networking, security and databases
- System administration skills, including automation and orchestration of Linux/Windows using Chef, Puppet, Ansible, Salt Stack and/or containers (Docker, Kubernetes, etc.)
- Proficiency with continuous integration and continuous delivery tooling and practices
- Strong analytical and troubleshooting skills
- System problem-solving approach, coupled with strong communication skills and a sense of ownership and drive
Vacancy expired!