Job Details

ID #44967433
State Minnesota
City Minneapolis / st paul
Job type Contract
Salary USD TBD TBD
Source Tanson Corp
Showed 2022-08-18
Date 2022-08-18
Deadline 2022-10-17
Category Et cetera
Create resume

C2H - (2) - Sr Site Reliability Engr (Grafana, Splunk, Prometheus, Kafka, Google Cloud) - Remote

Minnesota, Minneapolis / st paul, 55401 Minneapolis / st paul USA

Vacancy expired!

Description: Combine two of the fastest-growing fields on the planet with a culture of performance, collaboration and opportunity and this is what you get. Leading edge technology in an industry that's improving the lives of millions. Here, innovation isn't about another gadget, it's about making health care data available wherever and whenever people need it, safely and reliably. There's no room for error. Join us and start doing your life's best work.(sm) You'll enjoy the flexibility to telecommute from anywhere within the U.S. as you take on some tough challenges. Primary Responsibilities: Layer in instrumentation in the development process so that applications can be monitored Establish measurements that are used to detect internal problems before they result into user visible outages Build processes and diagnostics tools to troubleshoot, maintain and optimize solutions and respond to customer and production issues Embrace continuous learning of engineering practices to ensure industry best practices and technology adoption, including DevOps, Cloud and Agile thinking Tech debt reduction/ Tech transformation including Open source adoption, Cloud adoption, HCP assessment and adoption Contribution to client Inner source / industry community Can you please provide a summary of the project/initiative which describes what's being done?. 5+ years of experience as a Site Reliability Engineer 5+ years of experience creating runbooks, processes, and test plans around reliability, performance, etc. of infra/applications 5+ years of experience in integrating monitoring and alerting into cloud software solutions 5+ years of experience Defining Service Identify and measure SLOs, SLAs and SLIs 5+ years of experience performing root cause analysis/postmortem after each Incident and delivering resolution for tools and automation failures 3+ years of experience in implementing dashboards to help teams visualize logs, instrumentation and other data to ensure optimal performance of the applications. What does the ideal candidate background look like (ex: healthcare specific background, etc.)? We want to be as specific as possible with our firms so they can find the type of candidate you're looking for. Healthcare would be the first preference Of the required skills listed, which would you consider the top 3? Please list your expectations regarding years of experience for each requirement. 1. Grafana 2. Splunk. 3. Prometheus, Kafka, Google Cloud. What experience will set candidates apart from one another? -The Candidates should have strong knowledge on Python, Google Cloud, Kafka, Grafana and Prometheus. Are you open to candidates that would need to be 100% remote for the duration of the engagement -Yes. Are you open to candidates that cannot convert to FTE without sponsorship -Yes What does the team structure look like how many members and what is the break-down of the team's skill sets (ex: 1 PM, 4 Developers, etc. We have total 5 members team members with Engineering Team(Development), Operation team, Performance team for load testing. What does the interview process look like? a. How many rounds? 2. b. Video vs. phone? Video c. How technical will the interviews be? : We have strong core technical people with Role of Principal Site Reliability Engineering. d. When do you anticipate starting the interview process? As soon as possible.

Vacancy expired!

Subscribe Report job