Job Details

ID #45355017
State California
City Newark
Job type Contract
Salary USD TBD TBD
Source SBS Corp.
Showed 2022-09-01
Date 2022-08-30
Deadline 2022-10-29
Category Et cetera
Create resume

Sr Engineer, IT Infrastructure Observability

California, Newark, 94560 Newark USA

Vacancy expired!

Title: Sr Engineer, IT Infrastructure Observability Location: Newark, CA (Day 1 onsite) Job Description: The Role:

  • Highly skilled in operational efficiency, optimal utilization, and system resiliency for a real-time streaming analytics platform
  • Utilizing knowledge and experience in monitoring systems and applications, conduct initial troubleshooting and root cause analysis
  • Supporting the integration of new technologies with monitoring systems to include deployment and decommissioning of monitoring agents
  • Implement service level metrics and service level objectives that act as service-level health indicators
  • Producing metrics and capacity planning reports as needed to support the monitored environments
  • Expert in collecting metrics for performance related monitoring
  • Design processes that help improve observability and system resiliency
  • Work with other system administrators, engineers, and vendors to resolve hardware and software issues
  • Maintain monitoring documentation, diagrams and standard operating procedures as required
  • Coordinate and participate in key process improvements as they relate to operations monitoring
  • Experience defining, creating, and supporting monitoring dashboards
  • Identify potential trends in performance gaps and recommend modifications to the standardized work documents by using process improvement principles
  • Triage site availability incidents and proactively work towards reducing MTTR for customer-impacting incidents
  • Coordinate with Project Management as a subject matter expert (SME) on various projects, for infrastructure and IT operations. Automate as required
  • Capable of working and collaborating on multiple projects or tasks with high attention to detail
  • Keep abreast of emerging technologies to identify, research, evaluate and present concepts and solutions to management for implementation considerations
  • Advocate standards, best practices, policies, and procedures
Qualifications:
  • Bachelor's degree in Information Technology, Computer Science, or related field
  • 6+ years of Site Reliability Engineer or production Engineer
  • 5+ years of Windows, Linux server environment
  • 2+ years of experience working with network
  • Excellent Systems Observability & Monitoring experience
  • Strong coding experience in Python, Perl, Bash
  • Knowledge of orchestration engines and package management including Kubernetes and Helm
  • Good understanding of modern application, version control and development flow
  • Customer service oriented with strong interpersonal and leadership skills
  • Experience with Influx DB Grafana, Prometheus, Splunk, SolarWinds ELK and APM
  • Knowledge of containers and cloud platforms (AWS, Azure and/or Google Cloud Platform)
Reach me at sudheer(at)mysbscorp(dot)com / Ph

Vacancy expired!

Subscribe Report job