Job Details

ID #8326395
State California
City Santaclara
Job type Full-time
Salary USD TBD TBD
Source Palo Alto Networks
Showed 2021-01-18
Date 2021-01-14
Deadline 2021-03-15
Category Et cetera
Create resume

Senior Manager, Site Reliability Engineering (Prisma Access)

California, Santaclara, 95050 Santaclara USA

Vacancy expired!

Job Description

With the Prisma Access solution, enterprise customers can confidently embrace a cloud-first approach for networking and security to seamlessly connect and secure your mobile users, branch offices and retail locations. Customers can connect their mobile workforce and consolidate their existing security products with on-demand, scalable, secure remote access.

The Site Reliability Engineering team has end to end ownership of Application availability, performance, and scalability.

Your Career

Site reliability and DevOPS Engineering are key functions of Prisma Access, and theis Leadership role will be responsible for infrastructure platforms and application support strategy, roadmap, and technical implementation of existing and future product lines.

Your Impact

  • Manage Compute Platform as a service with end-to-end responsibility for delivering and supporting the on-prem and cloud compute platforms ( GCP, AWS), Kubernetes, Terraform, Ansible, CI/CD, Artifactory etc for continuously deploying applications.
  • Own automation for delivery of Platform services using Infrastructure as Code. Build standard playbooks for Platform which can be consumed across multiple teams in the organization.
  • Lead delivery of Cloud Infrastructure strategies aligned with business objectives with a focus on mass Application movements into the Cloud involving design, implementation and Infrastructure automation.
  • Build a high performing team of Cloud Platform SMEs and platform leads while mentoring traditional platform SMEs on cloud computing best practices, technology, and adoption.
  • Build and manage an SRE function that owns application availability and performance and manage it through automation and proactive/predictive alerts by having a strong data analytical tool set to identify areas of improvement.
  • Implement comprehensive service monitoring to ensure uptime and performance, including synthetic, real user, system, application performance, dashboards etc.
  • Define, measure, and meet key Service Level Objectives including availability, performance, incidents and chronic problems.
  • Own end-to-end availability and performance of mission critical services and build automation to prevent problem recurrence; eventually automate response to all non-exceptional service conditions.
  • Partner with application and business stakeholders to ensure high quality product is developed and released into production. Establish and periodically update the Release Policy which governs the release process and details release categories, release activities, role & responsibilities, exception, etc.
  • Work closely with Enterprise Architecture and Information Security to specify and document solutions and practices.
  • Keep abreast with evolving threats/risks, industry trends and work to implement best practices in the organization.

Qualifications

  • BA/BS degree in Computer Science or related technical field, or equivalent practical experience.
  • 10+ years of hands-on technical experience combined with strong management and communication skills.
  • Solid understanding of Linux, Networking, TCP-IP, Routing, Switching, Firewalls, Load balancers and other infrastructure components
  • Solid understanding of modern cloud technologies and developer family of products: GKE, Serverless, Cloud Build, Monitoring and Logging, as well as the Microservices, DevSecOps etc.
  • Experience running revenue generating applications in a public cloud and IaaS, including real world experience with at least one public cloud provider: AWS, Google Cloud or Microsoft Azure.
  • Experience building, scaling, and running production operations for heterogeneous applications.
  • Strong troubleshooting experience and skillset to resolve incidents across multiple domains.
  • Ability to nurture and support a strong operations culture: customer/service focus excellent technology; high quality implementations; self-motivated innovation and problem-solving.
  • Demonstrated ability of establishing and maintaining metrics-based process improvement.
  • Demonstrated ability to develop strong alliances with those outside of your immediate organization.
  • Experience in building and managing strong technical teams.
  • Excellent communications, organization, and time management skills.

Additional Information

Our Commitment

We’re trailblazers that dream big, take risks, and challenge cybersecurity’s status quo. It’s simple: we can’t accomplish our mission without diverse teams innovating, together.

We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at[emailprotected]

Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

All your information will be kept confidential according to EEO guidelines.

#LI-TD1

Vacancy expired!

Subscribe Report job