Job Details

ID #54733413
State South Carolina
City Capetown
Job type Full-time
Salary USD TBD TBD
Source Sana Commerce
Showed 2025-10-28
Date 2025-10-28
Deadline 2025-12-27
Category Et cetera
Create resume
Apply Now

Senior Site Reliability Engineer

South Carolina, Capetown 00000 Capetown USA
Apply Now

What you’ll doLead incident response and postmortems, drive investigations, document learnings, and implement permanent fixes to prevent recurrence.Manage and optimize Azure Kubernetes environments, own cluster configurations, performance, cost control, and security best practices.Build observability systems, develop dashboards, alerts, and metrics using Dynatrace, Honeycomb, ElasticSearch, Grafana/Kibana, and Azure Monitor (KQL).Automate for resilience, write reliable scripts in PowerShell, Bash, Python, or C#, embedding logging, rollback, and version control.Implement Infrastructure-as-Code, design and maintain Terraform, Bicep, or ARM templates to standardize and automate deployments.Optimize system performance, identify bottlenecks through deep monitoring, dump analysis, and right-sizing of cloud resources.Collaborate across engineering teams, integrate reliability principles into CI/CD pipelines and the broader SDLC.Participate in on-call rotations, lead during critical incidents, ensuring lasting fixes and operational excellence.

Apply Now Report job