Senior Site Reliability Engineer (SRE)
Senior Site Reliability Engineer (SRE) Remote12-month contract (high chance of extension) Job Description Join a global pioneer in the video game industry and own the reliability of high-traffic, revenue-critical platforms used by millions worldwide. As a Senior SRE, you''ll shape the architecture, improve platform-wide resiliency, and ensure services stay performant, scalable, and secure. This isn''t just about maintaining a single system, you''ll influence reliability across multiple services, driving improvements that touch the entire ecosystem.Key ResponsibilitiesLead incident response and troubleshooting for production systems, resolving high-severity issues and driving post-incident improvements.Influence architecture to improve platform-wide reliability, resiliency, and operational efficiency, ensuring services remain available under heavy load.Drive containerisation best practices and manage Kubernetes-based workloads at scale.Build and maintain event-driven architectures that scale globally while ensuring fault-tolerance and high availability.Automate infrastructure provisioning, deployment, and monitoring using Infrastructure as Code (Terraform, CloudFormation, Ansible, CDK).Collaborate with engineering, product, and security teams to define SLOs, SLIs, and error budgets across services.Provide mentorship, advocate SRE best practices, and ensure teams are empowered to deliver resilient, reliable systems. Experience / Must-Have SkillsExtensive experience in AWS and ..... full job details .....
Other jobs of interest...
Perform a fresh search...
-
Create your ideal job search criteria by
completing our quick and simple form and
receive daily job alerts tailored to you!