Senior Site Reliability Engineer

OverviewLocation:
Remote / Hybrid – UK
Sector:
E-Commerce and Retail Platforms
A global retail brand is scaling its e-commerce and digital customer platforms, handling millions of daily transactions and peak seasonal traffic. To support this growth, they are hiring a Site Reliability Engineer with deep expertise in observability, cloud scalability, and performance tuning.
What you’ll do
Build and maintain highly scalable cloud infrastructure for large-scale e-commerce platforms.
Develop monitoring and observability frameworks to ensure fast response to performance bottlenecks.
Optimise CDN, caching, and APIs for high-traffic shopping events (e.g., Black Friday).
Drive automation and CI/CD pipelines to accelerate feature delivery without compromising stability.
Partner with software engineering teams to ensure always-on shopping experiences.
What we’re looking for
Proven track record in high-scale distributed systems (retail, e-commerce, digital platforms).
Expertise in observability stacks (Grafana, Prometheus, Datadog, NewRelic, Elastic).
Strong cloud skills (AWS/GCP/Azure) including Kubernetes and serverless.
Solid coding skills for automation (Python, Go, JavaScript, Bash).
Experience optimising performance in high-traffic digital platforms.
This is your chance to build reliability at retail scale, where seconds of downtime mean millions in lost revenue.
Highly competitive salary plus bonus % paid yearly.
Venquis is acting as an Employment Agency in relation to this ..... full job details .....