img
Contract

Lead PySpark Engineer

City of London
money-bag £281 - £292/day 388
Posted 2 weeks ago

PySpark Engineer Lead

As the Technical Lead, you will drive the high-stakes migration of legacy SAS analytics to a modern, cloud-native PySpark ecosystem on AWS. This isn''t just a lift and shift you will refactor complex procedural logic into scalable, production-ready distributed pipelines for a Tier-1 financial services environment.



Core Responsibilities

  • Engineering Leadership: Design and develop complex ETL/ELT pipelines and Data Marts using PySpark, EMR, and Glue.

  • Legacy Modernisation: Architect the conversion of SAS Base/Macros into modular, testable Python code using SAS2PY and manual refactoring.

  • Performance Tuning: Optimise Spark execution (partitioning, shuffling, caching) to ensure cost-efficient processing of massive financial datasets.

  • Quality & Governance: Implement rigorous CI/CD, unit testing, and data reconciliation frameworks to ensure "penny-perfect" accuracy.



Technical Stack

  • Engine: PySpark (Expert), Python (Clean Code/SOLID principles).

  • AWS: EMR, Glue, S3, Athena, IAM, Lambda.

  • Data Modeling: SCD Type 2, Fact/Dimension tables, Data Vault/Star Schema.

  • Legacy: Proficiency in reading/debugging SAS (Base, Macros, DI Studio).

  • DevOps: Git-based workflows, Jenkins/GitLab CI, Terraform.

Randstad Technologies is acting as an Employment Business in relation to this vacancy.

Other jobs of interest...

Randstad Technologies Recruitment
City2 weeks ago
money-bag281.00-281.00 Daily

Perform a fresh search...

  • Create your ideal job search criteria by
    completing our quick and simple form and
    receive daily job alerts tailored to you!

Jobs. Straight to your inbox!

if (string.IsNullOrEmpty(Model.Item1.RecruiterReference) || Model.Item1.ImportId == 50144) { if (!(!string.IsNullOrEmpty(Model.Item1.JobURL) && Model.Item1.ImportId != 101)) { } }