
Lead DevOps Engineer (Data)
- London
- Permanent
- Full-time
- Platform Ownership
- Own and manage the data platform infrastructure built on AWS services (EventBridge, Lambda, EC2, MWAA, S3).
- Oversee deployment and monitoring of Snowflake, and support its integration into the broader data ecosystem.
- Infrastructure and System Reliability
- Ensure platform reliability, availability, and scalability across environments.
- Design and maintain robust monitoring, alerting, and observability frameworks to reduce MTTR and improve visibility.
- Lead and manage initiatives related to data lineage, platform health, and alert hygiene.
- CI/CD and Automation
- Enhance and expand our CI/CD processes using Bitbucket pipelines.
- Drive consistent rollout of DevOps practices across all data engineering squads.
- Champion infrastructure as code (IaC) principles using tools like Terraform or CloudFormation.
- Cross-Team Collaboration
- Work cross-functionally with data engineering squads to identify platform pain points, inefficiencies, and areas for improvement.
- Translate ambiguous platform challenges into structured initiatives and deliver value iteratively.
- Security & Networking
- Manage VPCs, security groups, IAM roles, and cross-account networking and permissions within AWS.
- Support secure integration with Microsoft PowerApps and other third-party services where applicable.
- Proven experience owning and operating production data platforms within AWS.
- Strong understanding of AWS core services: EventBridge, Lambda, EC2, S3, and MWAA (Managed Workflows for Apache Airflow).
- Experience with infrastructure reliability, observability tooling, and platform automation.
- Solid experience with CI/CD pipelines, preferably Bitbucket Pipelines.
- Familiarity with Snowflake administration and deployment practices.
- Comfortable working through ambiguity and in cross-functional, collaborative teams.
- Experience with infrastructure-as-code tools such as Terraform or CloudFormation.
- Python scripting experience (for automation or lightweight tooling).
- Exposure to MLOps tooling and workflows.
- Familiarity with Power Platform (especially Microsoft PowerApps) in a data integration context.
- Understanding of data lineage, governance, and operational monitoring in modern data stacks.
- 24 days annual leave rising to 29 days
- Enhanced parental leave
- Medicash (Health Cash Plans)
- Wellness Days
- Flexible Fridays (Opportunity to finish early)
- Birthday day off
- Employee assistance program
- Travel loan scheme
- Charity days
- Breakfast provided
- Social Events throughout the year
- Hybrid Working