
Lead SRE - Team Lead
- London
- Permanent
- Full-time
- Internally, working closely with the Core SRE team, who own and maintain the SRE projects and services, you will build integrations and implement new features.
- Externally, collaborating with all teams and disciplines, including software developers, cloud engineers, and product managers, you will integrate SRE tools and create bespoke solutions for individual products, including payments and cards.
- Seek continuous improvement of reliability, monitoring, and alerting for our mission-critical microservices.
- Design monitoring and alerting that is customer journey-based and directly proportionate to customer experience, supporting our 'you build it, you own it' model. Our alerts must be highly precise, as developer teams are engaged immediately with no triage.
- Think outside of the box to eliminate toil and enable controls excellence, automating as much as possible.
- Contribute to internal tools, including our state-of-the-art framework for SLI and error budget aggregation.
- Enhance performance testing, forecasting, and capacity planning framework.
- Contribute to chaos engineering framework.
- Manage a growing team whilst remaining hands-on and close to the code
- Degree in computer science or another highly technical, scientific discipline.
- Proven experience as a software engineer, including proficiency in at least one systems programming language (e.g., Python, Go, Java).
- Working knowledge of microservice infrastructure components.
- Excellent debugging and troubleshooting skills.
- Experience with Kubernetes.
- Experience in cloud computing (preferably AWS).
- Experience with common SRE toolchains, such as Grafana, Prometheus, Elasticsearch, Kibana, and Jaeger, is a plus.