
Site Reliability Engineer
- Solihull, West Midlands
- £49,000-57,000 per year
- Permanent
- Full-time
- Build resilient, fault-tolerant, auto-healing systems and architecture throughout our infrastructure to ensure maximum usability for our clients.
- Add automation, continuous integration, and delivery (CI/CD) to the infrastructure and products, ensuring that we're adding safety every step of the way.
- Forsee problems and risks with the creation of, and operating the infrastructure, adding tests, measurements, and alarms where you see the optimal opportunity.
- Push your team and those around you to containerize and unwind monoliths to ensure that each piece of the system can scale independently.
- Bring your thirst to learn and your unnatural curiosity to bear by continually asking why and building systems, tools, and automation to make running the complex routine.
- Share what you've learned with us and have a desire to keep learning as we work through projects.
- Be available when the inevitable goes awry to help bring order from the chaos.
- Bachelor's in CS or related field (or equivalent experience)
- 3-5 years of experience
- Configuration management suites (Terraform, Chef, Ansible, Puppet, CI/CD)
- Networking, VPCs, proxies, CDNs
- Cloud resource provisioning through API/CLI
- Linux OS fundamentals (packaging, patching, configuration, triage)
- Kubernetes basic understanding, service-reprovisioning, CLIs
- Experience creating views, alerting from systems like Grafana, Greylog
- Artifact building (containers, machine images, life-cycle management)
- Atlassian tools (JIRA, Confluence)
- Hybrid Office Schedules
- Blameless Postmortems
- On-call rotations
- Cross-functional teams, standups, and plans
- Programming/scripting in: Shell, Go, Python, Ruby
- Unquenchable Curiosity
- SaaS experience preferred, or working in multi-product groups