
Senior Site Reliability Engineer
- London
- Permanent
- Full-time
- Advise:
- Work closely with engineering teams in designing and developing systems that are resilient and highly performant at a tremendous scale, and maintaining the foundational platform for running Reddit's infrastructure.
- Amplify:
- Identify and build capabilities into our foundational Infrastructure and Platform services, which are used by Reddit engineering teams to build, deploy, and operate Reddit.
- Deliver software to improve the availability, scalability, latency, and efficiency of observability components.
- Identify and engineer away risk across Reddit's systems.
- Automate:
- Take repetitive, manual, or risky tasks and automate them out of existence. Build tools and integrate systems to support Reddit's evolution.
- Automate critical aspects of the event driven development process
- Diagnose:
- Draw on your knowledge of distributed systems to identify and fix network, system, and service-level issues. Practice sustainable incident response, and drive structural improvement with blameless postmortem.
- Share on-call responsibilities.
- Optimize:
- Observe and improve performance, reduce cost, and improve the experience for millions of users
- Contribute upstream changes to the open source projects we use
- 5+ years of experience in Software Engineering, Site Reliability Engineering, or a development-focused DevOps role.
- Proficiency in one or more programming languages. We're predominantly writing code in Go and Python.
- Experience with Kubernetes and Cloud systems.
- Familiarity with distributed systems development, bonus if familiar with any of the specific tools (Prometheus, Thanos, Grafana, Vector, Clickhouse, Otel, Loki)
- Experience with the development and operation of high-traffic backend systems.
- A demonstrated ability to debug, fix, and optimize code.
- Troubleshooting skills that span applications, networking (TCP/IP), and systems.
- Strong working knowledge of Linux and containers.
- Excellent communication and collaborative skills.
- Pension Scheme
- Private Medical and Dental Scheme
- Life Assurance, Income Protection
- Workspace benefit for your home office
- Personal & Professional development funds
- Family Planning Support
- Commuter Benefits
- Flexible Vacation & Reddit Global Days Off