
Support Engineer, SRE for Trading Business
- London
- Permanent
- Full-time
- You will build telemetry and automation solutions that are in alignment with broader technology platforms; Support incident responses, blameless postmortem, design and implement improvements to prevent incident reoccurrence
- Modernize technology observability practices with an emphasis on top-down monitoring, white box monitoring.
- Analyze effort patterns (user queries, service requests, incidents, workflows) for optimization.
- Design, code, test, and deliver software to eliminate manual operational work.
- Implement self-healing and resiliency patterns, exercise failure cases regularly to validate resilience assumptions.
- Plan, lead, supervise and optimize the production related software and infrastructure for capacity and resiliency
- Relevant experience working in a similar capacity
- Ability to write scripts and web applications in multiple languages (react, python, JavaScript, .net)
- Knowledge of build and configuration tools (for example: Gitlab, Github, SolarWinds, CHEF, Puppet, Ansible, TeamCity)
- Knowledge of scheduling tools (for example: Autosys, Cron, Bob)
- Knowledge of profiling tools (for example: Datadog, Valgrind)
- Knowledge of monitoring tools (for example: Geneos, PagerDuty, Nagios)
- System and network administration and troubleshooting skills (Linux/Unix and Windows). Working knowledge of infrastructure components (e.g. routers, load balancers, cloud products, container systems, compute, storage, and networks)
- Strong communication skills to be able to manage stakeholder expectations
- Strong problem solving and troubleshooting skills
- Respond and prioritize multiple issues at a time in a timely matter and perform a structured analysis of the root cause
- Strong curiosity and bias for proactive planning, action, ownership, learning and continuous improvement.
- Strong interpersonal skills and ability to nurture relationships with all internal/external partners, promoting diversity of perspectives, ideas and culture
- Proficiency with any major RDBMS
- Bachelor’s degree in computer science or equivalent.
- Experience with 12 factor applications
- Strong experience with Python
- Experience with front end development with Node.js / React
- Experience with Datadog
- Experience with AWS CDK
- Experience with Kubernetes / Azure
- Knowledge of fixed income and/or equities products
- An understanding of ITIL support standard methodologies and experience with Service Management (Incident/Problem/Change Management, etc.)