Site Reliability Engineer
Matillion
- Manchester
- £53,000-79,500 per year
- Permanent
- Full-time
- You'll spend a significant amount of time working hands-on in the production of software for Matillion products
- You will discuss, establish and maintain the design patterns, framework selections and direction of the overall service reliability
- Collaborating with Engineering and Product teams to instrument system, application and business KPIs, you'll also work with members of the business teams to ensure customer-centric SLIs and SLOs are established
- There will also be opportunity to setup and define processes to support hosting and deploying of the live services
- You will provide engineering options, with unbiased pros and cons to meet problems shared by the business
- Passion for performance, observability, availability, scalability and security, with a solid understanding of networking systems and protocols
- Have previous experience of large-scale web operations in a public cloud environment and be competent in Ruby, Go, Java, Python or an equivalent programming language
- Have worked with some of the following key technologies: Prometheus, Grafana, Elasticsearch, Logstash, Kibana, OpenTelemetry, Micrometer, New Relic, Datadog
- Be experienced with cloudformation, terraform and any other infrastructure-as-code technologies
- Be confident in your ability to own and deliver projects and issues to resolution using Agile methodologies and demonstrate a definite bias for action and focus on results
- Be an excellent communicator and cross-team collaborator and strive for personal excellence through continuous development and by keeping current with developments and offerings in the observability field