Senior Site Reliability Engineer in London

Job description

Senior Site Reliability Engineer

Start: ASAP
Duration: 6-12 months
Location: hybrid, London (Tuesdays, Thursdays WFH)
Pay: negotiable, inside IR35

We're looking for an experienced DevOps Engineer to join our team on a contract basis, with a focus on AWS infrastructure, observability tooling, and CI/CD automation. This is a hands-on role supporting high-availability systems, rapid deployments, and production incident response.

Key Responsibilities
- Manage and monitor AWS infrastructure for performance and security
- Respond to production incidents, perform root cause analysis, and implement fixes
- Maintain observability tools (Prometheus, Grafana, Splunk) and write PromQL queries
- Improve and operate CI/CD pipelines using GitHub Actions and Kubernetes
- Automate infrastructure tasks with Python, Bash, Go or SQL
- Work with Git-based workflows for infrastructure as code
- Troubleshoot Kubernetes workloads and containerised services
- Participate in an on-call rotation to ensure system reliability

Your Profile

Essential:
- Solid hands-on AWS experience in a DevOps setting
- Background in incident, change, and problem management
- Strong with Prometheus, Grafana, Splunk, and PromQL
- Proficient in scripting (Python, Go, Bash, SQL)
- Skilled in GitHub, CI/CD, and Kubernetes operations

Desirable:
- Experience with Terraform or CloudFormation
- Advanced log analysis with Splunk
- Strong problem-solving and analytical thinking

Senior Site Reliability Engineer

Job description

Taylor Claxton

Consultant