Back to jobs
Site Reliability Engineer
- Posted 30 January 2025
- Salary £65000.00 - £70000.00 per annum
- LocationBirmingham
- Job type Permanent
- Discipline Cloud and DevOps
- ReferenceBBBH210518_1738250277
- Contact NameEmma Mayfield
Job description
Site Reliability Engineer
Permanent
£70,000
Midlands based/Hybrid working
As the Site Reliability Engineer you will be joining the clients Platform Engineering Team to help build, manage, and support some of the clients core infrastructure.
Key areas of responsibilities:
- Ensuring the platform services meet high standards for availability, reliability, and performance
- Defining and promoting best practices for observability, incident management, and operational processes
- Leading incident management efforts
- Partner with platform engineers and product teams
- Develop and maintain monitoring, logging, and alerting solutions to provide actionable insights into platform health and performance
Key Skills
- You will have a deep understanding of concepts such as SLAs, SLOs, and error budget
- You will have expertise in tools such as Prometheus, Grafana, Loki, or similar
- You will have experience in leading incident response processes, including root cause analysis and implementing preventative measures
- You will be proficient in scripting languages (e.g., Python, Bash)
- You will need to work effectively with cross functional teams
- You will be a problem solver