Site Reliability Engineer (Prometheus, Grafana, Ansible, Terraform, Jenkins, AWS)
£70,000 - £80,000 + Annual Bonus
Hybrid - Manchester
We are currently working with a leading gambling company dedicated to providing exceptional gaming experiences. They are looking for an experienced Site Reliability Engineer with a strong skill set in system reliability to join its world class technology team. This role is ideal for someone who has 4+ years of experience within the observability and monitoring space, along with being a true mentor to the more junior engineers within the team.
As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability, performance, and scalability of our critical IT systems. You will be responsible for implementing and maintaining robust observability and monitoring infrastructure, analysing system data, automating routine tasks, and collaborating with development teams to optimize our systems.
Responsibilities:
- Design, implement, and maintain observability and monitoring infrastructure using industry-leading tools.
- Analyse system performance data to identify and resolve issues proactively.
- Automate routine tasks to improve efficiency and reliability.
- Collaborate with development teams to ensure that new features and changes are released reliably.
- Stay up to date on the latest Site Reliability Engineering best practices and technologies.
Requirements:
- Strong experience in Site Reliability Engineering or a related field.
- Proficiency in using observability and monitoring tools (e.g., Prometheus, Grafana, ELK Stack).
- Excellent analytical and critical thinking skills.
- Experience with automation tools (e.g., Ansible, Terraform).
- Strong understanding of cloud platforms (e.g., AWS, GCP, Azure).
- General infrastructure administration skills (e.g., Networking, Server Management with either Linux/Windows)
- Ability to work effectively in a collaborative team environment.
Benefits:
- Competitive salary and bonus package
- Hybrid work arrangement (2 days a week) with flexible office hours
- Opportunities for professional development and growth
- 25 days annual leave
If you are an enthusiastic and skilled Site Reliability Engineer looking to join a fast-paced and innovative company, we encourage you to apply.
Site Reliability Engineer (Prometheus, Grafana, Ansible, Terraform, Jenkins, AWS)
£70,000 - £80,000 + Annual Bonus
Hybrid - Manchester
IND_PC1
Carbon60, Lorien & SRG - The Impellam Group STEM Portfolio are acting as an Employment Business in relation to this vacancy.