SonicJobs Logo
Left arrow iconBack to search

Senior Site Reliability Engineering

Randstad Technologies
Posted 10 hours ago, valid for 9 days
Location

London, Greater London EC1R 0WX

Contract type

Full Time

In order to submit this application, a Reed account will be created for you. As such, in addition to applying for this job, you will be signed up to all Reed’s services as part of the process. By submitting this application, you agree to Reed’s Terms and Conditions and acknowledge that your personal data will be transferred to Reed and processed by them in accordance with their Privacy Policy.

Sonic Summary

info
  • The position is for a Senior Site Reliability Engineer, located remotely in the UK, on a full-time 1-year contract basis.
  • Candidates should have at least 5 years of experience managing distributed systems on Linux, with 2+ years of development experience in Ruby, Python, Go, or similar languages.
  • The role involves scaling and optimizing Prometheus architecture, maintaining large ElasticSearch clusters, and building high-throughput Kafka pipelines.
  • The successful candidate will be responsible for developing self-service APIs, robust alerting systems, and deploying infrastructure using Terraform.
  • Salary details are not provided in the job description, but interested applicants are encouraged to apply with their CV to raghav.manrai@randstad.co.uk.

Job Title: Site Reliability EngineerLocation: Remote (UK)Type: Full-Time (1-Year Contract)Working Hours: 11 AM - 7 PM

Are you passionate about building and managing reliable, large-scale cloud systems? We're looking for a Senior Site Reliability Engineer to join a high-performing Observability team. In this role, you'll play a critical part in ensuring our cloud services remain performant and scalable, supporting billions of daily requests.

Key Responsibilities
  • Scale and optimize Prometheus architecture to manage millions of active metrics.
  • Operate and maintain large ElasticSearch clusters (2000TB+).
  • Build and manage high-throughput Kafka pipelines processing hundreds of thousands of events per second.
  • Develop self-service APIs, robust alerting systems, and deploy infrastructure with Terraform.
  • Support observability initiatives to monitor and improve critical cloud services.
What We're Looking For
  • 5+ years of experience managing distributed systems on Linux (Debian/Ubuntu preferred).
  • 2+ years of development experience with Ruby, Python, Go, or similar languages.
  • Expertise in technologies such as ElasticSearch, Kafka, Prometheus, Terraform, Ansible, and more.
  • A strong passion for solving complex challenges in large-scale distributed systems.
  • A proactive, curious mindset with a focus on quality and customer experience.

This is an urgent vacancy where the hiring manager is shortlisting for an interview immediately. Please apply with a copy of your CV or send it raghav. Manrai @ randstad .co .uk

Randstad Technologies is acting as an Employment Business in relation to this vacancy.

Apply now in a few quick clicks

In order to submit this application, a Reed account will be created for you. As such, in addition to applying for this job, you will be signed up to all Reed’s services as part of the process. By submitting this application, you agree to Reed’s Terms and Conditions and acknowledge that your personal data will be transferred to Reed and processed by them in accordance with their Privacy Policy.