SonicJobs Logo
Left arrow iconBack to search

Mid-level GPU Cloud Support Engineer

Hays Specialist Recruitment Limited
Posted 13 hours ago, valid for 14 days
Location

Bournemouth, Dorset BH89BJ, England

Contract type

Full Time

In order to submit this application, a Reed account will be created for you. As such, in addition to applying for this job, you will be signed up to all Reed’s services as part of the process. By submitting this application, you agree to Reed’s Terms and Conditions and acknowledge that your personal data will be transferred to Reed and processed by them in accordance with their Privacy Policy.

Sonic Summary

info
  • The company is seeking a Mid-level GPU Cloud Support Engineer with at least 2 years of IT support experience, preferably in GPU cloud environments.
  • This fully remote position offers an attractive salary and benefits, including share options and an unlimited holiday policy.
  • Key responsibilities include incident management, GPU cloud support, cluster monitoring, and user assistance, while collaborating with internal and external stakeholders.
  • Candidates must have proficiency in scripting languages like Bash or Python and experience with ITSM tools such as ServiceNow or Jira Service Management.
  • The role requires flexibility to work on either a -8 or +8 shift pattern, ensuring operational excellence in supporting GPU cloud platforms.

Your new companyI have partnered exclusively with a pioneering company that is shaping the future of cloud infrastructure. Their innovative, high-performance, and GPU-optimized platform not only drives advancements in AI and HPC but also champions sustainability for a greener, more efficient world.

This role is fully remote with no expectation to ever be in an office. You'll also enjoy the fantastic perk of unlimited holiday, allowing you to recharge and thrive.

Your new roleAs a Mid-level GPU Cloud Support Engineer, you will be responsible for providing support to customers on a GPU cloud platform as well as customer-dedicated GPU clusters. This role involves working closely with cross-functional teams, external vendors, and partners to uphold SLA commitments and maintain operational excellence.Key Responsibilities:

  • Incident Management: Receive and triage support enquiries, investigate unresolved complex issues related to storage (e.g. Vast, Weka etc.), networking (e.g. Infiniband, RoCE), and GPU optimisation.
  • GPU Cloud Support: Triage issues and provide timely resolutions, working within defined SLAs for critical incidents, including system outages and performance issues.
  • Cluster Monitoring: Conduct health checks of multi-node clusters, ensuring node performance, GPU utilisation, and service availability are optimal.
  • Documentation: Maintain detailed records of incidents, troubleshooting steps, resolutions, and root cause analyses.
  • Collaboration: work in real-time with internal and external stakeholders.
  • User Assistance: Provide users with best-endeavour guidance on their interactive tools.

What you'll need to succeed

  • Crucially, you must be willing to work on either a -8 or +8 shift pattern.
  • Support Background: 2+ years of experience in an IT support role, preferably in GPU cloud environments.
  • Linux system administration from the Command Line.
  • Scripting and Automation: Proficiency in scripting languages (Bash, Python etc.).
  • Tools and Platforms: Familiarity with ITSM tools (e.g. ServiceNow, Jira Service Management) and monitoring solutions.

What you'll get in return

  • Share options.
  • Unlimited holiday policy.
  • 100% Remote working.
  • Fantastic opportunities to develop - they make a habit of promoting in house.
  • A great team with a passion for working collaboratively.
  • Enhanced family friendly policies.
  • A truly flexible workplace.

What you need to do nowIf you're interested in this role, click 'apply now' to forward an up-to-date copy of your CV, or call us now.If this job isn't quite right for you, but you are looking for a new position, please contact us for a confidential discussion about your career.

Hays Specialist Recruitment Limited acts as an employment agency for permanent recruitment and employment business for the supply of temporary workers. By applying for this job you accept the T&C's, Privacy Policy and Disclaimers which can be found at hays.co.uk

Apply now in a few quick clicks

In order to submit this application, a Reed account will be created for you. As such, in addition to applying for this job, you will be signed up to all Reed’s services as part of the process. By submitting this application, you agree to Reed’s Terms and Conditions and acknowledge that your personal data will be transferred to Reed and processed by them in accordance with their Privacy Policy.