SonicJobs Logo
Left arrow iconBack to search

Mid-level GPU Cloud Support Engineer

Hays Specialist Recruitment Limited
Posted 18 hours ago, valid for a month
Location

Bournemouth, Dorset BH89BJ, England

Contract type

Full Time

In order to submit this application, a Reed account will be created for you. As such, in addition to applying for this job, you will be signed up to all Reed’s services as part of the process. By submitting this application, you agree to Reed’s Terms and Conditions and acknowledge that your personal data will be transferred to Reed and processed by them in accordance with their Privacy Policy.

Sonic Summary

info
  • The company is seeking a Mid-level GPU Cloud Support Engineer with at least 2 years of IT support experience, preferably in GPU cloud environments.
  • This fully remote position offers a salary that is competitive within the industry, alongside share options and unlimited holiday.
  • Key responsibilities include incident management, GPU cloud support, cluster monitoring, documentation, collaboration, and user assistance.
  • Candidates must possess proficiency in Linux system administration, scripting skills in languages such as Bash or Python, and familiarity with ITSM tools.
  • The role promises fantastic opportunities for career development within a collaborative team and a flexible workplace.

Your new companyI'm excited to partner with a trailblazing company that's revolutionising the future of cloud infrastructure! Their cutting-edge, high-performance, GPU-optimized platform is not only pushing the boundaries of AI and HPC but also making strides towards a greener, more sustainable world. This is a fully remote position, so you can work from anywhere without ever needing to step into an office. Plus, you'll love the fantastic perk of unlimited holiday, giving you the freedom to recharge and thrive whenever you need it. Your new roleAs a Mid-level GPU Cloud Support Engineer, you'll provide top-notch support to customers on a GPU cloud platform and customer-dedicated GPU clusters. You'll collaborate closely with cross-functional teams, external vendors, and partners to uphold SLA commitments and maintain operational excellence.Key Responsibilities:

  • Incident Management: Handle support enquiries, investigate complex issues related to storage (e.g., Vast, Weka), networking (e.g., Infiniband, RoCE), and GPU optimisation.
  • GPU Cloud Support: Resolve issues promptly, adhering to SLAs for critical incidents, including system outages and performance problems.
  • Cluster Monitoring: Perform health checks on multi-node clusters, ensuring optimal node performance, GPU utilisation, and service availability.
  • Documentation: Keep detailed records of incidents, troubleshooting steps, resolutions, and root cause analyses.
  • Collaboration: Work in real-time with internal and external stakeholders.
  • User Assistance: Provide best-effort guidance on interactive tools.

What you'll need to succeed

  • Shift Flexibility: Willingness to work on either a -8 or +8 shift pattern.
  • Support Background: 2+ years of experience in IT support, preferably in GPU cloud environments.
  • Linux Skills: Proficiency in Linux system administration from the command line.
  • Scripting and Automation: Skilled in scripting languages (e.g., Bash, Python).
  • Tools and Platforms: Familiarity with ITSM tools (e.g., ServiceNow, Jira Service Management) and monitoring solutions.

What you'll get in return

  • Share options.
  • Unlimited holiday policy.
  • 100% remote working.
  • Fantastic opportunities for career development with a strong internal promotion culture.
  • A collaborative team passionate about working together.
  • Enhanced family-friendly policies.
  • A truly flexible workplace.

What you need to do nowIf you're interested in this role, click 'apply now' to forward an up-to-date copy of your CV, or call us now.If this job isn't quite right for you, but you are looking for a new position, please contact us for a confidential discussion about your career.

Hays Specialist Recruitment Limited acts as an employment agency for permanent recruitment and employment business for the supply of temporary workers. By applying for this job you accept the T&C's, Privacy Policy and Disclaimers which can be found at hays.co.uk

Apply now in a few quick clicks

In order to submit this application, a Reed account will be created for you. As such, in addition to applying for this job, you will be signed up to all Reed’s services as part of the process. By submitting this application, you agree to Reed’s Terms and Conditions and acknowledge that your personal data will be transferred to Reed and processed by them in accordance with their Privacy Policy.