HPC Support Engineer
JOB_52881334649434Job type
PermanentLocation
United KingdomWorking Pattern
Full-timeSpecialism
InfrastructureIndustry
Technology & Internet ServicesPay
90000Closing date
3 Feb 2025
HPC Support Engineer | Fully remote | +8 or - 8 Time zone cover | Unlimited Holiday
Your new company
Your new role
As an HPC Support Engineer, you will support customers on a GPU cloud platform and dedicated GPU clusters. You'll work with various teams, vendors, and partners to meet SLA commitments and ensure smooth operations. Your main tasks will include resolving complex issues, managing incidents related to storage, networking, and GPU optimisation. You'll also monitor multi-node clusters to ensure they run efficiently, keep detailed records of incidents and resolutions and collaborate with stakeholders.
What you'll need to succeed
IT Support Background: 5+ years of experience in an IT support role, preferably in HPC or cloud environments.
Linux system administration from the Command Line.
Networking Knowledge: Familiarity with network protocols (e.g. TCP/IP, BGP), Infiniband, and RoCE.
Cluster Support: Experience working with cluster management tools like SLURM and GPU monitoring systems (e.g. NVIDIA DCGM).
Scripting and Automation: Proficiency in scripting languages (Bash, Python etc.).
Tools and Platforms: Familiarity with ITSM tools (e.g. ServiceNow, Jira Service Management) and monitoring solutions (e.g. Grafana, Prometheus).
- Knowledge of NVIDIA AI Enterprise Suite and software stacks relevant to GPU environments.
What you'll get in return
- Share options.
- Unlimited holiday policy.
- 100% Remote working.
- Fantastic opportunities to develop - they make a habit of promoting in house.
- A great team with a passion for working collaboratively.
- Enhanced family friendly policies.
- A truly flexible workplace.
What you need to do now
If you're interested in this role, click 'apply now' to forward an up-to-date copy of your CV, or call us now.
If this job isn't quite right for you, but you are looking for a new position, please contact us for a confidential discussion about your career.
Talk to Jacob Clift, the specialist consultant managing this position
Located in Southampton, 3rd Floor, One Dorset Street, SouthamptonTelephone 023 82 020 113