HPC Engineer

  • Xcede
  • Stevenage, Hertfordshire
  • 16/03/2026
Contractor Information Technology Telecommunications

Job Description

CONTRACT - 6-12 MONTHS INITAL
LOCATION - STEVENAGE (3X ONSITE)

An multinational pharma company are looking for a Senior Linux HPC Engineer.

The position requires hands-on expertise with high-end Workstation hardware and scientific applications, as well as a strong background in HPC techniques, including clustering and workload management with tools like Slurm.
The ideal candidate will be proficient in RedHat Enterprise Linux (RHEL 8 & 9) and have experience with scientific and high-performance computing environments and will also have excellent stakeholder relationship skills and the ability to communicate complex technical concepts effectively to various stakeholders, ensuring our scientists receive top-tier in-person support onsite.

Key Responsibilities

  • Enterprise Linux Administration:
    • Administer, configure, and maintain RHEL environments (specifically RHEL 8 & 9) ensuring stability, performance, and security.
    • Provide hands-on support with high-end Workstation hardware for scientists, promptly addressing hardware and software issues.
  • Scientific and HPC Support:
    • Offer technical support to scientific users, bridging the gap between research demands and IT infrastructure.
    • Leverage any scientific computing experience to optimise system performance and manage specialized applications.
    • Assist with management of high-performance compute resources, including experience with Slurm, clustering, and related HPC technologies.
  • Collaboration and Stakeholder Management:
    • Work closely with other technical teams and stakeholders to align IT services with organizational needs.
    • Build and maintain strong stakeholder relationships, communicating complex technical concepts.
    • Provide in-person support onsite to ensure effective resolution of issues and a high level of customer satisfaction.
  • Service Management and Process Improvement:
    • Utilise ServiceNow for tracking incidents, managing change requests, and ensuring timely resolution of service tickets.
    • Implement and follow IT best practices for incident management, performance monitoring, and network troubleshooting.
  • Additional Technical Duties:
    • Manage SSL certificates and configure web Servers as needed.
    • Monitor and troubleshoot system performance issues, including understanding the impact of GPUs, networking, and other hardware components.
    • Handle vendor relationships effectively, coordinating with external partners to resolve issues and optimise service delivery.
Required Qualifications/Expectations:
  • Technical Expertise:
    • Minimum 10 years of enterprise IT experience with extensive hands-on expertise in RedHat Enterprise Linux (RHEL), specifically RHEL 8 & 9.
    • Proven experience with high-end Workstation hardware setups and scientific application support.
    • Demonstrated knowledge of scientific computing and experience in high performance compute environments, including experience with Slurm and clustering, is highly desirable.
    • Strong troubleshooting skills for both hardware and software issues.
  • Interpersonal Skills:
    • Excellent communication skills with a proven ability to engage and build relationships with stakeholders at various levels.
    • Experience working collaboratively with other technical teams to resolve complex problems and drive operational improvements.
    • Strong stakeholder relationship building skills and the ability to manage vendor relationships effectively.
  • Additional Desirable Skills:
    • Working knowledge of ServiceNow and its application in incident and service management.
    • Familiarity with networking concepts, performance monitoring tools, and GPU technologies.
    • Any experience with scientific applications will be a significant advantage.
  • Onsite Requirement:
    • Must be able to work onsite to provide in-person technical support to scientists and ensure optimal system performance, so a minimum of 3 days a week is required to build relationships with stakeholders and also must be willing to come to site at short notice as we have physical kit we need to support onsite.