Role: HPC Cluster Support – CIBA 4 (Senior) Position Type: Part-Time Contract (20hrs/week) Contract Duration: 6 months Work Hours: EST or PST Location : 100% Remote We're seeking a Senior HPC Cluster Support Engineer to maintain and support large-scale production HPC environments running Bright Cluster Manager and Slurm. This role focuses on cluster operations, hardware troubleshooting, user support, and vendor coordination to ensure uninterrupted high-performance computing workloads. Key Responsibilities Manage and support HPC clusters: job submission issues, queue management, and user troubleshooting Monitor cluster health and resolve node failures, networking issues, and domain problems Diagnose hardware faults (GPUs, boards, power, nodes) and perform remote checks using BMC tools (Dell iDRAC, HPE iLOM, Supermicro) Troubleshoot InfiniBand, Panasas storage, and network integration issues Coordinate repairs and escalate with vendors (ParkPlace, VDura) Apply system updates, patches, and configurations Collaborate with users and provide regular status updates Required Skills Strong experience with Bright Cluster Manager and Slurm Linux systems administration and advanced troubleshooting Hardware diagnostics, BMC remote management tools Experience with InfiniBand, HPC storage systems (Panasas), and vendor escalation Active Directory integration for Linux is a plus