DevOps Software Engineer - HPC Specialist
General Description:
We are seeking a DevOps Software Engineer – HPC Specialist, where our DevOps team supports and maintains high-performance computing (HPC) environments and secure CI/CD infrastructure that support scientific research. This role demands expertise in Linux cluster administration, Slurm workload manager, and DevOps tools such as GitLab CI/CD, Python, and JFrog Artifactory, all within a highly secure, air-gapped environment. You’ll also document complex systems and processes clearly for a variety of technical and non-technical audiences.
Essential Duties:
Administer and troubleshoot Linux-based HPC clusters running Slurm.
Manage and maintain Slurm configurations and job scheduling policies.
Collaborate with researchers to support scalable and automated scientific workflows.
Monitor and optimize HPC performance, capacity, and reliability.
Develop and automate cluster management tasks, including node provisioning, software deployment, and user environment setup.
Administer and troubleshoot CI/CD infrastructure across open and air-gapped networks.
Contribute to Infrastructure-as-Code (IaC) automation and system administration.
Collaborate with developers, system administrators, and research staff to support integrated platforms.
Write and maintain high-quality technical documentation.
Participate in Agile team activities to support iterative problem-solving and project delivery.
Required Skills:
Proven ability to communicate complex technical concepts clearly in both written and verbal formats.
Hands-on experience administering Slurm in HPC environments.
Knowledge of HPC environment architecture and common challenges in scientific computing.
Strong Linux system administration skills.
Proficiency in Python programming and scripting languages (e.g., Bash or PowerShell).
Experience with software packaging and environment management (e.g., Conda) in HPC contexts.
Strong troubleshooting, analytical, and problem-solving abilities.
Familiarity with air-gapped or high-security computing environments.
Experience working in research or scientific computing environments is highly desired.
Required Education:
BS + 6 years of experience, or MS + 4 years of experience in computer science, computer engineering, or a related field. Candidates with different experience levels will be considered for other positions.
Special Requirements:
US citizenship and ability to obtain and maintain US Government security clearance
This is an on-site position due to the need to work with air-gapped networks and sensitive information.
Compensation:
The base salary range for this full-time position is $132,765 - $165,983 + bonus + benefits.
Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range during the hiring process. Please note that the compensation details listed reflect the base salary only, and do not include potential bonus or benefits.
We are proud to be an EEO/AA employer M/F/D/V. We maintain a drug-free workplace and perform pre-employment substance abuse testing.
Recommended Jobs
Multi-Platform Producer
#WeAreParamount on a mission to unleash the power of content… you in? We’ve got the brands, we’ve got the stars, we’ve got the power to achieve our mission to entertain the planet – now all we’re …
Music Therapist
Job Title: Music Therapist (Classroom & Clinic-Based). Reports To: Clinical Director + Preschool Director. Classification: Non-Exempt, Full-Time (40 hours/week). Hourly Rate: $30-36/hour…
Carpenter
About the Role We’re looking for experienced and detail-oriented Carpenters to join a growing construction team in the Los Angeles area. This role is ideal for someone who takes pride in quality…
Instrument Assembly Technician- 2
Roles & Responsibilities Responabilite Performs any combination of tasks involved in the fabrication, manufacture, assembly, testing and packaging of medical devices as well as setting up, oper…
Staff Fullstack Software Engineer
Senior / Staff Software Engineer Location: San Francisco, CA (Hybrid 3 days in office) Experience: 5-8 years About the Opportunity Join an early-stage healthtech startup thats tran…
Staff Software Engineer, Gameplay Generalist - Unpublished R&D Product
At Riot, we build games that inspire passionate communities around the world. Engineers here bring their deep technical expertise, and also value the opportunity to widen their craft across a variety…
Staff Software Engineer, Applications
Title: Staff Software Engineer, Applications This position is based in our Campbell, California offices. This position is on-site & full-time Why Telos Health? At Telos Health, an Imperative…
Director of Lean/Six Sigma
Education, Certifications, Experience : Experience in the General Electric and Pratt/Whitney supply chain Minimum of a 4-year technical degree in science, statistics, engineering, or quality …
Off-Road Development Engineer I
Job description: Off-Road Development Engineer Off-road focused software value added feature calibration owner Organize & own the off-road and traction performance development for North…
Quality Engineer II
Job Description Summary Quality Engineer II will assist the Manufacturing Department in the establishment and implementation of programs designed to assure control of processes and products toward…