Sr. DevOps Engineer (HPC)
SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars.
SR. DEVOPS ENGINEER (HPC)
SpaceX is looking for a Sr. DevOps Engineer with strong knowledge and experience in a world class engineering organization. This employee will be a member of the HPC team and will support SpaceX personnel and proprietary systems. The ideal candidate will be flexible and flourish in a fast paced and challenging environment. They should be a self-starter, self-motivator and possess ingenuity to excel at this position.
RESPONSIBILITIES:
- Administer and manage HPC clusters, storage systems, and high-speed networks.
- Provide application support to SpaceX employees across engineering disciplines.
- Install and integrate Linux-based compute clusters.
- Write instructional documentation and convey highly technical ideas in non-technical terms.
BASIC QUALIFICATIONS:
- 5+ years of hands-on experience with client and server hardware/software, management tools, enterprise networking, virtualization, and security technologies.
- Bachelor's degree in computer science, engineering, math, or scientific discipline and 5+ years of systems engineering experience; OR 7+ years of professional experience building software in lieu of a degree.
- Experience with Linux.
PREFERRED SKILLS AND EXPERIENCE:
- 5+ years of professional experience building, deploying and troubleshooting Linux systems.
- Experience with a scripting language (Bash, Python) to automate and solve reoccurring tasks.
- Experience building, deploying and troubleshooting HPC clusters.
- Familiarity with cluster resource managers (Slurm, PBS, LSF).
- Experience with monitoring and alerting technologies (Prometheus, Grafana, Nagios).
- Familiarity with scientific and engineering computing (CFD, FEA).
- Familiarity with ML frameworks (PyTorch, Tensorflow).
- Familiarity with GPU usage in a compute cluster and Cuda.
- Experience with containers (Docker, Podman, Singularity).
- Experience deploying and maintaining automated configuration management software (Puppet, Ansible).
- Comfortable working with mission critical and sensitive systems, with a sense of urgency appropriate to the responsibilities.
- Eligibility for access to classified material up to TS/SCI with Polygraph.
ADDITIONAL REQUIREMENTS:
- Must be willing to work extended hours and weekends as needed.
COMPENSATION AND BENEFITS:
Pay Range:
Sr. DevOps Engineer: $160,000.00-$220,000.00/per year
Your actual level and base salary will be determined on a case-by-case basis and may vary based on the following considerations: job-related knowledge and skills, education, and experience.
Base salary is just one part of your total rewards package at SpaceX. You may also be eligible for long-term incentives, in the form of company stock, stock options, or long-term cash awards, as well as potential discretionary bonuses and the ability to purchase additional stock at a discount through an Employee Stock Purchase Plan. You will also receive access to comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short and long-term disability insurance, life insurance, paid parental leave, and various other discounts and perks. You may also accrue 3 weeks of paid vacation and will be eligible for 10 or more paid holidays per year. Employees accrue paid sick leave pursuant to Company policy which satisfies or exceeds the accrual, carryover, and use requirements of the law.
ITAR REQUIREMENTS:
- To conform to U.S. Government export regulations, applicant must be a (i) U.S. citizen or national, (ii) U.S. lawful, permanent resident (aka green card holder), (iii) Refugee under 8 U.S.C. § 1157, or (iv) Asylee under 8 U.S.C. § 1158, or be eligible to obtain the required authorizations from the U.S. Department of State. Learn more about the ITAR here .
SpaceX is an Equal Opportunity Employer; employment with SpaceX is governed on the basis of merit, competence and qualifications and will not be influenced in any manner by race, color, religion, gender, national origin/ethnicity, veteran status, disability status, age, sexual orientation, gender identity, marital status, mental or physical disability or any other legally protected status.
Applicants wishing to view a copy of SpaceX’s Affirmative Action Plan for veterans and individuals with disabilities, or applicants requiring reasonable accommodation to the application/interview process should reach out to [email protected] .
Recommended Jobs
Personal Care Assistant (PCA)
Job summary: The PCA is a non-skilled worker that may possess a certificate of training either formally (i.e., Certified Nurse Assistant (CNA) or in-house formal training; competency and assessment …
Construction Project Accountant 2
Company Overview: RBA Builders is a full, General Contractor with self-performing tenant improvement division offering services in the commercial, industrial, entertainment, educational, aerospace…
Cook
We are seeking a dependable and detail-oriented Cook to join our behavioral health team. The Cook is responsible for preparing and serving nutritious, well-balanced meals that support the health an…
Infrastructure Engineer
About Northwood: Northwood is on a mission to transform connectivity between earth and space and bring the benefits of space to the masses through innovations in space communications technologies. I…
Sr. Software Test Engineer - Versa Messaging Server (VMS)
Description About Us At Versa Networks, we're revolutionizing the way businesses connect, secure, and optimize their networks. Our mission is to secure anywhere, anytime access to anything.…
Senior Product Manager (Zero Trust Identity)
Company Description Our Mission At Palo Alto Networks® everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vi…
Senior Software Engineer - Backend
About Hive Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organization…
On-Site Psychiatrist (MD)
You Matter • Make a difference every day in the lives of the underserved • Join a mission driven organization with a people first culture • Excellent career growth opportunities Join us an…
Control Systems Lead
Control Systems Lead Location Roseville, CA (Industrial Area East area) : Job Title Control Systems Lead Summary Our Purpose: At C&W Services, we live by the belief that Better Never Settles.…