DevOps Engineer

Center For Ai Safety
San Francisco, CA

The Center for AI Safety (CAIS) is a leading research and advocacy organization focused on mitigating societal-scale risks from AI. We address AI’s toughest challenges through technical research, field-building initiatives, and policy engagement, along with our partner 501(c)4 organization, Center for AI Safety Action Fund (CAISAF).

We’re looking for a versatile DevOps Engineer to support our cloud-based GPU cluster and contribute to engineering projects for our research team. In this role, you’ll work with our cloud provider to maintain and scale our infrastructure and support users with any technical issues they face. Your work will enable our research team to run experiments productively and will also empower a wide range of researchers at Stanford, Berkeley, CMU, Cambridge, Harvard, and other top universities worldwide, who partner with CAIS by using our cloud infrastructure, to produce technical research on AI safety. You will support our research team for their other engineering needs, for example the development and maintenance of lightweight public-facing websites for various research projects. Depending on the organization’s needs and your skillset, you may also contribute to engineering projects for other teams.

This is a great opportunity for a generalist engineer who enjoys operating at the intersection of infrastructure and software development and is excited to work on a wide variety of technical challenges in a mission-driven environment.

Key Responsibilities:

  • Maintain our cloud infrastructure to ensure scalability, availability and performance, and design and test upgrades and new features.
  • Collaborate with service providers to maintain high availability.
  • Monitor cluster resource usage, generate billing reports and support capacity planning to ensure efficient utilization of cluster resources.
  • Develop and maintain lightweight web-based tools, dashboards, and other services end-to-end.
  • Maintain and update existing websites (e.g., static sites, research tools).
  • Collaborate with research and operations teams to scope and implement new technical projects as needed.

You might be a good fit if you:

  • Are a generalist engineer with experience in full-stack development
  • Have previous SRE or DevOps experience in managing customer-facing systems in a 24/7 environment.
  • Have built simple web applications or tools (e.g. using Flask, React, or static site generators).
  • Are excited to take ownership of diverse technical projects and collaborate across a small, fast-moving team.

The following skills and experiences would be beneficial, though it is not required to have all of these prior to starting the role:

  • Have experience provisioning and maintaining distributed systems using containerization tools such as Docker or Apptainer.
  • Have a solid understanding of distributed systems including storage, networking, and security.
  • Have worked with ML pipelines, HPC systems, or SLURM-based workflows.

$100,000 - $140,000 a year

The Center for AI Safety is an Equal Opportunity Employer. We consider all qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, ancestry, age, disability, medical condition, marital status, military or veteran status, or any other protected status in accordance with applicable federal, state, and local laws. In alignment with the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records for employment.​

If you require a reasonable accommodation during the application or interview process, please contact [email protected].​

We value diversity and encourage individuals from all backgrounds to apply.

Posted 2025-09-22

Recommended Jobs

Seed Technician 30537

Belcan
Oxnard, CA

Job Title: Seed Technician- 2nd shift Location: Oxnard, CA Zip Code: 93030 Job Type: Contract Keywords: #SeedTech #OperationsSupportJobs Start Date: Immediate Shift- 2nd Shift: 2pm - 12:30am a…

View Details
Posted 2025-10-31

Customer Success Intern - Summer 2026

Veeam Software
Los Angeles, CA

Veeam, the #1 global market leader in data resilience, believes businesses should control all their data whenever and wherever they need it. Veeam provides data resilience through data backup, da…

View Details
Posted 2025-09-25

Procurement Manager

Taskrabbit
San Francisco, CA

About Taskrabbit: Taskrabbit is a marketplace platform that conveniently connects people with Taskers to handle everyday home to-do’s, such as furniture assembly, handyman work, moving help, and…

View Details
Posted 2025-09-22

2026 US Summer Internships - Photogrammetry

Activision
Playa Vista, CA

Job Title: 2026 US Summer Internships - Photogrammetry Requisition ID: R025955 : At Activision Blizzard we are dedicated to creating the most epic entertainment experiences, driven by ou…

View Details
Posted 2025-09-11

Engineer, Biotech Downstream Process Equipment Maintenance - (JP14514)

3 Key Consulting
Thousand Oaks, CA

Job Title: Engineer, Biotech Downstream Process Equipment Maintenance - (JP14514) Location: Thousand Oaks, CA. 91320 Employment Type: Contract Business Unit: F&E Drug Substance Supply. …

View Details
Posted 2025-09-10

Accounts Receivable Collections Specialist

Palantir Technologies
Palo Alto, CA

A World-Changing Company Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our part…

View Details
Posted 2025-09-22

Warehouse Team Member, Benicia

FOUND RENTALS, INC.
Benicia, CA

Job Description As a Warehouse Team Member at Found Rental Co., you will play a key role in maintaining the quality and organization of our rental inventory. You will be responsible for pulling,…

View Details
Posted 2025-09-08

Senior Operations and Test Engineer

Hermeus
Los Angeles, CA

Hermeus is an aerospace and defense technology company founded to radically accelerate air travel by delivering hypersonic aircraft. The company aims to develop hypersonic aircraft quickly and cost-e…

View Details
Posted 2025-10-01

Customer Success Manager

Ansa
San Francisco, CA

It’s time that payments work for merchants, not the other way around. Ansa is building a branded wallet solution to increase customer retention and unlock new growth opportunities. Ansa helps inno…

View Details
Posted 2025-10-10

Staff Software Developer in Test

Findem
San Francisco, CA

What is Findem: Findem is HR 2.0. We’re a fast-growth startup with an ambitious vision and the technology to back it up. Our People Intelligence platform uses true AI and machine learning to provi…

View Details
Posted 2025-11-04