Machine Learning Engineer

Relace

San Francisco, CA

About Us
Relace is building the models and infrastructure that code agents reach for. We power the fastest model on OpenRouter (10,000 tok/s) and deliver optimized small language models designed for retrieval, application, and core code generation functions.
Our technology supports some of the world’s fastest-moving companies — including Lovable, Figma, and Vercel — as they deploy and scale code generation to hundreds of millions of users. We recently raised our Series A from a16z, and we’re growing quickly.
Our team is made up of mathematicians, physicists, and computer scientists who are deeply passionate about their craft. If you thrive on ambitious technical problems, care about elegant systems design, and want to build the foundation of how code gets written at scale, this is the place for you.

The Role

We’re looking for a Machine Learning Engineer who loves getting close to the metal. This is a hands-on engineering role focused on making models faster, more efficient, and more reliable through low-level optimizations and smart systems design.

The ideal candidate is excited by CUDA kernels, memory layouts, GPU scheduling, and squeezing performance out of complex training and inference workloads. They should be just as comfortable optimizing compute and networking paths as they are working alongside research teams to productionize new architectures.

This is a role for someone who enjoys deep performance tuning, understands the realities of running large-scale ML systems, and thrives in fast-moving, high-leverage environments.

Requirements

Strong background in systems-level ML engineering.
Experience with CUDA, GPU kernel optimization, and performance tuning.
Fluency in Python and at least one systems language (C++ or Rust preferred).
Familiarity with distributed training frameworks (e.g., PyTorch, JAX, DeepSpeed, or similar).
Experience working with large-scale training or inference infrastructure.
Understanding of memory management, parallelization, and hardware-aware model optimization.
2+ years of experience working in ML infrastructure or performance-critical environments.
Willingness to work in-person from our SF office in FiDi.

Posted 2025-12-10

Recommended Jobs

FullStack Engineer

Taas Partners

San Francisco, CA

We are looking for a Full Stack Engineer to build the cockpit for our autonomous system. While our backend engines run the math, you will build the interface that builds trust. You will own th…

View Details

Posted 2025-12-19

Senior Data Analyst

Midi Health

Palo Alto, CA

&##128205; Palo Alto, CA (Hybrid – 2 days/week in office) Reports to: Director Data Science + Analytics About Midi Health: Midi Health is the fastest-growing virtual clinic focused exclus…

View Details

Posted 2025-11-28

Shipping Assistant

Stockton Products

Livermore, CA

Shipping Assistant Location Livermore, CA : Responsibilities:- Strap and secure loads to company truck.- Unload and load trucks using a reach truck or other equipment as necessary- Prepare and packag…

View Details

Posted 2026-01-09

Associate Teacher

Piper Preschool

Irvine, CA

Employment Type: Full-time. Operating Hours: Monday through Friday, schedules vary between 8:00 am to 5:30 pm; When: ASAP. Looking for an Associate Teacher for a progressive preschool …

View Details

Posted 2025-12-18

Stock Associate - PT - Bloomingdale's Century City - US

ALLSAINTS

Los Angeles, CA

Stock Associate - FT - Bloomingdale's Century City Los Angeles, California, United States THE ALLSAINTS TEAM At AllSaints we are in the business of feelings - making our custome…

View Details

Posted 2026-01-01

Member of Technical Staff Fullstack Engineer

Inflection Ai

Palo Alto, CA

At Inflection AI, our public benefit mission is to harness the power of AI to improve human well-being and productivity. The next era of AI will be defined by agents we trust to act on our behalf.…

View Details

Posted 2026-01-07

Staff Flight Test Engineer

Anduril Industries

San Clemente, CA

The Test Engineering team works across the entire spectrum of products ranging from Altius, Lattice, Ghost UAS, Sentry Tower, cUAS and our other platforms. Our team conducts full system level develop…

View Details

Posted 2025-11-25

Client Service & Operations Manager - Middle Market Banking

City National Bank

Los Angeles, CA

Overview: MANAGER CLIENT OPERATIONS WHAT IS THE OPPORTUNITY? Directs and manages the client and operational services of a Commercial, Specialty or Corporate Banking Services Division. Manages ri…

View Details

Posted 2026-01-09

Systems Integration & Test Engineer

Turion Space

Irvine, CA

Turion Space is building spacecraft for national security and in-space operations. From debris removal to space domain awareness to building a more secure space economy, we are a team focused on gett…

View Details

Posted 2025-11-28

IT Technician Intern - Fall 2025

Sfmoma

San Francisco, CA

The San Francisco Museum of Modern Art is one of the largest museums of modern and contemporary art in the United States and a thriving cultural center for the Bay Area. We cannot imagine life withou…

View Details

Posted 2025-12-13