Machine Learning Engineer
About Us
Relace is building the models and infrastructure that code agents reach for. We power the fastest model on OpenRouter (10,000 tok/s) and deliver optimized small language models designed for retrieval, application, and core code generation functions.
Our technology supports some of the world’s fastest-moving companies — including Lovable, Figma, and Vercel — as they deploy and scale code generation to hundreds of millions of users. We recently raised our Series A from a16z, and we’re growing quickly.
Our team is made up of mathematicians, physicists, and computer scientists who are deeply passionate about their craft. If you thrive on ambitious technical problems, care about elegant systems design, and want to build the foundation of how code gets written at scale, this is the place for you.
The Role
We’re looking for a Machine Learning Engineer who loves getting close to the metal. This is a hands-on engineering role focused on making models faster, more efficient, and more reliable through low-level optimizations and smart systems design.
The ideal candidate is excited by CUDA kernels, memory layouts, GPU scheduling, and squeezing performance out of complex training and inference workloads. They should be just as comfortable optimizing compute and networking paths as they are working alongside research teams to productionize new architectures.
This is a role for someone who enjoys deep performance tuning, understands the realities of running large-scale ML systems, and thrives in fast-moving, high-leverage environments.
Requirements
Strong background in systems-level ML engineering.
Experience with CUDA, GPU kernel optimization, and performance tuning.
Fluency in Python and at least one systems language (C++ or Rust preferred).
Familiarity with distributed training frameworks (e.g., PyTorch, JAX, DeepSpeed, or similar).
Experience working with large-scale training or inference infrastructure.
Understanding of memory management, parallelization, and hardware-aware model optimization.
2+ years of experience working in ML infrastructure or performance-critical environments.
Willingness to work in-person from our SF office in FiDi.
Recommended Jobs
FullStack Engineer
We are looking for a Full Stack Engineer to build the cockpit for our autonomous system. While our backend engines run the math, you will build the interface that builds trust. You will own th…
Senior Data Analyst
&##128205; Palo Alto, CA (Hybrid – 2 days/week in office) Reports to: Director Data Science + Analytics About Midi Health: Midi Health is the fastest-growing virtual clinic focused exclus…
Shipping Assistant
Shipping Assistant Location Livermore, CA : Responsibilities:- Strap and secure loads to company truck.- Unload and load trucks using a reach truck or other equipment as necessary- Prepare and packag…
Associate Teacher
Employment Type: Full-time. Operating Hours: Monday through Friday, schedules vary between 8:00 am to 5:30 pm; When: ASAP. Looking for an Associate Teacher for a progressive preschool …
Stock Associate - PT - Bloomingdale's Century City - US
Stock Associate - FT - Bloomingdale's Century City Los Angeles, California, United States THE ALLSAINTS TEAM At AllSaints we are in the business of feelings - making our custome…
Member of Technical Staff Fullstack Engineer
At Inflection AI, our public benefit mission is to harness the power of AI to improve human well-being and productivity. The next era of AI will be defined by agents we trust to act on our behalf.…
Staff Flight Test Engineer
The Test Engineering team works across the entire spectrum of products ranging from Altius, Lattice, Ghost UAS, Sentry Tower, cUAS and our other platforms. Our team conducts full system level develop…
Client Service & Operations Manager - Middle Market Banking
Overview: MANAGER CLIENT OPERATIONS WHAT IS THE OPPORTUNITY? Directs and manages the client and operational services of a Commercial, Specialty or Corporate Banking Services Division. Manages ri…
Systems Integration & Test Engineer
Turion Space is building spacecraft for national security and in-space operations. From debris removal to space domain awareness to building a more secure space economy, we are a team focused on gett…
IT Technician Intern - Fall 2025
The San Francisco Museum of Modern Art is one of the largest museums of modern and contemporary art in the United States and a thriving cultural center for the Bay Area. We cannot imagine life withou…