Backend Software Engineer (ML Infra)

Rockstar
San Francisco, CA

Rockstar is recruiting for a mobile-first digital product studio that turns ideas into extraordinary experiences. They are a team of dynamic and savvy professionals who know how to create killer digital products. Our lean structure and remote team mean we can move fast while still delivering top-notch technology and design.

Our client is building the AI backbone for the next generation of intelligent products. They help fast-growing AI startups design, fine-tune, evaluate, deploy, and maintain specialized models across text, vision, and embeddings.

Think of them as “AWS for AI models”—not data or raw compute, but a full-stack backend for fine-tuning, reinforcement learning, inference, and long-term model maintenance.

Their customers are Series A–C AI companies building enterprise-grade products. Their promise is simple: they make your AI system better.

They are hiring a Backend Software Engineer (ML Infrastructure) to help design, build, and scale the core systems that power large-scale model training and deployment.

The candidate will work on distributed training pipelines, cloud-native infrastructure, and internal developer platforms that support fine-tuning, reinforcement learning, and inference at scale. This role sits at the intersection of backend engineering and ML systems—the candidate will collaborate closely with ML engineers while owning production-grade infrastructure.

This is an ideal role for an early-career engineer who wants to work on real distributed systems, GPU workloads, and modern ML infrastructure—not dashboards or CRUD apps.

What You’ll Do

Build & Scale Core Infrastructure

- Design and implement backend systems that support large-scale ML workloads, including fine-tuning and reinforcement learning.

- Build distributed training and inference pipelines that are efficient, fault-tolerant, and observable.

- Develop internal developer tools and platforms that make it easier for ML engineers to train, evaluate, and deploy models.

Cloud & Systems Engineering

- Work on cloud-native systems using containers and orchestration (e.g., Kubernetes).

- Optimize systems for performance, reliability, and cost efficiency, especially for GPU-heavy workloads.

- Implement monitoring, logging, and observability for long-running training jobs and production services.

Collaborate with ML Engineers

- Partner closely with ML engineers to support evolving model architectures, training workflows, and evaluation needs.

- Translate ML requirements into scalable backend and infrastructure solutions.

Who You Are

Required

- 1–3 years of backend engineering experience, ideally working on production systems.

- Strong fundamentals in distributed systems, networking, and backend architecture.

- Experience building systems that scale under real load.

- Comfortable working in Python and/or Go (or similar backend languages).

- Excited to work on-site in San Francisco with a fast-moving early-stage team.

Strongly Preferred

- Experience with or exposure to ML infrastructure or ML platforms.

- Familiarity with GPU workloads, training pipelines, or inference systems.

- Experience with containerization and orchestration (Docker, Kubernetes).

- Contributions to or deep familiarity with ML infrastructure libraries such as:

- Ray

- vLLM

- SGLang

- or similar distributed ML systems

Bonus

- Computer science background from a top-tier program or equivalent demonstrated excellence.

- Open-source contributions, research projects, or side projects in systems or ML infrastructure.

- A track record of high ownership and technical curiosity.

Posted 2026-02-28

Recommended Jobs

Physical Therapist Assistant

Blue United Sourcing
San Jose, CA

Travel Physical Therapist Assistant (PTA) – Skilled Nursing Facility 📍 Salinas, CA 🕒 13-Week Assignment | 36 Hours per Week 💲 $44–$48 per hour 🚀 Start Date: ASAP 📆 Schedule Options: Su…

View Details
Posted 2026-01-15

Developer Program Manager

Roblox
San Mateo, CA

As a Developer Program Manager on the Developer Relations team you'll join a growing organization which enables the success of developers on our platform through a variety of programs. You will rep…

View Details
Posted 2026-02-25

Automotive Technician - Vista (Vista)

Evan's Tire & Service Centers
Vista, CA

Overview: Evans Tire & Service Centers strives to create a family-like atmosphere for our team, and our customers. We have been servicing cars and assisting drivers since 1976 making us your best ch…

View Details
Posted 2026-02-28

Veterinarian

Anaheim Hills Pet Clinic
Anaheim, CA

Anaheim Hills Pet Clinic is seeking an experienced Associate Veterinarian to join our highly collaborative, five-doctor team in beautiful Anaheim, CA. As one of North Orange County’s most establ…

View Details
Posted 2026-02-24

Lead Site Reliability Engineer

Stuut
San Francisco, CA

Stuut is transforming accounts receivable for B2B companies—making collections smarter and faster for companies that have historically relied on manual processes that are labor intensive and costly. …

View Details
Posted 2026-02-22

Senior Electrician

MalaceHR
Thousand Oaks, CA

Job Title: Senior Electrician Pay: $40 – $42 per hour   Open Positions / Shifts: Position 1: Monday – Friday | 7:00 AM – 3:00 PM (Overtime & weekends based on business needs) Pos…

View Details
Posted 2026-02-19

Senior Software Engineer, Data Platform

Parafin
San Francisco, CA

About Us: At Parafin, we’re on a mission to grow small businesses. Small businesses are the backbone of our economy, but traditional banks often don’t have their backs. We build tech that makes i…

View Details
Posted 2026-02-19

Infotainment Manager (North America Quality Center - NAQC)

Hyundai America Technical Center, Inc. (HATCI)
California

Job description: Infotainment Manager Hyundai's North America Quality Center (NAQC) is looking for a Manager for the Infotainment Team of the Investigation Team I Group The Team: NAQC…

View Details
Posted 2026-02-07

Site Reliability Engineer - US Government

Xai
Palo Alto, CA

About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on eng…

View Details
Posted 2026-02-13

Telemarketer - State Farm Agent Team Member (PT)

Peggy Langin - State Farm Agency
Diamond Bar, CA

Looking for a part-time job that provides meaningful work and competitive compensation? Consider a position in a State Farm Agent's office as a telemarketing specialist. Responsibilities W…

View Details
Posted 2026-02-04