Backend Software Engineer (ML Infra)

Rockstar

San Francisco, CA

Rockstar is recruiting for a mobile-first digital product studio that turns ideas into extraordinary experiences. They are a team of dynamic and savvy professionals who know how to create killer digital products. Our lean structure and remote team mean we can move fast while still delivering top-notch technology and design.

Our client is building the AI backbone for the next generation of intelligent products. They help fast-growing AI startups design, fine-tune, evaluate, deploy, and maintain specialized models across text, vision, and embeddings.

Think of them as “AWS for AI models”—not data or raw compute, but a full-stack backend for fine-tuning, reinforcement learning, inference, and long-term model maintenance.

Their customers are Series A–C AI companies building enterprise-grade products. Their promise is simple: they make your AI system better.

They are hiring a Backend Software Engineer (ML Infrastructure) to help design, build, and scale the core systems that power large-scale model training and deployment.

The candidate will work on distributed training pipelines, cloud-native infrastructure, and internal developer platforms that support fine-tuning, reinforcement learning, and inference at scale. This role sits at the intersection of backend engineering and ML systems—the candidate will collaborate closely with ML engineers while owning production-grade infrastructure.

This is an ideal role for an early-career engineer who wants to work on real distributed systems, GPU workloads, and modern ML infrastructure—not dashboards or CRUD apps.

What You’ll Do

Build & Scale Core Infrastructure

- Design and implement backend systems that support large-scale ML workloads, including fine-tuning and reinforcement learning.

- Build distributed training and inference pipelines that are efficient, fault-tolerant, and observable.

- Develop internal developer tools and platforms that make it easier for ML engineers to train, evaluate, and deploy models.

Cloud & Systems Engineering

- Work on cloud-native systems using containers and orchestration (e.g., Kubernetes).

- Optimize systems for performance, reliability, and cost efficiency, especially for GPU-heavy workloads.

- Implement monitoring, logging, and observability for long-running training jobs and production services.

Collaborate with ML Engineers

- Partner closely with ML engineers to support evolving model architectures, training workflows, and evaluation needs.

- Translate ML requirements into scalable backend and infrastructure solutions.

Who You Are

Required

- 1–3 years of backend engineering experience, ideally working on production systems.

- Strong fundamentals in distributed systems, networking, and backend architecture.

- Experience building systems that scale under real load.

- Comfortable working in Python and/or Go (or similar backend languages).

- Excited to work on-site in San Francisco with a fast-moving early-stage team.

Strongly Preferred

- Experience with or exposure to ML infrastructure or ML platforms.

- Familiarity with GPU workloads, training pipelines, or inference systems.

- Experience with containerization and orchestration (Docker, Kubernetes).

- Contributions to or deep familiarity with ML infrastructure libraries such as:

- Ray

- vLLM

- SGLang

- or similar distributed ML systems

Bonus

- Computer science background from a top-tier program or equivalent demonstrated excellence.

- Open-source contributions, research projects, or side projects in systems or ML infrastructure.

- A track record of high ownership and technical curiosity.

Posted 2025-12-25

Recommended Jobs

INDUSTRIAL SUPERVISOR, PRISON INDUSTRIES (DENTAL LAB) - CALIFORNIA MEDICAL FACILITY

California Correctional Health Care Services

Solano County, CA

Job Description and Duties Effective July 1, 2025, in accordance with the applicable Memorandum of Understanding, the Personal Leave Program 2025 (PLP 2025) was implemented. PLP 2025 requires each…

View Details

Posted 2025-12-18

Machine Learning Engineer

Voltai

Palo Alto, CA

About Voltai Voltai is developing world models, and agents to learn, evaluate, plan, experiment, and interact with the physical world. We are starting out with understanding and building hardware;…

View Details

Posted 2025-12-10

Software Engineer III, Shield

Box

Redwood City, CA

Shield is a team of engineers specializing in security and protecting the flow of our customers' content so that information doesn't fall into the wrong hands. Shield does that by detecting threats us…

View Details

Posted 2026-01-01

Chemical Operator

Henkel

Rancho Dominguez, CA

What you´ll do Operates machines and production equipment safely in accordance with instructions. Sets up or adjusts equipment according to manufacturing specifications. Monitors the quality…

View Details

Posted 2026-01-12

Principal Software Engineer - API Infrastructure

Rubrik

Palo Alto, CA

About the team Our team is responsible for building the foundational API layer for all user and system interaction with Rubrik products. We connect our distributed SaaS products, and federated…

View Details

Posted 2026-01-07

Product Manager - Pokémon GO

Scopely

San Francisco, CA

Our mission is to encourage exploration of the real world together with friends, family, and community through the universal appeal of Pokémon. The ideal candidate will be excited to build fun exp…

View Details

Posted 2025-11-28

Research Associate Support Resource - Molecular Biology & Cell-based Assay (Analytical Development)

SGS Consulting

California

Job Responsibilities: ~Develop, optimize, and execute analytical assays for proteins, DNA, adeno-associated viral (AAV) nanoparticles, and related cell-based assays. ~Perform routine testing usin…

View Details

Posted 2026-01-06

AR Collection Speacialist

Wizixtechnologygroupinc

Roseville, CA

Position: Accounts Receivable & Collections Location: Roseville, CA Our fast-paced headquarters office is seeking an Accounts Receivable Collections Specialist in Roseville, CA. Do you like …

View Details

Posted 2025-12-19

Complex Senior Event Manager - Hilton San Francisco Union Square and Parc 55

Hilton

San Francisco, CA

Come join the team at the Hilton San Francisco Union Square located in the heart of Downtown San Francisco! Our hotels are located a block from the Curran and ACT theaters, and just two blocks from U…

View Details

Posted 2026-01-12

BHO Technician I (Weekend AM )

Big Oil Co.

Lancaster, CA

BHO Technician I (Weekend AM ) Location Lancaster, CA : *Interested applicants MUST apply by filling out the form below: Job title : BHO Technician I - Weekend AM Compensation: $16.50/hr - $18…

View Details

Posted 2026-01-09