Senior ML infrastructure engineer

Kuzco
San Francisco, CA

Kuzco is seeking a Senior ML Infrastructure Engineer to join our team. This role involves developing large-scale, fault-tolerant systems that handle millions of large language model inference requests per day. If you are passionate about developing next-generation ML systems that operate at scale, we want to hear from you.

About Kuzco

We are building a distributed LLM inference network that combines idle GPU capacity from around the world into a single cohesive plane of compute that can be used for running large-language models like Llama and Mistral. At any given moment, we have over 5,000 GPUs and hundreds of terabytes of VRAM connected to the network. Learn more here .

We are a small, well-funded team of staff-level engineers who work in-person in downtown San Francisco on difficult, high-impact engineering problems. Everyone on the team has been writing code for over 10 years, and has founded and run their own software companies. We are high-agency, adaptable, and collaborative. We value creativity alongside technical prowess and humility. We work hard, and deeply enjoy the work that we do; we are almost always online at least six days per week.

About the Role

You will be responsible for designing and implementing the core systems that power our globally distributed LLM inference network. You'll work on problems at the intersection of distributed systems, machine learning, and resource optimization.

Key Responsibilities

  • Design and implement scalable distributed systems for our inference network
  • Develop models for efficient resource allocation across a network of heterogeneous hardware and quickly changing topology
  • Optimize network latency, throughput, and availability
  • Build robust logging and metrics systems to monitor network health and performance
  • Conduct reviews of architecture and system design to ensure use of best practices
  • Collaborate with founders, engineers, and other stakeholders to improve our infrastructure and product offerings

What We're Looking For

  • Very strong problem-solving skills and ability to work in a startup environment
  • 5+ years of experience in building high performance systems
  • Strong programming skills in Typescript, Python, and one of Go, Rust, or C++
  • Solid understanding of distributed systems concepts
  • Knowledge of orchestrators and schedulers like Kubernetes and Nomad
  • Use of AI tooling in development workflow (ChatGPT, Claude, Cursor, etc)
  • Experience with LLM inference engines like vLLM or TensorRT-LLM is plus
  • Experience with GPU programming and optimization (CUDA experience is a plus)

Compensation

We offer competitive compensation, equity in a high-growth startup, and comprehensive benefits. The base salary range for this role is $180,000 - $250,000, plus equity and benefits, depending on experience.

Equal Opportunity

Kuzco is an equal opportunity employer. We welcome applicants from all backgrounds and don't discriminate based on race, color, religion, gender, sexual orientation, national origin, genetics, disability, age, or veteran status.

If you're excited about building the future of developer-first AI infrastructure, we'd love to hear from you. Please send your resume, LinkedIn, and GitHub to [email protected].

Posted 2025-09-22

Recommended Jobs

Licensed Mental Health Counselor: LCSW / LMFT

Wellpath
Santa Maria, CA

You Matter • Make a difference every day in the lives of the underserved • Join a mission driven organization with a people first culture • Excellent career growth opportunities Join us an…

View Details
Posted 2025-09-10

TECH - MULTI MODALITY - MRI/CT- PRN

Riverside Medical Clinic
Riverside, CA

Responsibilities Come and join the RMC Family! We have been in the community since 1935. Our mission is to provide comprehensive multi-specialty medical services in the greater Riverside reg…

View Details
Posted 2025-09-10

Launch Systems Engineer

IESE Solutions
El Segundo, CA

Job Description Job Description IESE Solutions is seeking a Launch Systems Engineer to support and drive execution of launch integration efforts under the Space Systems Command (SSC) organizati…

View Details
Posted 2025-07-30

Accountant

Select Staffing
Northridge, CA

Description Part Time Accountant needed! Pay: $30.00 per hour 3 days a week – 8 hours a day! Summary / Objective The Accountant is responsible for managing the company’s financial records, ensur…

View Details
Posted 2025-08-27

Freight Forwarding Operations & Customer Service Agent

DP World
Long Beach, CA

Freight Forwarding- Operations and Customer Service Agent Job Description We are the leading provider of worldwide smart end-to-end supply chain & logistics,  enabling the flow of trade acros…

View Details
Posted 2025-09-02

Mental Health Worker - Full time (Day Shift) Chico, CA

Compassion Services
Chico, CA

Job description Bella Vida Center is a Short-Term Crisis Residential Treatment Program that specializes in the treatment of clients experiencing acute psychiatric episodes or crisis. The 24-hour r…

View Details
Posted 2025-09-16

Full Time ObGyn Job Valencia, CA

CompHealth CompHealth
Valencia, CA

Come practice in Valencia, a neighborhood in Santa Clarita located within Los Angeles County, California. It is famed locally for its oranges, but better known as the home of the Magic Mountain amusem…

View Details
Posted 2025-09-10

Mechanic

Marina Landscape, Inc.
Lathrop, CA

Marina Landscape, Inc. provides a family atmosphere where each of its team members are valued, and their ideas heard. Marina Landscape is 100% employee-owned , our employees share as the company gro…

View Details
Posted 2025-09-10

Principal systems software engineer

Targeted Talent
Los Angeles, CA

Job Description Job Description About the Company: Our client is a company building the world's highest- performance pure digital AI inference chip. We are looking for an experienced software …

View Details
Posted 2025-07-30

Facilities Plant Operator I (Temp to Perm)

Greenlight Professional Services
Cantil, CA

This is a temp to perm position. Once converted to perm you will be eligible for the company's full benefits package. Facility Operates 24 hours / 7 days a week (8-1/2 hour rotating shifts). Shif…

View Details
Posted 2025-09-02