Senior ML infrastructure engineer

Kuzco
San Francisco, CA

Kuzco is seeking a Senior ML Infrastructure Engineer to join our team. This role involves developing large-scale, fault-tolerant systems that handle millions of large language model inference requests per day. If you are passionate about developing next-generation ML systems that operate at scale, we want to hear from you.

About Kuzco

We are building a distributed LLM inference network that combines idle GPU capacity from around the world into a single cohesive plane of compute that can be used for running large-language models like Llama and Mistral. At any given moment, we have over 5,000 GPUs and hundreds of terabytes of VRAM connected to the network. Learn more here .

We are a small, well-funded team of staff-level engineers who work in-person in downtown San Francisco on difficult, high-impact engineering problems. Everyone on the team has been writing code for over 10 years, and has founded and run their own software companies. We are high-agency, adaptable, and collaborative. We value creativity alongside technical prowess and humility. We work hard, and deeply enjoy the work that we do; we are almost always online at least six days per week.

About the Role

You will be responsible for designing and implementing the core systems that power our globally distributed LLM inference network. You'll work on problems at the intersection of distributed systems, machine learning, and resource optimization.

Key Responsibilities

  • Design and implement scalable distributed systems for our inference network
  • Develop models for efficient resource allocation across a network of heterogeneous hardware and quickly changing topology
  • Optimize network latency, throughput, and availability
  • Build robust logging and metrics systems to monitor network health and performance
  • Conduct reviews of architecture and system design to ensure use of best practices
  • Collaborate with founders, engineers, and other stakeholders to improve our infrastructure and product offerings

What We're Looking For

  • Very strong problem-solving skills and ability to work in a startup environment
  • 5+ years of experience in building high performance systems
  • Strong programming skills in Typescript, Python, and one of Go, Rust, or C++
  • Solid understanding of distributed systems concepts
  • Knowledge of orchestrators and schedulers like Kubernetes and Nomad
  • Use of AI tooling in development workflow (ChatGPT, Claude, Cursor, etc)
  • Experience with LLM inference engines like vLLM or TensorRT-LLM is plus
  • Experience with GPU programming and optimization (CUDA experience is a plus)

Compensation

We offer competitive compensation, equity in a high-growth startup, and comprehensive benefits. The base salary range for this role is $180,000 - $250,000, plus equity and benefits, depending on experience.

Equal Opportunity

Kuzco is an equal opportunity employer. We welcome applicants from all backgrounds and don't discriminate based on race, color, religion, gender, sexual orientation, national origin, genetics, disability, age, or veteran status.

If you're excited about building the future of developer-first AI infrastructure, we'd love to hear from you. Please send your resume, LinkedIn, and GitHub to [email protected].

Posted 2025-09-22

Recommended Jobs

Fumigation Technician

Advanced Integrated Pest Management
Rancho Cordova, CA

Are you looking to grow your career in a family owned and oriented environment? Advanced IPM is searching for driven and passionate professionals to join our Service Team as Commodity Fumigation Tech…

View Details
Posted 2025-10-19

Lead DevOps Engineer

Aeg Worldwide
Los Angeles, CA

Company Information For more than 20 years, AEG has played a pivotal role in transforming sports and live entertainment. Annually, we host more than 160 million guests, promote more than 10,000 sh…

View Details
Posted 2025-09-22

Receptionist (H2) - Days

Robinson Pharma
Santa Ana, CA

We are expanding and need an enthusiastic  Receptionist  to join our team. This individual must be professional and have excellent communication skills. The Receptionist is the first point of contact…

View Details
Posted 2025-09-17

Mathematical Scientist

Pit.AI Technologies Inc.
San Jose, CA

Pit.AI Technologies is taking on one of the most exciting challenges of our time, namely solving intelligence, in one of the most competitive industries ever, investment management!  The team enjoys…

View Details
Posted 2025-09-17

Senior Software Engineer

Assort Health
San Francisco, CA

About the Company Assort’s vision is to make exceptional healthcare accessible anytime, anywhere, for everyone. We are building the most trusted patient-facing multimodal AI agent with industry-lead…

View Details
Posted 2025-09-14

Principal Associate, Data Scientist - Privacy-preserving Machine Learning and Analytics

Capital One
San Jose, CA

Principal Associate, Data Scientist - Privacy-preserving Machine Learning and Analytics Data is at the center of everything we do. As a startup, we disrupted the credit card industry by indiv…

View Details
Posted 2025-10-31

Senior Data Engineer (US-Remote)

Mdpanel
Los Angeles, CA

Our Mission: MDpanel is one of the largest providers of expert medical opinions in the United States. We are committed to being the most coveted partner for physicians, carriers, attorneys, and pa…

View Details
Posted 2025-10-22

Senior Optical System Test Engineer

Arista Networks
Cupertino, CA

Company Description Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. What sets us apart is our relentless …

View Details
Posted 2025-09-22

Accounting Assistant (Accounts Payable)

Madera Community Hospital
Madera, CA

Madera Community Hospital Located in the heart of Central California, Madera Community Hospital is a General Acute Care, private, not-for-profit hospital dedicated to improving and maintaining the…

View Details
Posted 2025-09-22

Route Driver

1-800-GOT-JUNK? - San Francisco Bay
San Jose, CA

Job Opportunity with 1-800-GOT-JUNK? Want to get paid to work outdoors, stay active, and make a difference? Join 1-800-GOT-JUNK?, the World’s Largest Junk Removal Service, known for our clean, shiny…

View Details
Posted 2025-10-31