ML Infrastructure Engineer

Phizenix
Menlo Park, CA

ML Infrastructure Engineer
Menlo Park, CA | On-Site | Full-Time/Direct Hire


Client Opportunity | Through Phizenix

Phizenix, a certified minority and women-led recruiting firm, is hiring on behalf of an AI startup pioneering diffusion-based large language models—built for faster generation, multimodal integration, and scalable enterprise deployment.

We’re looking for a ML Infrastructure Engineer to help build the infrastructure that powers large-scale model training and real-time inference. You’ll collaborate with world-class researchers and engineers to design high-performance, distributed systems that bring advanced LLMs into production.

Responsibilities




  • Design and manage distributed infrastructure for ML training at scale



  • Optimize model serving systems for low-latency inference



  • Build automated pipelines for data processing, model training, and deployment



  • Implement observability tools to monitor performance in production



  • Maximize resource utilization across GPU clusters and cloud environments



  • Translate research requirements into robust, scalable system designs


Must-Haves




  • PhD in Computer Science, Engineering, or a related field (or equivalent experience)



  • Strong foundation in software engineering, systems design, and distributed systems



  • Experience with cloud platforms (AWS, GCP, or Azure)



  • Proficient in Python and at least one systems-level language (C++/Rust/Go)



  • Hands-on experience with Docker, Kubernetes, and CI/CD workflows



  • Familiarity with ML frameworks like PyTorch or TensorFlow from a systems perspective



  • Understanding of GPU programming and high-performance infrastructure


Nice-to-Haves




  • Experience with large-scale ML training clusters and GPU orchestration



  • Knowledge of LLM-serving tools (vLLM, TensorRT, ONNX Runtime)



  • Experience with distributed training strategies (e.g., data/model/pipeline parallelism)



  • Familiarity with orchestration tools like Kubeflow or Airflow



  • Background in performance tuning, system profiling, and MLOps best practices


At Phizenix , we’re committed to supporting diverse and inclusive teams. This is your chance to shape the systems that power the next generation of AI innovation. Let’s build the future—together.

California Pay Range

$180,000 - $200,000 USD

Posted 2026-02-16

Recommended Jobs

Machine Learning Engineer Intern, Autonomy Behavior

zoox
Foster, CA

Zoox’s internship program provides hands-on experiences with state of the art technology, mentorship from some of the industry's brightest minds, and the opportunity to play a part in our success. In…

View Details
Posted 2025-11-18

10022 - Software Engineer III - Bilingual: Korean

Hyundai Autoever America
Fountain Valley, CA

10022 - Software Engineer III - Bilingual (Korean) Fountain Valley, CA (hybrid)   Purpose / Position Overview: As a Software Engineer III specializing in maintenance and support, you will …

View Details
Posted 2026-02-07

Accounting Assistant

K2 Staffing
San Diego, CA

Job Summary  Our client is an exponentially growing custom home builder in the North County Coastal Community of San Diego. They are in immediate need of a Accounting Assistant , who will be an impo…

View Details
Posted 2025-10-03

Senior Software Engineer ($160K $250K + Equity) at Series B Multimodal AI Lab

Jack & Jill/external Ats
San Francisco, CA

This is a job that Jill, our AI Recruiter, is recruiting for on behalf of one of our customers. She will pick the best candidates from Jack's network. The next step is to speak to Jack . …

View Details
Posted 2026-02-13

Staff Software Engineer

Playstation Global
San Mateo, CA

Why PlayStation? PlayStation isn’t just the Best Place to Play — it’s also the Best Place to Work. Today, we’re recognized as a global leader in entertainment producing The PlayStation family of pr…

View Details
Posted 2026-02-16

General Position

K & D Landscaping
San Jose, CA

K&D Landscaping Inc. es una empresa familiar con más de 40 años de experiencia, y ahora estamos orgullosos de expandir nuestros servicios a la región de San Luis Obispo. Desde nuestros humildes comi…

View Details
Posted 2026-01-10

Supply Chain Manager, Chemicals

Tesla
California

What To Expect We are looking for a motivated Supply Chain Manager to support our development teams and contract manufacturing partners in sourcing chemicals used in our vehicles, powertrain units…

View Details
Posted 2026-01-26

Server

Maria's Italian Kitchen
Encino, CA

Maria's Italian Kitchen is now hiring Restaurant Servers for our Encino Location . This position may require working some shifts as a Cashier. With Tips, earn up to $30 per hour. Give Great S…

View Details
Posted 2026-01-24

Full Stack Software Engineer - Finance

Perplexity
San Francisco, CA

Perplexity is seeking an experienced Full Stack Engineer to help revolutionize the way people search, learn, and get things done online. In this role, you'll translate cutting-edge AI advances into p…

View Details
Posted 2026-02-16