ML Infrastructure Engineer

Phizenix
Menlo Park, CA

ML Infrastructure Engineer
Menlo Park, CA | On-Site | Full-Time/Direct Hire


Client Opportunity | Through Phizenix

Phizenix, a certified minority and women-led recruiting firm, is hiring on behalf of an AI startup pioneering diffusion-based large language models—built for faster generation, multimodal integration, and scalable enterprise deployment.

We’re looking for a ML Infrastructure Engineer to help build the infrastructure that powers large-scale model training and real-time inference. You’ll collaborate with world-class researchers and engineers to design high-performance, distributed systems that bring advanced LLMs into production.

Responsibilities




  • Design and manage distributed infrastructure for ML training at scale



  • Optimize model serving systems for low-latency inference



  • Build automated pipelines for data processing, model training, and deployment



  • Implement observability tools to monitor performance in production



  • Maximize resource utilization across GPU clusters and cloud environments



  • Translate research requirements into robust, scalable system designs


Must-Haves




  • PhD in Computer Science, Engineering, or a related field (or equivalent experience)



  • Strong foundation in software engineering, systems design, and distributed systems



  • Experience with cloud platforms (AWS, GCP, or Azure)



  • Proficient in Python and at least one systems-level language (C++/Rust/Go)



  • Hands-on experience with Docker, Kubernetes, and CI/CD workflows



  • Familiarity with ML frameworks like PyTorch or TensorFlow from a systems perspective



  • Understanding of GPU programming and high-performance infrastructure


Nice-to-Haves




  • Experience with large-scale ML training clusters and GPU orchestration



  • Knowledge of LLM-serving tools (vLLM, TensorRT, ONNX Runtime)



  • Experience with distributed training strategies (e.g., data/model/pipeline parallelism)



  • Familiarity with orchestration tools like Kubeflow or Airflow



  • Background in performance tuning, system profiling, and MLOps best practices


At Phizenix , we’re committed to supporting diverse and inclusive teams. This is your chance to shape the systems that power the next generation of AI innovation. Let’s build the future—together.

California Pay Range

$180,000 - $200,000 USD

Posted 2025-11-25

Recommended Jobs

Line Cook

Doublz
Whittier, CA

Company Overview Doublz has been proudly serving the greater Los Angeles area for the last 25 years with the freshest food. At Doublz, quality and freshness are everything. Doublz is not your ordi…

View Details
Posted 2025-12-21

Retail Store Supervisor

Sunnyside
Stanton, CA

Retail Store Supervisor Location Pittsburgh, PA  (Stanton Heights area) : COMPANY OVERVIEW Cresco Labs is one of the largest public, vertically integrated, multistate operators in the cannabis indus…

View Details
Posted 2026-01-10

Sales Representative - Uniform

Cintas Corporation
Pittsburg, CA

Requisition Number: 214594  Job Description Cintas is seeking a Sales Representative to focus on new business-to-business account development in our Uniform Division. Responsibilities include p…

View Details
Posted 2026-01-03

QA Manager

Veeva Systems
Pleasanton, CA

Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in histo…

View Details
Posted 2026-01-10

Senior Software Engineer (Flight Software)

Anduril Industries
Costa Mesa, CA

About the Team Anduril’s Maritime Division has assembled a diverse team of experts in software, robotics, artificial intelligence, sensor fusion, and data analysis to create software and hardwar…

View Details
Posted 2026-01-07

Data Scientist AI Strategy & Implementation

Adidev Technologies
Los Angeles, CA

Role: Data Scientist – AI Strategy & Implementation (This role is open to US Citizens, Green Card holders, GC-EAD only. We do not sponsor visas.)   Adidev Technologies Inc Adidev Technolo…

View Details
Posted 2025-11-25

Senior Accountant

Ejam
Santa Ana, CA

Location: Santa Ana, CA Type: Full-Time Salary: $88,000 - $95,000 Overview: The Senior Accountant will support the month-end close process, reconciliations, and daily accounting operati…

View Details
Posted 2025-12-16

Staff Software Engineer, Infrastructure

Waymo
Mountain View, CA

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildi…

View Details
Posted 2025-12-25

Principal Product Manager, Partnerships

Udemy
San Francisco, CA

Where we work Udemy is a global company headquartered in San Francisco, with additional U.S. offices in Denver and Austin, and international hubs in Australia, India, Ireland, Mexico, and Türkiye.…

View Details
Posted 2025-11-28