ML Infrastructure Engineer
ML Infrastructure Engineer
Menlo Park, CA | On-Site | Full-Time/Direct Hire
Client Opportunity | Through Phizenix
Phizenix, a certified minority and women-led recruiting firm, is hiring on behalf of an AI startup pioneering diffusion-based large language models—built for faster generation, multimodal integration, and scalable enterprise deployment.
We’re looking for a ML Infrastructure Engineer to help build the infrastructure that powers large-scale model training and real-time inference. You’ll collaborate with world-class researchers and engineers to design high-performance, distributed systems that bring advanced LLMs into production.
Responsibilities
Design and manage distributed infrastructure for ML training at scale
Optimize model serving systems for low-latency inference
Build automated pipelines for data processing, model training, and deployment
Implement observability tools to monitor performance in production
Maximize resource utilization across GPU clusters and cloud environments
Translate research requirements into robust, scalable system designs
Must-Haves
PhD in Computer Science, Engineering, or a related field (or equivalent experience)
Strong foundation in software engineering, systems design, and distributed systems
Experience with cloud platforms (AWS, GCP, or Azure)
Proficient in Python and at least one systems-level language (C++/Rust/Go)
Hands-on experience with Docker, Kubernetes, and CI/CD workflows
Familiarity with ML frameworks like PyTorch or TensorFlow from a systems perspective
Understanding of GPU programming and high-performance infrastructure
Nice-to-Haves
Experience with large-scale ML training clusters and GPU orchestration
Knowledge of LLM-serving tools (vLLM, TensorRT, ONNX Runtime)
Experience with distributed training strategies (e.g., data/model/pipeline parallelism)
Familiarity with orchestration tools like Kubeflow or Airflow
Background in performance tuning, system profiling, and MLOps best practices
At Phizenix , we’re committed to supporting diverse and inclusive teams. This is your chance to shape the systems that power the next generation of AI innovation. Let’s build the future—together.
California Pay Range
$180,000 - $200,000 USD
Recommended Jobs
Machine Learning Engineer Intern, Autonomy Behavior
Zoox’s internship program provides hands-on experiences with state of the art technology, mentorship from some of the industry's brightest minds, and the opportunity to play a part in our success. In…
10022 - Software Engineer III - Bilingual: Korean
10022 - Software Engineer III - Bilingual (Korean) Fountain Valley, CA (hybrid) Purpose / Position Overview: As a Software Engineer III specializing in maintenance and support, you will …
Accounting Assistant
Job Summary Our client is an exponentially growing custom home builder in the North County Coastal Community of San Diego. They are in immediate need of a Accounting Assistant , who will be an impo…
Senior Software Engineer ($160K $250K + Equity) at Series B Multimodal AI Lab
This is a job that Jill, our AI Recruiter, is recruiting for on behalf of one of our customers. She will pick the best candidates from Jack's network. The next step is to speak to Jack . …
Staff Software Engineer
Why PlayStation? PlayStation isn’t just the Best Place to Play — it’s also the Best Place to Work. Today, we’re recognized as a global leader in entertainment producing The PlayStation family of pr…
General Position
K&D Landscaping Inc. es una empresa familiar con más de 40 años de experiencia, y ahora estamos orgullosos de expandir nuestros servicios a la región de San Luis Obispo. Desde nuestros humildes comi…
Supply Chain Manager, Chemicals
What To Expect We are looking for a motivated Supply Chain Manager to support our development teams and contract manufacturing partners in sourcing chemicals used in our vehicles, powertrain units…
Server
Maria's Italian Kitchen is now hiring Restaurant Servers for our Encino Location . This position may require working some shifts as a Cashier. With Tips, earn up to $30 per hour. Give Great S…
Full Stack Software Engineer - Finance
Perplexity is seeking an experienced Full Stack Engineer to help revolutionize the way people search, learn, and get things done online. In this role, you'll translate cutting-edge AI advances into p…